4.6 Article

REMEDIAL-HwR: Tackling multilabel imbalance through label decoupling and data resampling hybridization

Journal

NEUROCOMPUTING
Volume 326, Issue -, Pages 110-122

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2017.01.118

Keywords

Multilabel classification; Imbalanced learning; Resampling algorithms; Label concurrence

Funding

  1. Spanish Ministry of Economy, Industry and Competitiveness [TIN2014-57251-P, TIN2015-68454-R]
  2. Andalusian regional project [P11-TIC-7765]

Ask authors/readers for more resources

The learning from imbalanced data is a deeply studied problem in standard classification and, in recent times, also in multilabel classification. A handful of multilabel resampling methods have been proposed in late years, aiming to balance the labels distribution. However, these methods have to face a new obstacle, specific for multilabel data, as is the joint appearance of minority and majority labels in the same data patterns. We presented recently a new algorithm designed to decouple imbalanced labels concurring in the same instance, called REMEDIAL (REsampling MultilabEl datasets by Decoupling highly ImbAlanced Labels). The goal of this work is to propose REMEDIAL-HwR (REMEDIAL Hybridization with Resampling), a procedure to hybridize this method with some of the best resampling algorithms available in the literature, including random oversampling, heuristic undersampling and synthetic sample generation techniques. These hybrid methods are then empirically analyzed, determining how their behavior is influenced by the label decoupling process. The analysis of results shows that the proposed method improves certain classifiers performance when it is applied over imbalanced datasets with label concurrence. In addition, a noteworthy set of guidelines on the combined use of these techniques can be drawn from the conducted experimentation. (C) 2017 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available