4.7 Article

ECC plus plus : An algorithm family based on ensemble of classifier chains for classifying imbalanced multi-label data

Journal

EXPERT SYSTEMS WITH APPLICATIONS
Volume 236, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2023.121366

Keywords

Multi-label learning; Class imbalance; Ensemble of classifier chains; Sampling; Cost-sensitive learning; Threshold-moving

Ask authors/readers for more resources

Multi-label learning has a wide range of real-world applications and the problem of class imbalance in multilabel data has been less addressed. This study proposes the ECC++ algorithm family, which combines the ensemble classifier chain algorithm with binary-class imbalance learning techniques to tackle the challenges of class imbalance and label correlations. Experimental results demonstrate the effectiveness and superiority of ECC++ over existing class imbalance multi-label learning algorithms.
Multi-label learning has attracted a great deal of research interests as it has a wide range of real-world applications. Although many multilabel learning methods have been proposed, very few of them have addressed the problem of class imbalance distribution in multilabel data. Moreover, most of the existing class imbalance multilabel learning algorithms only focus on solving the class imbalance problem, without taking into account the correlations among labels. To address these issues simultaneously, we propose to combine the well-known ensemble of classifier chain (ECC) algorithm with various binary-class imbalance learning techniques such as sampling, cost-sensitive learning, and threshold moving. This approach creates a new algorithm family called ECC++, designed specifically for class imbalance multi-label learning. ECC is already an excellent ensemble high-order binary relevance multi-label learning algorithm that is well-suited to exploiting correlations among labels. Combining it with binary-class imbalance learning techniques enables each link in a classifier chain (CC) to overcome the negative effect of skewed data distribution. ECC++ is a dynamic algorithm family that can be extended arbitrarily by applying any new binary-class imbalance learning techniques. To demonstrate the effectiveness and superiority of the proposed ECC++ algorithm family, we developed several ECC++ family members using some popular binary-class imbalance learning techniques. We then compared them with several state-of-the-art class imbalance multi-label learning algorithms on twelve benchmark and four real-world multilabel datasets. Our experimental results showed the effectiveness and superiority of the proposed ECC++ algorithm family over existing class imbalance multi-label learning algorithms. In conclusion, the proposed ECC++ algorithm family combines the strengths of the wellestablished ECC algorithm and binary-class imbalance learning techniques, resulting in a superior methodology for class imbalance multi-label learning.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available