4.7 Article

What makes multi-class imbalanced problems difficult? An experimental study

Journal

EXPERT SYSTEMS WITH APPLICATIONS
Volume 199, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2022.116962

Keywords

Imbalanced data; Classification; Learning from multiple classes; Data difficulty factors

Funding

  1. Polish National Science Centre [2016/22/E/ST6/00299]
  2. TAILOR - EU Horizon 2020 research and innovation programme [952215]

Ask authors/readers for more resources

This study experimentally investigates the impact of various multi-class imbalanced difficulty factors on the performance of classifiers. The results reveal that class overlapping and class size configurations are important difficulties.
Multi-class imbalanced classification is more difficult and less frequently studied than its binary counterpart. Moreover, research on the causes of the difficulty of multi-class imbalanced data is quite limited and insufficient. Therefore, we experimentally study the impact of various multi-class imbalanced difficulty factors on the performance of three popular classifiers. The results demonstrated a strong influence of the class overlapping with the extent of its impact related to the types of overlapped classes. In particular, overlapping between minority and majority classes was more difficult than the others. The type of the class size configuration turned out to be another important factor, highlighting the special role of the configurations with classes of intermediate sizes. The obtained results could support studying the nature of the multi-class imbalanced data as well as the development of new methods for improving classifiers.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available