4.7 Article

What makes multi-class imbalanced problems difficult? An experimental study

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 199, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2022.116962

关键词

Imbalanced data; Classification; Learning from multiple classes; Data difficulty factors

资金

  1. Polish National Science Centre [2016/22/E/ST6/00299]
  2. TAILOR - EU Horizon 2020 research and innovation programme [952215]

向作者/读者索取更多资源

This study experimentally investigates the impact of various multi-class imbalanced difficulty factors on the performance of classifiers. The results reveal that class overlapping and class size configurations are important difficulties.
Multi-class imbalanced classification is more difficult and less frequently studied than its binary counterpart. Moreover, research on the causes of the difficulty of multi-class imbalanced data is quite limited and insufficient. Therefore, we experimentally study the impact of various multi-class imbalanced difficulty factors on the performance of three popular classifiers. The results demonstrated a strong influence of the class overlapping with the extent of its impact related to the types of overlapped classes. In particular, overlapping between minority and majority classes was more difficult than the others. The type of the class size configuration turned out to be another important factor, highlighting the special role of the configurations with classes of intermediate sizes. The obtained results could support studying the nature of the multi-class imbalanced data as well as the development of new methods for improving classifiers.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据