期刊
EXPERT SYSTEMS WITH APPLICATIONS
卷 199, 期 -, 页码 -出版社
PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2022.116962
关键词
Imbalanced data; Classification; Learning from multiple classes; Data difficulty factors
类别
资金
- Polish National Science Centre [2016/22/E/ST6/00299]
- TAILOR - EU Horizon 2020 research and innovation programme [952215]
This study experimentally investigates the impact of various multi-class imbalanced difficulty factors on the performance of classifiers. The results reveal that class overlapping and class size configurations are important difficulties.
Multi-class imbalanced classification is more difficult and less frequently studied than its binary counterpart. Moreover, research on the causes of the difficulty of multi-class imbalanced data is quite limited and insufficient. Therefore, we experimentally study the impact of various multi-class imbalanced difficulty factors on the performance of three popular classifiers. The results demonstrated a strong influence of the class overlapping with the extent of its impact related to the types of overlapped classes. In particular, overlapping between minority and majority classes was more difficult than the others. The type of the class size configuration turned out to be another important factor, highlighting the special role of the configurations with classes of intermediate sizes. The obtained results could support studying the nature of the multi-class imbalanced data as well as the development of new methods for improving classifiers.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据