4.7 Article

NI-MWMOTE: An improving noise-immunity majority weighted minority oversampling technique for imbalanced classification problems

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 158, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2020.113504

关键词

Imbalanced classification; Noise-immunity; MWMOTE; Clustering; Oversampling

资金

  1. National Natural Science Foundation of China [51865004]
  2. Major Science and Technology Plan of Guizhou Province [3002]
  3. Natural Science Foundation of Guizhou Province [5781]
  4. Guizhou Graduate Research Fund Project [Qianjiaohe YJSC XJH(2019)035]

向作者/读者索取更多资源

Oversampling techniques have been favored by researchers because of their simplicity and versatility in dealing with imbalanced classification problems. For oversampling techniques appeared in recent years (e.g. Majority Weighted Minority Oversampling Technique (MWMOTE)), noise processing plays an important role. This is because the processing of noise directly affects the distribution of new synthetic instances. MWMOTE and many other oversampling techniques use knn based noise processing method. While the knn method can effectively handle partial noise when the neighborhood parameter k value is reasonable, it may lead to under-recognition or over-recognition without prior experience. Therefore, we propose an improving noise-immunity majority weighted minority oversampling technique abbreviated NI-MWMOTE. NI-MWMOTE uses an adaptive noise processing scheme, which combines Euclidean distance and neighbor density to rank the probability that suspected noise (knn method) is true noise, and then adaptively selects the best noise processing scheme through iteration and misclassification error. Then, aggregative hierarchical clustering (AHC) method is used to cluster minority instances. And, in each sub-cluster, the sampling size of new samples is adaptively determined by classification complexity and cross-validation. NI-MWMOTE not only avoids the generation of new noise, but also effectively overcomes both between-class imbalances and within-class imbalances. Results demonstrate that NI-MWMOTE achieves significantly better results in most imbalanced datasets than eight popular oversampling algorithms. (c) 2020 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据