4.5 Article

Majority-to-minority resampling for boosting-based classification under imbalanced data

期刊

APPLIED INTELLIGENCE
卷 53, 期 4, 页码 4541-4562

出版社

SPRINGER
DOI: 10.1007/s10489-022-03585-2

关键词

Classification; Class imbalance; Sampling; Boosting

向作者/读者索取更多资源

This paper proposes a hybrid strategy called Majority-to-Minority Resampling (MMR) to solve the problem of class imbalance by adaptively sampling potential instances from the majority class to augment the minority class. To reduce the loss of information after sampling, a Majority-to-Minority Boosting (MMBoost) algorithm is also proposed to dynamically adjust the weights of the sampled instances for classification. Extensive experiments using real-world datasets demonstrate that the proposed framework achieves competitive performance in dealing with imbalanced data.
Classification is a classical research field due to its broad applications in data mining such as event extraction, spam detection, and medical treatment. However, class imbalance is an unavoidable problem in many real-world applications. It is challenging for conventional learning algorithms to deal with imbalanced datasets, since they tend to be biased towards the majority class, while the minority class is crucial as well. Many previous studies have been explored to solve class imbalance, such as data sampling and class switching. In this paper, we propose a hybrid strategy named Majority-to-Minority Resampling (MMR) to select switched instances, which adaptively samples potential instances from the majority class to augment the minority class. To reduce the loss of information after sampling, we also propose a Majority-to-Minority Boosting (MMBoost) algorithm for classification by dynamically adjusting weights of the sampled instances. We conduct extensive experiments using real-world datasets. Experimental results demonstrate that the proposed framework achieves competitive performance for dealing with imbalanced data compared to several strong baselines across different common metrics.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据