4.7 Article

Resampling-Based Ensemble Methods for Online Class Imbalance Learning

期刊

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TKDE.2014.2345380

关键词

Class imbalance; resampling; online learning; ensemble learning; Bagging

资金

  1. European Commission FP7 Grants [270428, 257906]
  2. EPSRC Grant [EP/J017515/1]
  3. Royal Society Wolfson Research Merit Award
  4. EPSRC [EP/J017515/1] Funding Source: UKRI
  5. Engineering and Physical Sciences Research Council [EP/J017515/1] Funding Source: researchfish

向作者/读者索取更多资源

Online class imbalance learning is a new learning problem that combines the challenges of both online learning and class imbalance learning. It deals with data streams having very skewed class distributions. This type of problems commonly exists in real-world applications, such as fault diagnosis of real-time control monitoring systems and intrusion detection in computer networks. In our earlier work, we defined class imbalance online, and proposed two learning algorithms OOB and UOB that build an ensemble model overcoming class imbalance in real time through resampling and time-decayed metrics. In this paper, we further improve the resampling strategy inside OOB and UOB, and look into their performance in both static and dynamic data streams. We give the first comprehensive analysis of class imbalance in data streams, in terms of data distributions, imbalance rates and changes in class imbalance status. We find that UOB is better at recognizing minority-class examples in static data streams, and OOB is more robust against dynamic changes in class imbalance status. The data distribution is a major factor affecting their performance. Based on the insight gained, we then propose two new ensemble methods that maintain both OOB and UOB with adaptive weights for final predictions, called WEOB1 and WEOB2. They are shown to possess the strength of OOB and UOB with good accuracy and robustness.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据