4.7 Article

Reinforcement Online Active Learning Ensemble for Drifting Imbalanced Data Streams

期刊

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TKDE.2020.3026196

关键词

Labeling; Classification algorithms; Learning systems; Uncertainty; Training; Heuristic algorithms; Bagging; Online active learning; reinforcement; ensemble learning; concept drift; class imbalance

资金

  1. China Advance Research Fund [9140C830304150C83352]

向作者/读者索取更多资源

This paper proposes a novel approach named ROALE-DI to handle the challenges of concept drift and class imbalance. By applying a reinforcement mechanism to increase the weight of dynamic classifiers, the classification performance is improved. The hybrid labeling strategy is introduced to determine the real label of instances, reducing the labeling cost.
Applications challenged by the joint problem of concept drift and class imbalance are attracting increasing research interest. This paper proposes a novel Reinforcement Online Active Learning Ensemble for Drifting Imbalanced data stream (ROALE-DI). The ensemble classifier has a long-term stable classifier and a dynamic classifier group which applies a reinforcement mechanism to increase the weight of the dynamic classifiers, which perform better on the minority class, and decreases the weight of the opposite. When the data stream is class imbalanced, the classifiers will lack the training samples of the minority class. To supply training samples, when creating a new classifier, the labeled instances buffer is used to provide instances of the minority class. Then, a hybrid labeling strategy that combines the uncertainty strategy and imbalance strategy is proposed to define whether to obtain the real label of an instance. An experimental evaluation compares the classification performance of the proposed method with semi-supervised and supervised algorithms on both real-world and synthetic data streams. The results show that the ROALE-DI achieves higher Area Under the ROC Curve (AUC) and accuracy values with even fewer real labels, and the labeling cost dynamically adjusts according to the concept drift and class imbalance ratio.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据