期刊
KNOWLEDGE-BASED SYSTEMS
卷 215, 期 -, 页码 -出版社
ELSEVIER
DOI: 10.1016/j.knosys.2021.106778
关键词
Online active learning; Multiclass imbalance; Concept drift; Data stream
资金
- National Natural Science Fund of China [71971212]
The paper proposed a comprehensive active learning method (CALMID) for handling multiclass imbalanced streaming data with concept drift. Novel uncertainty strategies and sample weight formulas were designed, and experimental results showed that CALMID outperformed existing algorithms in various imbalance and drift scenarios.
A challenge to many real-world applications is multiclass imbalance with concept drift. In this paper, we propose a comprehensive active learning method for multiclass imbalanced streaming data with concept drift (CALMID). First, we design a comprehensive online active learning framework that includes an ensemble classifier, a drift detector, a label sliding window, sample sliding windows and an initialization training sample sequence. Next, a variable threshold uncertainty strategy based on an asymmetric margin threshold matrix is designed to comprehensively address the problem that a given class can simultaneously be a majority to a given subset of classes while also being a minority to others. Last but not least, we design a novel sample weight formula that comprehensively considers the class imbalance ratio of the sample's category and the prediction difficulty. On 10 multiclass synthetic streams with different imbalance ratios and concept drifts, and on 5 real-world imbalanced streams with 7 to 55 classes and unknown drifts, the experimental results demonstrate that the proposed CALMID is more effective and efficient than several state-of-the-art learning algorithms. (C) 2021 The Author(s). Published by Elsevier B.V.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据