4.6 Article

Unsupervised concept drift detection for multi-label data streams

期刊

ARTIFICIAL INTELLIGENCE REVIEW
卷 56, 期 3, 页码 2401-2434

出版社

SPRINGER
DOI: 10.1007/s10462-022-10232-2

关键词

Big data; Multi-label data stream; Multi-label classification; Concept drift; Drift detection

向作者/读者索取更多资源

Many real-world applications adopt multi-label data streams as the need for algorithms to deal with rapidly changing data increases. We propose a novel algorithm called Label Dependency Drift Detector (LD3) for unsupervised concept drift detection in multi-label data streams. Our study shows that LD3 provides better predictive performance than other detectors on both real-world and synthetic data streams.
Many real-world applications adopt multi-label data streams as the need for algorithms to deal with rapidly changing data increases. Changes in data distribution, also known as concept drift, cause existing classification models to rapidly lose their effectiveness. To assist the classifiers, we propose a novel algorithm called Label Dependency Drift Detector (LD3), an unsupervised concept drift detector using label dependencies within the data for multi-label data streams. Our study exploits the dynamic temporal dependencies between labels using a label influence ranking method, which leverages a data fusion algorithm and uses the produced ranking to detect concept drift. LD3 is the first unsupervised concept drift detection algorithm in the multi-label classification problem area. In this study, we perform an extensive evaluation of LD3 by comparing it with 14 prevalent supervised concept drift detection algorithms that we adapt to the problem area using 15 datasets and a baseline classifier. The results show that LD3 provides between 16.9 and 56% better predictive performance than comparable detectors on both real-world and synthetic data streams.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据