☆ 4.7 Article

Online ensemble learning with abstaining classifiers for drifting and noisy data streams

APPLIED SOFT COMPUTING (2018)

Journal

APPLIED SOFT COMPUTING

Volume 68, Issue -, Pages 677-692

Publisher

ELSEVIER

DOI: 10.1016/j.asoc.2017.12.008

Keywords

Machine learning; Data stream mining; Concept drift; Ensemble learning; Abstaining classifier; Diversity

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Mining data streams is among most vital contemporary topics in machine learning. Such scenario requires adaptive algorithms that are able to process constantly arriving instances, adapt to potential changes in data, use limited computational resources, as well as be robust to any atypical events that may appear. Ensemble learning has proven itself to be an effective solution, as combining learners leads to an improved predictive power, more flexible drift handling, as well as ease of being implemented in high-performance computing environments. In this paper, we propose an enhancement of popular online ensembles by augmenting them with abstaining option. Instead of relying on a traditional voting, classifiers are allowed to abstain from contributing to the final decision. Their confidence level is being monitored for each incoming instance and only learners that exceed certain threshold are selected. We introduce a dynamic and self-adapting threshold that is able to adapt to changes in the data stream, by monitoring outputs of the ensemble and allowing to exploit underlying diversity in order to efficiently anticipate drifts. Additionally, we show that forcing uncertain classifiers to abstain from making a prediction is especially useful for noisy data streams. Our proposal is a lightweight enhancement that can be applied to any online ensemble method, improving its robustness to drifts and noise. Thorough experimental analysis validated through statistical tests proves the usefulness of the proposed approach. (C) 2017 Elsevier B.V. All rights reserved.

Online ensemble learning with abstaining classifiers for drifting and noisy data streams

Journal

APPLIED SOFT COMPUTING

Publisher

ELSEVIER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Online ensemble learning with abstaining classifiers for drifting and noisy data streams

Journal

APPLIED SOFT COMPUTING

Publisher

ELSEVIER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper