4.7 Review

A survey on machine learning for recurring concept drifting data streams

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Computer Science, Artificial Intelligence

An extensive study of C-SMOTE, a Continuous Synthetic Minority Oversampling Technique for Evolving Data Streams

Alessio Bernardo et al.

Summary: This paper investigates binary classification in the presence of concept drift by rebalancing imbalanced data streams. The authors propose a pipeline based on C-SMOTE, which is combined with SML classification algorithms. Through experiments on synthetic and real data streams, the paper provides statistical evidence that using C-SMOTE pipelines can improve the performance of minority classes without significantly affecting the majority class performance.

EXPERT SYSTEMS WITH APPLICATIONS (2022)

Article Computer Science, Artificial Intelligence

ROSE: robust online self-adjusting ensemble for continual learning on imbalanced drifting data streams

Alberto Cano et al.

Summary: This article introduces a novel online ensemble classifier called ROSE, which is capable of handling challenges in data streams such as concept drift and class imbalance. ROSE features online training of base classifiers, online detection of concept drift, sliding window per class to handle imbalance, and self-adjusting bagging. Experimental results demonstrate that ROSE performs well in various data stream mining tasks.

MACHINE LEARNING (2022)

Article Computer Science, Artificial Intelligence

Adaptive ensemble of self-adjusting nearest neighbor subspaces for multi-label drifting data streams

Gavin Alberghini et al.

Summary: This paper introduces a novel ensemble method AESAKNNS for multi-label drifting streams, which adapts to concept drift by training base classifiers on different subspaces and monitoring drift occurrences. Experimental results support the better performance of AESAKNNS compared to other classifiers in diverse multi-label datasets.

NEUROCOMPUTING (2022)

Article Computer Science, Artificial Intelligence

A Diversity Framework for Dealing With Multiple Types of Concept Drift Based on Clustering in the Model Space

Chun Wai Chiu et al.

Summary: This study proposes using diversity as a framework to handle multiple types of concept drift and utilizes clustering in the model space to build a diverse ensemble and identify recurring concepts, thereby accelerating the adaptation to new concepts. Experimental results show that the framework usually achieves similar or better predictive accuracy compared to existing approaches in data streams with different types of drift.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

Article Computer Science, Information Systems

Meta-ADD: A meta-learning based pre-trained model for concept drift active detection

Hang Yu et al.

Summary: Concept drift is a common phenomenon in data streams that needs to be detected in order to understand and identify the types of changes in the target variable over time. Most current detection methods rely on hypothesis testing, which requires setting a hypothesis and cannot determine the type of drift. In this paper, a novel framework called Meta-ADD is proposed, which uses meta learning to pre-train a model on data streams with known drifts and then fine-tune it online for improved detection accuracy. Meta-ADD does not require hypothesis testing and can automatically detect and identify the types of concept drift, thereby directly supporting drift understanding.

INFORMATION SCIENCES (2022)

Review Computer Science, Information Systems

Learning Under Concept Drift for Regression-A Systematic Literature Review

Marilia Lima et al.

Summary: The amount and diversity of data have increased greatly in recent years, leading to the problem of Concept Drift (CD) where trained Machine Learning models may not perform well on testing data. While there is a wealth of research on CD detection and mitigation, most of it focuses on classification tasks. This systematic literature review (SLR) aims to explore the detection and technique construction of CD specifically in the context of regression. The results show that ensembles and neural networks with highlight OS-ELM are the most frequently used methods with superior performance, although only two papers statistically confirm their superiority. The SLR highlights the existing literature on CD applied to regression, providing insights for further research.

IEEE ACCESS (2022)

Review Computer Science, Artificial Intelligence

Data stream clustering: a review

Alaettin Zubaroglu et al.

Summary: The number of connected devices generating data streams is increasing steadily, sparking interest in real-time processing despite challenges. Clustering is a suitable method for real-time data stream processing, requiring less prior information and no labeled instances. Data stream clustering presents unique challenges compared to traditional clustering, including concept drift, data structures, and outlier detection.

ARTIFICIAL INTELLIGENCE REVIEW (2021)

Article Computer Science, Artificial Intelligence

Using spectral entropy and bernoulli map to handle concept drift

Rohgi Toshio Meneses Chikushi et al.

Summary: Data stream mining involves extracting information from continuously evolving large amounts of data, where learning algorithms can combine a classifier and drift detector to identify changes in prediction error distribution. While many methods are based on error rate distribution, empirical studies have shown that error rate can be influenced by temporal dependence. New approaches, including using dynamical system tools, have been proposed for concept drift detection in unsupervised scenarios with temporal dependencies.

EXPERT SYSTEMS WITH APPLICATIONS (2021)

Article Computer Science, Information Systems

Improving the performance of bagging ensembles for data streams through mini-batching

Guilherme Cassales et al.

Summary: This paper proposes a mini-batching strategy to improve memory access locality and performance of several ensemble algorithms for stream mining in multi-core environments. Experiments show that this strategy can significantly reduce cache misses, leading to speedups of up to 5X on 8-core processors.

INFORMATION SCIENCES (2021)

Article Computer Science, Artificial Intelligence

Adaptive online incremental learning for evolving data streams

Si -si Zhang et al.

Summary: Recent years have seen a growing interest in online incremental learning, but there are three major challenges - concept drift, catastrophic forgetting, and learning of latent representation. An Adaptive Online Incremental Learning algorithm (AOIL) is proposed to address these difficulties by utilizing auto-encoder with memory module and self-attention mechanism. Extensive experiments show that AOIL outperforms other state-of-the-art methods, demonstrating promising results.

APPLIED SOFT COMPUTING (2021)

Article Computer Science, Artificial Intelligence

Preprocessed dynamic classifier ensemble selection for highly imbalanced drifted data streams

Pawel Zyblewski et al.

Summary: This work focuses on connecting two rarely combined research directions - non-stationary data stream classification and data analysis with skewed class distributions. By proposing a novel framework that employs stratified bagging for training base classifiers and integrating data preprocessing and dynamic ensemble selection methods, the study aims to improve the classification of imbalanced data streams.

INFORMATION FUSION (2021)

Article Computer Science, Artificial Intelligence

Semi-supervised classification on data streams with recurring concept drift and concept evolution

Xiulin Zheng et al.

Summary: Mining non-stationary streams poses challenges due to their infinite length, dynamic characteristics, concept drift, concept evolution, and limited labeled data. Existing supervised methods may result in poor performance and efficiency in the presence of scarce labeled data. This paper proposes a semi-supervised framework ESCR to detect recurring concept drifts and concept evolution in data streams with partially labeled data. The framework utilizes clustering-based classifiers, Jensen-Shannon divergence for change detection, and outlier monitoring for concept evolution, while also improving efficiency through recursive function and dynamic programming. Extensive experiments show the effectiveness and efficiency of ESCR compared to other semi-supervised methods.

KNOWLEDGE-BASED SYSTEMS (2021)

Article Computer Science, Artificial Intelligence

Self-adjusting k nearest neighbors for continual learning from multi-label drifting data streams

Martha Roseberry et al.

Summary: Drifting data streams and multi-label data present challenging scenarios, each with their own unique difficulties, and when combined together, additional challenges arise. This paper proposes a self-adapting algorithm for drifting, multi-label data streams that can adapt to various concept drifts, and is robust to data-level difficulties. The algorithm mitigates the need to tune multiple parameters, adjusts the window of retained instances self, and adapts k value for each label individually, showing superior performance compared to existing state-of-the-art methods in extensive experiments.

NEUROCOMPUTING (2021)

Article Computer Science, Artificial Intelligence

Hyperparameter self-tuning for data streams

Bruno Veloso et al.

Summary: This article introduces SSPT, an extension of the Self Parameter Tuning (SPT) optimization algorithm for data streams, which dynamically adjusts sample sizes during a single pass over data and converges to optimal settings with relatively few hyperparameter configurations. The algorithm automatically readjusts hyperparameters when concept drift occurs, and experiments show that it outperforms previous human expert efforts in hyperparameter tuning.

INFORMATION FUSION (2021)

Article Computer Science, Artificial Intelligence

Online ensemble learning algorithm for imbalanced data stream

Du Hongle et al.

Summary: This study proposes a cost-sensitive online ensemble learning algorithm for imbalanced data streams, which reduces the impact of imbalanced data streams and enhances classification performance through various equalization methods.

APPLIED SOFT COMPUTING (2021)

Article Computer Science, Artificial Intelligence

Automated adaptation strategies for stream learning

Rashid Bakirov et al.

Summary: This paper proposes the use of flexible adaptive mechanism deployment for automated development of adaptation strategies to address the issue in automated machine learning model development. Experimental results confirm the viability of these strategies, achieving better or comparable performance to custom adaptation strategies and repeated deployment of any single adaptive mechanism on 36 datasets.

MACHINE LEARNING (2021)

Article Computer Science, Artificial Intelligence

Kappa Updated Ensemble for drifting data stream mining

Alberto Cano et al.

MACHINE LEARNING (2020)

Article Computer Science, Artificial Intelligence

Handling concept drift via model reuse

Peng Zhao et al.

MACHINE LEARNING (2020)

Article Computer Science, Information Systems

Exploiting evolving micro-clusters for data stream classification with emerging class detection

Salah Ud Din et al.

INFORMATION SCIENCES (2020)

Article Computer Science, Artificial Intelligence

Analyzing concept drift: A case study in the financial sector

Andres R. Masegosa et al.

INTELLIGENT DATA ANALYSIS (2020)

Article Computer Science, Theory & Methods

Generalizing from a Few Examples: A Survey on Few-shot Learning

Yaqing Wang et al.

ACM COMPUTING SURVEYS (2020)

Article Computer Science, Artificial Intelligence

Evaluating time series forecasting models: an empirical study on performance estimation methods

Vitor Cerqueira et al.

MACHINE LEARNING (2020)

Article Computer Science, Artificial Intelligence

Towards explainable deep neural networks (xDNN)

Plamen Angelov et al.

NEURAL NETWORKS (2020)

Proceedings Paper Computer Science, Artificial Intelligence

C-SMOTE: Continuous Synthetic Minority Oversampling for Evolving Data Streams

Alessio Bernardo et al.

2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) (2020)

Article Computer Science, Theory & Methods

Learning in the presence of concept recurrence in data stream clustering

K. Namitha et al.

JOURNAL OF BIG DATA (2020)

Article Computer Science, Artificial Intelligence

On learning guarantees to unsupervised concept drift detection on data streams

Rodrigo F. de Mello et al.

EXPERT SYSTEMS WITH APPLICATIONS (2019)

Article Computer Science, Artificial Intelligence

Evolving rule-based classifiers with genetic programming on GPUs for drifting data streams

Alberto Cano et al.

PATTERN RECOGNITION (2019)

Article Computer Science, Information Systems

Multi-Label Punitive kNN with Self-Adjusting Memory for Drifting Data Streams

Martha Roseberry et al.

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA (2019)

Article Computer Science, Information Systems

Optimizing Data Stream Representation: An Extensive Survey on Stream Clustering Algorithms

Matthias Carnein et al.

BUSINESS & INFORMATION SYSTEMS ENGINEERING (2019)

Article Physics, Multidisciplinary

Incremental Market Behavior Classification in Presence of Recurring Concepts

Andres L. Suarez-Cetrulo et al.

ENTROPY (2019)

Article Computer Science, Artificial Intelligence

Recurring concept meta-learning for evolving data streams

Robert Anderson et al.

EXPERT SYSTEMS WITH APPLICATIONS (2019)

Article Computer Science, Artificial Intelligence

An overview and comprehensive comparison of ensembles for concept drift

Roberto Souto Maior de Barros et al.

INFORMATION FUSION (2019)

Article Computer Science, Artificial Intelligence

Enhancing stock market prediction with extended coupled hidden Markov model over multi-sourced data

Xi Zhang et al.

KNOWLEDGE AND INFORMATION SYSTEMS (2019)

Review Multidisciplinary Sciences

Data stream mining: methods and challenges for handling concept drift

Scott Wares et al.

SN APPLIED SCIENCES (2019)

Article Computer Science, Artificial Intelligence

A novel online ensemble approach to handle concept drifting data streams: diversified dynamic weighted majority

Parneeta Sidhu et al.

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS (2018)

Article Computer Science, Artificial Intelligence

Modeling recurring concepts in data streams: a graph-based framework

Zahra Ahmadi et al.

KNOWLEDGE AND INFORMATION SYSTEMS (2018)

Article Computer Science, Artificial Intelligence

Online ensemble learning with abstaining classifiers for drifting and noisy data streams

Bartosz Krawczyk et al.

APPLIED SOFT COMPUTING (2018)

Article Computer Science, Artificial Intelligence

Adapting dynamic classifier selection for concept drift

Paulo R. L. Almeida et al.

EXPERT SYSTEMS WITH APPLICATIONS (2018)

Article Computer Science, Information Systems

A large-scale comparison of concept drift detectors

Roberto Souto Maior Barros et al.

INFORMATION SCIENCES (2018)

Article Computer Science, Artificial Intelligence

Concept Drift Adaptation by Exploiting Historical Knowledge

Yu Sun et al.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2018)

Article Physics, Multidisciplinary

Statistical Mechanics of On-Line Learning Under Concept Drift

Michiel Straat et al.

ENTROPY (2018)

Article Computer Science, Artificial Intelligence

RDDM: Reactive drift detection method

Roberto S. M. Barros et al.

EXPERT SYSTEMS WITH APPLICATIONS (2017)

Article Computer Science, Artificial Intelligence

On the reliable detection of concept drift from streaming unlabeled data

Tegjyot Singh Sethi et al.

EXPERT SYSTEMS WITH APPLICATIONS (2017)

Article Computer Science, Artificial Intelligence

Multidimensional surrogate stability to detect data stream concept drift

Fausto G. da Costa et al.

EXPERT SYSTEMS WITH APPLICATIONS (2017)

Article Computer Science, Artificial Intelligence

Ensemble learning for data stream analysis: A survey

Bartosz Krawczyk et al.

INFORMATION FUSION (2017)

Article Computer Science, Artificial Intelligence

Data driven modelling based on Recurrent Interval-Valued Metacognitive Scaffolding Fuzzy Neural Network

Mahardhika Pratama et al.

NEUROCOMPUTING (2017)

Article Computer Science, Artificial Intelligence

A survey on data preprocessing for data stream mining: Current status and future directions

Sergio Ramirez-Gallego et al.

NEUROCOMPUTING (2017)

Article Computer Science, Theory & Methods

A Survey on Ensemble Learning for Data Stream Classification

Heitor Murilo Gomes et al.

ACM COMPUTING SURVEYS (2017)

Article Computer Science, Software Engineering

Capturing recurring concepts using discrete Fourier transform

Sripirakas Sakthithasan et al.

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE (2016)

Article Computer Science, Artificial Intelligence

Characterizing concept drift

Geoffrey I. Webb et al.

DATA MINING AND KNOWLEDGE DISCOVERY (2016)

Review Computer Science, Artificial Intelligence

Using dynamical systems tools to detect concept drift in data streams

F. G. da Costa et al.

EXPERT SYSTEMS WITH APPLICATIONS (2016)

Article Mathematics, Interdisciplinary Applications

Model-Based Clustering

Paul D. McNicholas

JOURNAL OF CLASSIFICATION (2016)

Article Management

Clustering financial time series: New insights from an extended hidden Markov model

Jose G. Dias et al.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2015)

Article Computer Science, Artificial Intelligence

Predicting stock market index using fusion of machine learning techniques

Jigar Patel et al.

EXPERT SYSTEMS WITH APPLICATIONS (2015)

Article Computer Science, Artificial Intelligence

Predicting stock and stock price index movement using Trend Deterministic Data Preparation and machine learning techniques

Jigar Patel et al.

EXPERT SYSTEMS WITH APPLICATIONS (2015)

Article Computer Science, Artificial Intelligence

Clustering by growing incremental self-organizing neural network

Hao Liu et al.

EXPERT SYSTEMS WITH APPLICATIONS (2015)

Article Computer Science, Artificial Intelligence

Evaluating multiple classifiers for stock price direction prediction

Michel Ballings et al.

EXPERT SYSTEMS WITH APPLICATIONS (2015)

Article Computer Science, Artificial Intelligence

Learning in Nonstationary Environments: A Survey

Gregory Ditzler et al.

IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE (2015)

Article Computer Science, Artificial Intelligence

pClass: An Effective Classifier for Streaming Examples

Mahardhika Pratama et al.

IEEE TRANSACTIONS ON FUZZY SYSTEMS (2015)

Article Computer Science, Artificial Intelligence

Online and Non-Parametric Drift Detection Methods Based on Hoeffding's Bounds

Isvani Frias-Blanco et al.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2015)

Article Computer Science, Artificial Intelligence

Recovery analysis for adaptive learning from non-stationary data streams: Experimental design and case study

Ammar Shaker et al.

NEUROCOMPUTING (2015)

Article Computer Science, Artificial Intelligence

Towards cost-sensitive adaptation: When is it worth updating your predictive model?

Indre Zliobaite et al.

NEUROCOMPUTING (2015)

Article Computer Science, Artificial Intelligence

Evaluation methods and decision theory for classification of streaming data with temporal dependence

Indre Zliobaite et al.

MACHINE LEARNING (2015)

Article Computer Science, Theory & Methods

A Survey on Concept Drift Adaptation

Joao Gama et al.

ACM COMPUTING SURVEYS (2014)

Article Computer Science, Artificial Intelligence

Proposal of a new stability concept to detect changes in unsupervised data streams

Rosane M. M. Vallim et al.

EXPERT SYSTEMS WITH APPLICATIONS (2014)

Article Computer Science, Artificial Intelligence

A comparative study on concept drift detectors

Paulo M. Goncalves et al.

EXPERT SYSTEMS WITH APPLICATIONS (2014)

Article Automation & Control Systems

SURVIVAL ANALYSIS ON DATA STREAMS: ANALYZING TEMPORAL EVENTS IN DYNAMICALLY CHANGING ENVIRONMENTS

Ammar Shaker et al.

INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE (2014)

Article Computer Science, Artificial Intelligence

Mining Recurring Concepts in a Dynamic Feature Space

Joao Bartolo Gomes et al.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2014)

Article Computer Science, Artificial Intelligence

Reacting to Different Types of Concept Drift: The Accuracy Updated Ensemble Algorithm

Dariusz Brzezinski et al.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2014)

Article Computer Science, Artificial Intelligence

An online incremental learning support vector machine for large-scale data

Jun Zheng et al.

NEURAL COMPUTING & APPLICATIONS (2013)

Article Computer Science, Artificial Intelligence

Opposite Maps: Vector Quantization Algorithms for Building Reduced-Set SVM and LSSVM Classifiers

Ajalmar R. R. Neto et al.

NEURAL PROCESSING LETTERS (2013)

Article Computer Science, Artificial Intelligence

RCD: A recurring concept drift framework

Paulo Mauricio Goncalves et al.

PATTERN RECOGNITION LETTERS (2013)

Article Computer Science, Artificial Intelligence

Just-In-Time Classifiers for Recurrent Concepts

Cesare Alippi et al.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2013)

Article Computer Science, Artificial Intelligence

An incremental learning vector quantization algorithm for pattern classification

Ye Xu et al.

NEURAL COMPUTING & APPLICATIONS (2012)

Article Computer Science, Artificial Intelligence

Meta-cognitive Neural Network for classification problems in a sequential learning framework

G. Sateesh Babu et al.

NEUROCOMPUTING (2012)

Article Computer Science, Artificial Intelligence

Learning from concept drifting data streams with unlabeled data

Xindong Wu et al.

NEUROCOMPUTING (2012)

Article Business, Finance

Regime Shifts: Implications for Dynamic Strategies

Mark Kritzman et al.

FINANCIAL ANALYSTS JOURNAL (2012)

Article Computer Science, Artificial Intelligence

Incremental Learning of Concept Drift in Nonstationary Environments

Ryan Elwell et al.

IEEE TRANSACTIONS ON NEURAL NETWORKS (2011)

Review Computer Science, Artificial Intelligence

Ensemble-based classifiers

Lior Rokach

ARTIFICIAL INTELLIGENCE REVIEW (2010)

Article Computer Science, Artificial Intelligence

The Impact of Diversity on Online Ensemble Learning in the Presence of Concept Drift

Leandro L. Minku et al.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2010)

Article Computer Science, Artificial Intelligence

A Survey on Transfer Learning

Sinno Jialin Pan et al.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2010)

Article Computer Science, Artificial Intelligence

Tracking recurring contexts using ensemble classifiers: an application to email filtering

Ioannis Katakis et al.

KNOWLEDGE AND INFORMATION SYSTEMS (2010)

Article Computer Science, Artificial Intelligence

Evolving Fuzzy-Rule-Based Classifiers From Data Streams

Plamen P. Angelov et al.

IEEE TRANSACTIONS ON FUZZY SYSTEMS (2008)

Article Computer Science, Artificial Intelligence

Mining in anticipation for concept change: Proactive-reactive prediction in data streams

Ying Yang et al.

DATA MINING AND KNOWLEDGE DISCOVERY (2006)

Article Computer Science, Artificial Intelligence

Prototype selection for dissimilarity-based classifiers

E Pekalska et al.

PATTERN RECOGNITION (2006)

Article Automation & Control Systems

An approach to Online identification of Takagi-Suigeno fuzzy models

PP Angelov et al.

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS (2004)

Article Computer Science, Artificial Intelligence

Dynamic topology representing networks

J Si et al.

NEURAL NETWORKS (2000)