4.5 Article

Processing data stream with chunk-similarity model selection

Journal

APPLIED INTELLIGENCE
Volume 53, Issue 7, Pages 7931-7956

Publisher

SPRINGER
DOI: 10.1007/s10489-022-03826-4

Keywords

Data stream; Classifier selection; Classification; Pattern recognition

Ask authors/readers for more resources

The paper proposes a new ensemble method - Covariance-signature Concept Selector - for the classification of data stream susceptible to the concept drift phenomenon. Unlike existing methods, this method performs static classifier selection by assessing model similarity to the currently processed data chunk as a concept selector. Experimental analysis demonstrates the advantage of this method over state-of-the-art methods in addressing specific problems and its high potential in practical applications.
The classification of data stream susceptible to the concept drift phenomenon has been a field of intensive research for many years. One of the dominant strategies of the proposed solutions is the application of classifier ensembles with the member classifiers validated on their actual prediction quality. This paper is a proposal of a new ensemble method - Covariance-signature Concept Selector - which, like state-of-the-art solutions, uses both the model accumulation paradigm and the detection of changes in the data posterior probability, but in the integrated procedure. However, instead of ensemble fusion, it performs a static classifier selection, where model similarity assessment to the currently processed data chunk serves as a concept selector. The proposed method was subjected to a series of computer experiments assessing its temporal complexity and efficiency in classifying streams with synthetic and real concepts. The conducted experimental analysis allows concluding the advantage of this proposal over state-of-the-art methods in the identified pool of problems and high potential in practical applications.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available