3.8 Review

Learning from streaming data with concept drift and imbalance: an overview

期刊

PROGRESS IN ARTIFICIAL INTELLIGENCE
卷 1, 期 1, 页码 89-101

出版社

SPRINGERNATURE
DOI: 10.1007/s13748-011-0008-0

关键词

Class imbalance; Concept drift; Data streams; Classification

资金

  1. NSF [ECCS-0926170, ECCS-092159]
  2. Notebaert Premier Fellowship
  3. Div Of Electrical, Commun & Cyber Sys
  4. Directorate For Engineering [0926170, 0926159] Funding Source: National Science Foundation

向作者/读者索取更多资源

The primary focus of machine learning has traditionally been on learning from data assumed to be sufficient and representative of the underlying fixed, yet unknown, distribution. Such restrictions on the problem domain paved the way for development of elegant algorithms with theoretically provable performance guarantees. As is often the case, however, real-world problems rarely fit neatly into such restricted models. For instance class distributions are often skewed, resulting in the class imbalance problem. Data drawn from non-stationary distributions is also common in real-world applications, resulting in the concept drift or non-stationary learning problem which is often associated with streaming data scenarios. Recently, these problems have independently experienced increased research attention, however, the combined problem of addressing all of the above mentioned issues has enjoyed relatively little research. If the ultimate goal of intelligent machine learning algorithms is to be able to address a wide spectrum of real-world scenarios, then the need for a general framework for learning from, and adapting to, a non-stationary environment that may introduce imbalanced data can be hardly overstated. in this paper, we first present an overview of each of these challenging areas, followed by a comprehensive review of recent research for developing such a general framework.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据