4.6 Article Proceedings Paper

Study of the impact of resarnpling methods for contrast pattern based classifiers in imbalanced databases

期刊

NEUROCOMPUTING
卷 175, 期 -, 页码 935-947

出版社

ELSEVIER SCIENCE BV
DOI: 10.1016/j.neucom.2015.04.120

关键词

Supervised classification; Resampling methods; Imbalanced databases; Contrast patterns

资金

  1. National Council of Science and Technology of Mexico [CB2008-106366, 370272]

向作者/读者索取更多资源

The class imbalance problem is a challenge in supervised classification, since many classifiers are sensitive to class distribution, biasing their prediction towards the majority class. Usually, in imbalanced databases, contrast pattern miners extract a very large collection of patterns from the majority class but only a few patterns (or none) from the minority class. It causes that minority class objects have low support and they could be identified as noise and consequently discarded by the contrast pattern based classifier biasing the results towards the majority class. In the literature, the class imbalance problem is commonly faced by applying resampling methods. Therefore, in this paper, we present a study about the impact of using resampling methods for improving the performance of contrast pattern based classifiers in class imbalance problems. Experimental results using standard imbalanced databases show that there are statistically significant differences between using the classifier before and after applying resampling methods. Moreover, from this study, we provide a guide based on the class imbalance ratio for selecting a resampling method that jointly with a contrast pattern based classifier allows us to have good results in a class imbalance problem. (C) 2015 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据