4.7 Article

Noise Reduction A Priori Synthetic Over-Sampling for class imbalanced data sets

Journal

INFORMATION SCIENCES
Volume 408, Issue -, Pages 146-161

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2017.04.046

Keywords

NRAS; SMOTE; OUPS; Class imbalance; Classification

Ask authors/readers for more resources

In real world data set the underlying data distribution may be highly skewed. Building accurate classifiers for predicting group membership is made difficult because the classifier has a tendency to be biased towards the over represented or majority group as a result. This problem is referred to as a class imbalance problem. Re-sampling techniques that produce new samples by means of over-sampling aim to combat class imbalance by increasing the number of members that belong to the minority group. This paper introduces a new over-sampling technique that focuses on noise reduction and selective sampling of the minority group which results in improvement for prediction of minority group membership. Experiments are conducted across a wide range of data sets, learners and over sampling methods. The results for this new method show improvement for Sensitivity and Gmean measures over the compared approaches. (C) 2017 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available