4.7 Article

A novel SMOTE-based resampling technique trough noise detection and the boosting procedure

Journal

EXPERT SYSTEMS WITH APPLICATIONS
Volume 200, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2022.117023

Keywords

Oversampling; SMOTE; Class imbalance; Noisy data

Ask authors/readers for more resources

Most classification methods assume balanced class observations, but class imbalance is a common issue. This study discusses the challenges of resampling methods and proposes a solution by combining a new noise detection method and SMOTE.
Most of the classification methods assume that the numbers of class observations are balanced. In such cases, models are predicted by giving biased weight to the the class with more observations. Therefore, the classifiers ignore the class with smaller number of observations and the majority class makes biased predictions. There are some advised performance measures to be used in datasets, as well as recommended approaches to solve class imbalance problem. One of the most widely used methods is resampling method. In this study, the difficulties relevant to random oversampling (ROS) and synthetic minority oversampling technique (SMOTE), which are some of the oversampling methods, are discussed. This study aims to propose a combination of a new noise detection method and SMOTE to overcome those difficulties. Using the boosting procedure in ensemble algo-rithms, noise detection is possible with the proposed SMOTE with boosting (SMOTEWB) method, which makes use of this information to determine the appropriate number of neighbors for each observation within SMOTE algorithm.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available