4.5 Article

DBSMOTE: Density-Based Synthetic Minority Over-sampling TEchnique

Journal

APPLIED INTELLIGENCE
Volume 36, Issue 3, Pages 664-684

Publisher

SPRINGER
DOI: 10.1007/s10489-011-0287-y

Keywords

Classification; Class imbalance; Over-sampling; Density-based

Funding

  1. Commission on Higher Education, Thailand

Ask authors/readers for more resources

A dataset exhibits the class imbalance problem when a target class has a very small number of instances relative to other classes. A trivial classifier typically fails to detect a minority class due to its extremely low incidence rate. In this paper, a new over-sampling technique called DBSMOTE is proposed. Our technique relies on a density-based notion of clusters and is designed to over-sample an arbitrarily shaped cluster discovered by DBSCAN. DBSMOTE generates synthetic instances along a shortest path from each positive instance to a pseudo-centroid of a minority-class cluster. Consequently, these synthetic instances are dense near this centroid and are sparse far from this centroid. Our experimental results show that DBSMOTE improves precision, F-value, and AUC more effectively than SMOTE, Borderline-SMOTE, and Safe-Level-SMOTE for imbalanced datasets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available