4.6 Article

KDE-Based Ensemble Learning for Imbalanced Data

Journal

ELECTRONICS
Volume 11, Issue 17, Pages -

Publisher

MDPI
DOI: 10.3390/electronics11172703

Keywords

imbalanced data; kernel density estimate; ensemble method; data sampling

Ask authors/readers for more resources

This paper proposes a novel ensemble classification method that deals with imbalanced data by training each tree in the ensemble using uniquely generated synthetically balanced data. Data balancing is achieved through kernel density estimation, resulting in a lower variance of the model estimator. The proposed classifier significantly outperforms benchmark methods in various datasets.
Imbalanced class distribution affects many applications in machine learning, including medical diagnostics, text classification, intrusion detection and many others. In this paper, we propose a novel ensemble classification method designed to deal with imbalanced data. The proposed method trains each tree in the ensemble using uniquely generated synthetically balanced data. The data balancing is carried out via kernel density estimation, which offers a natural and effective approach to generating new sample points. We show that the proposed method results in a lower variance of the model estimator. The proposed method is tested against benchmark classifiers on a range of simulated and real-life data. The results of experiments show that the proposed classifier significantly outperforms the benchmark methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available