4.7 Article

The effect of imbalanced data sets on LDA: A theoretical and empirical analysis

Journal

PATTERN RECOGNITION
Volume 40, Issue 2, Pages 557-562

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.patcog.2006.01.009

Keywords

imbalanced data sets; linear discriminant analysis (LDA); random sampling; tomek links; smote

Ask authors/readers for more resources

This paper demonstrates that the imbalanced data sets have a negative effect on the performance of LDA theoretically. This theoretical analysis is confirmed by the experimental results: using several sampling methods to rebalance the imbalanced data sets, it is found that the performances of LDA on balanced data sets are superior to those of LDA on imbalanced data sets. (c) 2006 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available