4.7 Article

CEGAN: Classification Enhancement Generative Adversarial Networks for unraveling data imbalance problems

Journal

NEURAL NETWORKS
Volume 133, Issue -, Pages 69-86

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.neunet.2020.10.004

Keywords

Imbalanced classification; Data augmentation; Generative adversarial networks; Classification enhancement; Ambiguous classes

Funding

  1. Korea Institute of Science and Technology Europe Institutional Program [12020]

Ask authors/readers for more resources

The research introduces a classification enhancement generative adversarial networks (CEGAN) for improving prediction accuracy in data imbalanced conditions by enhancing the quality of generated synthetic minority data. Additionally, an ambiguity reduction method using the generated synthetic minority data is proposed. Results from five benchmark datasets demonstrate significant improvements in classification performance when approximating the real data distribution using CEGAN compared to standard data augmentation methods.
The data imbalance problem in classification is a frequent but challenging task. In real-world datasets, numerous class distributions are imbalanced and the classification result under such condition reveals extreme bias in the majority data class. Recently, the potential of GAN as a data augmentation method on minority data has been studied. In this paper, we propose a classification enhancement generative adversarial networks (CEGAN) to enhance the quality of generated synthetic minority data and more importantly, to improve the prediction accuracy in data imbalanced condition. In addition, we propose an ambiguity reduction method using the generated synthetic minority data for the case of multiple similar classes that are degenerating the classification accuracy. The proposed method is demonstrated with five benchmark datasets. The results indicate that approximating the real data distribution using CEGAN improves the classification performance significantly in data imbalanced conditions compared with various standard data augmentation methods. (c) 2020 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available