4.7 Article

A conditional variational autoencoder based self-transferred algorithm for imbalanced classification

Journal

KNOWLEDGE-BASED SYSTEMS
Volume 218, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.knosys.2021.106756

Keywords

Highly imbalanced classification; Over-sampling; Variational autoencoders; Knowledge transfer

Funding

  1. Fundamental Research Funds for the Central Universities, PR China [20D110323, CUSF-DH-D-2018096]
  2. Natural Science Foundation of Shanghai, PR China [20ZR1400400, 19ZR1402300]

Ask authors/readers for more resources

In this paper, a conditional variational autoencoder-based self-transferred algorithm is proposed to address highly imbalanced classification problems. The method utilizes information from both majority and minority classes for domain knowledge transfer, while conducting distribution learning, image generation, and dataset rebalancing in a unified framework. Experimental results demonstrate the effectiveness of the approach in both imbalanced datasets and real-world industrial applications.
In this paper, we propose a conditional variational autoencoder-based self-transferred (CVAE_SeTred) algorithm to solve the highly imbalanced classification problem, where the training instances of the minority classes are rare. Our method belongs to an over-sampling technique that utilizes variational autoencoders (VAEs) to generate training samples for the minority classes. Traditional over-sampling methods mainly rely on minority classes themselves, our approach exploits the information from both the majority and minority classes and aims to transfer instructional knowledge from the majority classes to the minority classes, where the majority and minority classes are analogized as the self-transferred (SeTred) source and target domain, respectively. Specifically, our model comprises two encoders, one decoder, and one domain classifier and can simultaneously conduct distribution learning, SeTred learning, image generation, and dataset rebalancing in a joint and unified framework. The proposed method can not only learn domain-invariant and multivariate Gaussian distributed latent variables but also generate discriminative samples for the minority class according to designated labels. We verify the effectiveness of the CVAE_SeTred model on both imbalanced datasets constructed from benchmark datasets and a more challenging real-world industrial application, such as imbalanced classification for fabric defects. Experimental results indicate that our method outperforms other comparative methods and can generate samples with better diversity. (C) 2021 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available