4.7 Article

Novel regularization method for the class imbalance problem

Journal

EXPERT SYSTEMS WITH APPLICATIONS
Volume 188, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2021.115974

Keywords

Regularization; Class imbalance; Sentence classification; Image classification

Funding

  1. Institute of Information & Communications Technology Planning & Evaluation (IITP) grant - Korea Government (MSIT) [2020-0-00368]
  2. National Research Foundation of Korea (NRF) - Korea Government (MSIT) [NRF-2020R1A2C2100362]

Ask authors/readers for more resources

In neural network models, obtaining high-quality datasets is crucial. To address the problem of class imbalance, a novel regularization method is proposed, which significantly improves the performance on multiple datasets, achieving state-of-the-art results.
In neural network models, obtaining a high-quality dataset is critical because they are generally reliant on training data. A common problem that occurs is class imbalance, in which models tend to be biased to the majority class when the training data is not balanced. To overcome this problem, we propose a novel regularization method that provides a penalty to the loss function, using two facets of the distribution of the model's output p((y) over cap vertical bar x): (1) skewed mean and (2) variance divergence between p((y) over cap vertical bar x is an element of D+) and p((y) over cap vertical bar x is an element of D_). The experimental results demonstrate that our methods consistently improve the performance on imbalanced datasets. Moreover, the combination of two regularization methods provides a substantial performance improvement on five sentence classification datasets and also an image classification dataset; notably, state-of-the-art performances are achieved on the WikiQA and SelQA datasets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available