期刊
EXPERT SYSTEMS WITH APPLICATIONS
卷 188, 期 -, 页码 -出版社
PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2021.115974
关键词
Regularization; Class imbalance; Sentence classification; Image classification
类别
资金
- Institute of Information & Communications Technology Planning & Evaluation (IITP) grant - Korea Government (MSIT) [2020-0-00368]
- National Research Foundation of Korea (NRF) - Korea Government (MSIT) [NRF-2020R1A2C2100362]
In neural network models, obtaining high-quality datasets is crucial. To address the problem of class imbalance, a novel regularization method is proposed, which significantly improves the performance on multiple datasets, achieving state-of-the-art results.
In neural network models, obtaining a high-quality dataset is critical because they are generally reliant on training data. A common problem that occurs is class imbalance, in which models tend to be biased to the majority class when the training data is not balanced. To overcome this problem, we propose a novel regularization method that provides a penalty to the loss function, using two facets of the distribution of the model's output p((y) over cap vertical bar x): (1) skewed mean and (2) variance divergence between p((y) over cap vertical bar x is an element of D+) and p((y) over cap vertical bar x is an element of D_). The experimental results demonstrate that our methods consistently improve the performance on imbalanced datasets. Moreover, the combination of two regularization methods provides a substantial performance improvement on five sentence classification datasets and also an image classification dataset; notably, state-of-the-art performances are achieved on the WikiQA and SelQA datasets.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据