期刊
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)
卷 -, 期 -, 页码 3224-3228出版社
IEEE
DOI: 10.1109/ICASSP43922.2022.9746299
关键词
Long-tailed learning; mixup; data augmentation; class-balanced sampler; vision and sound recognition
资金
- National Key R&D Program of China [2021YFF0602101]
- National Science Foundation of China [NSFC 61906194]
- Commercialization of Research Findings Fund of Inner Mongolia Autonomous Region [2020CG0075]
Mixup is a popular data augmentation technique, but may face label suppression issue when applied to long-tailed data. To address this, Label-Occurrence-Balanced Mixup is proposed, using class-balanced samplers to generate new data and improve the adaptability of mixup to imbalanced data.
Mixup is a popular data augmentation method, with many variants subsequently proposed. These methods mainly create new examples via convex combination of random data pairs and their corresponding one-hot labels. However, most of them adhere to a random sampling and mixing strategy, without considering the frequency of label occurrence in the mixing process. When applying mixup to long-tailed data, a label suppression issue arises, where the frequency of label occurrence for each class is imbalanced and most of the new examples will be completely or partially assigned with head labels. The suppression effect may further aggravate the problem of data imbalance and lead to a poor performance on tail classes. To address this problem, we propose Label-Occurrence-Balanced Mixup to augment data while keeping the label occurrence for each class statistically balanced. In a word, we employ two independent class-balanced samplers to select data pairs and mix them to generate new data. We test our method on several long-tailed vision and sound recognition benchmarks. Experimental results show that our method significantly promotes the adaptability of mixup method to imbalanced data and achieves superior performance compared with state-of-the-art long-tailed learning methods.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据