4.7 Article

Speech emotion recognition based on meta-transfer learning with domain adaption

Journal

APPLIED SOFT COMPUTING
Volume 147, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.asoc.2023.110766

Keywords

Speech emotion recognition; Few-shot learning; Meta-transfer learning; Domain adaption

Ask authors/readers for more resources

In this study, a few-shot learning method based on meta-transfer learning with domain adaption is proposed for speech emotion recognition (SER). It effectively reduces the over-fitting phenomenon and solves the target domain adaptability problem.
Deep learning often requires large amounts of labeled data to train the model, which is not always readily available in the field of speech emotion recognition (SER). Related research work on SER in few shot conditions has reported problem with overfifitting and domain transfer of training. In this study, a few-shot learning method based on meta-transfer learning with domain adaption (MTLDA) is proposed for SER. It not only effectively reduces the over-fitting phenomenon of deep neural networks (DNN) trained with a small number of samples, but also solves the forgetting problem in meta-learning and the target domain adaptability problem in transfer learning. Experiments on three databases (i.e., CASIA is used for pre-training, Emo-DB and SAVEE are used for few-shot learning) are performed for few-shot learning of SER, from which the WAR is 65.12% and UAR is 64.50% on Emo-DB, and the WAR is 58.84% and UAR is 53.26% on SAVEE.& COPY; 2023 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available