4.6 Review

A Review on Speech Emotion Recognition Using Deep Learning and Attention Mechanism

期刊

ELECTRONICS
卷 10, 期 10, 页码 -

出版社

MDPI
DOI: 10.3390/electronics10101163

关键词

speech emotion recognition; deep learning; attention mechanism; recurrent neural network; long short-term memory

向作者/读者索取更多资源

Emotions are crucial in human interactions, and SER modules are important for HCI applications. Attention-based DNNs are effective in mining emotional information, and recent advancements have incorporated attention mechanisms to emphasise emotional salient information. This paper reviews the development of SER and evaluates the impact of various attention mechanisms on SER performance, with a comparison on the IEMOCAP benchmark database.
Emotions are an integral part of human interactions and are significant factors in determining user satisfaction or customer opinion. speech emotion recognition (SER) modules also play an important role in the development of human-computer interaction (HCI) applications. A tremendous number of SER systems have been developed over the last decades. Attention-based deep neural networks (DNNs) have been shown as suitable tools for mining information that is unevenly time distributed in multimedia content. The attention mechanism has been recently incorporated in DNN architectures to emphasise also emotional salient information. This paper provides a review of the recent development in SER and also examines the impact of various attention mechanisms on SER performance. Overall comparison of the system accuracies is performed on a widely used IEMOCAP benchmark database.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据