4.6 Article

Speech Emotion Recognition via Multi-Level Attention Network

期刊

IEEE SIGNAL PROCESSING LETTERS
卷 29, 期 -, 页码 2278-2282

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/LSP.2022.3219352

关键词

MFCC; multi-scale feature; attention mechanism; speech emotion recognition

资金

  1. National Key Research and Development Program of China [2020YFC1523302]
  2. National Natural Science Foundation of China [62002290]

向作者/读者索取更多资源

The aim of this research is to improve the performance of human speech emotion recognition. The proposed multi-level attention network (MLAnet) extracts low-level emotion features from the popular mel-scale frequency cepstral coefficient (MFCC) and weights these features using a multi-unit attention module. Experimental results show that this method outperforms other state-of-the-art approaches.
Aiming to improve the performance of human speech emotion recognition (SER), the existing work has made great progress based on the popular mel-scale frequency cepstral coefficient (MFCC). However, the existing work rarely pays attention to the low-level emotion related features in MFCC, such as the underlying interactive relations. In this letter, we propose a novel multi-level attention network (MLAnet), which contains a multi-scale low-level feature (MLF) extractor and a multi-unit attention (MUA) module. Within the MLF extractor, we minimize the task-irrelevant information which harms the performance of SER by applying the attention mechanism. Since the features extracted by the MLF extractor contain rich domain-specific emotion information, we further present a MUA module to simultaneously weight the features in terms of time, frequency and channel dimensions. In this way, the discriminative emotion features in different dimensions can be extracted by corresponding weighting blocks. Experimental results on two benchmark datasets demonstrate that the proposed method outperforms other state-of-the-art approaches.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据