☆ 4.2 Article

Comparison of the Effects of Mel Coefficients and Spectrogram Images via Deep Learning in Emotion Classification

TRAITEMENT DU SIGNAL (2020)

期刊

TRAITEMENT DU SIGNAL

卷 37, 期 1, 页码 51-57

出版社

INT INFORMATION & ENGINEERING TECHNOLOGY ASSOC

DOI: 10.18280/ts.370107

关键词

speech emotion recognition; Deep Neural Network (DNN); Convolutional Neural Network (CNN); deep learning algorithm; Mel-Frequency Cepstrum Coefficients (MFCC)

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic

资金

Konya Technical University Scientific Research Projects
Selcuk University Scientific Research Projects
TUBITAK

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

In the present paper, an approach was developed for emotion recognition from speech data using deep learning algorithms, a problem that has gained importance in recent years. Feature extraction manually and feature selection steps were more important in traditional methods for speech emotion recognition. In spite of this, deep learning algorithms were applied to data without any data reduction. The study implemented the triple emotion groups of EmoDB emotion data: Boredom, Neutral, and Sadness-BNS; and Anger, Happiness, and Fear-AHF. Firstly, the spectrogram images resulting from the signal data after preprocessing were classified using AlexNET. Secondly, the results formed from the MelFrequency Cepstrum Coefficients (MFCC) extracted by feature extraction methods to Deep Neural Networks (DNN) were compared. The importance and necessity of using manual feature extraction in deep learning was investigated, which remains a very important part of emotion recognition. The experimental results show that emotion recognition through the implementation of the AlexNet architecture to the spectrogram images was more discriminative than that through the implementation of DNN to manually extracted features.

Comparison of the Effects of Mel Coefficients and Spectrogram Images via Deep Learning in Emotion Classification

期刊

TRAITEMENT DU SIGNAL

出版社

INT INFORMATION & ENGINEERING TECHNOLOGY ASSOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Comparison of the Effects of Mel Coefficients and Spectrogram Images via Deep Learning in Emotion Classification

期刊

TRAITEMENT DU SIGNAL

出版社

INT INFORMATION & ENGINEERING TECHNOLOGY ASSOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文