☆ 4.7 Article

Connecting Subspace Learning and Extreme Learning Machine in Speech Emotion Recognition

IEEE TRANSACTIONS ON MULTIMEDIA (2019)

期刊

IEEE TRANSACTIONS ON MULTIMEDIA

卷 21, 期 3, 页码 795-808

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TMM.2018.2865834

关键词

Speech emotion recognition; extreme learning machine; subspace learning; graph embedding; spectral regression

类别

Computer Science, Information Systems Computer Science, Software Engineering Telecommunications

资金

China Scholarship Council
European Union [338164]
European Union's Horizon 2020 Research and Innovation Programme [645378, 645094]
Natural Science Foundation of China [61673108, 61231002, 11701290]
Natural Science Foundation for Jiangsu Higher Education Institutions [16KJB510031, 17KJB110012]
NUPTSF [NY217149, NY217150]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Speech emotion recognition (SER) is a powerful tool for endowing computers with the capacity to process information about the affective states of users in human-machine interactions. Recent research has shown the effectiveness of graph embedding-based subspace learning and extreme learning machine applied to SER, but there are still various drawbacks in these two techniques that limit their application. Regarding subspace learning, the change from linearity to nonlinearity is usually achieved through kernelization, whereas extreme learning machines only take label information into consideration at the output layer. In order to overcome these drawbacks, this paper leverages extreme learning machines for dimensionality reduction and proposes a novel framework to combine spectral regression-based subspace learning and extreme learning machines. The proposed framework contains three stages-data mapping, graph decomposition, and regression. At the data mapping stage, various mapping strategies provide different views of the samples. At the graph decomposition stage, specifically designed embedding graphs provide a possibility to better represent the structure of data through generating virtual coordinates. Finally, at the regression stage, dimension-reduced mappings are achieved by connecting the virtual coordinates and data mapping. Using this framework, we propose several novel dimensionality reduction algorithms, apply them to SER tasks, and compare their performance to relevant state-of-the-art methods. Our results on several paralinguistic corpora show that our proposed techniques lead to significant improvements.

Connecting Subspace Learning and Extreme Learning Machine in Speech Emotion Recognition

期刊

IEEE TRANSACTIONS ON MULTIMEDIA

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Connecting Subspace Learning and Extreme Learning Machine in Speech Emotion Recognition

期刊

IEEE TRANSACTIONS ON MULTIMEDIA

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文