☆ 4.7 Article

A social emotion classification approach using multi-model fusion

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE (2020)

期刊

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE

卷 102, 期 -, 页码 347-356

出版社

ELSEVIER

DOI: 10.1016/j.future.2019.07.007

关键词

Multimodal fusion; Emotion analysis; 3D convolutional neural network; Recurrent neural network

类别

Computer Science, Theory & Methods

资金

Beijing Institute of Technology Research Fund Program for Young Scholars, China
National Natural Science Foundation, China [61772099]
Program for Innovation Team Building at Institutions of Higher Education in Chongqing, China [CXTDG201602010]
University Outstanding Achievements Transformation Funding Project of Chongqing, China [KJZH17116]
Artificial Intelligence Technology Innovation Important Subject Projects of Chongqing, China [cstc2017rgzn-zdyf0140]
Innovation and Entrepreneurship Demonstration Team Cultivation Plan of Chongqing, China [cstc2017kjrc-cxcytd0063]
China Postdoctoral Science Foundation [2014M562282]
Project Postdoctoral Supported in Chongqing, China [Xm2014039]
Wenfeng Leading Top Talent Project in CQUPT, China
New Research Area Development Programme [A201544]
Science and Technology Research Project of Chongqing Municipal Education Committee, China [KJ1400422, KJ1500441, KJ1704089, KJ1704081]
Chongqing Research Program of Basic Research and Frontier Technology, China [cstc2017jcyjAX0270, cstc2018jcyjA0672, cstc2017jcyjAX0071]
Industry Important Subject Projects of Chongqing, China [CSTC2018JSZX -CYZTZX0178, CSTC2018JSZX -CYZTZX0185]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

With the proliferation of the online video publishing, the number of multimodal contents on the Internet has exponentially grown. Research of emotion analysis has developed from the traditional single-mode to complex multimode analysis. Most recent studies have paid rare attention to the visual emotion information deriving from merging visual and audio emotional information at the feature or decision level, even though some of them considered the multimodality analysis. In this paper, we extract visual, textual, and audio information from video and propose a multimodal emotional classification framework to capture the emotions of users in social networks. We have designed a 3DCLS (3D Convolutional-Long Short Term Memory) hybrid model that classifies visual emotions as well as a CNN-RNN hybrid model that classifies text-based emotions. Finally, visual, audio and text modes are combined to generate final emotional classification results. Experiments on the MOUD and IEMOCAP emotion datasets show that the proposed framework outperforms existing models in multimodal mood analysis. (C) 2019 Elsevier B.V. All rights reserved.

A social emotion classification approach using multi-model fusion

期刊

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A social emotion classification approach using multi-model fusion

期刊

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文