4.7 Article

I-GCN: Incremental Graph Convolution Network for Conversation Emotion Detection

期刊

IEEE TRANSACTIONS ON MULTIMEDIA
卷 24, 期 -, 页码 4471-4481

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TMM.2021.3118881

关键词

Emotion recognition; Emotion detection; Sentiment analysis; Incremental GCN; GCN

资金

  1. National Natural Science Foundation of China [61872267, 61702471, 61772359]
  2. 2019 Tianjin New Generation Artificial Intelligence Major Program [18ZXZNGX00150, 19ZXZNGX00110]
  3. Open Project Program of the State Key Laboratory of CAD & CG, Zhejiang University [A2005, A2012]
  4. Tianjin Science Foundation for Young Scientists of China [19JC-QNJC00500]

向作者/读者索取更多资源

Sentiment analysis and emotion detection in conversation have attracted increasing attention in the fields of social robot, social network, and intelligent voice assistant. In this paper, an incremental graph convolution network (I-GCN) is proposed to handle emotion detection in conversation. The graph structure is utilized to represent the semantic correlation information of utterances, while the incremental graph structure is used to preserve the temporal change information. Two types of GCN, namely utterance-level GCN (U-GCN) and speaker-level GCN (S-GCN), are introduced to learn the features of utterances. The parameters of the model are fine-tuned with new utterances to enhance the contribution of temporal change information. Experimental results show the effectiveness of the proposed method in three conversation corpuses.
Sentiment analysis and emotion detection in conversation are becoming hot topics in regard to several applications. With the development of the social robot, social network, and intelligent voice assistant, emotion detection is attracting more attention as a key component in these research fields. Many approaches have been proposed to handle this problem in recent years. However, these previous approaches focus on either the temporal change information of the conversation or the semantic correlation information of the dialogue but ignore the combination of temporal information and semantic correlation information. In this paper, we propose an incremental graph convolution network (I-GCN) to handle emotion detection in conversation. We first utilize the graph structure to represent conversation at different times, which can represent the semantic correlation information of utterances. Then, we apply the incremental graph structure to imitate the process of dynamic conversation, which can preserve the temporal change information of conversation. Especially, for the first step of the process, we creatively propose utterance-level GCN (U-GCN) and speaker-level GCN (S-GCN) to learn the features of utterances for emotion detection. U-GCN focuses on the correlations among utterances and applies the multi-head attention model to find latent correlation information among utterances, which aims to further enhance the guidance of semantic relevance for feature learning. S-GCN focuses on the correlation between speaker and utterances, which can provide a different angle to guide the feature learning of utterances. In the learning of model parameters, we constantly utilize the new utterances to fine-tune the parameters of GNN for enhancement of the contribution of temporal change information. Detailed evaluations of the proposed method on three published conversation corpuses demonstrate the great effectiveness of our approach over several conventional competitive baselines.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据