☆ 4.4 Article

Visuo-auditory Multimodal Emotional Structure to Improve Human-Robot-Interaction

INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS (2012)

期刊

INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS

卷 4, 期 1, 页码 29-51

出版社

SPRINGER

DOI: 10.1007/s12369-011-0134-7

关键词

Visual perception; Auditory perception; Emotion recognition; Multimodal interaction; Social behavior profile; Bayesian networks

类别

Robotics

资金

Institute of Systems and Robotics at University of Coimbra (ISR-UC)
Portuguese Foundation for Science and Technology (FCT) [SFRH/BD/60954/2009, PTDC/SAU-BEB/100147/2008]
Polytechnical Institute of Leiria
Fundação para a Ciência e a Tecnologia [PTDC/SAU-BEB/100147/2008, SFRH/BD/60954/2009] Funding Source: FCT

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

We propose an approach to analyze and synthesize a set of human facial and vocal expressions, and then use the classified expressions to decide the robot's response in a human-robot-interaction. During a human-to-human conversation, a person senses the interlocutor's face and voice, perceives her/his emotional expressions, and processes this information in order to decide which response to give. Moreover, observed emotions are taken into account and the response may be aggressive, funny (henceforth meaning humorous) or just neutral according to not only the observed emotions, but also the personality of the person. The purpose of our proposed structure is to endow robots with the capability to model human emotions, and thus several subproblems need to be solved: feature extraction, classification, decision and synthesis. In the proposed approach we integrate two classifiers for emotion recognition from audio and video, and then use a new method for fusion with the social behavior profile. To keep the person engaged in the interaction, after each iterance of analysis, the robot synthesizes human voice with both lips synchronization and facial expressions. The social behavior profile conducts the personality of the robot. The structure and work flow of the synthesis and decision are addressed, and the Bayesian networks are discussed. We also studied how to analyze and synthesize the emotion from the facial expression and vocal expression. A new probabilistic structure that enables a higher level of interaction between a human and a robot is proposed.

Visuo-auditory Multimodal Emotional Structure to Improve Human-Robot-Interaction

期刊

INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Visuo-auditory Multimodal Emotional Structure to Improve Human-Robot-Interaction

期刊

INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文