☆ 4.5 Article

Multi-modal feature fusion for better understanding of human personality traits in social human-robot interaction

ROBOTICS AND AUTONOMOUS SYSTEMS (2021)

期刊

ROBOTICS AND AUTONOMOUS SYSTEMS

卷 146, 期 -, 页码 -

出版社

ELSEVIER

DOI: 10.1016/j.robot.2021.103874

关键词

Human-robot interaction; Human personality traits; Multi-modal feature fusion; Machine learning

类别

Automation & Control Systems Computer Science, Artificial Intelligence Robotics

资金

Air Force Office of Scientific Research, United State [AFOSR-AOARD/FA2386-19-1-4015]
Shibuya Science, Culture, and Sports Foundation 2019 Grant Program, Japan

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study addressed three main challenges in human-robot interaction, including fusion of different interaction modalities, integration of variable length feature vectors, and compensation for camera shake caused by robot movements. By extracting key visual and audio features and utilizing a multi-layer Hidden Markov Model, the classification accuracy of personality traits was improved, enhancing the communicative competence of social robots.

Since the dynamic nature of human-robot interaction becomes increasingly prevalent in our daily life, there is a great demand for enabling the robot to better understand human personality traits and inspiring humans to be more engaged in the interaction with the robot. Therefore, in this work, as we design the paradigm of human-robot interaction as close to the real situation as possible, the following three main problems are addressed: (1) fusion of visual and audio features of human interaction modalities, (2) integration of variable length feature vectors, and (3) compensation of shaky camera motion caused by movements of the robot's communicative gesture. Specifically, the three most important visual features of humans including head motion, gaze, and body motion were extracted from a camera mounted on the robot performing verbal and body gestures during the interaction. Then, our system was geared to fuse the aforementioned visual features and different types of vocal features, such as voice pitch, voice energy, and Mel-Frequency Cepstral Coefficient, dealing with variable length multiple feature vectors. Lastly, considering unknown patterns and sequential characteristics of human communicative behavior, we proposed a multi-layer Hidden Markov Model that improved the classification accuracy of personality traits and offered notable advantages of fusing the multiple features. The results were thoroughly analyzed and supported by psychological studies. The proposed multi-modal fusion approach is expected to deepen the communicative competence of social robots interacting with humans from different cultures and backgrounds. (C) 2021 Elsevier B.V. All rights reserved.

Multi-modal feature fusion for better understanding of human personality traits in social human-robot interaction

期刊

ROBOTICS AND AUTONOMOUS SYSTEMS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Multi-modal feature fusion for better understanding of human personality traits in social human-robot interaction

期刊

ROBOTICS AND AUTONOMOUS SYSTEMS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文