☆ 4.7 Article

Multimodal estimation and communication of latent semantic knowledge for robust execution of robot instructions

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH (2020)

期刊

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH

卷 39, 期 10-11, 页码 1279-1304

出版社

SAGE PUBLICATIONS LTD

DOI: 10.1177/0278364920917755

关键词

Human-robot collaboration; semantic state estimation; Bayesian modeling; multimodal interaction; natural language understanding

类别

Robotics

资金

Robotics Consortium of the US Army Research Laboratory under the Collaborative Technology Alliance Program (RCTA
ARO grant) [W911NF-151-0402]
Toyota Research Institute (TRI) [LPC000765-SR]
Lockheed Martin Co.

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

The goal of this article is to enable robots to perform robust task execution following human instructions in partially observable environments. A robot's ability to interpret and execute commands is fundamentally tied to its semantic world knowledge. Commonly, robots use exteroceptive sensors, such as cameras or LiDAR, to detect entities in the workspace and infer their visual properties and spatial relationships. However, semantic world properties are often visually imperceptible. We posit the use of non-exteroceptive modalities including physical proprioception, factual descriptions, and domain knowledge as mechanisms for inferring semantic properties of objects. We introduce a probabilistic model that fuses linguistic knowledge with visual and haptic observations into a cumulative belief over latent world attributes to infer the meaning of instructions and execute the instructed tasks in a manner robust to erroneous, noisy, or contradictory evidence. In addition, we provide a method that allows the robot to communicate knowledge dissonance back to the human as a means of correcting errors in the operator's world model. Finally, we propose an efficient framework that anticipates possible linguistic interactions and infers the associated groundings for the current world state, thereby bootstrapping both language understanding and generation. We present experiments on manipulators for tasks that require inference over partially observed semantic properties, and evaluate our framework's ability to exploit expressed information and knowledge bases to facilitate convergence, and generate statements to correct declared facts that were observed to be inconsistent with the robot's estimate of object properties.

Multimodal estimation and communication of latent semantic knowledge for robust execution of robot instructions

期刊

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH

出版社

SAGE PUBLICATIONS LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Multimodal estimation and communication of latent semantic knowledge for robust execution of robot instructions

期刊

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH

出版社

SAGE PUBLICATIONS LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文