4.6 Article

Learning Goal Conditioned Socially Compliant Navigation From Demonstration Using Risk-Based Features

期刊

IEEE ROBOTICS AND AUTOMATION LETTERS
卷 6, 期 2, 页码 651-658

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/LRA.2020.3048657

关键词

Navigation; Trajectory; Reinforcement learning; Entropy; Computational modeling; Two dimensional displays; Robot sensing systems; Inverse reinforcement learning; learning from demonstration; motion and path planning; robot navigation; social navigation

类别

资金

  1. Samsung Electronics

向作者/读者索取更多资源

This letter presents a learning-based solution for socially compliant navigation of mobile robots, inferring navigational policies from human examples and validating its effectiveness through comparisons with classical algorithms and reinforcement learning agents. The proposed method and feature representation are found to produce higher quality trajectories and play a critical role in successful navigation.
One of the main challenges of operating mobile robots in social environments is the safe and fluid navigation therein, specifically the ability to share a space with other human inhabitants by complying with the explicit and implicit rules that we humans follow during navigation. While these rules come naturally to us, they resist simple and explicit definitions. In this letter, we present a learning-based solution to address the question of socially compliant navigation, which is to navigate while maintaining adherence to the navigational policies a person might use. We infer these policies by learning from human examples using inverse reinforcement learning techniques. In particular, this letter contributes an efficient sampling-based approximation to enable model-free deep inverse reinforcement learning, and a goal conditioned risk-based feature representation that adequately captures local information surrounding the agent. We validate our approach by comparing against a classical algorithm and a reinforcement learning agent and evaluate our feature representation against similar feature representations from the literature. We find that the combination of our proposed method and our feature representation produce higher quality trajectories and that our proposed feature representation plays a critical role in successful navigation.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据