4.5 Article

Efficient Online Interest-Driven Exploration for Developmental Robots

期刊

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCDS.2020.3001633

关键词

Data driven; developmental robot; efficient learning; intrinsic motivation; inverse kinematics (IK); inverse robot models; inverse statics; online learning

资金

  1. Deutscher Akademischer Austauschdienst DAAD-Research Grants-Doctoral Programme in Germany Scholarship

向作者/读者索取更多资源

This article introduces the challenge of high sample complexity in online robot learning and proposes new methods to overcome this problem. The author utilizes exploration and intrinsic motivation signals to drive robot online learning and introduces an episodic online mental replay method to accelerate the learning process. The efficiency and applicability of these methods are demonstrated with experiments on a physical robot.
A major challenge for online and data-driven model learning in robotics is the high sample complexity. This hinders its efficiency and practical feasibility for lifelong learning, in particular, for developmental robots that autonomously bootstrap their sensorimotor skills in an open-ended environment. In this work, we propose new methods to mediate this problem in order to permit the learning of robot models online, from scratch, and in learning while behaving fashion. Exploration is utilized and autonomously driven by a novel intrinsic motivation signal which combines knowledge-based and competence-based elements and surpasses other state-of-the-art methods. In addition, we propose an episodic online mental replay to accelerate online learning, to ensure sample efficiency, and to update the model online rapidly. The efficiency as well as the applicability of our methods are demonstrated with a physical 7-DoF Baxter manipulator. We show that our learning schemes are able to drastically reduce the sample complexity and learn the data-driven model online, even within a limited time frame.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据