4.7 Article

Robot Motor Skill Transfer With Alternate Learning in Two Spaces

期刊

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNNLS.2020.3021530

关键词

Task analysis; Robot motion; Planning; Optimization; Trajectory; Dynamics; Alternate learning in two spaces (ALTS); improved locally weighted regression (iLWR); motor skill acquisition; policy improvement with path integral by dual perturbation (PI2-DP)

资金

  1. National Natural Science Foundation of China [61773299, 51575412]
  2. Excellent Dissertation Cultivation Funds of the Wuhan University of Technology [2017-YS-066, 2017-YS-067]

向作者/读者索取更多资源

The research proposes a framework based on improved locally weighted regression and policy improvement with path integral for robot intelligent trajectory planning, aiming to address the challenge of generating new robot motions automatically to perform new tasks.
Recent research achievements in learning from demonstration (LfD) illustrate that the reinforcement learning is effective for the robots to improve their movement skills. The current challenge mainly remains in how to generate new robot motions automatically to perform new tasks, which have a similar preassigned performance indicator but are different from the demonstration tasks. To deal with the abovementioned issue, this article proposes a framework to represent the policy and conduct imitation learning and optimization for robot intelligent trajectory planning, based on the improved locally weighted regression (iLWR) and policy improvement with path integral by dual perturbation (PI2-DP). Besides, the reward-guided weight searching and basis function's adaptive evolving are performed alternately in two spaces, i.e., the basis function space and the weight space, to deal with the abovementioned problem. The alternate learning process constructs a sequence of two-tuples that join the demonstration task and new one together for motor skill transfer, so that the robot gradually acquires motor skill, from the task similar to demonstration to dissimilar tasks with different performance metrics. Classical via-points trajectory planning experiments are performed with the SCARA manipulator, a 10-degree of freedom (DOF) planar, and the UR robot. These results show that the proposed method is not only feasible but also effective.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据