4.8 Article

Learning quadrupedal locomotion on deformable terrain

期刊

SCIENCE ROBOTICS
卷 8, 期 74, 页码 -

出版社

AMER ASSOC ADVANCEMENT SCIENCE
DOI: 10.1126/scirobotics.ade2256

关键词

-

类别

向作者/读者索取更多资源

Simulation-based reinforcement learning approaches are at the forefront of legged robot control innovation. However, they are still unable to perform well on soft and deformable terrains, especially at high speed. To address this issue, we propose a versatile and computationally efficient granular media model for reinforcement learning, coupled with an adaptive control architecture that can identify terrain properties and enhance the locomotion performance of the legged robot.
Simulation-based reinforcement learning approaches are leading the next innovations in legged robot control. However, the resulting control policies are still not applicable on soft and deformable terrains, especially at high speed. The primary reason is that reinforcement learning approaches, in general, are not effective beyond the data distribution: The agent cannot perform well in environments that it has not experienced. To this end, we introduce a versatile and computationally efficient granular media model for reinforcement learning. Our model can be parameterized to represent diverse types of terrain from very soft beach sand to hard asphalt. In addition, we introduce an adaptive control architecture that can implicitly identify the terrain properties as the robot feels the terrain. The identified parameters are then used to boost the locomotion performance of the legged robot. We applied our techniques to the Raibo robot, a dynamic quadrupedal robot developed in-house. The trained networks demonstrated high-speed locomotion capabilities on deformable terrains: The robot was able to run on soft beach sand at 3.03 meters per second although the feet were completely buried in the sand during the stance phase. We also demonstrate its ability to generalize to different terrains by presenting running experiments on vinyl tile flooring, athletic track, grass, and a soft air mattress.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据