4.5 Article

Hybrid autonomous controller for bipedal robot balance with deep reinforcement learning and pattern generators

期刊

ROBOTICS AND AUTONOMOUS SYSTEMS
卷 146, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.robot.2021.103891

关键词

Bipedal robot; Pattern generator; Reinforcement learning; Hybrid controller

资金

  1. Engineering and Physical Sciences Research Council (EPSRC) Center for Doctoral Training in Embedded Intelligence (CDT-EI) [EP/L014998/1]

向作者/读者索取更多资源

The research proposed a hybrid autonomous controller that hierarchically combines two separate systems for bipedal robots to recover in close collaboration with humans in real-world applications. By combining hardcoded and reinforcement learning controllers, a balance of speed and adaptability is achieved, allowing the system to maintain efficient control in new dynamic environments.
Recovering after an abrupt push is essential for bipedal robots in real-world applications within environments where humans must collaborate closely with robots. There are several balancing algorithms for bipedal robots in the literature, however most of them either rely on hard coding or power-hungry algorithms. We propose a hybrid autonomous controller that hierarchically combines two separate, efficient systems, to address this problem. The lower-level system is a reliable, high-speed, full state controller that was hardcoded on a microcontroller to be power efficient. The higher-level system is a low-speed reinforcement learning controller implemented on a low-power onboard computer. While one controller offers speed, the other provides trainability and adaptability. An efficient control is then formed without sacrificing adaptability to new dynamic environments. Additionally, as the higher-level system is trained via deep reinforcement learning, the robot could learn after deployment, which is ideal for real-world applications. The system's performance is validated with a real robot recovering after a random push in less than 5 s, with minimal steps from its initial positions. The training was conducted using simulated data. (C) 2021 The Author(s). Published by Elsevier B.V.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据