4.4 Article

Learning Humanoid Robot Running Motions with Symmetry Incentive through Proximal Policy Optimization

Journal

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS
Volume 102, Issue 3, Pages -

Publisher

SPRINGER
DOI: 10.1007/s10846-021-01355-9

Keywords

Deep reinforcement learning; Robotics; Proximal Policy optimization

Ask authors/readers for more resources

This study introduces a methodology based on deep reinforcement learning to improve running skills in a humanoid robot, achieving remarkable results with Proximal Policy Optimization. The approach outperforms existing technologies by approximately 50% in terms of sprint speed in the RoboCup 3D Soccer Simulation competition. Evaluation of training procedures and controllers in terms of speed, reliability, and human similarity were conducted, with a focus on encouraging symmetry in movements for top speed running policies. Key factors leading to surpassing previous results and suggestions for future research are discussed.
This article contributes with a methodology based on deep reinforcement learning to develop running skills in a humanoid robot with no prior knowledge. Specifically, the algorithm used for learning is the Proximal Policy Optimization (PPO). The chosen application domain is the RoboCup 3D Soccer Simulation (Soccer 3D), a competition where teams composed by 11 autonomous agents each compete in simulated soccer matches. In our approach, the state vector used as the neural network's input consists of raw sensor measurements or quantities which could be obtained through sensor fusion, while the actions are the joint positions, which are sent to joint controllers. Our running behavior outperforms the state-of-the-art in terms of sprint speed by approximately 50%. We present results regarding the training procedure and also evaluate the controllers in terms of speed, reliability, and human similarity. Since the running policies with top speed display asymmetric motions, we also investigate a technique to encourage symmetry in the sagittal plane. Finally, we discuss key factors that lead us to surpass previous results in the literature and share some ideas for future research.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available