4.6 Article

Understanding the stability of deep control policies for biped locomotion

Journal

VISUAL COMPUTER
Volume 39, Issue 1, Pages 473-487

Publisher

SPRINGER
DOI: 10.1007/s00371-021-02342-9

Keywords

Biped locomotion; Deep reinforcement learning; Gait analysis; Physically based simulation; Push-recovery stability

Ask authors/readers for more resources

The primary goal of this study is to address questions regarding the robustness of deep control policies compared with human walking, and to evaluate the effectiveness of different variants of DRL algorithms.
Achieving stability and robustness is the primary goal of biped locomotion control. Recently, deep reinforcement learning (DRL) has attracted great attention as a general methodology for constructing biped control policies and demonstrated significant improvements over the previous state-of-the-art control methods. Although deep control policies are more advantageous compared with previous controller design approaches, many questions remain: Are deep control policies as robust as human walking? Does simulated walking involve strategies similar to human walking for maintaining balance? Does a particular gait pattern affect human and simulated walking similarly? What do deep policies learn to achieve improved gait stability? The goal of this study is to address these questions by evaluating the push-recovery stability of deep policies compared with those of human subjects and a previous feedback controller. Furthermore, we conducted experiments to evaluate the effectiveness of variants of DRL algorithms.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available