4.8 Article

Fuzzy Double Deep Q-Network-Based Gait Pattern Controller for Humanoid Robots

Journal

IEEE TRANSACTIONS ON FUZZY SYSTEMS
Volume 30, Issue 1, Pages 147-161

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TFUZZ.2020.3033141

Keywords

Humanoid robots; Legged locomotion; Robot sensing systems; Control systems; Neural networks; Force; Adaptive-network-based fuzzy inference system (ANFIS); double deep Q-network (DDQN); dynamic gait balance; humanoid robot

Funding

  1. Ministry of Science and Technology, Taiwan [MOST 106-2218-E-153-001-MY3, MOST 106-2221-E-006-009-MY3]

Ask authors/readers for more resources

This article introduces a method combining the adaptive-network-based fuzzy inference system (ANFIS) with the double deep Q-network (DDQN) to create a fuzzy DDQN (FDDQN) that enables a humanoid robot to generate a real-time gait pattern based on a linear inverted pendulum model. The proposed FDDQN not only allows immediate correction of the gait pattern, but also enhances the stability of the robot.
In this article, the adaptive-network-based fuzzy inference system (ANFIS) is combined with the double deep Q-network (DDQN) to realize a fuzzy DDQN (FDDQN) such that a humanoid robot can generate a linear inverted pendulum model-based gait pattern in real time. The FDDQN not only allows the humanoid robot to correct the gait pattern instantly but also improves its stability. The proposed scheme is designed and implemented in a toddler-sized humanoid robot called Louis. First, four pressure sensors are installed on the bottom of the sole and one inertial measurement unit is set up on the trunk of the robot. A wireless communication chip is employed to transfer the data to a computer to determine the required parameters for the robot. Next, a control system based on the Linux operating system is developed. The values of the center of pressure and acceleration obtained with the ANFIS are adopted to train the DDQN. The proposed neural network comprises four layers, and the model is cautiously selected to avoid overfitting. The proposed scheme is verified using a robot simulator and then real-time-tested on Louis. The experimental results indicate that the FDDQN can provide the robot timely feedback during walking as well as helps it in adjusting the gait pattern independently. The balancing of the robot through effective dynamic feedback is similar to the balancing ability of an infant learning to walk.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available