☆ 4.5 Article

Fuzzy Deep Deterministic Policy Gradient-Based Motion Controller for Humanoid Robot

INTERNATIONAL JOURNAL OF FUZZY SYSTEMS (2022)

期刊

INTERNATIONAL JOURNAL OF FUZZY SYSTEMS

卷 24, 期 5, 页码 2476-2492

出版社

SPRINGER HEIDELBERG

DOI: 10.1007/s40815-022-01293-0

关键词

DDPG; Fuzzy logic; Humanoid robot; Intelligent control

类别

Automation & Control Systems Computer Science, Artificial Intelligence Computer Science, Information Systems

资金

Ministry of Science and Technology, Taiwan [MOST 109-2221-E-194-053-MY3]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper proposes a motion controller based on machine learning and fuzzy logic for humanoid robot arm control. By using the deep deterministic policy gradient (DDPG) algorithm, the humanoid robot can self-learn and autonomously plan the movement and joint angles of its arm. Experimental results show satisfactory learning outcomes and motion control performance when combining fuzzy logic with the DDPG algorithm.

In conventional robot arm control, inverse kinematics (IK) is used as the basis for computing arm joint angles. However, IK can be used to compute joint angles only after the terminal point has been reached, and it cannot optimize arm movements. Furthermore, the singularity problem is sometimes encountered when using IK. For example, if a robot arm in motion passes through a singularity, the next step's movement is incomputable, which results in errors. Therefore, this study did not use IK for computing the joint angles of humanoid robot arms. Instead, this paper proposes a motion controller based on machine learning and fuzzy logic for the aforementioned purpose. Conventional reinforcement learning can provide satisfactory results for a single state but cannot be used to perform optimized calculations for infinite states. To solve this problem, this study used the deep deterministic policy gradient (DDPG) algorithm and allowed a humanoid robot to self-learn and autonomously plan the movement of and joint angles in its arm. A state and its action can be calculated in a hyperplane by using the developed neural network. A continuous mapping relationship exists between the state and its action in this hyperplane. Thus, the humanoid robot obtained optimal learning experiences in multiple self-learning processes. Finally, the concept was incorporated into a visual feedback system to achieve object grasping by the humanoid robot. The humanoid robot exhibited satisfactory learning outcomes-as well as satisfactory motion control performance-in experiments when combining fuzzy logic with the DDPG algorithm.

Fuzzy Deep Deterministic Policy Gradient-Based Motion Controller for Humanoid Robot

期刊

INTERNATIONAL JOURNAL OF FUZZY SYSTEMS

出版社

SPRINGER HEIDELBERG

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Fuzzy Deep Deterministic Policy Gradient-Based Motion Controller for Humanoid Robot

期刊

INTERNATIONAL JOURNAL OF FUZZY SYSTEMS

出版社

SPRINGER HEIDELBERG

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文