4.7 Article

Path Planning Algorithm for Dual-Arm Robot Based on Depth Deterministic Gradient Strategy Algorithm

期刊

MATHEMATICS
卷 11, 期 20, 页码 -

出版社

MDPI
DOI: 10.3390/math11204392

关键词

path planning; dual-arm robot; multi-agent reinforcement learning

向作者/读者索取更多资源

This paper introduces a reinforcement learning algorithm for dual-arm robot path planning, which improves efficiency through the use of experience replay, shortest-path constraint, and policy gradient. The algorithm demonstrates strong performance in collaborative task execution and addresses the challenge of reward sparsity, as validated through simulations and experiments.
In recent years, the utilization of dual-arm robots has gained substantial prominence across various industries owing to their collaborative operational capabilities. In order to achieve collision avoidance and facilitate cooperative task completion, efficient path planning plays a pivotal role. The high dimensionality associated with collaborative task execution in dual-arm robots renders existing path planning methods ineffective for conducting efficient exploration. This paper introduces a multi-agent path planning reinforcement learning algorithm that integrates an experience replay strategy, a shortest-path constraint, and the policy gradient method. To foster collaboration and avoid competition between the robot arms, the proposed approach incorporates a mechanism known as reward cooperation, punishment competition during the training process. Our algorithm demonstrates strong performance in the control of dual-arm robots and exhibits the potential to mitigate the challenge of reward sparsity encountered during the training process. The effectiveness of the proposed algorithm is validated through simulations and experiments, comparing the results with existing methods and showcasing its superiority in dual-arm robot path planning.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据