3.8 Article

A path planning algorithm fusion of obstacle avoidance and memory functions

期刊

出版社

WILEY
DOI: 10.1049/ccs2.12098

关键词

artificial intelligence; deep reinforcement learning; intelligent robots; mobile robots; path planning

向作者/读者索取更多资源

In this study, the authors improve and optimize the Deep Deterministic Policy Gradient (DDPG) algorithm to address the issues of sluggish convergence and poor learning efficiency at the initial stages of training. They improve the selection strategy of DDPG to accelerate the convergence speed and reduce the time it takes for the mobile robot to reach the target point. They also optimize the neural network structure of the DDPG algorithm based on the Long Short-Term Memory to accelerate the algorithm's convergence speed in complex dynamic scenes.
In this study, to address the issues of sluggish convergence and poor learning efficiency at the initial stages of training, the authors improve and optimise the Deep Deterministic Policy Gradient (DDPG) algorithm. First, inspired by the Artificial Potential Field method, the selection strategy of DDPG has been improved to accelerate the convergence speed during the early stages of training and reduce the time it takes for the mobile robot to reach the target point. Then, optimising the neural network structure of the DDPG algorithm based on the Long Short-Term Memory accelerates the algorithm's convergence speed in complex dynamic scenes. Static and dynamic scene simulation experiments of mobile robots are carried out in ROS. Test findings demonstrate that the Artificial Potential Field method-Long Short Term Memory Deep Deterministic Policy Gradient (APF-LSTM DDPG) algorithm converges significantly faster in complex dynamic scenes. The success rate is improved by 7.3% and 3.6% in contrast to the DDPG and LSTM-DDPG algorithms. Finally, the usefulness of the method provided in this study is similarly demonstrated in real situations using real mobile robot platforms, laying the foundation for the path planning of mobile robots in complex changing conditions.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据