4.7 Article

Distributed and Scalable Cooperative Formation of Unmanned Ground Vehicles Using Deep Reinforcement Learning

期刊

AEROSPACE
卷 10, 期 2, 页码 -

出版社

MDPI
DOI: 10.3390/aerospace10020096

关键词

unmanned ground vehicles (UGVs); deep reinforcement learning; deep deterministic policy gradient (DDPG); multiagent systems; distributed formation control

向作者/读者索取更多资源

Cooperative formation control of unmanned ground vehicles (UGVs) is a significant research hotspot in UGV applications, attracting increasing attention in both military and civil domains. Compared to traditional algorithms, reinforcement-learning-based algorithms equipped with artificial intelligence offer a lower complexity solution for real-time formation control. This paper proposes a distributed deep-reinforcement-learning-based algorithm to address the navigation, maintenance, and obstacle avoidance tasks of UGV formations. The algorithm's effectiveness and scalability are validated through formation simulation experiments of different scales.
Cooperative formation control of unmanned ground vehicles (UGVs) has become one of the important research hotspots in the application of UGV and attracted more and more attention in the military and civil fields. Compared with traditional formation control algorithms, reinforcement-learning-based algorithms can provide a new solution with a lower complexity for real-time formation control by equipping UGVs with artificial intelligence. Therefore, in this paper, a distributed deep-reinforcement-learning-based cooperative formation control algorithm is proposed to solve the navigation, maintenance, and obstacle avoidance tasks of UGV formations. More importantly, the hierarchical triangular formation structure and the newly designed Markov decision process for UGV formations of leader and follower attributes make the control strategy learned by the algorithm reusable, so that the formation can arbitrarily increase the number of UGVs and realize a more flexible expansion. The effectiveness and scalability of the algorithm is verified by formation simulation experiments of different scales.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据