4.7 Article

Distributed and Scalable Cooperative Formation of Unmanned Ground Vehicles Using Deep Reinforcement Learning

Journal

AEROSPACE
Volume 10, Issue 2, Pages -

Publisher

MDPI
DOI: 10.3390/aerospace10020096

Keywords

unmanned ground vehicles (UGVs); deep reinforcement learning; deep deterministic policy gradient (DDPG); multiagent systems; distributed formation control

Ask authors/readers for more resources

Cooperative formation control of unmanned ground vehicles (UGVs) is a significant research hotspot in UGV applications, attracting increasing attention in both military and civil domains. Compared to traditional algorithms, reinforcement-learning-based algorithms equipped with artificial intelligence offer a lower complexity solution for real-time formation control. This paper proposes a distributed deep-reinforcement-learning-based algorithm to address the navigation, maintenance, and obstacle avoidance tasks of UGV formations. The algorithm's effectiveness and scalability are validated through formation simulation experiments of different scales.
Cooperative formation control of unmanned ground vehicles (UGVs) has become one of the important research hotspots in the application of UGV and attracted more and more attention in the military and civil fields. Compared with traditional formation control algorithms, reinforcement-learning-based algorithms can provide a new solution with a lower complexity for real-time formation control by equipping UGVs with artificial intelligence. Therefore, in this paper, a distributed deep-reinforcement-learning-based cooperative formation control algorithm is proposed to solve the navigation, maintenance, and obstacle avoidance tasks of UGV formations. More importantly, the hierarchical triangular formation structure and the newly designed Markov decision process for UGV formations of leader and follower attributes make the control strategy learned by the algorithm reusable, so that the formation can arbitrarily increase the number of UGVs and realize a more flexible expansion. The effectiveness and scalability of the algorithm is verified by formation simulation experiments of different scales.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available