4.7 Article

Deep Reinforcement Learning Based Computation Offloading and Trajectory Planning for Multi-UAV Cooperative Target Search

Journal

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS
Volume 41, Issue 2, Pages 504-520

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JSAC.2022.3228558

Keywords

Task analysis; Search problems; Uncertainty; Edge computing; Autonomous aerial vehicles; Trajectory; Servers; Unmanned aerial vehicle; cooperative target search; edge computing; computation offloading; deep reinforcement learning (DRL)

Ask authors/readers for more resources

In this paper, the concept of uncertainty is used to evaluate the search process, and a deep reinforcement learning technique is proposed for optimal computation offloading decisions and flying orientation choices in multi-UAV cooperative target search. The uncertainty minimization problem is formulated based on a system model and transformed into a reward maximization problem, which is further analyzed using a Markov decision process. A deep Q-network based DRL architecture is then proposed to obtain the optimal task offloading decisions and flying orientation choices. Extensive simulations validate the effectiveness of the proposed techniques, and comprehensive discussions on the impact of different parameters on search performance are provided.
Unmanned aerial vehicles (UAVs) are widely used for surveillance and monitoring to complete target search tasks. However, the short battery life and moderate computational capability hinder UAVs to process computation-intensive tasks. The emerging edge computing technologies can alleviate this problem by offloading tasks to the ground edge servers. How to evaluate the search process so as to make optimal offloading decisions and make optimal flying trajectories represent fundamental research challenges. In this paper, we propose to utilize the concept of uncertainty to evaluate the search process, which reflects the reliability of the target search results. Thereafter, we propose a deep reinforcement learning (DRL) technique to jointly make optimal computation offloading decisions and flying orientation choices for multi-UAV cooperative target search. Specifically, we first formulate an uncertainty minimization problem based on the established system model. By introducing a reward function, we prove that the uncertainty minimization problem is equivalent to a reward maximization problem, which is further analyzed by a Markov decision process (MDP). To obtain the optimal task offloading decisions and flying orientation choices, a deep Q-network (DQN) based DRL architecture with a separated Q-network is then proposed. Finally, extensive simulations validate the effectiveness of the proposed techniques, and comprehensive discussions on how different parameters affect the search performance are given.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available