4.8 Article

Learning-Based UAV Path Planning for Data Collection With Integrated Collision Avoidance

期刊

IEEE INTERNET OF THINGS JOURNAL
卷 9, 期 17, 页码 16663-16676

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2022.3153585

关键词

Collision avoidance; data collection; deep reinforcement learning (RL); multiunmanned aerial vehicle (UAV) scenarios; path planning

向作者/读者索取更多资源

This article discusses the challenging task of determining collision-free trajectory in multi-UAV noncooperative scenarios while collecting data from distributed IoT nodes. The problem is translated into a Markov decision process and a Dueling double deep Q-network algorithm is proposed to learn the decision-making policy. Numerical results demonstrate efficient real-time navigation in various scenarios.
Unmanned aerial vehicles (UAVs) are expected to be an integral part of wireless networks, and determining collision-free trajectory in multi-UAV noncooperative scenarios while collecting data from distributed Internet of Things (IoT) nodes is a challenging task. In this article, we consider a path-planning optimization problem to maximize the collected data from multiple IoT nodes under realistic constraints. The considered multi-UAV noncooperative scenarios involve a random number of other UAVs in addition to the typical UAV, and UAVs do not communicate or share information among each other. We translate the problem into a Markov decision process (MDP) with parameterized states, permissible actions, and detailed reward functions. Dueling double deep Q-network (D3QN) is proposed to learn the decision-making policy for the typical UAV, without any prior knowledge of the environment (e.g., channel propagation model and locations of the obstacles) and other UAVs (e.g., their missions, movements, and policies). The proposed algorithm can adapt to various missions in various scenarios, e.g., different numbers and positions of IoT nodes, different amount of data to be collected, and different numbers and positions of other UAVs. Numerical results demonstrate that real-time navigation can be efficiently performed with high success rate, high data collection rate, and low collision rate.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据