4.7 Article

Path Planning for Cellular-Connected UAV: A DRL Solution With Quantum-Inspired Experience Replay

期刊

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS
卷 21, 期 10, 页码 7897-7912

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TWC.2022.3162749

关键词

Autonomous aerial vehicles; Wireless communication; Optimization; Navigation; Trajectory; Antenna radiation patterns; Reinforcement learning; Drone; trajectory design; deep reinforcement learning; quantum-inspired experience replay

资金

  1. China Scholarship Council [CSC201908350102]
  2. King's College London [CSC201908350102]

向作者/读者索取更多资源

This paper considers a minimization problem in a cellular-connected unmanned aerial vehicle (UAV) network. The problem involves both the time cost and expected outage duration. A UAV navigation approach is proposed to optimize these factors by utilizing the UAV's adjustable mobility. The effectiveness of the proposed approach is demonstrated through numerical results.
In cellular-connected unmanned aerial vehicle (UAV) network, a minimization problem on the weighted sum of time cost and expected outage duration is considered. Taking advantage of UAV's adjustable mobility, a UAV navigation approach is formulated to achieve the aforementioned optimization goal. Conventional offline optimization techniques suffer from inefficiency in accomplishing the formulated UAV navigation task due to the practical consideration of local building distribution and directional antenna radiation pattern. Alternatively, after mapping the navigation task into a Markov decision process (MDP), a deep reinforcement learning (DRL)-aided solution is proposed to help the UAV find the optimal flying direction within each time slot, and thus the designed trajectory towards the destination can be generated. To help the DRL agent commit a better trade-off between sampling priority and diversity, a novel quantum-inspired experience replay (QiER) framework is proposed, via relating experienced transition's importance to its associated quantum bit (qubit) and applying Grover iteration based amplitude amplification technique. Compared to several representative DRL-related and non-learning baselines, the effectiveness and supremacy of the proposed DRL-QiER solution are demonstrated and validated in numerical results.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据