4.8 Article

AoI-Energy-Aware UAV-Assisted Data Collection for IoT Networks: A Deep Reinforcement Learning Method

期刊

IEEE INTERNET OF THINGS JOURNAL
卷 8, 期 24, 页码 17275-17289

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2021.3078701

关键词

Age of Information (AoI); data collection; deep reinforcement learning (RL); energy efficiency; unmanned aerial vehicle (UAV) trajectory planning

资金

  1. National Key R&D Program of China [2020YFB1806905]
  2. National Natural Science Foundation of China [61871045, 61801045]
  3. BUPT Excellent Ph.D.
  4. Students Foundation [CX2020213]
  5. China Scholarship Council

向作者/读者索取更多资源

In this study, a new UAV-assisted IoT network data collection scheme is proposed to minimize energy consumption and Age of Information (AoI) by considering the temporal value of data. By optimizing UAV flight speed, hovering locations, and bandwidth allocation, a UAV trajectory planning algorithm based on deep neural network is introduced, showing superior performance in simulation results.
Thanks to the inherent characteristics of flexible mobility and autonomous operation, unmanned aerial vehicles (UAVs) will inevitably be integrated into 5G/B5G cellular networks to assist remote sensing for real-time assessment and monitoring applications. Most existing UAV-assisted data collection schemes focus on optimizing energy consumption and data collection throughput, which overlook the temporal value of collected data. In this article, we employ Age of Information (AoI) as a performance metric to quantify the temporal correlation among data packets consecutively sampled by the Internet of Things (IoT) devices, and investigate an AoI-energy-aware data collection scheme for UAV-assisted IoT networks. We aim to minimize the weighted sum of expected average AoI, propulsion energy of UAV, and the transmission energy at IoT devices, by jointly optimizing the UAV flight speed, hovering locations, and bandwidth allocation for data collection. Considering the system dynamics, the optimization problem is modeled as a Markov decision process. To cope with the multidimensional action space, we develop a twin-delayed deep deterministic (TD3) policy gradient-based UAV trajectory planning algorithm (TD3-AUTP) by introducing the deep neural network (DNN) for feature extraction. Through simulation results, we demonstrate that our proposed scheme outperforms the deep Q-network and actorcritic-based algorithms in terms of achievable AoI and energy efficiency.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据