4.8 Article

AoI-Energy-Aware UAV-Assisted Data Collection for IoT Networks: A Deep Reinforcement Learning Method

Journal

IEEE INTERNET OF THINGS JOURNAL
Volume 8, Issue 24, Pages 17275-17289

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2021.3078701

Keywords

Age of Information (AoI); data collection; deep reinforcement learning (RL); energy efficiency; unmanned aerial vehicle (UAV) trajectory planning

Funding

  1. National Key R&D Program of China [2020YFB1806905]
  2. National Natural Science Foundation of China [61871045, 61801045]
  3. BUPT Excellent Ph.D.
  4. Students Foundation [CX2020213]
  5. China Scholarship Council

Ask authors/readers for more resources

In this study, a new UAV-assisted IoT network data collection scheme is proposed to minimize energy consumption and Age of Information (AoI) by considering the temporal value of data. By optimizing UAV flight speed, hovering locations, and bandwidth allocation, a UAV trajectory planning algorithm based on deep neural network is introduced, showing superior performance in simulation results.
Thanks to the inherent characteristics of flexible mobility and autonomous operation, unmanned aerial vehicles (UAVs) will inevitably be integrated into 5G/B5G cellular networks to assist remote sensing for real-time assessment and monitoring applications. Most existing UAV-assisted data collection schemes focus on optimizing energy consumption and data collection throughput, which overlook the temporal value of collected data. In this article, we employ Age of Information (AoI) as a performance metric to quantify the temporal correlation among data packets consecutively sampled by the Internet of Things (IoT) devices, and investigate an AoI-energy-aware data collection scheme for UAV-assisted IoT networks. We aim to minimize the weighted sum of expected average AoI, propulsion energy of UAV, and the transmission energy at IoT devices, by jointly optimizing the UAV flight speed, hovering locations, and bandwidth allocation for data collection. Considering the system dynamics, the optimization problem is modeled as a Markov decision process. To cope with the multidimensional action space, we develop a twin-delayed deep deterministic (TD3) policy gradient-based UAV trajectory planning algorithm (TD3-AUTP) by introducing the deep neural network (DNN) for feature extraction. Through simulation results, we demonstrate that our proposed scheme outperforms the deep Q-network and actorcritic-based algorithms in terms of achievable AoI and energy efficiency.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available