4.8 Article

Average AoI Minimization in UAV-Assisted Data Collection With RF Wireless Power Transfer: A Deep Reinforcement Learning Scheme

期刊

IEEE INTERNET OF THINGS JOURNAL
卷 9, 期 7, 页码 5216-5228

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2021.3110138

关键词

Age of Information (AoI); deep reinforcement learning (DRL); unmanned aerial vehicle (UAV); wireless power transmission

资金

  1. National Key Research and Development Program of China [2020YFB1806903]
  2. National Natural Science Foundation of China (NSFC) [62071033, U1834210]
  3. Fundamental Research Funds for the Central Universities [2020JBZD010]
  4. NSFC [92046026]
  5. National Natural Science Foundation of China
  6. Jiangsu Provincial Key Research and Development Program [BE2020001-3]
  7. Jiangsu Provincial Policy Guidance Program [BZ2020008]
  8. China Postdoctoral Science Foundation [BX2021031]

向作者/读者索取更多资源

This article studies the wireless powered network assisted by unmanned aerial vehicles (UAVs), where the UAV charges ground nodes (GNs) wirelessly and the GNs use their harvested energy to upload information to the UAV. An optimization problem is formulated to minimize the average Age of Information (AoI) of the GNs by optimizing the trajectory of the UAV and the scheduling of information transmission and energy harvesting of GNs. A deep Q network (DQN)-based scheme is proposed to find a near-optimal solution. Simulation results demonstrate the convergence of the proposed DQN scheme and its superiority in terms of average AoI compared to other known schemes.
This article studies the unmanned aerial vehicle (UAV)-assisted wireless powered network, where a UAV is dispatched to wirelessly charge multiple ground nodes (GNs) by using radio frequency (RF) energy transfer and then the GNs use their harvested energy to upload the sensed information to the UAV. At each moment, the UAV is scheduled to charge the GNs or only one GN is scheduled to upload its data. An optimization problem is formulated to minimize the average Age of Information (AoI) of the GNs by jointly optimizing the trajectory of the UAV and the scheduling of information transmission and energy harvesting of GNs. As the problem is a combinational optimization problem with a set of binary variables, it is difficult to be solved. Thus, it is modeled as a Markov problem with large state spaces and a deep Q network (DQN)-based scheme is proposed to find its near-optimal solution on the basis of the deep reinforcement learning (DRL) framework. Two nets are structured with artificial neural network (ANN), where one is for evaluating the reward of the action performed in current state, and the other is for predicting realistic action. The corresponding state spaces, the efficient action spaces, and reward function are designed. Simulation results demonstrate the convergence of the proposed DQN scheme, which also show that the proposed DQN scheme gets much smaller average AoI than the three other known schemes. Moreover, by involving the energy punishment in the reward, the UAV may save its energy but yield higher AoI. Additionally, the effects of the packet size, the transmit power, and the distribution area of GNs on the GNs' average AoI are also discussed, which are expected to provide some useful insights.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据