4.8 Article

Joint Speed Control and Energy Replenishment Optimization for UAV-Assisted IoT Data Collection With Deep Reinforcement Transfer Learning

期刊

IEEE INTERNET OF THINGS JOURNAL
卷 10, 期 7, 页码 5778-5793

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2022.3151201

关键词

Data collection; Transfer learning; Batteries; Internet of Things; Q-learning; Task analysis; Velocity control; Deep reinforcement learning (DRL); Internet of Things (IoT) data collection; Markov decision process (MDP); transfer learning (TL); unmanned aerial vehicle (UAV)

向作者/读者索取更多资源

This article introduces a novel framework that jointly optimizes the flying speed and energy replenishment for each UAV to significantly improve the overall system performance.
Unmanned-aerial-vehicle (UAV)-assisted data collection has been emerging as a prominent application due to its flexibility, mobility, and low operational cost. However, under the dynamic and uncertainty of Internet of Things data collection and energy replenishment processes, optimizing the performance for UAV collectors is a very challenging task. Thus, this article intro-duces a novel framework that jointly optimizes the flying speed and energy replenishment for each UAV to significantly improve the overall system performance (e.g., data collection and energy usage efficiency). Specifically, we first develop a Markov decision process to help the UAV automatically and dynamically make optimal decisions under the dynamics and uncertainties of the environment. Although traditional reinforcement learning algo-rithms, such as Q-learning and deep Q-learning, can help the UAV to obtain the optimal policy, they often take a long time to con-verge and require high computational complexity. Therefore, it is impractical to deploy these conventional methods on UAVs with limited computing capacity and energy resource. To that end, we develop advanced transfer learning techniques that allow UAVs to share and transfer learning knowledge, thereby reducing the learning time as well as significantly improving learning quality. Extensive simulations demonstrate that our proposed solution can improve the average data collection performance of the system up to 200% and reduce the convergence time up to 50% compared with those of conventional methods.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据