4.8 Article

Deep Reinforcement Learning for Aerial Data Collection in Hybrid-Powered NOMA-IoT Networks

期刊

IEEE INTERNET OF THINGS JOURNAL
卷 10, 期 2, 页码 1761-1774

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2022.3209980

关键词

Autonomous aerial vehicles; NOMA; Resource management; Internet of Things; Wireless communication; Trajectory; Throughput; 3-D trajectory design; deep reinforcement learning (DRL); nonorthogonal multiple access (NOMA); time allocation; unmanned aerial vehicle (UAV); wireless-powered communication (WPC)

向作者/读者索取更多资源

With the help of UAV, remote terminals out of wireless coverage can be connected to IoT networks. The hybrid-powered networks composed of WPC terminal and solar-powered UAV can solve the energy supply problem, and NOMA technique can solve the massive access problem of IoT terminals. A DRL algorithm called FC-TDTA is developed to maximize fair network throughput by jointly considering energy limitation, QoS requirements, and flight conditions. Simulation results demonstrate the superiority of the proposed algorithm in fair throughput maximization.
With the help of unmanned aerial vehicle (UAV), remote terminals that out of wireless coverage can be connected to the Internet of Things (IoT) networks. Currently, the IoT relies on a large number of low-cost wireless sensors with limited energy supply to realize ubiquitous monitoring and intelligent control. The hybrid-powered networks composed of wireless-powered communication (WPC) terminal and solar-powered UAV can solve the energy supply problem of the IoT networks, and the nonorthogonal multiple access (NOMA) technique can solve the massive access problem of IoT terminals. Exploiting these benefits, we investigate joint UAV 3-D trajectory design and time allocation for aerial data collection in hybrid-powered NOMA-IoT networks. To maximize the total fair network throughput, we jointly consider energy limitation, Quality of Service (QoS) requirements, and flight conditions. The problem is nonconvex and time-dimension coupled which is intractable to solve by traditional optimization methods. Therefore, we develop a deep reinforcement learning (DRL) algorithm called fair communication is accomplished by trajectory design and time allocation (FC-TDTA), which uses the deep deterministic policy gradient (DDPG) as its basis. Simulation results show that our proposed algorithm performs better than benchmarks in fair throughput maximization. The proposed FC-TDTA algorithm can make the UAV: 1) fly in appropriate direction and speed, so that the UAV can arrive at the charging station before the energy runs out and 2) conduct WPC energy transmission and data collection to achieve fair communication.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据