4.8 Article

Deep Reinforcement Learning Resource Allocation in Wireless Sensor Networks With Energy Harvesting and Relay

期刊

IEEE INTERNET OF THINGS JOURNAL
卷 9, 期 3, 页码 2330-2345

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2021.3094465

关键词

Wireless sensor networks; Relays; Wireless communication; Resource management; Throughput; Radio frequency; Energy harvesting; Amplify-and-forward (AF); deep reinforcement learning (DRL); energy cooperation; energy harvesting; through-put maximization; wireless sensor network (WSN)

资金

  1. National Natural Science Foundation of China [61571209]

向作者/读者索取更多资源

This paper proposes a wireless sensor network composed of several local subnetworks with amplified forwarding relay and specially designed working time cycle. The authors use deep reinforcement learning to develop resource allocation policies for maximizing throughput. Simulation results show that the proposed policies can significantly improve network performance.
Green wireless communications have been extensively studied in wireless sensor networks (WSNs), including the use of new energy, renewable energy, and low-power consumption and energy-saving technologies for years. In these networks, due to channel fading, insufficient and random energy arrival, some possible bad deployment of sensors, etc., the communication among sensor nodes in a WSNs will inevitably be affected or even interrupted sometimes, which may result in unacceptable performance in the entire network. In order to solve this problem, we propose a WSN composing of several local subnetworks with amplified forwarding relay and specially designed working time cycle. In this network, we study our resource allocation policies to manage both power and time for throughput maximization. We use deep reinforcement learning (DRL) to develop our resource allocation policies under the model constructed as a Markov decision process for this optimization problem in the subnetwork. We apply an actor-critic strategy to find our optimal solution in continuous state and action space and adaptively achieve maximum throughput of this network based on energy harvesting, causal information of battery state and channel gains. The simulation results demonstrate that the proposed transmission policies can produce higher throughput in the local network and finally improve overall system performance in comparison with greedy policy, random policy, and conservative policy.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据