4.8 Article

Deep Reinforcement Learning Resource Allocation in Wireless Sensor Networks With Energy Harvesting and Relay

Journal

IEEE INTERNET OF THINGS JOURNAL
Volume 9, Issue 3, Pages 2330-2345

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2021.3094465

Keywords

Wireless sensor networks; Relays; Wireless communication; Resource management; Throughput; Radio frequency; Energy harvesting; Amplify-and-forward (AF); deep reinforcement learning (DRL); energy cooperation; energy harvesting; through-put maximization; wireless sensor network (WSN)

Funding

  1. National Natural Science Foundation of China [61571209]

Ask authors/readers for more resources

This paper proposes a wireless sensor network composed of several local subnetworks with amplified forwarding relay and specially designed working time cycle. The authors use deep reinforcement learning to develop resource allocation policies for maximizing throughput. Simulation results show that the proposed policies can significantly improve network performance.
Green wireless communications have been extensively studied in wireless sensor networks (WSNs), including the use of new energy, renewable energy, and low-power consumption and energy-saving technologies for years. In these networks, due to channel fading, insufficient and random energy arrival, some possible bad deployment of sensors, etc., the communication among sensor nodes in a WSNs will inevitably be affected or even interrupted sometimes, which may result in unacceptable performance in the entire network. In order to solve this problem, we propose a WSN composing of several local subnetworks with amplified forwarding relay and specially designed working time cycle. In this network, we study our resource allocation policies to manage both power and time for throughput maximization. We use deep reinforcement learning (DRL) to develop our resource allocation policies under the model constructed as a Markov decision process for this optimization problem in the subnetwork. We apply an actor-critic strategy to find our optimal solution in continuous state and action space and adaptively achieve maximum throughput of this network based on energy harvesting, causal information of battery state and channel gains. The simulation results demonstrate that the proposed transmission policies can produce higher throughput in the local network and finally improve overall system performance in comparison with greedy policy, random policy, and conservative policy.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available