4.7 Article

An Adaptive Asynchronous Wake-Up Scheme for Underwater Acoustic Sensor Networks Using Deep Reinforcement Learning

Journal

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY
Volume 70, Issue 2, Pages 1851-1865

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TVT.2021.3055065

Keywords

Reinforcement learning; Delays; Wireless sensor networks; Protocols; Internet of Things; Synchronization; Energy consumption; The internet of underwater things (IoUT); deep reinforcement learning; asynchronous wake-up scheme; cyclic difference set (CDS)

Funding

  1. Natural Science Foundation of Jiangsu Province [BK20190733]
  2. NUPTSF [NY219166]
  3. National Natural Science Foundation of China [61872423]
  4. Natural Sciences, and Engineering Research Council (NSERC) of Canada [RGPIN-2018-03792]
  5. InnovateNL SensorTECH [5404-2061-101]

Ask authors/readers for more resources

The paper explores the optimal policy selection for sensor nodes in underwater acoustic sensor networks, proposing an adaptive asynchronous wake-up scheme based on deep reinforcement learning and LSTM networks to enhance energy efficiency and network performance.
Underwater acoustic sensor networks (UWSNs), acting as a reliable and efficient infrastructure of the Internet of underwater things (IoUT), have attracted much research interest in recent years due to the wide range of their potential marine applications. The limited energy supply of underwater sensor nodes is a significant challenge that can be mitigated by the cyclic difference set (CDS)-based coordination asynchronous wake-up scheme. However, the CDS-based asynchronous wake-up scheme also introduces long delays in the neighbor discovery that deteriorates packet delay as well as the network lifetime. In this paper, we formulate the problem of policy selection for idle listening as a Markov decision process and exploit the framework of deep reinforcement learning to obtain the optimal policies of underwater sensor nodes. Furthermore, the long short-term memory (LSTM) networks are utilized to estimate the network traffic feature, which can improve the performance of the proposed adaptive asynchronous wake-up scheme. To verify the performance of the proposed scheme, simulations in different network scenarios are conducted with the comparison of random, fixed metric policies, and original CDS-based asynchronous wake-up schemes.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available