4.7 Article

Adaptive switching for communication profiles in underwater acoustic modems based on reinforcement learning

期刊

APPLIED ACOUSTICS
卷 210, 期 -, 页码 -

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.apacoust.2023.109430

关键词

Underwater acoustic communications; Adaptive communications; Reinforcement learning; DDPG

向作者/读者索取更多资源

This work investigates the adaptation of communication strategies to the dynamics of the underwater acoustic (UWA) channel. Three communication strategies, including Frequency-Hopped Binary Frequency Shift Keying (FH-BFSK), Single-Carrier (SC) communication, and multicarrier communication, are considered. A reinforcement learning algorithm, the Deep Deterministic Policy Gradient (DDPG) method with a Gumbel-softmax scheme, is employed for intelligent and adaptive switching among these strategies. Simulation and experimental results demonstrate that the proposed method outperforms random selection and direct feedback methods in time-varying channels.
The underwater acoustic (UWA) channel is a complex and stochastic process with large spatial and temporal dynamics. This work studies the adaptation of the communication strategy to channel dynamics. Specifically, a set of communication strategies are considered, including Frequency-Hopped Binary Frequency Shift Keying (FH-BFSK), Single-Carrier (SC) communication, and multicarrier communication. Based on the channel condition, a reinforcement learning (RL) algorithm, the Deep Deterministic Policy Gradient (DDPG) method along with a Gumbel-softmax scheme is employed for intelligent and adaptive switching among those communication strategies. The adaptive switching is performed on a transmission block-by-block basis, with the goal of maximizing long-term system performance. The reward function is defined based on the energy efficiency (EE) and the spectral efficiency (SE) of the communication strategies. Simulation results and experimental data processing results reveal that the proposed method outperforms a random selection method and a direct feedback method in time-varying channels.(c) 2023 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据