4.8 Article

Piecewise-DRL: Joint Beamforming Optimization for RIS-Assisted MU-MISO Communication System

期刊

IEEE INTERNET OF THINGS JOURNAL
卷 10, 期 19, 页码 17323-17337

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2023.3275818

关键词

Deep reinforcement learning (DRL); Internet of Things (IoT); multiuser multiple-input-single-output (MU-MISO); nonconvex optimization; reconfigurable intelligent sur-face (RIS)

向作者/读者索取更多资源

This article introduces a method of creating virtual Line-of-Sight channels in an IoT network using reconfigurable intelligent surfaces (RIS). It aims to maximize the sum-rate of communication systems by jointly optimizing the phase shift matrix of the RIS and transmit beamforming. A piecewise-deep reinforcement learning algorithm is proposed to solve the nonconvex problem, which avoids local optima and achieves faster convergence.
With the widespread connectivity of everyday devices realized by the advent of the Internet of Things (IoT), communication between users of different devices has become increasingly close. In practical scenarios, obstacles present between the transceiver may cause a deterioration in the quality of the received signals. Therefore, the reconfigurable intelligent surface (RIS) is employed to create virtual Line-of-Sight (LoS) channels in an IoT network. Specifically, this article aims at maximizing the sum-rate of the RIS-assisted multiuser multiple-input-single-output (MU-MISO) communication systems by jointly optimizing the phase shift matrix of the RIS and transmit beamforming. To solve the formulated nonconvex problem, a piecewise-deep reinforcement learning (DRL) algorithm is proposed in this article. Unlike the existing alternative optimization (AO) algorithms, the proposed algorithm avoids falling into the local optimal by using an exploration mechanism. Moreover, piecewise-DRL can reduce the action dimension, allowing the algorithm to obtain faster convergence. Simultaneously, this algorithm also ensures that the parameters of the two-part networks are updated to generate a larger system sum-rate by unsupervised joint optimization. Simulations in various circumstances reveal that the proposed approach is more robust and presents better stability and faster convergence than previous state-of-the-art algorithms while obtaining competitive performance.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据