期刊
IEEE INTERNET OF THINGS JOURNAL
卷 10, 期 19, 页码 17323-17337出版社
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2023.3275818
关键词
Deep reinforcement learning (DRL); Internet of Things (IoT); multiuser multiple-input-single-output (MU-MISO); nonconvex optimization; reconfigurable intelligent sur-face (RIS)
This article introduces a method of creating virtual Line-of-Sight channels in an IoT network using reconfigurable intelligent surfaces (RIS). It aims to maximize the sum-rate of communication systems by jointly optimizing the phase shift matrix of the RIS and transmit beamforming. A piecewise-deep reinforcement learning algorithm is proposed to solve the nonconvex problem, which avoids local optima and achieves faster convergence.
With the widespread connectivity of everyday devices realized by the advent of the Internet of Things (IoT), communication between users of different devices has become increasingly close. In practical scenarios, obstacles present between the transceiver may cause a deterioration in the quality of the received signals. Therefore, the reconfigurable intelligent surface (RIS) is employed to create virtual Line-of-Sight (LoS) channels in an IoT network. Specifically, this article aims at maximizing the sum-rate of the RIS-assisted multiuser multiple-input-single-output (MU-MISO) communication systems by jointly optimizing the phase shift matrix of the RIS and transmit beamforming. To solve the formulated nonconvex problem, a piecewise-deep reinforcement learning (DRL) algorithm is proposed in this article. Unlike the existing alternative optimization (AO) algorithms, the proposed algorithm avoids falling into the local optimal by using an exploration mechanism. Moreover, piecewise-DRL can reduce the action dimension, allowing the algorithm to obtain faster convergence. Simultaneously, this algorithm also ensures that the parameters of the two-part networks are updated to generate a larger system sum-rate by unsupervised joint optimization. Simulations in various circumstances reveal that the proposed approach is more robust and presents better stability and faster convergence than previous state-of-the-art algorithms while obtaining competitive performance.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据