4.7 Article

Deep Reinforcement Learning Based End-to-End Multiuser Channel Prediction and Beamforming

期刊

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS
卷 21, 期 12, 页码 10271-10285

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TWC.2022.3183255

关键词

Deep reinforcement learning; channel prediction; beamforming; physical layer

资金

  1. National Key Research and Development Program of China [2021YFA1003300]
  2. Research Grants Council [16213119]

向作者/读者索取更多资源

This paper proposes reinforcement learning based end-to-end channel prediction and beamforming algorithms for multi-user downlink systems, achieving autonomous learning and performance optimization without perfect channel state information. Empirical simulations and complexity analysis verify the effectiveness and superiority of the algorithms.
In this paper, reinforcement learning (RL) based end-to-end channel prediction (CP) and beamforming (BF) algorithms are proposed for multi-user downlink system. Different from the previous methods which either require perfect channel state information (CSI), or estimate outdated CSI and set constraints on pilot sequences, the proposed algorithms have no such premised assumptions or constraints. Firstly, RL is considered in channel prediction and the actor-critic aided CP algorithm is proposed at the base station (BS). With the received pilot signals and partial feedback information, the actor network at BS directly outputs the predicted downlink CSI without channel reciprocity. After obtaining the CSI, BS generates the beamforming matrix using zero-forcing (ZF). Secondly, we further develop a deep RL based two-layer architecture for joint CP and BF design. The first layer predicts the downlink CSI with the similar actor network as in the CP algorithm. Then, by importing the outputs of the first layer as inputs, the second layer is the actor-critic based beamforming layer, which can autonomously learn the beamforming policy with the objective of maximizing the transmission sum rate. Since the learning state and action spaces in the considered CP and BF problems are continuous, we employ the actor-critic method to deal with the continuous outputs. Empirical numerical simulations and the complexity analysis verify that the proposed end-to-end algorithms could always converge to stable states under different channel statistics and scenarios, and can beat the existing traditional and learning based benchmarks, in terms of transmission sum rate.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据