4.7 Article

Deep Reinforcement Learning-Based Power Allocation for Rate-Splitting Multiple Access in 6G LEO Satellite Communication System

期刊

IEEE WIRELESS COMMUNICATIONS LETTERS
卷 11, 期 10, 页码 2185-2189

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/LWC.2022.3196408

关键词

Low earth orbit satellites; Resource management; Satellites; Optimization; Precoding; Interference; Approximation algorithms; Rate-splitting multiple access; LEO satellite communication; deep reinforcement learning; proximal policy optimization (PPO)

资金

  1. National Natural Science Foundation of China [61801035]
  2. CCF-Baidu Open Fund
  3. 2022 BUPT Master Education Project [2022ZY072, 2022ZY075]

向作者/读者索取更多资源

This letter investigates the power allocation problem in LEO satellite networks with rate-splitting multiple access (RSMA) mechanism based on deep reinforcement learning. A highly-effective proximal policy optimization (PPO) scheme is proposed to tackle the challenge of uncertain and limited channel distribution information. Simulation results demonstrate the superiority of the proposed scheme in terms of the sum rate metric with low computation complexity.
Rate-splitting multiple access (RSMA) softly reconciles and decodes the extreme interference by non-orthogonal transmission, which can remarkably solve the spectrum scarcity for future six-generation (6G) low earth orbits (LEO) satellite communication system. In this letter, we investigate the power allocation problem in LEO satellite networks with RSMA mechanism based on the deep reinforcement learning (DRL) technique. Specifically, in order to achieve better RSMA performance, the LEO satellite base station (SBS) has to effectively allocate transmit power to common and private streams, which is very challenging due to the uncertain and limited information of the channel distribution. To solve this problem, a highly-effective proximal policy optimization (PPO) based scheme is further proposed, which enables the LEO SBS to learn an optimal power allocation strategy to maximize the sum rate of the system without knowing any prior information. Simulation results prove that the proposed scheme significantly outperforms the other three baseline schemes in terms of the sum rate metric with low computation complexity.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据