期刊
WIRELESS NETWORKS
卷 26, 期 7, 页码 5163-5176出版社
SPRINGER
DOI: 10.1007/s11276-020-02386-0
关键词
D2D communication; Energy harvesting; Resource allocation; Reinforcement learning; Artificial network
资金
- National Natural Science Foundation of China [61971239, 61631020]
In this paper, we consider an energy harvesting device-to-device (D2D) communication system, where D2D transmitter can use modeAto directly communicate with D2D receiver or use modeBas a relay to assist cellular communication while communicating with D2D receiver by adopting non-orthogonal multiple access technology. Firstly, the outage probability expression in two modes is derived, and the communication mode is determined according to outage performance. Then, assuming that the full system information is available, the channel allocation and relay selection are completed by Kuhn-Munkres algorithm, and the offline power allocation of D2D users is realized by reinforcement learning. Next, the offline optimization results are taken as the training data set to train the neural network, and the optimal model of the transmission power is obtained. Considering the transmission power constraint, the online power allocation optimization algorithm is further proposed. Numerical results demonstrate the accuracy of derived outage probability, and the proposed resource allocation algorithm can improve the performance of hybrid system.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据