3.8 Proceedings Paper

A Q-learning based Resource Allocation Algorithm for D2D-Unlicensed communications

Publisher

IEEE
DOI: 10.1109/VTC2021-Spring51267.2021.9448909

Keywords

D2D-U system; WiFi system; power allocation; duty cycle allocation; fairness

Funding

  1. Nation major special project [2018zx0301016]
  2. NSFC [62071077]
  3. ZBYY [61400020109]
  4. Chongqing BCEP [cstc2018jcyjAX0507, cstc2017jcyjBX0005]

Ask authors/readers for more resources

With the spectrum resources licensed to mobile operators becoming scarce, Device-to-Device (D2D) communication in unlicensed frequency bands is proposed. This paper introduces a Q-learning based resource allocation algorithm that maximizes total throughput and fairness while ensuring satisfactory SNR for cellular users.
The spectrum resources licensed to the mobile operators become increasingly scarce because of the explosive growth of the mobile traffic. Device-to-Device (D2D) communication is thus proposed to be deployed in unlicensed frequency bands, i.e. D2D-Unlicensed (D2D-U). The fixed duty cycle method is generally adopted in the coexistence scenario of D2D and WiFi, which may lead to unfair unlicensed spectrum usage since it cannot adapt the data traffic change. Therefore, a Q-learning (QL) based resource allocation algorithm for D2D-U is proposed in this paper. In the algorithm, the considered cellular base station acts as the agent. The actions of agent are defined as the different combinations of the transmission power and the duty cycle of D2D-U users, and the states of agent are defined as the different combinations of the total throughput, fairness and signal-to-noise ratio (SNR) of cellular users. Based on the proposed QL framework, the agent can always learn the optimal power allocation and duty cycle by interacting with the environment, which can maximize the total throughput and fairness while ensuring the satisfactory SNR of cellular users. The simulation results show that the proposed algorithm can obtain the largest throughput and the best fairness while ensuring the satisfactory SNR of LTE-U users among all traditional algorithms.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available