☆ 3.8 Proceedings Paper

A Q-learning based Resource Allocation Algorithm for D2D-Unlicensed communications

2021 IEEE 93RD VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-SPRING) (2021)

Journal

2021 IEEE 93RD VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-SPRING)

Volume -, Issue -, Pages -

Publisher

IEEE

DOI: 10.1109/VTC2021-Spring51267.2021.9448909

Keywords

D2D-U system; WiFi system; power allocation; duty cycle allocation; fairness

Funding

Nation major special project [2018zx0301016]
NSFC [62071077]
ZBYY [61400020109]
Chongqing BCEP [cstc2018jcyjAX0507, cstc2017jcyjBX0005]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

With the spectrum resources licensed to mobile operators becoming scarce, Device-to-Device (D2D) communication in unlicensed frequency bands is proposed. This paper introduces a Q-learning based resource allocation algorithm that maximizes total throughput and fairness while ensuring satisfactory SNR for cellular users.

The spectrum resources licensed to the mobile operators become increasingly scarce because of the explosive growth of the mobile traffic. Device-to-Device (D2D) communication is thus proposed to be deployed in unlicensed frequency bands, i.e. D2D-Unlicensed (D2D-U). The fixed duty cycle method is generally adopted in the coexistence scenario of D2D and WiFi, which may lead to unfair unlicensed spectrum usage since it cannot adapt the data traffic change. Therefore, a Q-learning (QL) based resource allocation algorithm for D2D-U is proposed in this paper. In the algorithm, the considered cellular base station acts as the agent. The actions of agent are defined as the different combinations of the transmission power and the duty cycle of D2D-U users, and the states of agent are defined as the different combinations of the total throughput, fairness and signal-to-noise ratio (SNR) of cellular users. Based on the proposed QL framework, the agent can always learn the optimal power allocation and duty cycle by interacting with the environment, which can maximize the total throughput and fairness while ensuring the satisfactory SNR of cellular users. The simulation results show that the proposed algorithm can obtain the largest throughput and the best fairness while ensuring the satisfactory SNR of LTE-U users among all traditional algorithms.

A Q-learning based Resource Allocation Algorithm for D2D-Unlicensed communications

Journal

2021 IEEE 93RD VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-SPRING)

Publisher

IEEE

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

A Q-learning based Resource Allocation Algorithm for D2D-Unlicensed communications

Journal

2021 IEEE 93RD VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-SPRING)

Publisher

IEEE

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper