4.8 Article

Deep-Reinforcement-Learning-Based Mode Selection and Resource Allocation for Cellular V2X Communications

Journal

IEEE INTERNET OF THINGS JOURNAL
Volume 7, Issue 7, Pages 6380-6391

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JIOT.2019.2962715

Keywords

Resource management; Vehicle-to-everything; Reliability; Clustering algorithms; Quality of service; Interference; Reinforcement learning; Cellular vehicle-to-everything (V2X) communications; deep reinforcement learning (DRL); mode selection; resource allocation

Funding

  1. National Natural Science Foundation of China [61921003, 61925101, 61831002, 61901044]
  2. State Major Science and Technology Special Project [2018ZX03001025]
  3. HUAWEI Technical Cooperative Project
  4. National Program for Special Support of Eminent Professionals

Ask authors/readers for more resources

Cellular vehicle-to-everything (V2X) communication is crucial to support future diverse vehicular applications. However, for safety-critical applications, unstable vehicle-to-vehicle (V2V) links, and high signaling overhead of centralized resource allocation approaches become bottlenecks. In this article, we investigate a joint optimization problem of transmission mode selection and resource allocation for cellular V2X communications. In particular, the problem is formulated as a Markov decision process, and a deep reinforcement learning (DRL)-based decentralized algorithm is proposed to maximize the sum capacity of vehicle-to-infrastructure users while meeting the latency and reliability requirements of V2V pairs. Moreover, considering training limitation of local DRL models, a two-timescale federated DRL algorithm is developed to help obtain robust models. Wherein, the graph theory-based vehicle clustering algorithm is executed on a large timescale and in turn, the federated learning algorithm is conducted on a small timescale. The simulation results show that the proposed DRL-based algorithm outperforms other decentralized baselines, and validate the superiority of the two-timescale federated DRL algorithm for newly activated V2V pairs.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available