4.7 Article

Age of Information Aware Radio Resource Management in Vehicular Networks: A Proactive Deep Reinforcement Learning Perspective

Journal

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS
Volume 19, Issue 4, Pages 2268-2281

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TWC.2019.2963667

Keywords

Vehicular communications; multi-user resource scheduling; Markov decision process; long short-term memory; deep reinforcement learning; Q-function decomposition

Funding

  1. Academy of Finland [319759, 319758, 289611]
  2. National Key Research and Development Program of China [2017YFB1301003]
  3. National Natural Science Foundation of China [61701439, 61731002]
  4. Zhejiang Key Research and Development Plan [2019C01002]
  5. Japan Society for the Promotion of Science (JSPS) KAKENHI [18KK0279, 18K18036, 19H04092]
  6. Telecommunications Advanced Foundation

Ask authors/readers for more resources

In this paper, we investigate the problem of age of information (AoI)-aware radio resource management for expected long-term performance optimization in a Manhattan grid vehicle-to-vehicle network. With the observation of global network state at each scheduling slot, the roadside unit (RSU) allocates the frequency bands and schedules packet transmissions for all vehicle user equipment-pairs (VUE-pairs). We model the stochastic decision-making procedure as a discrete-time single-agent Markov decision process (MDP). The technical challenges in solving the optimal control policy originate from high spatial mobility and temporally varying traffic information arrivals of the VUE-pairs. To make the problem solving tractable, we first decompose the original MDP into a series of per-VUE-pair MDPs. Then we propose a proactive algorithm based on long short-term memory and deep reinforcement learning techniques to address the partial observability and the curse of high dimensionality in local network state space faced by each VUE-pair. With the proposed algorithm, the RSU makes the optimal frequency band allocation and packet scheduling decision at each scheduling slot in a decentralized way in accordance with the partial observations of the global network state at the VUE-pairs. Numerical experiments validate the theoretical analysis and demonstrate the significant performance improvements from the proposed algorithm.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available