4.7 Article

Multi-agent reinforcement learning for edge information sharing in vehicular networks

Journal

DIGITAL COMMUNICATIONS AND NETWORKS
Volume 8, Issue 3, Pages 267-277

Publisher

KEAI PUBLISHING LTD
DOI: 10.1016/j.dcan.2021.08.006

Keywords

Vehicular networks; Edge information sharing; Delay guarantee; Multi-agent reinforcement learning; Proximal policy optimization

Funding

  1. National Natural Science Foundation of China [61901078, 61771082, 61871062, U20A20157]
  2. Science and Technology Research Program of Chongqing Municipal Education Commission [KJQN201900609]
  3. Natural Science Foundation of Chongqing [cstc2020jcyj-zdxmX0024]
  4. University Innovation Research Group of Chongqing [CXQT20017]
  5. China University Industry-University Research Collaborative Innovation Fund (Future Network Innovation Research and Application Project) [2021FNA04008]

Ask authors/readers for more resources

This paper investigates the design of a comprehensive cooperative policy to guarantee the heterogeneous delay requirements of both Vehicle to Infrastructure (V2I) and Vehicle to Vehicle (V2V) links. It formulates mean delay minimization and maximum individual delay minimization problems to improve the global network performance and ensure fairness for individual users. A multi-agent reinforcement learning framework is used to solve these problems, and a proximal policy optimization approach is proposed. Simulation experiments validate the effectiveness of the proposed approach.
To guarantee the heterogeneous delay requirements of the diverse vehicular services, it is necessary to design a full cooperative policy for both Vehicle to Infrastructure (V2I) and Vehicle to Vehicle (V2V) links. This paper investigates the reduction of the delay in edge information sharing for V2V links while satisfying the delay requirements of the V2I links. Specifically, a mean delay minimization problem and a maximum individual delay minimization problem are formulated to improve the global network performance and ensure the fairness of a single user, respectively. A multi-agent reinforcement learning framework is designed to solve these two problems, where a new reward function is proposed to evaluate the utilities of the two optimization objectives in a unified framework. Thereafter, a proximal policy optimization approach is proposed to enable each V2V user to learn its policy using the shared global network reward. The effectiveness of the proposed approach is finally validated by comparing the obtained results with those of the other baseline approaches through extensive simulation experiments.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available