4.7 Article

Joint Control of Power, Beamwidth, and Spacing for Platoon-Based Vehicular Cyber-Physical Systems

Journal

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY
Volume 71, Issue 8, Pages 8615-8629

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TVT.2022.3173923

Keywords

Quality of service; Autonomous vehicles; Interference; Vehicle-to-infrastructure; Fuels; Sensors; Reinforcement learning; Beamwidth control; spacing control; two-timescale reinforcement learning; vehicle platooning; vehicle-to-infrastructure communications

Funding

  1. National Natural Science Foundation of China (NSFC) [61971286]

Ask authors/readers for more resources

This paper investigates the optimization problem of minimizing IVD in VCPS to meet V2I QoS requirements, proposing a two-timescale DRL framework to control the platoon's beamforming and spacing. Research shows that the proposed algorithm can significantly improve traffic capacity, reduce air drag coefficient, while meeting V2I communication QoS requirements.
In a platoon-based vehicular cyber-physical system (VCPS), the traffic capacity and the fuel efficiency can be significantly improved by maintaining a small inter-vehicle distance (IVD). Meanwhile, the vehicle-to-infrastructure (V2I) communications provided by the roadside unit (RSU) are important for promoting the development of autonomous vehicles and reshaping the driving experience. However, as maintaining a small IVD inside the platoon may result in severe inter-beam interference, there is a tradeoff between the traffic capacity and the communication efficiency. In this paper, we formulate an optimization problem to minimize the IVD (a.k.a spacing) under the V2I quality-of-service (QoS) requirements by jointly controlling the transmit power, the beamwidth, and the IVD in a platoon-based VCPS. The traffic and channel uncertainties significantly complicate the analytical characterization of the impact of IVD, transmit power, and beamwidth on the QoS requirements. To solve the intractable challenging problem and capture the inherent timescale difference between the communication and vehicular systems, we propose a novel data-driven two-timescale reinforcement learning framework to control the beamforming and the spacing of the platoon at two timescales. With elaborately designed state, action, and reward, we develop a soft actor-critic based two-timescale deep reinforcement learning (DRL) algorithm to learn an effective policy for the control-communication co-design problem. Simulation results validate the effectiveness of spacing control and show that the proposed two-timescale DRL algorithm can significantly improve the traffic capacity and reduce the air drag coefficient while satisfying the QoS requirements of V2I communications.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available