4.7 Article

Interpretable Decision-Making for Autonomous Vehicles at Highway On-Ramps With Latent Space Reinforcement Learning

Journal

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY
Volume 70, Issue 9, Pages 8707-8719

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TVT.2021.3098321

Keywords

Hidden Markov models; Task analysis; Road transportation; Autonomous vehicles; Merging; Entropy; Decision making; Autonomous driving; interpretability; highway on-ramps; reinforcement learning; latent states

Funding

  1. National Natural Science Foundation of China [U20A20225, U2013601]
  2. Natural Science Foundation of Hefei, China [2021032]
  3. Key Research and Development Plan of Anhui Province [202004a05020058]
  4. Fundamental Research Funds for the Central Universities
  5. Science and Technology Innovation Planning Project of Ministry of Education of China
  6. NVIDIA NVAIL program

Ask authors/readers for more resources

This paper introduces a latent space reinforcement learning method for interpretable decision-making of autonomous vehicles at highway on-ramps. By utilizing the latent model and HMM-GMR, the method provides an interpretable state that offers semantic information and environment understanding. Experimental results demonstrate a successful balance between driving safety and efficiency in challenging highway on-ramps merging scenarios.
This paper presents a latent space reinforcement learning method for interpretable decision-making of autonomous vehicles at highway on-ramps. This method is based on the latent model and the combination model of the hidden Markov model and Gaussian mixture regression (HMM-GMR). It is difficult for the traditional decision-making method to understand the environment because its input is high-dimensional and lacks an understanding of the task. By utilizing the HMM-GMR model, we can obtain the interpretable state providing semantic information and environment understanding. A framework is proposed to unify representation learning with the deep reinforcement learning (DRL) approach, in which the latent model is used to reduce the dimension of interpretable state by extracting underlying task-relevant information. Experimental results are presented and the results show the right balance between driving safety and efficiency in the challenging scenarios of highway on-ramps merging.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available