4.7 Article

Interpretable Decision-Making for Autonomous Vehicles at Highway On-Ramps With Latent Space Reinforcement Learning

期刊

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY
卷 70, 期 9, 页码 8707-8719

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TVT.2021.3098321

关键词

Hidden Markov models; Task analysis; Road transportation; Autonomous vehicles; Merging; Entropy; Decision making; Autonomous driving; interpretability; highway on-ramps; reinforcement learning; latent states

资金

  1. National Natural Science Foundation of China [U20A20225, U2013601]
  2. Natural Science Foundation of Hefei, China [2021032]
  3. Key Research and Development Plan of Anhui Province [202004a05020058]
  4. Fundamental Research Funds for the Central Universities
  5. Science and Technology Innovation Planning Project of Ministry of Education of China
  6. NVIDIA NVAIL program

向作者/读者索取更多资源

This paper introduces a latent space reinforcement learning method for interpretable decision-making of autonomous vehicles at highway on-ramps. By utilizing the latent model and HMM-GMR, the method provides an interpretable state that offers semantic information and environment understanding. Experimental results demonstrate a successful balance between driving safety and efficiency in challenging highway on-ramps merging scenarios.
This paper presents a latent space reinforcement learning method for interpretable decision-making of autonomous vehicles at highway on-ramps. This method is based on the latent model and the combination model of the hidden Markov model and Gaussian mixture regression (HMM-GMR). It is difficult for the traditional decision-making method to understand the environment because its input is high-dimensional and lacks an understanding of the task. By utilizing the HMM-GMR model, we can obtain the interpretable state providing semantic information and environment understanding. A framework is proposed to unify representation learning with the deep reinforcement learning (DRL) approach, in which the latent model is used to reduce the dimension of interpretable state by extracting underlying task-relevant information. Experimental results are presented and the results show the right balance between driving safety and efficiency in the challenging scenarios of highway on-ramps merging.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据