4.4 Article

Modelling personalised car-following behaviour: a memory-based deep reinforcement learning approach

期刊

出版社

TAYLOR & FRANCIS LTD
DOI: 10.1080/23249935.2022.2035846

关键词

Car-following; autonomous driving; twin delayed deep deterministic policy gradients; long short-term memory; driving styles

资金

  1. National Key R&D Program of China [2020YFB1600302]
  2. National Natural Science Foundation of China [61873018, 51775016]

向作者/读者索取更多资源

This study develops a personalised car-following model using a memory-based deep reinforcement learning approach, which optimises the car-following behaviour according to driving characteristics. The results show that it can reflect the fundamental differences among different driving styles.
To adapt to human-driving habits, this study develops a personalised car-following model via a memory-based deep reinforcement learning approach. Specifically, Twin Delayed Deep Deterministic Policy Gradients (TD3) is integrated with a long short-term memory (LSTM) (abbreviated as LSTM-TD3). Using the NGSIM dataset, unsupervised learning-based clustering and data feature analyses are performed. The driving characteristics related to safety, efficiency and comfort are extracted for different driving styles, i.e. aggressive, common and conservative. Then, reward functions are constructed for different driving styles by incorporating their driving characteristics. By resorting to the TD3 policy within a recurrent actor-critic framework, LSTM-TD3 optimises the car-following behaviour via trial-and-error interactions according to the reward functions. Results show that compared with LSTM-DDPG and DDPG, LSTM-TD3 reproduces personalised car-following behaviour with desirable convergence speed and reward. It reveals that LSTM-TD3 can reflect the essential difference in safety, efficiency and comfort requirements among different driving styles.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据