4.4 Article

Navigating Electric Vehicles Along a Signalized Corridor via Reinforcement Learning: Toward Adaptive Eco-Driving Control

期刊

TRANSPORTATION RESEARCH RECORD
卷 2676, 期 8, 页码 657-669

出版社

SAGE PUBLICATIONS INC
DOI: 10.1177/03611981221084683

关键词

operations; ITS

资金

  1. National Key R&D Program of China [2021TFB1600500]
  2. Key R&D Program of Jiangsu Province in China [BE2020013]
  3. Opening Project of Key Laboratory of Intelligent Transportation Systems Technologies, Ministry of Communications, P.R. China [2020-8501]
  4. Postgraduate Research & Practice Innovation Program of Jiangsu Province [SJCX21_0062]
  5. Zhishan Young Scholar Support Program of Southeast University [2019-ZJKJ-ZDZX02-2]

向作者/读者索取更多资源

This paper proposes an adaptive eco-driving method for electric vehicles using reinforcement learning technology. It considers factors such as safety, traffic mobility, energy consumption, and comfort, and achieves balanced and stable performance in simulation studies.
One problem associated with the operation of electric vehicles (EVs) is the limited battery, which cannot guarantee their endurance. The increasing electricity consumption will also impose a burden on economy and ecology of the vehicles. To achieve energy saving, this paper proposes an adaptive eco-driving method in the environment of signalized corridors. The framework with adaptive and real-time control is implemented by the reinforcement learning technique. First, the operation of EVs in the proximity of intersections is defined as a Markov Decision Process (MDP) to apply the twin delayed deep deterministic policy gradient (TD3) algorithm, to deal with the decision process with continuous action space. Therefore, the speed of the vehicle can be adjusted continuously. Second, safety, traffic mobility, energy consumption, and comfort are all considered by designing a comprehensive reward function for the MDP. Third, the simulation study takes Aoti Street in Nanjing City with several consecutive signalized intersections as the research road network, and the state representation in MDP considers the information from consecutive downstream traffic signals. After the parameter tuning procedure, simulations are carried out for three typical eco-driving scenarios, including free flow, car following, and congestion flow. By comparing with default car-following behavior in the simulation platform SUMO and several state-of-the-art deep reinforcement learning algorithms, the proposed strategy shows a balanced and stable performance.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据