4.8 Article

Deep reinforcement learning of energy management with continuous control strategy and traffic information for a series-parallel plug-in hybrid electric bus

期刊

APPLIED ENERGY
卷 247, 期 -, 页码 454-466

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.apenergy.2019.04.021

关键词

Energy management; Hybrid electric vehicle; Deep reinforcement learning; Deep deterministic policy gradient

资金

  1. National Natural Science Foundation of China [61620106002, 51705020]

向作者/读者索取更多资源

Hybrid electric vehicles offer an immediate solution for emissions reduction and fuel displacement under the current technique level. Energy management strategies are critical for improving fuel economy of hybrid electric vehicles. In this paper we propose a energy management strategy for a series-parallel plug-in hybrid electric bus based on deep deterministic policy gradients. Specifically, deep deterministic policy gradients is an actor-critic, model-free reinforcement learning algorithm that can assign the optimal energy split of the bus over continuous spaces. We consider that the buses are driving in a fixed bus line, where driving cycle is constrained by the traffic. The traffic information and number of passengers are also incorporated into the energy management system. The deep reinforcement learning based energy management agent is trained with a large amount of driving cycles that generated from traffic simulation. Experiments on the traffic simulation driving cycles show that the proposed approach outperforms conventional reinforcement learning approach and exhibits performance close to the global optimal dynamic programming. Moreover, it also has great generality to the standard driving cycles that are significantly different with the ones that it has been trained with. We also show some interesting attributes of learned energy management strategies through visualizations of the actor and critic. The main contribution of this study is to explore the incorporation of traffic information within hybrid electric vehicle energy managment through advanced intelligent algorithms.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据