4.7 Article

An Adaptive Hierarchical Energy Management Strategy for Hybrid Electric Vehicles Combining Heuristic Domain Knowledge and Data-Driven Deep Reinforcement Learning

期刊

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TTE.2021.3132773

关键词

State of charge; Energy management; Hybrid electric vehicles; Optimization; Task analysis; Environmental management; Engines; Energy management strategy (EMS); heuristic knowledge; hierarchical; hybrid electric vehicle (HEV); reinforcement learning (RL)

资金

  1. National Natural Science Foundation of China [51905061]
  2. China Postdoctoral Science Foundation [2020M671842]
  3. Natural Science Foundation of Chongqing [cstc2019jcyj-msxmX0097]

向作者/读者索取更多资源

With the development of artificial intelligence and machine learning, reinforcement learning has opened up new possibilities for hybrid electric vehicle energy management. However, current issues limit its application in industrial energy management strategy tasks. To overcome this, an adaptive hierarchical energy management strategy combining heuristic equivalent consumption minimization strategy knowledge and deep deterministic policy gradient algorithm is proposed. Experimental results show that the proposed strategy outperforms other benchmark strategies in terms of fuel consumption.
With the development of artificial intelligence, there has been a growing interest in machine learning-based control strategy, among which reinforcement learning (RL) has opened up a new direction in the field of hybrid electric vehicle (HEV) energy management. However, the issues of the current RL setting ranging from inappropriate battery state-of-charge (SOC) constraint to ineffective and risky exploration make it inapplicable to many industrial energy management strategy (EMS) tasks. To address this, an adaptive hierarchical EMS combining heuristic equivalent consumption minimization strategy (ECMS) knowledge and deep deterministic policy gradient (DDPG), which is a state-of-the-art data-driven RL algorithm, is proposed in this work. For comparison purposes, the proposed strategy is contrasted with dynamic programming (DP), proportion integration differentiation (PID)-based adaptive ECMS, and rule-based and standard RL-based counterparts, and the results show that the fuel consumption after SOC correction for the proposed strategy is very close to that of the DP-based control and lower than that of the other three benchmark strategies. Considering that the proposed strategy can make better use of the RL techniques while realizing an effective, efficient, and safe exploration in a data-driven manner, it may become a strong foothold for future RL-based EMS to build on, especially when the controller has to be trained directly and from scratch in a real-world environment.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据