4.7 Article

An Adaptive Hierarchical Energy Management Strategy for Hybrid Electric Vehicles Combining Heuristic Domain Knowledge and Data-Driven Deep Reinforcement Learning

Journal

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TTE.2021.3132773

Keywords

State of charge; Energy management; Hybrid electric vehicles; Optimization; Task analysis; Environmental management; Engines; Energy management strategy (EMS); heuristic knowledge; hierarchical; hybrid electric vehicle (HEV); reinforcement learning (RL)

Funding

  1. National Natural Science Foundation of China [51905061]
  2. China Postdoctoral Science Foundation [2020M671842]
  3. Natural Science Foundation of Chongqing [cstc2019jcyj-msxmX0097]

Ask authors/readers for more resources

With the development of artificial intelligence and machine learning, reinforcement learning has opened up new possibilities for hybrid electric vehicle energy management. However, current issues limit its application in industrial energy management strategy tasks. To overcome this, an adaptive hierarchical energy management strategy combining heuristic equivalent consumption minimization strategy knowledge and deep deterministic policy gradient algorithm is proposed. Experimental results show that the proposed strategy outperforms other benchmark strategies in terms of fuel consumption.
With the development of artificial intelligence, there has been a growing interest in machine learning-based control strategy, among which reinforcement learning (RL) has opened up a new direction in the field of hybrid electric vehicle (HEV) energy management. However, the issues of the current RL setting ranging from inappropriate battery state-of-charge (SOC) constraint to ineffective and risky exploration make it inapplicable to many industrial energy management strategy (EMS) tasks. To address this, an adaptive hierarchical EMS combining heuristic equivalent consumption minimization strategy (ECMS) knowledge and deep deterministic policy gradient (DDPG), which is a state-of-the-art data-driven RL algorithm, is proposed in this work. For comparison purposes, the proposed strategy is contrasted with dynamic programming (DP), proportion integration differentiation (PID)-based adaptive ECMS, and rule-based and standard RL-based counterparts, and the results show that the fuel consumption after SOC correction for the proposed strategy is very close to that of the DP-based control and lower than that of the other three benchmark strategies. Considering that the proposed strategy can make better use of the RL techniques while realizing an effective, efficient, and safe exploration in a data-driven manner, it may become a strong foothold for future RL-based EMS to build on, especially when the controller has to be trained directly and from scratch in a real-world environment.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available