4.8 Article Proceedings Paper

Continuous reinforcement learning of energy management with deep Q network for a power split hybrid electric bus

Journal

APPLIED ENERGY
Volume 222, Issue -, Pages 799-811

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.apenergy.2018.03.104

Keywords

Energy management strategy; Continuous reinforcement learning; Deep Q learning; Dynamic programming; Hybrid electric bus

Funding

  1. National Key Research and Development Program of China [2017YFB0103802]
  2. National Natural Science Foundation of China [51705020, 51675042]
  3. China Postdoctoral Science Foundation [2016M600933]

Ask authors/readers for more resources

Reinforcement learning is a new research hotspot in the artificial intelligence community. Q learning as a famous reinforcement learning algorithm can achieve satisfactory control performance without need to clarify the complex internal factors in controlled objects. However, discretization state is necessary which limits the application of Q learning in energy management for hybrid electric bus (HEB). In this paper the deep Q learning (DQL) is adopted for energy management issue and the strategy is proposed and verified. Firstly, the system modeling of bus configuration are described. Then, the energy management strategy based on deep Q learning is put forward. Deep neural network is employed and well trained to approximate the action value function (Q function). Furthermore, the Q learning strategy based on the same model is mentioned and applied to compare with deep Q learning. Finally, a part of trained decision network is analyzed separately to verify the effectiveness and rationality of the DQL-based strategy. The training results indicate that DQL-based strategy makes a better performance than that of Q learning in training time consuming and convergence rate. Results also demonstrate the fuel economy of proposed strategy under the unknown driving condition achieves 89% of dynamic programming-based method. In addition, the technique can finally learn to the target state of charge under different initial conditions. The main contribution of this study is to explore a novel reinforcement learning methodology into energy management for HEB which solve the curse of state variable dimensionality, and the techniques can be adopted to solve similar problems.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available