4.8 Article

Reinforcement learning-based real-time energy management for a hybrid tracked vehicle

期刊

APPLIED ENERGY
卷 171, 期 -, 页码 372-382

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.apenergy.2016.03.082

关键词

Hybrid tracked vehicle; Markov chain; Kullback-Leibler divergence rate; Reinforcement learning; Energy management; Control strategy

资金

  1. National Natural Science Foundation of China [51375044]
  2. University Talent Introduction 111 Project [B12022]
  3. Defense Basic Research Project [B20132010]

向作者/读者索取更多资源

To realize the optimal energy allocation between the engine-generator and battery of a hybrid tracked vehicle (HTV), a reinforcement learning-based real-time energy-management strategy was proposed. A systematic control-oriented model for the HTV was built and validated through the test bench, including the battery pack, the engine-generator set (EGS), and the power request. To use effectively the statistical information of power request online, a Markov chain-based real-time power request recursive algorithm for learning transition probabilities was derived and validated. The Kullback-Leibler (KL) divergence rate was adopted to determine when the transition probability matrix and the optimal control strategy update in real time. Reinforcement learning (RL) was applied to compare quantitatively the effects of different forgetting factors and KL divergence rates on reducing fuel consumption. RI. has also been used to optimize the control strategy for HTV, compared to preliminary and dynamic programming-based control strategies. The real-time and robust performance of the proposed online energy management strategy was verified under two driving schedules collected in the field test. The simulation results indicate the proposed RL-based energy management strategy can significantly improve fuel efficiency and can be applied in real time. (C) 2016 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据