4.7 Article

Real-time self-adaptive Q-learning controller for energy management of conventional autonomous vehicles

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 222, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2023.119770

关键词

Autonomous vehicle; Artificial intelligence; Intelligent energy management; Control strategy; Reinforcement learning controller; Machine learning control

向作者/读者索取更多资源

This paper presents an intelligent energy management system based on reinforcement learning for conventional autonomous vehicles, aiming to reduce emissions and energy consumption. A new exploration strategy is proposed to replace the traditional epsilon-greedy strategy in the Q-learning algorithm. The Q-learning and SAQ-learning controllers are shown to generate the desired engine torque and control the air/fuel ratio efficiently in real-time, improving operational time compared to standard Q-learning.
Reducing emissions and energy consumption of autonomous vehicles is critical in the modern era. This paper presents an intelligent energy management system based on Reinforcement Learning (RL) for conventional autonomous vehicles. Furthermore, in order to improve the efficiency, a new exploration strategy is proposed to replace the traditional decayed epsilon-greedy strategy in the Q-learning algorithm associated with RL. Unlike tradi-tional Q-learning algorithms, the proposed self-adaptive Q-learning (SAQ-learning) can be applied in real-time. The learning capability of the controllers can help the vehicle deal with unknown situations in real-time. Nu-merical simulations show that compared to other controllers, Q-learning and SAQ-learning controllers can generate the desired engine torque based on the vehicle road power demand and control the air/fuel ratio by changing the throttle angle efficiently in real-time. Also, the proposed real-time SAQ-learning is shown to improve the operational time by 23% compared to standard Q-learning. Our simulations reveal the effectiveness of the proposed control system compared to other methods, namely dynamic programming and fuzzy logic methods.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据