4.5 Article

A citywide TD-learning based intelligent traffic signal control for autonomous vehicles: Performance evaluation using SUMO

期刊

EXPERT SYSTEMS
卷 -, 期 -, 页码 -

出版社

WILEY
DOI: 10.1111/exsy.13301

关键词

adaptive traffic signal controller; agent-based learning; average waiting time; eligibility traces; gaussian distance function; SARSA (?)

向作者/读者索取更多资源

An autonomous vehicle can operate without human involvement and significantly reduce traffic congestion in an intelligent transportation system. The study proposes an improved SARSA model for managing autonomous vehicles by introducing a Gaussian function to regulate the weights updating mechanism effectively and suggesting the MaxAbs scaled state values instead of MinMax for efficient understanding of the traffic environment.
An autonomous vehicle can sense its environment and operate without human involvement. Its adequate management in an intelligent transportation system could significantly reduce traffic congestion and overall travel time in a network. Adaptive traffic signal controller (ATSC) based on multi-agent systems using state-action-reward-state-action (SARSA (?) are well-known state-of-the-art models to manage autonomous vehicles within urban areas. However, this study found inefficient weights updating mechanisms of the conventional SARSA (?) models. Therefore, it proposes a Gaussian function to regulate the eligibility trace vector's decay mechanism effectively. On the other hand, an efficient understanding of the state of the traffic environment is crucial for an agent to take optimal actions. The conventional models feed the state values to the agents through the MinMax normalization technique, which sometimes shows less efficiency and robustness. So, this study suggests the MaxAbs scaled state values instead of MinMax to address the problem. Furthermore, the combination of the A-star routing algorithm and proposed model demonstrated a good increase in performance relatively to the conventional SARSA (?)-based routing algorithms. The proposed model and the baselines were implemented in a microscopic traffic simulation environment using the SUMO package over a complex real-world-like 21-intersections network to evaluate their performance. The results showed a reduction of the vehicle's average total waiting time and total stops by a mean value of 59.9% and 17.55% compared to the considered baselines. Also, the A-star combined with the proposed controller outperformed the conventional approaches by increasing the vehicle's average trip speed by 3.4%.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据