☆ 4.7 Article

DCL-AIM: Decentralized coordination learning of autonomous intersection management for connected and automated vehicles

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES (2019)

期刊

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES

卷 103, 期 -, 页码 246-260

出版社

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.trc.2019.04.012

关键词

Multi-agent coordination; Reinforcement learning; Intersection management; Connected and automated vehicles

类别

Transportation Science & Technology

资金

Singapore Ministry of Education Academic Research Fund Tier 2 [MOE2017-T2-1-029]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Conventional intersection managements, such as signalized intersections, may not necessarily be the optimal strategies when it comes to connected and automated vehicles (CAVs) environment. Autonomous intersection management (AIM) is tailored for CAVs aiming at replacing the conventional traffic control strategies. In this work, using the communication and computation technologies of CAVs, the sequential movements of vehicles through intersections are modelled as multi-agent Markov decision processes (MAMDPs) in which vehicle agents cooperate to minimize intersection delay with collision-free constraints. To handle the huge dimension scale incurred by the nature of multi-agent decision making problems, the state space of CAVs are decomposed into independent part and coordinated part by exploiting the structural properties of the AIM problem, and a decentralized coordination multi-agent learning approach (DCL-AIM) is proposed to solve the problem efficiently by exploiting both global and localized agent coordination needs in MM. The main feature of the proposed approach is to explicitly identify and dynamically adapt agent coordination needs during the learning process so that the curse of dimensionality and environment nonstationarity problems in multi-agent learning can be alleviated. The effectiveness of the proposed method is demonstrated under a variety of traffic conditions. The comparison analysis is performed between DCL-AIM and the First-Come-First-Serve based AIM (FCFS-AIM), with Longest-Queue-First (LQF-AIM) policy and the signal control based on the Webster's method (Signal) as benchmarks. Experimental results show that the sequential decisions from DCL-AIM outperform the other control policies.

DCL-AIM: Decentralized coordination learning of autonomous intersection management for connected and automated vehicles

期刊

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

DCL-AIM: Decentralized coordination learning of autonomous intersection management for connected and automated vehicles

期刊

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文