4.7 Article

Adaptive Traffic Signal Control for large-scale scenario with Cooperative Group-based Multi-agent reinforcement learning

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.trc.2021.103046

关键词

Multi-agent reinforcement learning; Adaptive traffic signal control; Regional green wave control; CVIS

资金

  1. National Natural Science Foundation, China [61102105, 51779050]
  2. National Key Research and Development Program of China [2016YFB0700100]
  3. Harbin Science Fund for Young Reserve Talents, China [2017RAQXJ036]
  4. Fundamental Research Funds for the Central Universities, China [HEUCFG201831]

向作者/读者索取更多资源

The research introduces a Cooperative Group-Based Multi-Agent reinforcement learning-ATSC framework that effectively controls large-scale road networks. The algorithm incorporates advanced techniques and achieves remarkable results in congestion alleviation and environmental protection.
Recent research reveals that reinforcement learning can potentially perform optimal decision making compared to traditional methods like Adaptive Traffic Signal Control (ATSC). With the development of knowledge through trial and error, the Deep Reinforcement Learning (DRL) technique shows its feasibility for the intelligent traffic lights control. However, the general DRL algorithms cannot meet the demands of agents for coordination within large complex road networks. In this article, we introduce a new Cooperative Group-Based Multi-Agent reinforcement learning-ATSC (CGB-MATSC) framework. It is based on Cooperative Vehicle Infrastructure System (CVIS) to realize effective control in the large-scale road network. We propose a CGB-MAQL algorithm that applies k-nearest-neighbor-based state representation, pheromone-based regional green-wave control mode, and spatial discounted reward to stabilize the learning convergence. Extensive experiments and ablation studies of the CGB-MAQL algorithm show its effectiveness and scalability in the synthetic road network, Monaco city and Harbin city scenarios. Results demonstrate that compared with a set of general control methods, our algorithm can better control multiple intersection cases on congestion alleviation and environmental protection.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据