4.5 Article

Traffic signal control using a cooperative EWMA-based multi-agent reinforcement learning

Journal

APPLIED INTELLIGENCE
Volume 53, Issue 4, Pages 4483-4498

Publisher

SPRINGER
DOI: 10.1007/s10489-022-03643-9

Keywords

Mean-field; Traffic signal control; TD3; Multi-agent reinforcement learning

Ask authors/readers for more resources

This paper proposes a new MARL algorithm called CoTD3-EWMA, which introduces mean-field theory and dynamic delay updating to effectively solve the challenges in urban traffic signal control and improve traffic efficiency.
In contemporary urban, traffic signal control is still enormously difficult. Multi-agent reinforcement learning (MARL) is a promising ways to solve this problem. However, most MARL algorithms can not effectively transfer learning strategies when the agents increase or decrease. This paper proposes a new MARL algorithm called cooperative dynamic delay updating twin delayed deep deterministic policy gradient based on the exponentially weighted moving average (CoTD3-EWMA) to solve the problem. By introducing mean-field theory, the algorithm implicitly models the interaction between agents and environment. It reduces the dimension of action space and improves the scalability of the algorithm. In addition, we propose a dynamic delay updating method based on the exponentially weighted moving average (EWMA), which improves the Q value overestimation problem of the traditional TD3 algorithm. Moreover, a joint reward allocation mechanism and state sharing mechanism are proposed to improve the global strategy learning ability and robustness of the agent. The simulation results show that the performance of the new algorithm is better than the current state-of-the-art algorithms, which effectively reduces the delay time of vehicles and improves the traffic efficiency of the traffic network.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available