Network-wide traffic signal control optimization using a multi-agent deep reinforcement learning
Abstract
Inefficient traffic control may cause numerous problems such as traffic congestion and energy waste. This paper proposes a novel multi-agent reinforcement learning method, named KS-DDPG (Knowledge Sharing Deep Deterministic Policy Gradient) to achieve optimal control by enhancing the cooperation between traffic signals. By introducing the knowledge-sharing enabled communication protocol, each agent can access to the collective representation of the traffic environment collected by all agents. The proposed method is evaluated through two experiments respectively using synthetic and real-world datasets. The comparison with state-of-the-art reinforcement learning-based and conventional transportation methods demonstrate the proposed KS-DDPG has significant efficiency in controlling large-scale transportation networks and coping with fluctuations in traffic flow. In addition, the introduced communication mechanism has also been proven to speed up the convergence of the model without significantly increasing the computational burden.
- Publication:
-
Transportation Research Part C: Emerging Technologies
- Pub Date:
- April 2021
- DOI:
- arXiv:
- arXiv:2104.09936
- Bibcode:
- 2021TRPC..12503059L
- Keywords:
-
- Multi-agent reinforcement learning;
- Knowledge sharing;
- Adaptive traffic signal control;
- Deep learning;
- Transportation network;
- Computer Science - Artificial Intelligence
- E-Print:
- Transportation Research Part C: Emerging Technologies Volume 125, April 2021, 103059