What type of study is this?

This is a Experimental Study study.

September 27, 2025Open Access

A Constrained Multi-Agent Reinforcement Learning Approach to Autonomous Traffic Signal Control

Key Points

This algorithm effectively addresses traffic congestion by optimizing signal control in real-time environments.
The Multi-Agent Proximal Policy Optimization with Lagrange Cost Estimator enhanced performance by 12.60% compared to MAPPO.
Evaluation used three real-world datasets, showcasing the algorithm's practicality in urban traffic systems.
This approach highlights the importance of incorporating constraints for creating effective traffic management solutions.

Abstract

Traffic congestion in modern cities is exacerbated by the limitations of traditional fixed-time traffic signal systems, which fail to adapt to dynamic traffic patterns. Adaptive Traffic Signal Control (ATSC) algorithms have emerged as a solution by dynamically adjusting signal timing based on real-time traffic conditions. However, the main limitation of such methods is they are not transferable to environments under real-world constraints, such as balancing efficiency, minimizing collisions, and ensuring fairness across intersections. In this paper, we view the ATSC problem as a constrained multi-agent reinforcement learning (MARL) problem and propose a novel algorithm named Multi-Agent Proximal Policy Optimization with Lagrange Cost Estimator (MAPPO-LCE) to produce effective traffic signal control policies. Our approach integrates the Lagrange multipliers method to balance rewards and constraints, with a cost estimator for stable adjustment. We also introduce three novel constraints on the traffic network: GreenTime, GreenSkip, and PhaseSkip, which penalize traffic policies that do not conform to real-world scenarios. Our experimental results on three real-world datasets demonstrate that MAPPO-LCE outperforms three baseline MARL algorithms by across all environments and traffic constraints (improving on MAPPO by \(12.60\% \) , IPPO by \(10.29\% \) , and QTRAN by \(13.10\% \) ). Our results show that constrained MARL is a valuable tool for traffic planners to deploy scalable and efficient ATSC methods in real-world traffic networks.

A Constrained Multi-Agent Reinforcement Learning Approach to Autonomous Traffic Signal Control

Key Points

Abstract

Cite This Study

Also Consider

Also Consider