What question did this study set out to answer?

This review aims to summarize the use of reinforcement learning for traffic signal control and identify prevailing challenges.

June 7, 2026Open Access

Reinforcement Learning for Adaptive Traffic Signal Control: An Overview

Key Points

This review aims to summarize the use of reinforcement learning for traffic signal control and identify prevailing challenges.
Grouped core reinforcement learning approaches for traffic signal control
Reviewed evaluation practices in simulation environments like SUMO and CityFlow
Identified gaps in addressing safety and environmental factors in studies
Most studies emphasize efficiency measures like delay and queue length
Major evaluations are conducted in simulation environments
Key challenges include managing multiple objectives and scaling to large networks

Abstract

Traffic signal control is central to urban mobility because it directly influences congestion, travel time, and emissions. Traditional methods, including fixed-time signal scheduling and actuated control strategies, can handle normal conditions but often fail when traffic changes suddenly. Reinforcement learning (RL) has gained attention as a data-driven approach which adapts policies by interacting with traffic environments. Recent studies have investigated both single-agent and multi-agent formulations, encompassing value-based, policy-based, and actor–critic learning paradigms. This paper reviews how RL has been adopted for traffic signal control, grouping the core approaches and highlighting their evaluation practices. The review shows that most studies still focus on efficiency measures such as delay and queue length, with safety and environmental factors less frequently addressed. Nearly all evaluations are done in simulation, with SUMO and CityFlow as the dominant platforms. Key challenges remain in handling multiple objectives, scaling to large networks, and addressing the challenge of transferring simulation-based results to real-world deployment. By outlining current methods, their strengths and weaknesses, and the gaps that persist, this review points to the directions needed for RL to move from research to practice.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper