Inefficient urban traffic management remains a critical challenge, as conventional signal controllers—built on fixed timing plans—cannot cope with the dynamic nature of modern city traffic. This study addresses this limitation by developing a decentralized MARL-based framework capable of coordinating five interconnected intersections as a unified traffic cell. Central to the proposed solution is the Strategic Anti-Blocking Phase Adjustment (SAPA) module, which enables intersections to autonomously modify phase durations in response to real-time traffic conditions. The framework is designed to handle heterogeneous demand patterns, with particular emphasis on arterial corridors connecting urban centers to peripheral zones. Integration of a Visible Light Communication (VLC) network allows continuous monitoring of key variables, including vehicle kinematics and pedestrian activity, feeding the agents with rich environmental feedback. Experimental evaluation confirms the effectiveness of the approach: the SAPA-augmented DQN achieves roughly 33% shorter vehicle queues and a ~70% reduction in pedestrian waiting counts relative to a standard DQN baseline. Remarkably, these gains bring the value-based method to a performance level comparable to MAPPO, a considerably more complex multi-agent policy optimization algorithm, establishing SAPA as an efficient and scalable enhancement for intelligent urban traffic control.
Galvão et al. (Mon,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: