What question did this study set out to answer?

This research addresses the challenges of mandatory lane changes at signalized intersections for connected and autonomous vehicles (CAVs).

February 26, 2026Open Access

Joint longitudinal-lateral trajectory planning for CAVs in mixed traffic at signalized intersections

Key Points

This research addresses the challenges of mandatory lane changes at signalized intersections for connected and autonomous vehicles (CAVs).
Formulated the trajectory planning problem as a MARL task
Developed SS-MA-PPO framework for acceleration and lane-change decisions
Implemented a SGSM for offline trajectory rollouts and online policy arbitration
Included surrounding vehicle information for cooperation and applied transfer learning to enhance training
SS-MA-PPO outperformed conventional and MARL baseline methods
Verified effectiveness of SGSM, vehicle cooperation, and transfer learning
Achieved faster training convergence and enhanced performance across various evaluation metrics

Abstract

Mandatory lane changes (MLCs) pose significant challenges to trajectory planning at intersections, where vehicles are required to change lanes mid-block to reach designated turn lanes before the stop bar. MLCs often generate shockwaves that induce increased vehicle delay and fuel consumption, and the presence of human-driven vehicles in mixed traffic further exacerbates this issue. To address these challenges, this study formulates the joint longitudinal-lateral trajectory planning problem in mixed traffic as a multi-agent reinforcement learning (MARL) task. We propose SS-MA-PPO, a Simulation-Supervised Multi-Agent Proximal Policy Optimization framework, which guides connected and autonomous vehicles (CAVs) in both acceleration and lane-change decisions. A Simulation-Guided Supervisory Module (SGSM) performs offline trajectory rollouts of human-driver models to assess feasibility and safety, and arbitrates online between rule-based and learned policies. The information of surrounding vehicles is incorporated in the observation to achieve vehicle cooperation, and a transfer learning mechanism is designed to accelerate training. Experiments using a real-world dataset from Langfang, China demonstrate that SS-MA-PPO outperforms both conventional and MARL baselines across various evaluation metrics. Ablation experiments verify the substantial effectiveness of the proposed SGSM module, vehicle cooperation, and transfer learning, achieving enhanced performance and faster training convergence. The source code is available at: https://github.com/Xingwei-Jiang/SS-MA-PPO.

Joint longitudinal-lateral trajectory planning for CAVs in mixed traffic at signalized intersections

Key Points

Abstract

Cite This Study