What question did this study set out to answer?

To enhance ride-sharing dispatching efficiency using a multi-agent reinforcement learning approach.

January 20, 2026Open Access

A multi-agent reinforcement learning scheduling algorithm integrating state graph and task graph structural modeling for ride-sharing dispatching

Key Points

To enhance ride-sharing dispatching efficiency using a multi-agent reinforcement learning approach.
Developed a scheduling algorithm integrating state and task graphs.
Employed dual-path modeling and multi-order diffusion kernels for feature extraction.
Introduced a feasibility mask with Top-K filtering for cross-graph matching.
Reduced Average Waiting Time by 0.27 to 0.35 minutes.
Increased Order Response Rate by 2.3% to 2.7%.
Improved Vehicle Utilization Rate by 3.5% to 3.9%.
Lowered Average Detour Ratio by 0.05 to 0.06.

Abstract

Urban-scale ride-hailing dispatch faces critical challenges such as heterogeneous demand density, highly dynamic state transitions, and multi-agent coordination. Traditional rule-based or heuristic matching strategies struggle to maintain efficiency under large-scale spatiotemporal distributions. This paper proposes DualG-MARL , a graph-attentive multi-agent reinforcement learning framework that employs dual-path modeling of vehicle state graphs and task graphs. The framework extracts spatial structural features via multi-order diffusion kernels and introduces a feasibility mask combined with a Top-K filtering mechanism for cross-graph matching, thereby enhancing both decision-making efficiency and assignment quality. Empirical evaluations on real-world order datasets from Manhattan and Queens demonstrate that the proposed method outperforms the current state-of-the-art approach, CoopRide, by reducing the Average Waiting Time (AWT) by 0.27 and 0.35 minutes, increasing the Order Response Rate (ORR) by 2.3% and 2.7%, improving Vehicle Utilization Rate (VUR) by 3.5% and 3.9%, and lowering the Average Detour Ratio (ADR) by 0.05 and 0.06, respectively. These results establish new benchmarks in core dispatching metrics, and show that the proposed method maintains high responsiveness while effectively reducing matching redundancy and idle travel, offering a structure-aware paradigm for large-scale urban mobility systems.

A multi-agent reinforcement learning scheduling algorithm integrating state graph and task graph structural modeling for ride-sharing dispatching

Key Points

Abstract

Cite This Study