What question did this study set out to answer?

The research aims to develop an effective method for optimal tracking control in multiplayer differential games with unknown dynamics.

April 10, 2026

Two-Stage Asynchronous Learning for Optimal Tracking in Multiplayer Differential Games

Key Points

The research aims to develop an effective method for optimal tracking control in multiplayer differential games with unknown dynamics.
Proposed a two-stage asynchronous learning scheme for Nash equilibrium solutions.
Constructed stabilizing control policies using a homotopic-based iterative process in the first stage.
Employed an asynchronous policy iteration method for players to update their policies with partial information.
Extended the algorithm to a data-driven framework to reduce reliance on explicit dynamics.
The proposed method shows improved convergence efficiency compared to synchronous approaches.
Theoretical convergence under stabilizability and detectability conditions is proven.
Simulation examples confirm the effectiveness of the method in tracking sinusoidal references, outperforming comparison algorithms.

Abstract

The optimal tracking control problem for multiplayer differential game systems (MDGS) with unknown dynamics is investigated in this article. A two-stage asynchronous learning scheme is proposed to achieve Nash equilibrium solutions without requiring initial admissible control policies. In the first stage, stabilizing control policies are constructed through a homotopic-based iterative process. In the second stage, an asynchronous policy iteration (PI) method is employed, in which players sequentially update their policies using partial real-time information, contributing to improved convergence efficiency compared to synchronous approaches. The proposed scheme is further extended to a data-driven framework, relaxing the requirement of explicit system dynamic information. Convergence under stabilizability and detectability conditions is theoretically proven. Finally, two simulation examples are conducted to demonstrate the effectiveness of the proposed method in tracking a sinusoidal reference. Additionally, comparison experiments are provided to highlight the superiority of the proposed algorithm.

Bookmark

Two-Stage Asynchronous Learning for Optimal Tracking in Multiplayer Differential Games

Key Points

Abstract

Cite This Study