Key points are not available for this paper at this time.
Abstract This paper introduces mix‐zero‐sum differential (MZSD) game theory to address multi‐player tracking systems, offering a better understanding of the coexistence of cooperation and competition among players. Within this framework, we present an optimal safety tracking control (OSTC) method, which incorporates a control barrier function (CBF) into the value function to ensure that the tracking error remains within a specified range, thus guaranteeing safety while achieving optimization. Simultaneously, to eliminate the need for system dynamics, we propose a novel approach leveraging off‐policy integral reinforcement learning (IRL) technology to obtain the Nash equilibrium solution of the MZSD games. We establish a unique critics–actors neural network (NN) structure that updates concurrently. Furthermore, we analyze stability and convergence using the Lyapunov method. We conduct two simulations to demonstrate the effectiveness of the proposed algorithm.
Building similarity graph...
Analyzing shared references across papers
Loading...
Xiaohong Cui
Taiyuan University of Technology
Wenjie Chen
Central China Normal University
Binrui Wang
Beijing University of Technology
Asian Journal of Control
China Jiliang University
Building similarity graph...
Analyzing shared references across papers
Loading...
Cui et al. (Wed,) studied this question.
synapsesocial.com/papers/68e6c5cab6db643587643dc5 — DOI: https://doi.org/10.1002/asjc.3397
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: