What question did this study set out to answer?

This research aims to develop a control method for multi-agent systems that adhere to orbital dynamics constraints during pursuit.

May 9, 2026

An AMA-DDQN design for orbital dynamics-constrained pursuit

Key Points

This research aims to develop a control method for multi-agent systems that adhere to orbital dynamics constraints during pursuit.
Developed an adaptive multi-agent dueling deep Q-network (AMA-DDQN) for pursuit control.
Used AMA to adjust exploration policies and generate orbital viability rewards.
Compared AMA-DDQN performance to Deep Deterministic Policy Gradient (DDPG) in terms of convergence speed and control stability.
AMA-DDQN showed superior convergence speed compared to DDPG.
Significantly improved compliance with orbital constraints was observed.
Coordinated control stability of AMA-DDQN surpassed that of DDPG.

Abstract

This paper addresses the pursuit control problem of multi-agent systems with orbital dynamics constraints in high-dimensional spaces, where the targets are governed by rule-compliant Keplerian motion. A so-called adaptive multi-agent dueling deep Q-network (AMA-DDQN) is designed, where AMA is used to dynamically adjust exploration policies and generate orbital viability rewards, and DDQN is employed to decouple state-value and action-advantage estimation for dimensionality reduction. By compared to Deep Deterministic Policy Gradient (DDPG), the proposed AMA-DDQN is much superior in convergence speed, orbital constraint compliance, and coordinated control stability.

An AMA-DDQN design for orbital dynamics-constrained pursuit

Key Points

Abstract

Cite This Study