When operating in complex and obstacle-dense environments, micro UAV swarms often face severe cooperative positioning failures due to transient non-line-of-sight (NLOS) interference and cascaded inertial sensor drift. To address this, this work proposes a fault-tolerant positioning framework integrating multi-agent deep reinforcement learning with cooperative extended Kalman filtering (MADRL-CEKF). The system incorporates a link-level dynamic soft isolation mechanism that dynamically adjusts observation covariance to effectively sever paths of cooperative error contagion. An adaptive Markov smoothing constraint is mathematically embedded to mitigate high-frequency control jitter typical of AI-driven policies. Crucially, the framework implements a resource-aware multi-objective reward architecture tailored for micro UAVs. Evaluated through high-fidelity simulations and offline physical datasets, the proposed framework achieves a 96.01% reduction in average tracking error (RMSE) under extreme multi-node cascaded failures, completely preventing system divergence. Furthermore, through autonomous multi-objective trade-offs, the system reduces processing delay by 44% (to 25.1 ms) and computational energy consumption by 41% with only a marginal accuracy compromise of 0.16 m, strictly keeping the execution time within the 50 ms real-time threshold. The MADRL-CEKF framework effectively bridges the gap between sophisticated AI decision-making and strict engineering constraints, providing a highly robust and resource-efficient navigation paradigm for swarm robotics.
Yang et al. (Fri,) studied this question.