What question did this study set out to answer?

This research aims to enhance the cooperative positioning of micro UAV swarms operating in challenging environments.

June 14, 2026Open Access

Fault-Tolerant Cooperative Positioning for UAV Swarms in Degraded Environments: A Multi-Objective Deep Reinforcement Learning Approach

Key Points

This research aims to enhance the cooperative positioning of micro UAV swarms operating in challenging environments.
Developed a fault-tolerant positioning framework integrating multi-agent deep reinforcement learning with cooperative extended Kalman filtering.
Implemented a link-level dynamic soft isolation mechanism and adaptive Markov smoothing constraint.
Evaluated the framework through high-fidelity simulations and offline physical datasets.
Achieved a 96.01% reduction in average tracking error (RMSE) under extreme multi-node cascaded failures.
Reduced processing delay by 44% (to 25.1 ms) while keeping execution time within 50 ms.
Decreased computational energy consumption by 41% with a marginal accuracy compromise of 0.16 m.

Abstract

When operating in complex and obstacle-dense environments, micro UAV swarms often face severe cooperative positioning failures due to transient non-line-of-sight (NLOS) interference and cascaded inertial sensor drift. To address this, this work proposes a fault-tolerant positioning framework integrating multi-agent deep reinforcement learning with cooperative extended Kalman filtering (MADRL-CEKF). The system incorporates a link-level dynamic soft isolation mechanism that dynamically adjusts observation covariance to effectively sever paths of cooperative error contagion. An adaptive Markov smoothing constraint is mathematically embedded to mitigate high-frequency control jitter typical of AI-driven policies. Crucially, the framework implements a resource-aware multi-objective reward architecture tailored for micro UAVs. Evaluated through high-fidelity simulations and offline physical datasets, the proposed framework achieves a 96.01% reduction in average tracking error (RMSE) under extreme multi-node cascaded failures, completely preventing system divergence. Furthermore, through autonomous multi-objective trade-offs, the system reduces processing delay by 44% (to 25.1 ms) and computational energy consumption by 41% with only a marginal accuracy compromise of 0.16 m, strictly keeping the execution time within the 50 ms real-time threshold. The MADRL-CEKF framework effectively bridges the gap between sophisticated AI decision-making and strict engineering constraints, providing a highly robust and resource-efficient navigation paradigm for swarm robotics.

Read Full Paperexternally

Ask AI

Helpful

Bookmark

View Full Paper