What question did this study set out to answer?

The aim is to develop an online control algorithm for UAVs that addresses uncertainties and faults in actuation.

February 17, 2026Open Access

Soft Real-Time Asynchronous Online Learning from Input–Output Data for UAV Model Reference Control Under Uncertain Dynamics and Faulty Actuation

Key Points

The aim is to develop an online control algorithm for UAVs that addresses uncertainties and faults in actuation.
Proposed an online off-policy asynchronous real-time model reference tracking control (OOART-MRTC) algorithm.
Utilized approximate dynamic programming and reinforcement learning frameworks.
Constructed a virtual state-space representation based on input-output system data.
Analyzed learning convergence and stability under adaptive behavior.
Implemented an asynchronous mechanism for real-time controller parameter updates.
Demonstrated effective learning and adaptation of control parameters under uncertain dynamics.
Showed improved stability of UAV operations despite actuation faults.
Validations conducted on a realistic case study involving coupled double integrators for UAV attitude control.

Abstract

An online off-policy asynchronous real-time model reference tracking control (OOART-MRTC) algorithm is proposed and validated for unmanned aerial vehicles (UAVs) characterized by faulty actuation and parametric uncertainty. The optimal control problem is posed based on approximate dynamic programming (ADP) and reinforcement learning (RL) theory, using a virtual state-space representation constructed exclusively on input–output true system data, which exploits the observability theory. OOART-MRTC learns control by interacting with the system, starting from an initial stabilizing controller derived from an approximate uncertain model. Learning convergence and stability under the proposed adaptive behavior are analyzed. Since the learning iterations cannot update within a sampling period, an asynchronous mechanism is proposed for updating the controller parameters, leveraging real-time control and multi-tasking. The complexity associated with the resulting high-dimensional system is solved by efficient linear parameterization and validated on a realistic case study where three coupled double integrators describe the UAV attitude control.

Soft Real-Time Asynchronous Online Learning from Input–Output Data for UAV Model Reference Control Under Uncertain Dynamics and Faulty Actuation

Key Points

Abstract

Cite This Study