What does this research mean for the field?

Reinforcement learning significantly improves the performance of the extended Kalman filter in UAV attitude estimation compared to traditional methods. Novelty: ClaimNovelty.NOVEL_FINDING. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

The study aims to improve UAV attitude estimation accuracy using reinforcement learning techniques.

February 24, 2026

Adaptation of Noise Covariance in Extended Kalman Filter Using Reinforcement Learning for Improved UAV Attitude Estimation

Key Points

The study aims to improve UAV attitude estimation accuracy using reinforcement learning techniques.
Utilized reinforcement learning to optimize the measurement noise covariance matrix in the EKF.
Employed a Q-learning model to autonomously adjust the covariance adjustments.
Established a reward mechanism to minimize prediction errors based on true measurements.
RL-EKF significantly outperforms traditional EKF in attitude estimation accuracy.
Marked improvements in performance metrics were observed with the RL-EKF approach.

Abstract

Accurate attitude determination of unmanned aerial vehicles (UAVs) is crucial for autonomous navigation, particularly when relying solely on gyroscope, accelerometer, and magnetometer measurements without utilizing the Global Positioning System (GPS). Reinforcement learning (RL) has emerged as a promising artificial intelligence technique applicable across various domains. This research introduces a novel approach that leverages RL to enhance the performance of the extended Kalman filter (EKF) in attitude estimation. The proposed method depends of RL which uses the Q-learning model and policy to find best solution to adjust autonomously the measurement noise covariance matrix within the EKF. By establishing a reward mechanism that incentivizes actions minimizing the prediction error relative to true measurements, the RL dynamically optimizes the measurement noise covariance matrix. This innovative integration of RL and EKF, referred to as RL-EKF, has been implemented and tested. Results demonstrate that RL-EKF significantly outperforms the traditional EKF, yielding marked improvements in attitude estimation accuracy. The improvement ratios showed that selected method is very effective in the field of attitude estimation.

Bookmark

Cite This Study

Assad et al. (Mon,) studied this question.

synapsesocial.com/papers/699d3f9ede8e28729cf643cd https://doi.org/https://doi.org/10.1134/s2075108725700269

Bookmark