What does this research mean for the field?

The proposed two-stage pose estimation algorithm improves translation and rotation accuracy for Autonomous Underwater Vehicles (AUVs) by 93.2% and 28.6% respectively compared to traditional iterative PnP methods. Novelty: ClaimNovelty.NOVEL_FINDING. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

This research aims to improve pose estimation for AUVs using a two-stage algorithm combining PnP and binocular constraints.

February 25, 2026Open Access

Two-Stage Pose Estimation for AUV Visual Guidance Using PnP and Binocular Constraints

Key Points

This research aims to improve pose estimation for AUVs using a two-stage algorithm combining PnP and binocular constraints.
Developed a two-stage algorithm for pose estimation.
Utilized iterative PnP for reliable initial estimates.
Applied binocular constraint optimization for refinement.
Conducted simulation and experimental validation underwater and on land.
Achieved 93.2% improvement in translation accuracy compared to traditional PnP methods.
Achieved 28.6% improvement in rotation accuracy compared to traditional PnP methods.
Confirmed 32.7% reduction in average rotation error in land-based validation.
Demonstrated 76.5% reduction in average distance error in underwater experiments under real conditions.

Abstract

Accurate pose estimation is crucial for reliable docking and recovery of Autonomous Underwater Vehicles (AUVs). Traditional visual-based pose estimation methods face inherent challenges: monocular methods often struggle with depth inference, and conventional Perspective-n-Point (PnP) algorithms exhibit accuracy degradation at large viewing angles and limited noise resistance, while binocular systems involve higher computational complexity. This paper proposes a two-stage algorithm that combines iterative PnP initialization with binocular constraint optimization. By using iterative PnP to establish reliable initial estimates, the approach avoids convergence difficulties of direct binocular optimization, while the subsequent binocular refinement leverages stereo geometric constraints to enhance accuracy. Comprehensive evaluation through simulation, land-based experiments, and underwater validation demonstrates consistent performance improvements over conventional geometric methods. In simulation experiments across −60° to 60° yaw angles, the method achieves 93.2% and 28.6% improvements in translation and rotation accuracy respectively compared to iterative PnP. Land-based validation confirms 32.7% average rotation error reduction, while underwater experiments demonstrate 76.5% average distance error reduction under real optical conditions including refraction and light attenuation. The method maintains real-time processing capability (2.16 ms per frame), offering a practical solution for AUV pose estimation in docking applications.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Wang et al. (Mon,) studied this question.

synapsesocial.com/papers/699e91fdf5123be5ed04fe09 https://doi.org/https://doi.org/10.3390/jmse14040405

Bookmark

View Full Paper