What question did this study set out to answer?

The aim is to develop a lightweight network for more accurate camera pose estimation in complex environments.

April 22, 2026

LMNNet : Camera Pose Regression Model Based on Lightweight Network Architecture

Key Points

The aim is to develop a lightweight network for more accurate camera pose estimation in complex environments.
Utilized lightweight DGhost convolutions to reduce network parameters and memory usage.
Integrated multi-head dynamic sparse attention mechanism for better feature extraction.
Implemented a non-local feature fusion module to enhance feature interaction.
LMNNet achieves higher accuracy in camera pose estimation in complex scenes compared to traditional methods.
Significantly reduced network parameters and memory consumption were reported in the evaluations.

Abstract

ABSTRACT Camera pose estimation is a significant application within the field of computer vision. However, many deep learning‐based camera pose estimation models feature substantial parameter scales and typically require significant computational resources during training. Many models experience a decline in accuracy when operating in complex environments such as those featuring dense textures or motion blur. To address this issue, this study proposes LMNNet, a lightweight camera pose regression network for complex scenes. To address the issues of substantial network parameters and high computational memory consumption, the model employs lightweight DGhost convolutions within its backbone network as a replacement for traditional standard convolutions, significantly reducing both the number of network parameters and memory usage. In addition, to enhance the robustness of the model in complex scene, the multi‐head dynamic sparse attention mechanism (MHDSA) is integrated into the encoder part. This mechanism improves the network's ability to focus on key areas during feature extraction by dynamically allocating feature weights. To capture more global and edge feature information, this study innovatively proposes a non‐local feature fusion (NLFF) module. This module significantly enhances the accuracy of camera pose estimation through feature interaction and a multi‐scale feature information fusion mechanism. Finally, LMNNet was evaluated on the 7Scenes indoor dataset and the Cambridge Landmarks outdoor dataset. Research findings indicate that LMNNet achieves more precise camera pose estimation in complex scene, whilst significantly reducing the number of parameters and computational memory requirements compared to other absolute pose regression networks.

Bookmark

Cite This Study

Wang et al. (Wed,) studied this question.

synapsesocial.com/papers/69e866416e0dea528ddea9c4 https://doi.org/https://doi.org/10.1002/cpe.70660

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Bookmark