Abstract In indoor dynamic scenes, traditional visual SLAM algorithms often suffer from significant degradation in localization accuracy due to interference from moving objects. This study proposes a deep learning-based visual SLAM method. The method is based on the ORBSLAM3 framework and combines target detection with a dual dynamic feature removal mechanism based on extreme point geometry constraints to improve localization accuracy. First, the original convolutional layers in the YOLOv8m network are replaced with GhostConv modules, and the CBAM attention mechanism is added to the backbone network for dynamic object detection. Then, the extracted point features and line features are combined to preliminarily remove dynamic features, followed by further filtering and removal of residual dynamic features using polar geometry constraints. Finally, the P3P algorithm is used to perform pose-assisted localization based on the filtered static features. In validation on the TUMRGBD dataset, the improved algorithm achieves an average localization accuracy improvement of 53. 10% compared to ORBSLAM3, with a maximum of 93. 56%. It demonstrates good accuracy and robustness in physical validation, providing new research insights for dynamic scene SLAM systems.
Building similarity graph...
Analyzing shared references across papers
Loading...
J. Qin
Yousong Wang
Cong Xiong
Liaoning University of Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Qin et al. (Mon,) studied this question.
www.synapsesocial.com/papers/689a02b6e6551bb0af8cc531 — DOI: https://doi.org/10.21203/rs.3.rs-6945178/v1