Los puntos clave no están disponibles para este artículo en este momento.
Simultaneous Localization and Mapping (SLAM) is one of the key technologies in robotics navigation, augmented reality, and autonomous driving. However, existing dense SLAM methods are constrained by their reliance on external depth observers and high computational costs, limiting applications in the field of human-computer interaction, particularly in AR and VR. We propose DSOSplat, a monocular SLAM framework based on 3D Gaussian Splatting to address these challenges. We generate dense depth maps with absolute scale consistency by employing a self-calibrated adaptive multiview stereo (SC-AMVS) algorithm. Additionally, the accuracy and robustness of depth estimation are significantly improved through dynamic weighted fusion, local constraints, and a scale calibration factor. Our visual odometry module leverages composite depth maps and a keyframe selection strategy to further enhance tracking and reconstruction performance. Furthermore, we propose a depth smoothing regularization (DSR) method that optimizes local gradients and global consistency, thereby improving the geometric expressiveness of Gaussian Splatting and the quality of scene reconstruction. Experimental results demonstrate that DSOSplat achieves efficient localization and high-accuracy scene reconstruction in dynamic and complex environments, offering new possibilities for the development of monocular SLAM. In addition, we perform evaluations in real-world scenarios, where the algorithm also exhibited noteworthy performance.
Zhou et al. (Thu,) studied this question.