May 1, 2024

Self-supervised monocular visual odometry based on cross-correlation

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

Abstract Visual odometry constitutes a critical component in enabling autonomous navigation. However, the existing methods are limited by the feature extraction and matching accuracy, and cannot show good real-time performance while combining accuracy and robustness. In this paper, we propose a novel monocular visual odometry framework based on cross-correlation. The framework starts with a parameter-sharing Siamese network to build feature extractors that can simultaneously process multiple images as inputs. Moreover, we design cross-correlation modules and define a cross-correlation matrix to describe the strength of correlation between different parts of the input feature maps, reflecting the rotational and translational transformations of the input images. Furthermore, a novel loss function is introduced to impose constraints on the network. Additionally, a fully convolutional network is designed for pose estimation, computing poses alterations from the structure of the cross-correlation matrix. Channel attention and spatial attention mechanisms are introduced to improve the performance. More importantly, our method innovatively uses time intervals as labels, enables self-supervised training, and relies only on a monocular camera. Experimental results on the KITTI visual odometry dataset and the Oxford Robotcar Dataset show that our method produces competitive performance, demonstrating the superiority of the proposed method.

Me gusta

Guardar