What type of study is this?

This is a Quantitative Study study.

October 19, 2025Open Access

MsGf: A Lightweight Self-Supervised Monocular Depth Estimation Framework with Multi-Scale Feature Extraction

Puntos clave

The MsGf framework suppresses artifacts in monocular depth estimation, achieving better results with fewer parameters.
Using only 0.8 M parameters, MsGf outperforms advanced methods on KITTI and Make3D datasets.
The framework employs a CDMs module for efficient multi-scale feature extraction and an SEGF module for edge feature capture.
Ablation studies confirm that combining different modules leads to significant improvements in performance.

Resumen

Monocular depth estimation is an essential component in computer vision that enables 3D scene understanding, with critical applications in autonomous driving and augmented reality. This paper proposes a lightweight self-supervised framework from single RGB images for multi-scale feature extraction and artifact elimination in monocular depth estimation (MsGf). The proposed framework first designs a Cross-Dimensional Multi-scale Feature Extraction (CDMs) module. The CDMs module combines parallel multi-scale convolution with sequential feature convolutions to achieve multi-scale feature extraction with minimal parameters. Additionally, a Sobel Edge Perception-Guided Filtering (SEGF) module is proposed. The SEGF module uses the Sobel operator to decompose the features into horizontal direction features and vertical direction features, and then generates the filter kernel through two steps of filtering to effectively suppress artifacts and better capture structural and edge features. A large number of ablation experiments and comparative experiments on the KITTI and Make3D datasets demonstrate that the MsGf with only 0.8 M parameters can achieve better performance than the current most advanced methods.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo