What question did this study set out to answer?

This research aims to develop a method for improving the accuracy and stability of self-supervised monocular depth estimation.

May 13, 2026

Efficient Self-supervised Monocular Depth Estimation via Knowledge Distillation and Light-weighted Attention

Key Points

This research aims to develop a method for improving the accuracy and stability of self-supervised monocular depth estimation.
Utilized knowledge distillation from a foundation model to enhance training stability.
Introduced a lightweight attention module to strengthen global spatial representation.
Reduced model parameters by 40% and FLOPs by 20% during implementation.
Achieved improved absolute relative error (abs_rel) performance compared to baseline methods.
Demonstrated enhanced prediction accuracy with reduced computational costs through the proposed framework.

Abstract

Depth estimation is a key perception task for robots and autonomous systems, with self-supervised monocular approaches gaining traction due to their independence from ground-truth labels. However, these methods often exhibit unstable training due to reliance on photometric consistency. This paper proposes an efficient self-supervised depth estimation framework that improves prediction accuracy while reducing computational cost. Training stability is enhanced through knowledge distillation using pseudo-labels from a foundation model, and a lightweight attention module is introduced to strengthen global spatial representation. Despite reducing model parameters by 40% and FLOPs by 20%, experiments on the KITTI Eigen split show improved absᵣel and performance compared to the baseline.

Bookmark

Efficient Self-supervised Monocular Depth Estimation via Knowledge Distillation and Light-weighted Attention

Key Points

Abstract

Cite This Study