What question did this study set out to answer?

March 15, 2026Open Access

SCPM: monocular 3D object detection with spatiotemporal consistent pseudo-labels module

Key Points

The research aims to improve the performance of monocular 3D object detection models using a new framework.
Developed Spatiotemporally Consistent Pseudo-labels Module (SCPM) for reliable label generation.
Utilized spatiotemporal priors for data augmentation to combat dataset imbalance.
Introduced depth decoupling module using geometric priors for better depth estimation under occlusion.
Achieved state-of-the-art performance on the KITTI dataset.
Significantly enhanced detection robustness and reduced missed detections compared to previous approaches.

Abstract

Monocular 3D object detection models have become increasingly popular due to its low cost and ease of deployment. It remains challenging because of limited depth estimation and dataset imbalance. To tackle this challenge, we propose a Spatiotemporally Consistent Pseudo-labels Module (SCPM) that aims to enhance the performance of monocular 3D object detection models. Our proposed method leverages spatiotemporal priors and data augmentation to generate reliable and temporally consistent pseudo-labels, effectively mitigating survivorship bias. In addition, we introduce a depth decoupling module guided by geometric priors to improve depth estimation and spatial localization, particularly under occlusion. The pro-posed framework enhances detection robustness and reduces missed detections. Extensive experiments conducted on the KITTI dataset demonstrate that our method significantly outperforms existing monocular 3D detection approaches, achieving state-of-the-art performance.

KI fragen

Bookmark

View Full Paper

Cite This Study

Wang et al. (Thu,) studied this question.

synapsesocial.com/papers/69b64c67b42794e3e660dae5 https://doi.org/https://doi.org/10.1007/s40747-026-02271-x

KI fragen

Bookmark

View Full Paper