What question did this study set out to answer?

The aim is to develop a weakly supervised RGB-D camouflaged object detection method that minimizes annotation costs.

February 5, 2026

SAM-guided Depth-aware Weakly Supervised Camouflaged Object Detection with Spatial-Frequency Exploration

Key Points

The aim is to develop a weakly supervised RGB-D camouflaged object detection method that minimizes annotation costs.
Implemented a Multimodal SAM-based Label Optimization strategy.
Used scribble annotations for initial labeling.
Introduced a Spatial Frequency Exploration Module to enhance feature extraction.
Developed a Multi-Modal Cross-layer Fusion Module for better multi-scale context integration.
The proposed method outperforms fully supervised RGB/RGB-D object detection methods.
It surpasses existing weakly supervised RGB COD methods.
Demonstrated effective reduction in annotation costs without compromising quality.

Abstract

Current RGB-D Camouflaged Object Detection (COD) methods primarily rely on dense pixel-level annotations, which suffer from the limitation of high labeling costs. In this paper, we investigate a weakly supervised RGB-D COD using scribble annotations to reduce annotation costs. First, we design a Multimodal SAM-based Label Optimization (MSLO) strategy. Through dual pixel-level and image-level optimization, this strategy refines the initial results generated by the Segment Anything Model (SAM), thereby producing high-quality pseudo-labels. Second, we propose a Spatial Frequency Exploration Module (SFEM), which enhances feature representation by mining important features from both spatial and frequency domains. Furthermore, we construct a Multi-Modal Cross-layer Fusion Module (MCFM), which aims to achieve effective fusion of multi-modal features and fully capture multi-scale contextual information. Extensive experiments demonstrate that our method outperforms most fully supervised RGB/RGB-D COD methods and surpasses state-of-the-art weakly supervised RGB COD methods.

Bookmark

SAM-guided Depth-aware Weakly Supervised Camouflaged Object Detection with Spatial-Frequency Exploration

Key Points

Abstract

Cite This Study