What question did this study set out to answer?

The study aims to develop an effective self-supervised framework for despeckling sonar images.

May 6, 2026Open Access

SAME: a self-supervised sonar image despeckling framework via multi-scale mixture-of-experts and semantic guidance

Key Points

The study aims to develop an effective self-supervised framework for despeckling sonar images.
Proposes SAME framework with two modules: Multi-scale Mixture-of-Experts Gated and Contextual Semantic Enhancement Module.
Utilizes semantic guidance to improve despeckling performance without clean reference images.
Employs expert routing with heterogeneous receptive fields to address speckle noise.
Achieves superior speckle suppression in sonar images compared to existing methods.
Improves structural fidelity in despeckled images, particularly in single-channel scenarios.

Abstract

Abstract Sonar imaging is essential for underwater perception, yet its quality is often degraded by strong multiplicative speckle noise. Conventional supervised despeckling methods rely on clean reference images, which are typically unavailable in practical sonar scenarios. Although self-supervised blind-spot networks (BSNs) remove the need for paired data, their performance on sonar imagery remains limited, mainly due to two factors: first, the strong spatial correlation of speckle noise leads to implicit noise leakage; second, blind-spot masking removes the center pixel, resulting in irreversible loss of local structural details, especially in single-channel sonar images where inter-channel redundancy is absent. To address these issues, we propose SAME, a self-supervised semantic-guided sonar image despeckling framework with two complementary modules, where the Multi-scale Mixture-of-Experts Gated (MOEG) module employs dynamic expert routing with heterogeneous receptive fields to decouple spatially correlated noise, while the Contextual Semantic Enhancement Module (CSEM) introduces structural priors from a frozen self-supervised DINO backbone to compensate for structural degradation caused by blind-spot masking. Extensive experiments on the DEBRIS and KLSG datasets show that SAME achieves superior speckle suppression and improved structural fidelity compared with existing methods, demonstrating its effectiveness without requiring clean ground truth.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Guo et al. (Mon,) studied this question.

synapsesocial.com/papers/69fa980604f884e66b531da2 https://doi.org/https://doi.org/10.1007/s44443-026-00789-1

Bookmark

View Full Paper