What question did this study set out to answer?

The aim is to improve underwater image quality for better visual perception and operation of underwater robots.

April 12, 2026

Multi‐Scale Underwater Image Enhancement Network Based on VM ‐Unet Improvement

Key Points

The aim is to improve underwater image quality for better visual perception and operation of underwater robots.
Developed a multi-scale image enhancement network using improved VM-Unet.
Implemented an asymmetric encoder-decoder architecture with Visual Mamba layers.
Incorporated an attention mechanism for channel-level multi-scale feature fusion.
Conducted experiments on the UIEB dataset to evaluate performance.
The proposed method outperformed traditional and other deep learning models in visual quality metrics.
Demonstrated significant improvements in color restoration, detail preservation, and noise suppression.
Achieved real-time processing of single-frame images in just 0.2 seconds.

Abstract

Abstract Underwater images suffer from color distortion, low contrast, and blurred details due to selective light absorption, scattering, and suspended particles in water, severely limiting the visual perception and autonomous operation capabilities of underwater robots. To address these issues, this paper proposes a multi‐scale underwater image enhancement network based on an improved VM‐Unet. Centered on the Visual Mamba model, this network employs an asymmetric encoder‐decoder architecture and incorporates parallel Visual Mamba layers to enhance long‐range dependency modeling. Additionally, it integrates an attention mechanism to construct a channel‐level multi‐scale feature fusion module, enabling dynamic integration of features across different scales and improving the model's adaptability and robustness in complex underwater environments. Experiments on the UIEB dataset demonstrate that the proposed method outperforms traditional approaches and mainstream deep learning models in both subjective visual quality and objective evaluation metrics (including PSNR, SSIM, UCIQE, and UIQM), particularly excelling in color restoration, detail preservation, and noise suppression. Furthermore, the method processes single‐frame images in just 0.2 s, offering excellent real‐time performance that meets the demands of real‐time visual enhancement for underwater robots. © 2026 Institute of Electrical Engineers of Japan. Published by Wiley Periodicals LLC.

Bookmark

Cite This Study

Huang et al. (Thu,) studied this question.

synapsesocial.com/papers/69db37f94fe01fead37c6201 https://doi.org/https://doi.org/10.1002/tee.70261

Bookmark