What question did this study set out to answer?

This research aims to improve the accuracy and efficiency of medical image segmentation using EFSVMNet.

January 21, 2026

EFSVMNet: Enhanced Feature-Selective Vision Mamba Network for Medical Image Segmentation

Key Points

This research aims to improve the accuracy and efficiency of medical image segmentation using EFSVMNet.
Developed EFSVMNet and EFSVMNet-Lite for segmentation tasks.
Integrated a Spatial Feature-Selective block to filter irrelevant features.
Implemented a Gradient Reversal Layer for adversarial unlearning.
Utilized a Dilated Cross-Fusion Spatial Attention module for enhanced contextual processing.
Applied Masked Adaptive Singular Value Decomposition loss for feature regularization.
Achieved improvements up to +4.0% in mean Intersection over Union (mIoU) and +2.5% in Dice coefficient.
Demonstrated superior robustness against Gaussian and Poisson noise in EFSVMNet-Lite.
Consistent performance gains observed across seven benchmark datasets.

Abstract

Accurate medical image segmentation is essential for reliable diagnosis, treatment planning, and disease monitoring. Existing convolutional and Transformer-based models, such as U-Net and its variants, often extract redundant spatial features and learn correlations from imaging artifacts that reduce generalization and robustness in clinical environments. Vision State-Space Models (VSSMs), including VM-UNet, improve computational efficiency and long-range dependency modeling but lack explicit mechanisms to suppress irrelevant activations and maintain compact representations. To address these issues, we present EFSVMNet and EFSVMNet-Lite, an Enhanced Feature-Selective Vision Mamba Network that introduces adaptive feature suppression within a state-space framework for robust and efficient medical image segmentation. EFSVMNet integrates four complementary components: (i) a Spatial Feature-Selective (SFS) block that filters task-irrelevant activations, (ii) a Gradient Reversal Layer (GRL) that promotes adversarial feature unlearning, (iii) a Dilated Cross-Fusion Spatial Attention (DCFSA) module that enhances multi-scale contextual fusion, and (iv) a Masked Adaptive Singular Value Decomposition (SVD) loss that enforces low-rank feature regularization. Experiments on seven benchmark datasets show consistent performance gains over VM-UNet and related baselines, with up to +4.0% mIoU and +2.5% Dice improvements. Analysis shows EFSVMNet-Lite demonstrates superior robustness under Gaussian and Poisson noise. These results demonstrate that incorporating explicit feature suppression into a state-space formulation substantially enhances segmentation reliability and computational efficiency.

KI fragen

Bookmark