What question did this study set out to answer?

The aim is to develop a novel framework for improving super-resolution of remote sensing images while addressing key limitations of existing methods.

June 11, 2026Open Access

DFSMamba: A Spatial–Frequency Collaborative Modeling Framework for Remote Sensing Image Super-Resolution

Key Points

The aim is to develop a novel framework for improving super-resolution of remote sensing images while addressing key limitations of existing methods.
Developed DFSMamba framework integrating Semantic Continuous-Sparse Attention, Adaptive State-Space Module, and Discrete Fourier Transform Module.
Conducted experiments on five public datasets with a focus on multiple scaling factors (×2 to ×4).
Performed ablation studies to analyze the contributions of each component in the framework.
DFSMamba achieved a PSNR of 31.48 dB on the AID×3 task, surpassing MambaIRv2 by 1.07 dB.
The full configuration of DFSMamba improved PSNR by 0.85 dB compared to single-module setups.
Demonstrated superior performance across all tested scales when compared to mainstream CNN and Transformer methods.

Abstract

Existing single-image super-resolution methods for remote sensing images suffer from insufficient global receptive fields, weak high-frequency texture recovery, and excessive computational complexity. To address these issues, this paper proposes DFSMamba, a novel spatial–frequency collaborative modeling framework. First, Semantic Continuous-Sparse Attention enhances semantic perception through dynamic chunking and sparse connections while maintaining linear complexity, effectively alleviating the semantic truncation problem caused by fixed window partitioning. Second, the Adaptive State-Space Module employs parallel forward and backward state-space model branches to achieve bidirectional long-range dependency modeling and introduces an activation-guided feature fusion mechanism to adaptively enhance semantically relevant regions. Third, the Discrete Fourier Transform Module maps images to the frequency domain, establishes a global lossless receptive field, and explicitly enhances high-frequency details, compensating for the insufficient utilization of frequency-domain information in pure spatial-domain methods. Experiments on five public datasets demonstrate that DFSMamba outperforms mainstream CNN, Transformer, and Mamba-based methods across ×2 to ×4 scales. On the AID×3 task, it achieves a PSNR of 31.48 dB, exceeding MambaIRv2 by 1.07 dB. Ablation studies verify the positive synergistic effect of the three modules, with the full configuration achieving a PSNR improvement of 0.85 dB over the single-module setup. Fine-grained category, multi-scale input, and loss function experiments further confirm its robustness and generalization capability, particularly in edge and texture detail reconstruction.

DFSMamba: A Spatial–Frequency Collaborative Modeling Framework for Remote Sensing Image Super-Resolution

Key Points

Abstract

Cite This Study

Also Consider

Also Consider