Existing single-image super-resolution methods for remote sensing images suffer from insufficient global receptive fields, weak high-frequency texture recovery, and excessive computational complexity. To address these issues, this paper proposes DFSMamba, a novel spatial–frequency collaborative modeling framework. First, Semantic Continuous-Sparse Attention enhances semantic perception through dynamic chunking and sparse connections while maintaining linear complexity, effectively alleviating the semantic truncation problem caused by fixed window partitioning. Second, the Adaptive State-Space Module employs parallel forward and backward state-space model branches to achieve bidirectional long-range dependency modeling and introduces an activation-guided feature fusion mechanism to adaptively enhance semantically relevant regions. Third, the Discrete Fourier Transform Module maps images to the frequency domain, establishes a global lossless receptive field, and explicitly enhances high-frequency details, compensating for the insufficient utilization of frequency-domain information in pure spatial-domain methods. Experiments on five public datasets demonstrate that DFSMamba outperforms mainstream CNN, Transformer, and Mamba-based methods across ×2 to ×4 scales. On the AID×3 task, it achieves a PSNR of 31.48 dB, exceeding MambaIRv2 by 1.07 dB. Ablation studies verify the positive synergistic effect of the three modules, with the full configuration achieving a PSNR improvement of 0.85 dB over the single-module setup. Fine-grained category, multi-scale input, and loss function experiments further confirm its robustness and generalization capability, particularly in edge and texture detail reconstruction.
Yu et al. (Tue,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: