The use of multimodal data for semantic segmentation in remote sensing has attracted considerable interest, as it enables the integration of complementary information from various sensors. However, conventional multimodal fusion methods primarily operate in the spatial domain. Given the substantial divergence and inherent redundancy across modalities, direct fusion in the spatial domain often leads to the accumulation of irrelevant information and the loss of useful features. Furthermore, spatial-domain fusion alone is insufficient to fully exploit the complementary characteristics of multimodal data. To address these challenges, we introduce a wavelet transform-based multimodal frequency fusion network (MFFNet) to compensate for the limitations of spatial-domain fusion by introducing frequency-domain information. Specifically, we propose the spatial-frequency domain wavelet attention fusion module (SFWAF), which uses weight-shared spatial-domain branches to extract generic spatial features for different modalities. The SFWAF module uses the discrete wavelet transform (DWT) to map different modal features into the frequency domain for fusion and adaptively integrates the dual-domain features using a learnable weighting factor. Additionally, we propose a lightweight frequency-enhanced feature fusion (FEF) module for multiscale feature integration. This module fuses high-frequency components from various modalities using a fixed fusion strategy to preserve critical edge and detail information. Extensive experimental results on the ISPRS Vaihingen, ISPRS Potsdam, and WHU-OPT-SAR datasets demonstrate that MFFNet outperforms traditional multimodal fusion methods, achieving mIoU of 84.21% and 85.88% on the Vaihingen and Potsdam datasets, respectively, and overall accuracies of 92.26% and 91.16%.
Building similarity graph...
Analyzing shared references across papers
Loading...
Chao Li
Haitao Lyu
Weipeng Jing
GIScience & Remote Sensing
University of Liverpool
Northeast Forestry University
Building similarity graph...
Analyzing shared references across papers
Loading...
Li et al. (Sat,) studied this question.
www.synapsesocial.com/papers/68a366a20a429f797332c7bd — DOI: https://doi.org/10.1080/15481603.2025.2534740
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: