What question did this study set out to answer?

The aim is to develop a robust method for infrared and visible image fusion that overcomes challenges posed by modality discrepancies and computational redundancy.

May 16, 2026Open Access

RWFI-Fusion: A residual prior-guided wavelet-fourier iterative network for infrared and visible image fusion

Key Points

The aim is to develop a robust method for infrared and visible image fusion that overcomes challenges posed by modality discrepancies and computational redundancy.
Proposed a residual prior-guided wavelet-Fourier iterative fusion network (RWFI-Fusion).
Used wavelet transformation for frequency decoupling and a divide-and-conquer strategy for modeling low and high frequencies.
Implemented an iterative optimization framework to regulate cross-modal information flow.
RWFI-Fusion significantly outperformed existing methods in both quantitative metrics and visual quality.
Successfully maintained high computational efficiency throughout the fusion process.
Demonstrated improvements across four datasets and in downstream object detection tasks.

Abstract

Infrared and visible image fusion (IVIF) aims to generate high-fidelity images by integrating complementary cross-modal information. However, existing methods often suffer from limited robustness when handling pronounced modality discrepancies and complex structures, and frequency-domain modeling typically incurs substantial computational redundancy. To address these challenges, we propose a residual prior-guided wavelet–fourier iterative fusion network (RWFI-Fusion). The proposed approach achieves frequency decoupling via wavelet transformation and adopts a divide-and-conquer strategy to model low-frequency and high-frequency components separately. As fourier transforms perform global modeling at full image resolution and may lead to unnecessary computational overhead, we exploit the complementary strengths of wavelet and Fourier transforms to enhance intrinsic frequency-domain representations of different modalities. In addition, a residual prior is introduced to explicitly extract complementary information from modality discrepancies, thereby improving information propagation during fusion. The fusion process is further refined within an iterative optimization framework that dynamically regulates cross-modal information flow, enabling progressive enhancement of the fused results. Extensive experiments on four datasets and downstream object detection tasks demonstrate that RWFI-Fusion consistently outperforms existing methods in both quantitative metrics and visual quality, while maintaining high computational efficiency.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Wang et al. (Wed,) studied this question.

synapsesocial.com/papers/6a08093ca487c87a6a40b232 https://doi.org/https://doi.org/10.1016/j.optlastec.2026.115478

Bookmark

View Full Paper