What question did this study set out to answer?

The goal is to enhance the fusion of infrared and visible images by addressing feature representation and fusion balance.

June 17, 2026Open Access

DiffFuseNet: Infrared-visible image fusion via diffusion-guided feature alignment and feature consistency alignment

Key Points

The goal is to enhance the fusion of infrared and visible images by addressing feature representation and fusion balance.
Developed DiffFuseNet that includes the Dual Diffusion-based Feature Enhancement (D2FE) module for feature enhancement.
Implemented Explicit Decoupling and Frequency Decomposition (EDFD) to separate image features into different components.
Employed a frequency-aware fusion mechanism with Haar wavelet and invertible neural networks for improved integration.
DiffFuseNet showed superior detail preservation and thermal saliency compared to state-of-the-art methods.
Achieved enhanced structural consistency in fused images from experiments on multiple datasets.
Demonstrated significant improvements in visual quality and objective metrics over existing techniques.

Abstract

Infrared and visible image fusion aims to synthesize images with thermal radiation and rich texture details. Existing methods suffer from inadequate feature representation and imbalanced fusion due to cross-modal discrepancies, leading to blurred details or insufficient thermal saliency. To address this, we propose DiffFuseNet, which integrates diffusion-guided feature enhancement and explicit feature decoupling. The Dual Diffusion-based Feature Enhancement (D 2 FE) module enhances cross-modal robustness via controllable noise injection and denoising. The Explicit Decoupling and Frequency Decomposition (EDFD) module separates features into shared, modality-specific, and frequency-aware components. Coupled with a frequency-aware fusion mechanism using Haar wavelet and invertible neural networks and a two-stage training strategy, our approach achieves balanced information integration. Experiments on M3FD, RoadScene, and TNO datasets show that DiffFuseNet outperforms state-of-the-art methods in visual quality and objective metrics, with superior detail preservation, thermal saliency, and structural consistency.

DiffFuseNet: Infrared-visible image fusion via diffusion-guided feature alignment and feature consistency alignment

Key Points

Abstract

Cite This Study

Also Consider

Also Consider