What question did this study set out to answer?

This research aims to develop a method that integrates degradation modeling into the image fusion process for improved performance with degraded inputs.

February 14, 2026Open Access

A VLM guided network coupling degradation modeling for degradation aware infrared and visible image fusion

Key Points

This research aims to develop a method that integrates degradation modeling into the image fusion process for improved performance with degraded inputs.
Proposed the VLM-Guided Degradation-Coupled Fusion network (VGDCFusion).
Introduced Specific-Prompt Degradation-Coupled Extractor (SPDCE) for modality-specific degradation awareness.
Implemented Joint-Prompt Degradation-Coupled Fusion (JPDCF) for cross-modal degradation perception.
Conducted extensive experiments to evaluate performance against existing methods.
VGDCFusion outperformed state-of-the-art techniques in degraded image fusion tasks.
Achieved average improvements of approximately 15% in AG metrics and 14.75% in SF measures.
Demonstrated marked superior qualitative visual quality and quantitative evaluation.

Abstract

Existing Infrared and Visible Image Fusion (IVIF) methods typically assume high-quality inputs. However, when handing degraded images, these methods heavily rely on manually switching between different pre-processing techniques. This decoupling of degradation handling and image fusion leads to significant performance degradation. In this paper, we propose a novel VLM-Guided Degradation-Coupled Fusion network (VGDCFusion), which tightly couples degradation modeling with the fusion process and leverages vision-language models (VLMs) for degradation-aware perception and guided suppression. Specifically, the proposed Specific-Prompt Degradation-Coupled Extractor (SPDCE) enables modality-specific degradation awareness and establishes a joint modeling of degradation suppression and intra-modal feature extraction. In parallel, the Joint-Prompt Degradation-Coupled Fusion (JPDCF) facilitates cross-modal degradation perception and couples residual degradation filtering with complementary cross-modal feature fusion. Extensive experimental results indicate that the proposed VGDCFusion demonstrates marked superiority in degraded image fusion tasks, surpassing existing state-of-the-art methods in both qualitative visual quality and quantitative evaluation metrics (e.g., the AG and SF measures achieve average improvements of approximately 15% and 14.75%, respectively). Our code is available at https://github.com/Lmmh058/VGDCFusion.

Bookmark

View Full Paper

Cite This Study

Zhao et al. (Wed,) studied this question.

synapsesocial.com/papers/699010942ccff479cfe56dbb https://doi.org/https://doi.org/10.1038/s41598-026-38181-8

Bookmark

View Full Paper