What question did this study set out to answer?

The research aims to develop a more efficient method for fusing visible and infrared images to enhance performance in various applications.

February 20, 2026Open Access

Mamba-Based Infrared and Visible Images Fusion Method

Key Points

The research aims to develop a more efficient method for fusing visible and infrared images to enhance performance in various applications.
Investigated the Mamba model for visible-infrared image fusion.
Designed a Multi-Path Mamba (MPMamba) module for extracting multi-scale features.
Created a Dual-path Mamba Attention Fusion (DMAF) module for processing shared features.
Utilized dynamic calibration with a Convolutional Block Attention Module (CBAM).
Conducted extensive experiments on the MSRS benchmark.
Achieved state-of-the-art performance in visible-infrared image fusion.
Outperformed strong baselines like U2Fusion and SwinFusion.
Showed significant improvement in Information Entropy (EN), Spatial Frequency (SF), and Mutual Information (MI).
Visual results highlighted better preservation of thermal targets and rich textures.

Abstract

Visible-infrared image fusion is crucial for applications like autonomous driving and nighttime surveillance, yet it remains challenging due to the inherent limitations of existing deep learning models. Convolutional Neural Networks (CNNs) are constrained by their local receptive fields, while Transformers suffer from quadratic computational complexity. To address these issues, this paper investigates the application of the Mamba model—a novel State Space Model (SSM) with linear-complexity global modeling and selective scanning capabilities—to the task of visible-infrared image fusion. Building upon Mamba, we propose a novel fusion framework featuring two key designs: (1) A Multi-Path Mamba (MPMamba) module that orchestrates parallel Mamba blocks with convolutional streams to extract multi-scale, modality-specific features; and (2) a Dual-path Mamba Attention Fusion (DMAF) module that explicitly decouples and processes shared and complementary features via dual Mamba paths, followed by dynamic calibration with a Convolutional Block Attention Module (CBAM). Extensive experiments on the MSRS benchmark demonstrate that our framework achieves state-of-the-art performance, outperforming strong baselines such as U2Fusion and SwinFusion across key metrics including Information Entropy (EN), Spatial Frequency (SF), Mutual Information (MI), and edge-based fusion quality (Qabf). Visual results confirm its ability to produce fused images that saliently preserve thermal targets while retaining rich texture details.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Jinsong He

Jianghua Cheng

Tong Liu

Journals

Remote Sensing

Actions

Institutions

National University of Defense Technology

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Mamba-Based Infrared and Visible Images Fusion Method

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study