What does this research mean for the field?

The Federated and Explainable Multimodal Medical Image Fusion (FEMMIF) framework outperforms existing methods in structural similarity, inference speed, and interpretability across various imaging modalities while preserving data privacy. Novelty: ClaimNovelty.METHODOLOGICAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

This research aims to develop a framework that enhances multimodal medical image fusion for better detection of Alzheimer's disease.

June 1, 2026

FEMMIF —Federated and Explainable Multimodal Medical Image Fusion Framework for Alzheimer’s Disease Detection

Key Points

This research aims to develop a framework that enhances multimodal medical image fusion for better detection of Alzheimer's disease.
Introduced a novel framework called FEMMIF for medical image fusion.
Employed a modality-agnostic dual-branch encoder using MobileNetV3 for feature extraction.
Utilized a federated training approach to maintain privacy and achieve convergence.
Achieved a structural similarity index (SSIM) greater than 0.91 with improved entropy metrics.
Demonstrated faster inference speeds compared to leading methods.
Showed strong generalization across different imaging modalities and reduced sensitivity to misalignment.

Abstract

ABSTRACT Multimodal medical image fusion (MMIF) is essential for improving diagnostic accuracy by integrating useful details from various imaging modalities. However, current fusion methods have many challenges, such as modality‐specific drawbacks, restricted generalizability, high processing costs, and limited explainability. This paper introduces a novel framework called Federated and Explainable Multimodal Medical Image Fusion (FEMMIF) designed to address several challenges in medical image fusion through a hybrid approach. FEMMIF employs a modality‐agnostic dual‐branch encoder based on MobileNetV3 to extract both anatomical and functional features. These features are integrated using a cross‐attention technique and then processed through a Feature Importance Learning (FIL) module, which dynamically assigns weights to the contributions of each modality. The combined image is subsequently decoded with a decoder that utilizes residual refinement. Evaluations on various multimodal datasets, including MRI, PET, CT, and SPECT, show that FEMMIF consistently outperforms leading MMIF methodologies, achieving a structural similarity index (SSIM) greater than 0.91, improved entropy metrics, and faster inference speeds. The model demonstrates strong generalization across different modalities, reduced sensitivity to misalignment, and produces interpretable outputs suitable for clinical validation. The federated training approach also maintains privacy while achieving convergence comparable to centralized techniques. Overall, the FEMMIF framework demonstrates strong experimental performance but requires further prospective and multicenter validation before clinical deployment.

Bookmark

FEMMIF —Federated and Explainable Multimodal Medical Image Fusion Framework for Alzheimer’s Disease Detection

Key Points

Abstract

Cite This Study