What question did this study set out to answer?

The aim is to develop an explainable framework for classifying medical images using a dual-stream approach that combines various feature types.

January 18, 2026Open Access

Explainable Multi-Modal Medical Image Analysis Through Dual-Stream Multi-Feature Fusion and Class-Specific Selection

Key Points

The aim is to develop an explainable framework for classifying medical images using a dual-stream approach that combines various feature types.
Utilized a dual-stream architecture for fusing handcrafted and deep features.
Implemented decision-level integration using calibrated soft voting and optimized classifiers.
Employed a class-specific feature selection strategy to identify key features for each class.
Applied Local Interpretable Model-Agnostic Explanations for model interpretability.
Validated the framework on benchmark datasets for MRI, ultrasound, and retinal fundus images.
Demonstrated effective integration of multi-modal data for robust classification.
Achieved improved computational efficiency through hybrid feature selection.
Ensured transparent predictions linked to clinically relevant image characteristics across diverse disease categories.

Abstract

Effective and transparent medical diagnosis relies on accurate and interpretable classification of medical images across multiple modalities. This paper introduces an explainable multi-modal image analysis framework based on a dual-stream architecture that fuses handcrafted descriptors with deep features extracted from a custom MobileNet. Handcrafted descriptors include frequency-domain and texture features, while deep features are summarized using 26 statistical metrics to enhance interpretability. In the fusion stage, complementary features are combined at both the feature and decision levels. Decision-level integration combines calibrated soft voting, weighted voting, and stacking ensembles with optimized classifiers, including decision trees, random forests, gradient boosting, and logistic regression. To further refine performance, a hybrid class-specific feature selection strategy is proposed, combining mutual information, recursive elimination, and random forest importance to select the most discriminative features for each class. This hybrid selection approach eliminates redundancy, improves computational efficiency, and ensures robust classification. Explainability is provided through Local Interpretable Model-Agnostic Explanations, which offer transparent details about the ensemble model’s predictions and link influential handcrafted features to clinically meaningful image characteristics. The framework is validated on three benchmark datasets, i.e., BTTypes (brain MRI), Ultrasound Breast Images, and ACRIMA Retinal Fundus Images, demonstrating generalizability across modalities (MRI, ultrasound, retinal fundus) and disease categories (brain tumor, breast cancer, glaucoma).

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Ullah et al. (Fri,) studied this question.

synapsesocial.com/papers/696c7817eb60fb80d1396424 https://doi.org/https://doi.org/10.3390/ai7010030

Bookmark

View Full Paper