What question did this study set out to answer?

The research aims to develop a hybrid model, MDeiT, for accurate and interpretable cancer classification in histopathological images.

May 20, 2026Open Access

MDeiT: A lightweight and explainable hybrid model for cancer classification in histopathology images

Key Points

The research aims to develop a hybrid model, MDeiT, for accurate and interpretable cancer classification in histopathological images.
Developed a hybrid framework integrating CNNs and ViTs with MobileNetV2 and DeiT Tiny architectures.
Implemented Gradient-weighted Class Activation Mapping for visual explanation of model predictions.
Conducted expert-driven validation with pathologists to ensure model alignment with clinical diagnostics.
MDeiT outperforms state-of-the-art models on skin and lung cancer datasets across multiple metrics.
Achieves high-quality interpretability through expert validation of model saliency maps.

Abstract

Accurate and interpretable cancer classification in histopathological images remains a significant challenge due to the complex structural variations in tissue samples. In this paper, we propose MDeiT, a lightweight and interpretable sequential hybrid model that effectively integrates Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) to enhance both classification accuracy and efficiency. Unlike traditional ensemble-based hybrid models, our framework adopts a streamlined design, leveraging MobileNetV2 and DeiT Tiny as backbone architectures, with an adaptation layer facilitating the transition from CNN-extracted local features to Transformer processing. To improve interpretability, we incorporate Gradient-weighted Class Activation Mapping (Grad-CAM) for visual explanations of model predictions. Furthermore, we introduce expert-driven qualitative validation, where pathologists annotate ground truth to systematically assess the alignment between model-generated saliency maps and clinically relevant diagnostic regions, establishing a high-quality benchmark for interpretability evaluation. Extensive experiments on skin and lung cancer datasets demonstrate that MDeiT consistently outperforms state-of-the-art models across multiple metrics while maintaining computational efficiency. The results demonstrate its effectiveness in capturing both fine-grained tissue details and broader contextual patterns, making it a robust and scalable solution for real-world histopathological image analysis.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Dagnaw et al. (Sun,) studied this question.

synapsesocial.com/papers/6a0d4e9df03e14405aa99dac https://doi.org/https://doi.org/10.1016/j.bspc.2026.110620

Bookmark

View Full Paper