Key points are not available for this paper at this time.
Early and accurate diagnosis of cancer is crucial for improving patient outcomes. This study proposes a novel hybrid deep learning architecture integrating Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) to leverage their complementary strengths for advanced imaging data analysis. The proposed model was evaluated on the HAM10000 and Melanoma Skin Cancer datasets, achieving state-of-the-art performance across multiple metrics. On the HAM10000 dataset, the hybrid CNN-ViT model achieved an accuracy of 94.23%, a precision of 93.89%, a recall of 93.56%, an F1-score of 93.72%, and a ROC-AUC of 95.18%. Similarly, on the Melanoma dataset, it demonstrated an accuracy of 93.78%, a precision of 93.45%, a recall of 93.12%, an F1-score of 93.28%, and a ROC-AUC of 94.56%. These results outperform traditional CNNs, such as ResNet-50, and standalone ViTs, showcasing the hybrid model’s capability to ensure both local spatial features and global contextual relationships effectively.
Building similarity graph...
Analyzing shared references across papers
Loading...
Md Rokibul Hasan
Mohammad Balayet Hossain Sakil
Md Amit Hasan
Southeast Missouri State University
American International University-Bangladesh
Trine University
Building similarity graph...
Analyzing shared references across papers
Loading...
Hasan et al. (Thu,) studied this question.
www.synapsesocial.com/papers/69dcc3367873f5f05b133b9e — DOI: https://doi.org/10.1109/ecce64574.2025.11013837
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: