What question did this study set out to answer?

The research aims to evaluate the diagnostic performance of various deep learning architectures for classifying pulmonary CT scans.

February 2, 2026Open Access

Evaluation of Deep Learning Architectures for Pulmonary CT Lesion Classification Highlighting Diagnostic Performance

Key Points

The research aims to evaluate the diagnostic performance of various deep learning architectures for classifying pulmonary CT scans.
Evaluated five deep learning models: ConvNeXt Base, ResNet50, EfficientNetV2 Small, InceptionV4, Xception.
Utilized the IQ OTH NCCD dataset for model training and testing.
Employed a transfer learning approach with ImageNet weights and dynamic data augmentation.
Assessed model performance using accuracy, precision, recall, and F1-score metrics.
InceptionV4 model achieved an overall accuracy of 0.988 and a macro-averaged F1-score of 0.976.
Perfect sensitivity for malignant lesions with a recall rate of 1.00 was observed.
InceptionV4 outperformed other models, indicating it as a dependable tool for clinical use.

Abstract

Early identification of pulmonary lesions is a critical factor in enhancing patient prognosis and survival rates. This study systematically evaluates the diagnostic performance of five deep learning architectures, ConvNeXt Base, ResNet50, EfficientNetV2 Small, InceptionV4, and Xception, for the three-class categorization of Computed Tomography (CT) scans into Benign, Malignant, and Normal categories. Utilizing the public IQ OTH NCCD dataset, we applied a transfer learning approach with ImageNet weights, complemented by a robust training pipeline incorporating dynamic data augmentation and early stopping to mitigate overfitting and ensure model generalization. Model efficacy was rigorously assessed on an independent test set using accuracy, precision, recall, and F1-score metrics. Experimental results indicate that InceptionV4 emerged as the most reliable architecture, achieving an overall accuracy of 0.988 and a macro-averaged F1-score of 0.976. Notably, this model demonstrated perfect sensitivity for the pathologically critical malignant class, achieving a recall rate of 1.00, thereby prioritizing clinical safety. These findings confirm that advanced neural networks can serve as dependable secondary opinion systems for clinicians. Given its superior sensitivity and balanced diagnostic profile, InceptionV4 represents a promising candidate for integration into automated lung cancer screening workflows to improve diagnostic precision.

Evaluation of Deep Learning Architectures for Pulmonary CT Lesion Classification Highlighting Diagnostic Performance

Key Points

Abstract

Cite This Study