What question did this study set out to answer?

The central aim is to develop a hybrid framework for accurate lung nodule segmentation and cancer detection using chest CT scans.

May 16, 2026

Hybrid CNN–Vision Transformer Framework with Self-Attention for Lung Nodule Segmentation and Cancer Detection from CT Scans

Key Points

The central aim is to develop a hybrid framework for accurate lung nodule segmentation and cancer detection using chest CT scans.
Utilized a hybrid ViT-Mini + CNN framework with 3D convolutional feature extraction and self-attention.
Evaluated on the LIDC-IDRI dataset using mixed-precision training on an NVIDIA T4 GPU.
Employed composite Dice + cross-entropy loss and OneCycle scheduling for training.
Achieved an 86.8% Dice score for segmentation effectiveness.
Demonstrated 88.9% sensitivity in cancer detection.
Attained 89.3% classification accuracy with an area under the curve (AUC) of 0.93.

Abstract

Lung cancer remains an important cause of cancer-related mortality worldwide due to late-stage diagnosis and subtle early lesions in chest CT scans, where manual interpretation is labor-intensive and prone to errors. This system proves an efficient Hybrid ViT-Mini + CNN framework that synergizes 3D convolutional local feature extraction with transformer-based self-attention for global contextual modeling across CT slices, enabling precise lung nodule segmentation and malignancy classification. Evaluated on the LIDC-IDRI dataset using a single NVIDIA T4 GPU with mixed-precision training, composite Dice + cross-entropy loss, and OneCycle scheduling, the proposed model achieves superior performance—86.8% Dice score, 88.9% sensitivity, 89.3% classification accuracy, and 0.93 AU. Key contributions include volumetric 3D self-attention for enhanced interpretability of low-contrast nodules, lightweight hybrid fusion for clinical deployability, and a unified dual-task pipeline advancing computer-aided diagnosis systems for early lung cancer screening.

Bookmark

View Full Paper

Bookmark

View Full Paper

Hybrid CNN–Vision Transformer Framework with Self-Attention for Lung Nodule Segmentation and Cancer Detection from CT Scans

Key Points

Abstract

Cite This Study