Non-small cell lung cancer (NSCLC) poses a major threat to human health due to its high morbidity and mortality. Early accurate diagnosis and differential diagnosis from other respiratory diseases, are pivotal to improving patient prognosis. This study aimed to construct an NSCLC diagnostic model based on multidimensional datasets by leveraging machine learning (ML) driven feature selection and classification algorithms, clarifying the diagnostic value of blood protein metabolic profiles, and provide a novel non-invasive diagnostic scheme for clinical practice. A total of 144 lung cancer patients (LC), 132 healthy controls (HCs), and 130 patients with other respiratory diseases (ORDs) were recruited from three medical centers. A panel of 26 serum protein metabolism indicators and demographic variables was included as candidate features. Five feature-selection algorithms were used to identify core variables from the high-dimensional data. Ten ML classifiers were subsequently constructed, and their performance was comprehensively evaluated using metrics such as the area under the receiver operating characteristic curve (AUC), accuracy, positive precision, negative precision, positive recall, negative recall, F1 score, and Cohen’s kappa. Moreover, SHapley Additive exPlanations (SHAP) analysis was performed to decipher key predictive factors. Significant differences were observed in 21 serum protein metabolic indicators across the LC, HC, and ORD cohorts (p 0.840 across all datasets). Furthermore, SHAP analysis identified age as the top predictor of NSCLC, followed by Glu, His, ApoB, FN, ApoA2, and Arg. In this study, a stable, high-performance LightGBM model for NSCLC diagnosis was developed, and key clinical metrics and probability-based outputs were optimized. This model is expected to enhance early detection and differential diagnosis, thereby helping reduce lung cancer morbidity and mortality.
Building similarity graph...
Analyzing shared references across papers
Loading...
Huiying Shi
JiaMing Sun
HuiMin Wei
Respiratory Research
Shanghai Jiao Tong University
Fudan University
Yangzhou University
Building similarity graph...
Analyzing shared references across papers
Loading...
Shi et al. (Mon,) studied this question.
www.synapsesocial.com/papers/69f1a015edf4b46824806c65 — DOI: https://doi.org/10.1186/s12931-026-03686-3
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: