What does this research mean for the field?

A Random Forest machine learning model using pre-bronchodilator spirometry Z-scores and smoking status can successfully screen for chronic obstructive pulmonary disease with an accuracy of 83.1%. Novelty: ClaimNovelty.METHODOLOGICAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

This study aims to develop and evaluate a machine learning model for screening chronic obstructive pulmonary disease (COPD) using pre-bronchodilator spirometry indices.

May 20, 2026

B78-04 Machine Learning-based Screening of Chronic Obstructive Pulmonary Disease Using Pre-bronchodilator Spirometry Indices

Q: What is the clinical evidence from this study?

Study design: Cross-Sectional. Population: Chronic obstructive pulmonary disease (COPD) (n=399). Intervention: Random Forest classifier using pre-bronchodilator spirometry Z-scores and smoking status vs. No Information Rate (NIR). Primary outcome: Model accuracy for COPD screening (95% CI 71.0%-91.6%, p=0.027).

Key Result

A Random Forest model using pre-bronchodilator spirometry Z-scores and smoking status successfully screened for COPD with an accuracy of 83.1% (95% CI: 71.0%-91.6%; p=0.027 vs No Information Rate).

Key Points

This study aims to develop and evaluate a machine learning model for screening chronic obstructive pulmonary disease (COPD) using pre-bronchodilator spirometry indices.
Analyzed data from 399 subjects in the National Health and Nutrition Examination Survey (NHANES).
Calculated Z-scores for spirometry indices using Global Lung Initiative (GLI) equations.
Trained a Random Forest classifier with pre-bronchodilator Z-scores and smoking status, using an 80/20 train-test split.
The Random Forest model achieved an accuracy of 83.1% (95% CI: 71.0%-91.6%).
It demonstrated a sensitivity of 64.7%, identifying most actual COPD cases, and specificity of 90.5% for those without COPD.
The model showed a moderate agreement with actual diagnoses (Kappa = 0.572).

Study Design

Type

Cross-Sectional (n=399)

Structured PICO

Does a machine learning model based on pre-bronchodilator spirometry Z-scores and smoking status accurately screen for COPD?

Population

399 subjects with complete pre- and post-bronchodilator spirometry measures from the National Health and Nutrition Examination Survey (NHANES, cycle G) data.

Intervention

Random Forest classifier trained on pre-bronchodilator spirometry Z-scores (FEV1, FVC, FEV1/FVC, and FEF25-75%) and smoking status

Outcome

Accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), Cohen's Kappa, and No Information Rate (NIR) for COPD screening

A machine learning model using pre-bronchodilator spirometry Z-scores and smoking status demonstrated strong accuracy and high specificity for noninvasive COPD screening.

Main Result

p-value: p=0.027

Limitations

Results are based on a small dataset
Requires validation using larger, high-quality, and more varied datasets, especially from primary care and community settings

Abstract

Abstract Rationale Chronic obstructive pulmonary disease (COPD) is a major worldwide health challenge, and it is frequently underdiagnosed, especially in areas where post-bronchodilator spirometry is not routinely conducted. Machine learning (ML) presents a promising way to improving COPD screening with pre-bronchodilator data. The study aims to construct and assess an ML model for COPD screening based on pre-bronchodilator spirometry Z-scores generated from the Global Lung Initiative (GLI) equations, with FEF25–75% and smoking status as predictive factors. Methods 399 subjects with complete pre- and post-bronchodilator spirometry measures were included in the National Health and Nutrition Examination Survey (NHANES, cycle G) data. Z-scores were produced by standardizing the spirometric indices (FEV1, FVC, FEV1/FVC, and FEF25–75%) using GLI reference equations. A post-bronchodilator FEV1/FVC Z-score below -1.645, which represents the lower limit of normal, was used to define COPD. Z-scores from pre-bronchodilator spirometry and smoking status were used to train a Random Forest classifier with an 80/20 train-test split. Accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), Cohen’s Kappa, and No Information Rate (NIR) were used to assess the model’s performance. Results The Random Forest model greatly outperformed the NIR (p = 0.027) with an accuracy of 83.1% (95% CI: 71.0%-91.6%). The model showed a sensitivity of 64.7%, capturing the majority of actual COPD patients, and a specificity of 90.5%, successfully identifying people without COPD. The NPV was 86.4% and the PPV was 73.3%. The degree of agreement between the actual and anticipated diagnoses was moderate (Kappa = 0.572). Conclusion This study shows that machine learning models trained on GLI-based pre-bronchodilator spirometry Z-scores may successfully screen for COPD using the LLN threshold (Z-score -1.645) in conjunction with FEF25–75% and smoking status. Strong performance and high specificity were demonstrated by the Random Forest model, underscoring the potential of ML-based techniques for noninvasive COPD screening. Nevertheless, the results are based on a small dataset, and in order to improve model generalizability and therapeutic utility, validation using larger, high quality datasets, and more varied data—especially from primary care and community settings—is crucial. This abstract is funded by: None

Bookmark

Cite This Study

Almeshari et al. (Fri,) conducted a cross-sectional in Chronic obstructive pulmonary disease (COPD) (n=399). Random Forest classifier using pre-bronchodilator spirometry Z-scores and smoking status vs. No Information Rate (NIR) was evaluated on Model accuracy for COPD screening (95% CI 71.0%-91.6%, p=0.027). A Random Forest model using pre-bronchodilator spirometry Z-scores and smoking status successfully screened for COPD with an accuracy of 83.1% (95% CI: 71.0%-91.6%; p=0.027 vs No Information Rate).

synapsesocial.com/papers/6a0d4f4cf03e14405aa9a8b4 https://doi.org/https://doi.org/10.1093/ajrccm/aamag162.1875

Bookmark