What type of study is this?

This is a Validation Study study.

What question did this study set out to answer?

This research aims to assess artificial intelligence's ability to predict prolonged air leak after lung surgery using comprehensive pulmonary function test data.

May 7, 2026Open Access

Can Artificial Intelligence Interpret Pulmonary Function Tests and Predict Prolonged Air Leaks After Lung Resection

Key Points

This research aims to assess artificial intelligence's ability to predict prolonged air leak after lung surgery using comprehensive pulmonary function test data.
Utilized optical character recognition to digitize pulmonary function tests reports.
Combined PFT data with clinical and demographic features from the Society of Thoracic Surgeons General Thoracic Surgery Database.
Employed a feature selection algorithm to identify predictive features and trained a neural network for PAL prediction.
Involved 410 lung resection patients with successful digitization of PFTs.
Extracted 76 PFT features per patient and identified 10 key variables for the AI model.
Achieved specificity of 73%, sensitivity of 60%, overall accuracy of 72%, and an area under the curve of 0.74, surpassing existing PAL prediction models.

Abstract

Background/Objectives: Preoperative pulmonary function tests (PFTs) contain numerous physiologic parameters, yet surgeons typically rely on forced expiratory volume in one second (FEV1) and diffusing capacity of the lung for carbon monoxide (DLCO) to assess surgical risk. This study aimed to evaluate whether artificial intelligence (AI) could utilize more PFT data to predict the occurrence of prolonged air leak (PAL) following lung resection. Methods: An optical character recognition (OCR) model was used to extract structured data from PFT reports. These data were combined with clinical and demographic features from our institutional Society of Thoracic Surgeons General Thoracic Surgery Database (STS-GTSD) between 2016 and 2023. A feature selection algorithm was used to select the most predictive features, and a neural network was trained and tested on an internal validation cohort to predict PAL. Model performance was compared to previously published models. Results: There were 410 patients undergoing lung resection who had PFTs successfully digitized by the OCR system. A total of 76 available PFT features were extracted per patient. The final AI model included 10 key input variables, including three PFTs and seven clinical variables. On validation, the model achieved a specificity of 73%, sensitivity of 60%, overall accuracy of 72%, and an area under the curve of 0.74. This performance exceeded most existing PAL prediction models. Conclusions: AI-driven models using structured PFT and clinical data can enhance prediction of prolonged air leak after lung resection and outperform conventional regression-based models. Further research may focus on external validation and integration into clinical workflows.

Can Artificial Intelligence Interpret Pulmonary Function Tests and Predict Prolonged Air Leaks After Lung Resection

Key Points

Abstract

Cite This Study