What is the clinical evidence from this study?

Study design: Cross-Sectional. Population: Chronic Obstructive Pulmonary Disease (COPD) (n=609). Intervention: Computational phenotype (ICD-10 codes, medications, and lung function testing history) vs. Manual chart review (post-bronchodilator FEV1/FVC<0.70). Primary outcome: Airflow obstruction with a post-bronchodilator FEV1/FVC<0.70.

What does this research mean for the field?

A computational phenotype combining ICD-10 codes, long-acting bronchodilator prescriptions, and lung function testing history identifies true airflow obstruction in only 62% of patients. Novelty: ClaimNovelty.INCREMENTAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

This study aimed to assess the effectiveness of a computational phenotype that combines various data elements to identify COPD patients fulfilling spirometry criteria.

May 20, 2026

A23-24 Performance of a Multi-component Computational Phenotype for Identifying Patients With Chronic Obstructive Pulmonary Disease

Q: What are the key findings of this study?

A computational phenotype combining ICD-10 codes, long-acting bronchodilator prescriptions, and lung function testing history identified true airflow obstruction in only 62% of patients.

Q: What does this research mean for the field?

A computational phenotype combining ICD-10 codes, long-acting bronchodilator prescriptions, and lung function testing history identifies true airflow obstruction in only 62% of patients. Novelty: ClaimNovelty.INCREMENTAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

Q: What question did this study set out to answer?

This study aimed to assess the effectiveness of a computational phenotype that combines various data elements to identify COPD patients fulfilling spirometry criteria.

Key Result

A computational phenotype combining ICD-10 codes, long-acting bronchodilator prescriptions, and lung function testing history identified true airflow obstruction in only 62% of patients.

Key Points

This study aimed to assess the effectiveness of a computational phenotype that combines various data elements to identify COPD patients fulfilling spirometry criteria.
Conducted a cross-sectional study using EHR data from a single academic hospital in Chicago.
Included patients aged over 40 with an ICD-10 diagnosis of COPD and a history of lung function testing.
Performed manual chart reviews of pulmonary function testing to determine airflow obstruction.
Of 609 patients analyzed, 376 (62%) had airflow obstruction defined by FEV1/FVC < 0.70.
Those without airflow obstruction were significantly younger (mean age 63 years vs 66 years).
Non-White and Hispanic patients exhibited lower rates of airflow obstruction, though differences were not statistically significant.

Study Design

Type

Cross-Sectional (n=609)

Multicenter

Structured PICO

Does a computational phenotype using structured EHR data accurately identify patients with COPD who meet spirometry criteria for airflow obstruction?

Population

609 patients aged >40 years with an ICD-10 diagnosis of COPD, an active long-acting bronchodilator prescription, and a previous encounter for lung function testing from a single academic hospital in Chicago. Mean age 65, 58% female, 71% Black.

Intervention

Computational phenotype using a combination of structured data including ICD-10 codes, long-acting bronchodilator prescription, and history of lung function testing

Comparator

Manual chart review of pulmonary function testing to determine if post-bronchodilator spirometry met criteria for airflow obstruction (FEV1/FVC<0.70)

Outcome

Proportion of patients identified by the computational phenotype who actually met spirometry criteria for airflow obstruction (FEV1/FVC<0.70)

A computational phenotype using structured EHR data (ICD codes, medications, lung function testing history) had low accuracy (62%) in identifying true COPD patients with airflow obstruction.

Abstract

Abstract Rationale Computational phenotypes are data algorithms used in clinical and epidemiologic research to identify patient cohorts with a disease based on structured or unstructured electronic health record (EHR) data elements. Computational phenotypes to identify patients with chronic obstructive pulmonary disease (COPD) commonly utilize ICD diagnosis codes. However, previous research suggests that just over half of patients with an ICD diagnosis of COPD meet guideline-based diagnostic criteria including airflow obstruction on post-bronchodilator spirometry. This study aimed to evaluate the performance of a computational phenotype that utilized a combination of structured data elements including ICD codes, medications, and a history of lung function testing for identifying patients with COPD who meet spirometry criteria for airflow obstruction. Methods We performed a cross-sectional study using EHR data from a single academic hospital in Chicago. We included patients aged 40 years with an ICD-10 diagnosis of COPD (J41.*, J42, J43.*, or J44.*) during an encounter in the previous two years, an active long-acting bronchodilator prescription, and a previous encounter for lung function testing. Demographic data was collected as part of the data query. We performed manual chart review of pulmonary function testing to determine if post-bronchodilator spirometry met criteria for airflow obstruction based on FEV1/FVC0.70. Patients with and without airflow obstruction were compared using chi-square and student t-tests as appropriate. Results We found 617 patients who met inclusion criteria, though 8 patients did not have post-bronchodilator spirometry available. Of the remaining 609 patients, the mean age was 65 years old, 354 (58%) were female, 432 (71%) were Black, 102 (17%) were White, and 75 (12%) were another race. Only 376 (62%) of the study sample had airflow obstruction with a post-bronchodilator FEV1/FVC0.70. Compared to those with airflow obstruction, those without airflow obstruction were significantly younger (mean age 66 years vs 63 years, respectively) and more likely to be female. Although non-White and Hispanic patients had lower rates of airflow obstruction, the differences were not statistically significant. There were no significant between-group differences in insurance, language, and smoking history. Conclusions A computational phenotype using a combination of structured data including ICD-10 codes, long-acting bronchodilator prescription, and history of lung function testing did not identify patients with COPD and airflow obstruction with high accuracy. Utilizing natural language processing models to extract unstructured spirometry data may enhance the accuracy of computational phenotypes for identifying patients with COPD who meet guideline-based diagnostic criteria. This abstract is funded by: The National Center for Advancing Translational Sciences and National Institutes of Health through Grant Award Numbers KL2TR002002 and UL1TR002003

Bookmark