What question did this study set out to answer?

This research aims to develop and evaluate an algorithm that automates data extraction for stroke registry variables, reducing reliance on manual processes.

May 8, 2026Open Access

Abstract Number: Esoc2026a2214 Combining a Large Language Model With Traditional Software Engineering Tools for Automated Res-Q Registry Variable Extraction

Key Points

This research aims to develop and evaluate an algorithm that automates data extraction for stroke registry variables, reducing reliance on manual processes.
Created a reference dataset with 198 variables from 100 ischemic stroke cases, totaling 13,872 data points.
Extracted 126 variables using software engineering and 72 with a large language model.
Assessed accuracy with Cohen’s κ for categorical variables and Spearman’s correlation for numeric variables.
The algorithm achieved an accuracy of 89.2% compared to the reference dataset.
Discrepancies included 457 missing values and 1,042 incorrect values.
60 variables had perfect accuracy, with Cohen’s κ or Spearman’s coefficient scores of 1.0.

Abstract

Abstract Background and aims Despite substantial advances in digital health data infrastructure, quality improvement efforts still rely on resource-intensive manual data collection. Using a combination of a large language model (LLM) and conventional programming tools, we developed and evaluated the accuracy of a pilot algorithm that enables automated data extraction of the international Registry of Stroke Care Quality (RES-Q) variables. Methods We created a reference dataset containing 198 variables for 100 ischemic stroke cases, manually extracted from the electronic health records (overall 13,872 data points), taking an average of 20 minutes per patient. In automated extraction, 126 variables were extracted using conventional software engineering, and 72 using LLM. The accuracy of categorical variables was compared using Cohen’s κ, numeric variables using Spearman’s correlation, and free-text variables were assessed descriptively. Results The accuracy of the current version of the pilot algorithm was 89.2%, compared to the ground-truth dataset. Discrepancies included 457 (3.3%) missing and 1,042 (7.5%) incorrect values. 60 variables, including age, sex, prestroke modified Rankin scale score, index thrombectomy, mTICI score, puncture and reperfusion timestamps, showed Cohen’s κ score of 1.0 or Spearman’s coefficient of 1.0 (Figure 1). Overall, estimated programming time was 416 hours, and 19 algorithm iterations were performed using 5-30 patients’ data in each iteration. 1.2% of values were found to be incorrectly entered manually into the reference dataset. Conclusions Automated tools can potentially improve stroke registry data collection efficiency by reducing data extraction time, with accuracy for some variables comparable to or better than trained staff. Further improvements in the algorithm are anticipated. Conflict of interest “All authors: nothing to disclose” Figure 1 - belongs to Results

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Kaubrytė et al. (Fri,) studied this question.

synapsesocial.com/papers/69fd7f0dbfa21ec5bbf07709 https://doi.org/https://doi.org/10.1093/esj/aakag023.1020

Bookmark

View Full Paper