What question did this study set out to answer?

This review assesses the performance and quality of dementia prediction models using electronic health records (EHR).

April 14, 2026

Electronic health record-based prediction models for dementia detection: a systematic review of model performance and quality

Key Points

This review assesses the performance and quality of dementia prediction models using electronic health records (EHR).
Systematic search of electronic databases including Medline and EMBASE until July 2024.
Inclusion of studies developing or validating prediction models for dementia using EHR data.
Assessment of risk of bias using the PROBAST tool.
Included 56 studies with 434 prediction models and 155 external validations.
Most models were prognostic (66%) and predominantly used data from the US (71%).
Only 4% used gold-standard clinical criteria for outcomes, with many relying on diagnostic codes.
82% of models reported discriminative metrics, but only 16% assessed model calibration.
All models were deemed high risk of bias due to poor definitions and handling of data.

Abstract

Abstract Objectives Leveraging routine electronic health records (EHR) for dementia detection is a growing field, but quality and clinical utility of existing models are unclear. This systematic review aimed to evaluate performance, methodological quality, and risk of bias of EHR-based dementia prediction models. Materials and Methods We systematically searched Medline, EMBASE, Scopus, IEEE Xplore, and ACM from conception until July 2024. All studies and grey literature describing development or validation of probabilistic prediction models using EHR data for dementia detection were included. Risk of bias was assessed using PROBAST. Results Fifty-six studies (434 prediction models, 155 external validations) were included. Most models were prognostic (66%), used US data (71%), relied solely on structured data, and 47 (11%) were externally validated. Modeled outcomes were extremely heterogeneous: gold-standard clinical criteria were used in 17 models (4%), with others reliant on diagnostic codes for case ascertainment. Discriminative metrics were frequently reported (82% of models), but calibration was rarely assessed (16%). All models were judged high risk of bias, driven by poor outcome definition, inadequate handling of missing data, and potential overfitting. Discussion Our review highlights significant issues with methodological rigor and reporting transparency in existing EHR dementia prediction models. Ambiguous outcomes, flawed case ascertainment, and incomplete performance reporting, all limit clinical usefulness. Overall, model performance was difficult to assess and compare across studies due to incomplete reporting. Conclusion Electronic health record-based dementia prediction is still in its infancy. Methodological rigor and interdisciplinary collaboration are essential to meet clinical needs and achieve real-world impact.

Bookmark

Cite This Study

Lu et al. (Thu,) studied this question.

synapsesocial.com/papers/69ddda22e195c95cdefd7ae5 https://doi.org/https://doi.org/10.1093/jamia/ocag048

Bookmark