What is the clinical evidence from this study?

Study design: Cohort. Population: Cardiovascular disease risk prediction (n=222998). Intervention: Machine learning-based algorithms vs. Pooled cohort equation (PCE) and other contemporary models. Primary outcome: First hard atherosclerotic cardiovascular disease event (C-statistic 0.751, 95% CI 0.740-0.761, p=<0.001).

April 26, 2021Open Access

Pre-existing and machine learning-based models for cardiovascular risk prediction

Q: What are the key findings of this study?

Machine learning-based algorithms improved cardiovascular risk prediction with a C-statistic of 0.751 compared to Pooled Cohort Equations showing 0.738 in statin-naïve healthy Korean adults without cardiovascular disease.

Q: What does this research mean for the field?

Machine learning-based algorithms improve cardiovascular risk prediction with a C-statistic of 0.751 compared to Pooled Cohort Equations showing 0.738 in statin-naïve healthy Korean adults without cardiovascular disease. Novelty: ClaimNovelty.NOVEL_FINDING. Consensus alignment: ConsensusAlignment.SUPPORTS_CONSENSUS.

Key Result

Machine learning-based algorithms improved cardiovascular risk prediction with a C-statistic of 0.751 compared to Pooled Cohort Equations showing 0.738 in statin-naïve healthy Korean adults without cardiovascular disease.

Study Design

Type

Cohort (n=222,998)

Multicenter

Structured PICO

Do machine learning-based algorithms improve cardiovascular risk prediction compared to pre-existing models in statin-naïve healthy adults without cardiovascular disease?

Population

222,998 Korean adults aged 40–79 years, naïve to lipid-lowering therapy, with no previous history of atherosclerotic cardiovascular disease.

Intervention

Machine learning-based cardiovascular risk prediction algorithms (neural network, logistic regression, AdaBoost, TreeBag, random forest)

Comparator

Pre-existing cardiovascular risk prediction models (Pooled Cohort Equation [PCE], Framingham Risk Score [FRS], SCORE, QRISK3)

Outcome

First hard atherosclerotic cardiovascular disease (composite of cardiac death, non-fatal myocardial infarction, and fatal or nonfatal stroke) at 5 yearscomposite

Machine learning-based algorithms, particularly neural networks, provide modest but significant improvements in discrimination and calibration for predicting 5-year cardiovascular risk compared to traditional models like the Pooled Cohort Equations.

Main Result

Effect estimate: C-statistic 0.751 (95% CI 0.740-0.761)

p-value: p=<0.001

Limitations

Only 5-year follow-up data available
Potential selection bias from health screening program
Risk of misclassification bias using claims information

Abstract

Abstract Predicting the risk of cardiovascular disease is the key to primary prevention. Machine learning has attracted attention in analyzing increasingly large, complex healthcare data. We assessed discrimination and calibration of pre-existing cardiovascular risk prediction models and developed machine learning-based prediction algorithms. This study included 222,998 Korean adults aged 40–79 years, naïve to lipid-lowering therapy, had no history of cardiovascular disease. Pre-existing models showed moderate to good discrimination in predicting future cardiovascular events (C-statistics 0.70–0.80). Pooled cohort equation (PCE) specifically showed C-statistics of 0.738. Among other machine learning models such as logistic regression, treebag, random forest, and adaboost, the neural network model showed the greatest C-statistic (0.751), which was significantly higher than that for PCE. It also showed improved agreement between the predicted risk and observed outcomes (Hosmer–Lemeshow χ 2 = 86.1, P < 0.001) than PCE for whites did (Hosmer–Lemeshow χ 2 = 171.1, P < 0.001). Similar improvements were observed for Framingham risk score, systematic coronary risk evaluation, and QRISK3. This study demonstrated that machine learning-based algorithms could improve performance in cardiovascular risk prediction over contemporary cardiovascular risk models in statin-naïve healthy Korean adults without cardiovascular disease. The model can be easily adopted for risk assessment and clinical decision making.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Sang-Yeong Cho

Changwon National University

Sun–Hwa Kim

Seoul National University

Si‐Hyuck Kang

Seoul National University

Journals

Scientific Reports

Actions

Institutions

Seoul National University

Gyeongsang National University

Seoul National University Bundang Hospital

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Pre-existing and machine learning-based models for cardiovascular risk prediction

Key Result

Study Design

Structured PICO

Main Result

Limitations

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study