Machine learning-based algorithms improved cardiovascular risk prediction with a C-statistic of 0.751 compared to Pooled Cohort Equations showing 0.738 in statin-naïve healthy Korean adults without cardiovascular disease.
Cohort (n=222,998)
No
Do machine learning-based algorithms improve cardiovascular risk prediction compared to pre-existing models in statin-naïve healthy adults without cardiovascular disease?
222,998 Korean adults aged 40–79 years, naïve to lipid-lowering therapy, with no previous history of atherosclerotic cardiovascular disease.
Machine learning-based cardiovascular risk prediction algorithms (neural network, logistic regression, AdaBoost, TreeBag, random forest)
Pre-existing cardiovascular risk prediction models (Pooled Cohort Equation [PCE], Framingham Risk Score [FRS], SCORE, QRISK3)
First hard atherosclerotic cardiovascular disease (composite of cardiac death, non-fatal myocardial infarction, and fatal or nonfatal stroke) at 5 yearscomposite
Machine learning-based algorithms, particularly neural networks, provide modest but significant improvements in discrimination and calibration for predicting 5-year cardiovascular risk compared to traditional models like the Pooled Cohort Equations.
Effect estimate: C-statistic 0.751 (95% CI 0.740-0.761)
p-value: p=<0.001
Abstract Predicting the risk of cardiovascular disease is the key to primary prevention. Machine learning has attracted attention in analyzing increasingly large, complex healthcare data. We assessed discrimination and calibration of pre-existing cardiovascular risk prediction models and developed machine learning-based prediction algorithms. This study included 222,998 Korean adults aged 40–79 years, naïve to lipid-lowering therapy, had no history of cardiovascular disease. Pre-existing models showed moderate to good discrimination in predicting future cardiovascular events (C-statistics 0.70–0.80). Pooled cohort equation (PCE) specifically showed C-statistics of 0.738. Among other machine learning models such as logistic regression, treebag, random forest, and adaboost, the neural network model showed the greatest C-statistic (0.751), which was significantly higher than that for PCE. It also showed improved agreement between the predicted risk and observed outcomes (Hosmer–Lemeshow χ 2 = 86.1, P < 0.001) than PCE for whites did (Hosmer–Lemeshow χ 2 = 171.1, P < 0.001). Similar improvements were observed for Framingham risk score, systematic coronary risk evaluation, and QRISK3. This study demonstrated that machine learning-based algorithms could improve performance in cardiovascular risk prediction over contemporary cardiovascular risk models in statin-naïve healthy Korean adults without cardiovascular disease. The model can be easily adopted for risk assessment and clinical decision making.
Building similarity graph...
Analyzing shared references across papers
Loading...
Sang-Yeong Cho
Changwon National University
Sun–Hwa Kim
Seoul National University
Si‐Hyuck Kang
Seoul National University
Scientific Reports
Seoul National University
Gyeongsang National University
Seoul National University Bundang Hospital
Building similarity graph...
Analyzing shared references across papers
Loading...
Cho et al. (Mon,) conducted a cohort in Cardiovascular disease risk prediction (n=222,998). Machine learning-based algorithms vs. Pooled cohort equation (PCE) and other contemporary models was evaluated on First hard atherosclerotic cardiovascular disease event (C-statistic 0.751, 95% CI 0.740-0.761, p=<0.001). Machine learning-based algorithms improved cardiovascular risk prediction with a C-statistic of 0.751 compared to Pooled Cohort Equations showing 0.738 in statin-naïve healthy Korean adults without cardiovascular disease.
synapsesocial.com/papers/6963fcde24efc61310138b63 — DOI: https://doi.org/10.1038/s41598-021-88257-w