An XGBoost machine learning model predicted ejection fraction scores from structured electronic health record data with an RMSE of 12.63 and R2 of 0.26, identifying gender as the most important feature.
Observational (n=130,727)
No
Can an XGBoost machine learning model accurately predict ejection fraction scores and identify clinical subtypes in heart failure patients using structured EHR data?
A tree-based machine learning model using structured EHR data can predict ejection fraction scores with moderate accuracy and identify distinct clinical subtypes of heart failure when combined with SHAP interpretation.
Effect estimate: R2 0.2619 (95% CI 12.62829-12.63231)
p-value: p=<10^-32
Heart failure (HF) is a major cause of mortality. Accurately monitoring HF progress and adjusting therapies are critical for improving patient outcomes. An experienced cardiologist can make accurate HF stage diagnoses based on combination of symptoms, signs, and lab results from the electronic health records (EHR) of a patient, without directly measuring heart function. We examined whether machine learning models, more specifically the XGBoost model, can accurately predict patient stage based on EHR, and we further applied the SHapley Additive exPlanations (SHAP) framework to identify informative features and their interpretations. Our results indicate that based on structured data from EHR, our models could predict patients' ejection fraction (EF) scores with moderate accuracy. SHAP analyses identified informative features and revealed potential clinical subtypes of HF. Our findings provide insights on how to design computing systems to accurately monitor disease progression of HF patients through continuously mining patients' EHR data.
Lu et al. (Sat,) conducted a observational in Heart Failure (n=130,727). XGBoost machine learning model was evaluated on Prediction of ejection fraction (EF) score (RMSE) (R2 0.2619, 95% CI 12.62829-12.63231, p=<10^-32). An XGBoost machine learning model predicted ejection fraction scores from structured electronic health record data with an RMSE of 12.63 and R2 of 0.26, identifying gender as the most important feature.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: