What does this research mean for the field?

The ensemble learning soft voting model achieved an area under the curve (AUC) of 0.906, with an accuracy of 88.8%, outperforming individual classifiers in predicting hypertension complicated by coronary heart disease. Novelty: ClaimNovelty.NOVEL_FINDING. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

The study aims to create a robust ensemble learning model for predicting coronary heart disease in patients with hypertension.

February 11, 2026Open Access

Machine Learning for Predicting Coronary Heart Disease Risk in Patients with Hypertension: An Ensemble Modeling Approach

Q: What is the clinical evidence from this study?

Study design: Other. Population: Essential hypertension complicated by coronary heart disease (n=6391). Intervention: Soft voting ensemble machine learning model combining random forest, XGBoost, CatBoost, CART, and logistic regression vs. Individual machine learning classifiers (random forest, XGBoost, CatBoost, CART, and logistic regression) and hard voting ensemble. Primary outcome: Prediction accuracy of essential hypertension complicated by coronary heart disease measured by area under the curve (AUC) and accuracy (ACC) (AUC 0.906 vs. 0.783, 95% CI 0.895-0.918 for soft voting model).

Resultado clave

The soft voting ensemble machine learning model predicted essential hypertension complicated by coronary heart disease with an AUC of 0.906 and accuracy of 88.8%, outperforming individual classifiers including logistic regression (AUC 0.783, accuracy 70.6%).

Puntos clave

The study aims to create a robust ensemble learning model for predicting coronary heart disease in patients with hypertension.
Developed an ensemble-based predictive model using voting fusion.
Data collected from 2,487 patients with hypertension and 3,904 non-CHD controls.
Conducted univariate and multivariate feature selection to refine the model.
Trained five machine learning algorithms independently before integrating them into a voting ensemble.
Validated model performance using area under the curve and accuracy metrics.
The ensemble model achieved an area under the curve of 0.906.
Achieved an accuracy of 0.888 in predicting hypertension complicated by CHD.
Outperformed all individual classifiers in predictive performance.

PICO estructurado

Does a soft voting ensemble machine learning model improve the prediction of coronary heart disease risk in patients with essential hypertension compared to individual classifiers?

Población

6,391 patients with essential hypertension, including 2,487 complicated by first-time coronary heart disease (confirmed by angiography) and 3,904 controls without cardiovascular, cerebrovascular, or renal disease, derived from electronic medical records across 7 hospitals.

Intervención

Soft voting ensemble machine learning model integrating five algorithms (random forest, XGBoost, CatBoost, CART, and logistic regression), with weights optimized using a deep neural network.

Comparador

Individual machine learning classifiers (random forest, XGBoost, CatBoost, CART, logistic regression) and a hard voting ensemble model.

Resultado

Prediction of essential hypertension complicated by coronary heart disease (evaluated by area under the curve [AUC] and accuracy).

A soft voting ensemble machine learning model using routine electronic medical record data provides high accuracy (AUC 0.906) for early risk stratification of coronary heart disease in hypertensive patients.

Resultado numérico

Estimación del efecto: AUC 0.906 vs. 0.783 (95% CI 0.895-0.918 for soft voting model)

Tasa de eventos absoluta: 0.906% vs 0.783%

Limitaciones

Study data derived from a single healthcare network in China limiting generalizability
Model relies on cross-sectional single-timepoint EMR data
No external population diversity validation
Reduced interpretability of complex ensemble models
No threshold optimization or decision-curve analysis performed

Resumen

Objectives: This study aimed to develop an optimized ensemble learning model to improve the prediction of hypertension complicated by coronary heart disease (CHD) through advanced feature selection and classifier fusion, thereby enhancing both accuracy and stability in risk assessment.Methods: We constructed an ensemble-based predictive model using voting fusion to enhance early detection of hypertension complicated by CHD. The dataset comprised 2,487 patients with essential hypertension (EH) complicated by CHD and 3,904 non-CHD controls. Following data preprocessing procedures, including data cleaning and univariate and multivariate feature selection, an 18-dimensional feature set was derived. Five machine learning algorithms (logistic regression, random forest, XGBoost, CatBoost, and CART) were trained independently and subsequently integrated through a voting ensemble to optimize predictive performance.Results: The voting fusion model outperformed all individual classifiers, achieving an area under the curve of 0.906 and an accuracy of 0.888 in predicting EH complicated by CHD.Conclusions: The proposed ensemble model improves classification accuracy and robustness, offering a clinically useful tool for early risk stratification of hypertension-associated CHD. Although the model demonstrates strong predictive performance using cross-sectional data, its reliance on single-timepoint measurements and selected control populations necessitates further validation. Pending additional studies, this framework may serve as a supplementary decision-support tool within clinical informatics systems.

Me gusta

Guardar

Ver artículo completo

Cite This Study

Hassan et al. (Sat,) conducted a other in Essential hypertension complicated by coronary heart disease (n=6,391). Soft voting ensemble machine learning model combining random forest, XGBoost, CatBoost, CART, and logistic regression vs. Individual machine learning classifiers (random forest, XGBoost, CatBoost, CART, and logistic regression) and hard voting ensemble was evaluated on Prediction accuracy of essential hypertension complicated by coronary heart disease measured by area under the curve (AUC) and accuracy (ACC) (AUC 0.906 vs. 0.783, 95% CI 0.895-0.918 for soft voting model). The soft voting ensemble machine learning model predicted essential hypertension complicated by coronary heart disease with an AUC of 0.906 and accuracy of 88.8%, outperforming individual classifiers including logistic regression (AUC 0.783, accuracy 70.6%).

synapsesocial.com/papers/698c1c53267fb587c655ea93 https://doi.org/https://doi.org/10.4258/hir.2026.32.1.28

Me gusta

Guardar

Ver artículo completo