What type of study is this?

August 17, 2025Open Access

Optimizing Heart Disease Prediction with Random Forest and Ensemble Methods

Key Points

Random Forest achieved an accuracy of 87.04%, outperforming other classifiers in heart disease prediction.
The study compares several ensemble classifiers, revealing Random Forest's superior metrics like precision and recall.
Analysis utilized 10-fold cross-validation on the Heart Disease Prediction Dataset from Kaggle.
Findings suggest Random Forest's strong potential for early heart disease risk prediction, needing clinical validation.

Abstract

This study evaluates ensemble learning techniques for optimizing heart disease prediction, with a focus on Random Forest due to its robustness in handling complex medical data. The dataset used, "Heart Disease Prediction Dataset" from Kaggle, consists of 270 instances and 13 features like age, cholesterol, and family history. Data preprocessing involved mean imputation for missing values and min-max normalization. The study compares Random Forest with other ensemble classifiers—AdaBoost, Gradient Boosting, and XGBoost—using 10-fold cross-validation and evaluation metrics such as accuracy, precision, recall, and F1 score. Results show that Random Forest outperforms the other models with an accuracy of 87.04%, precision of 85.00%, recall of 80.95%, and F1 score of 82.93%. These findings emphasize Random Forest's ability to maintain prediction stability across various medical attributes and imbalanced data. Although the study highlights Random Forest as a promising method for early heart disease risk prediction, it remains a computational evaluation and requires clinical validation. The results aim to inform the development of predictive tools for enhancing early diagnosis and preventive strategies in healthcare systems.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Imran Amin

Setyawan Wibisono

Endang Lestariningsih

Journals

CogITo Smart Journal

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Optimizing Heart Disease Prediction with Random Forest and Ensemble Methods

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study