Objective Mother’s health risks should be identified early so that the outcome of the pregnancy can be enhanced and the complications experienced during pregnancy can be minimized. This paper will design and test a leakage-regulated hybrid machine learning model to predict maternal health risk using the optimized ensemble models. Methods It used the publicly available UCI Maternal Health Risk dataset (n = 1015). A fixed random seed was used to stratify the dataset to 80% training and 20% independent testing subsets (42). SMOTENN resampling was only done to the training data to avoid data leakage. Internal cross-validation was resorted to as a means of hyperparameter tuning. We came up with the optimized XGBoost, blending, and hybrid stacking models. The performance of a model was measured in terms of accuracy, precision, recall, F1 score, ROC-AUC, confusion analysis, and probability mean squared error (Brier score). Results The hybrid stacking model had a ROC-AUC of 0.911 and general accuracy of 80 percent over the independent test set. The model proved to be very sensitive with high-risk cases (recall = 0.85). The probability mean squared error (Brier score) was 0.07, which is good probability calibration. The hybrid framework proposed performed better in terms of discriminative capability as compared to baseline models (logistic regression, random forest, and SVM). Conclusions The suggested leakage-sensitive hybrid ensemble framework offers strong and clinically significant working outcomes on maternal health risk forecasting. The results show the significance of effective validation techniques and probabilistic evaluation measures in healthcare machine learning systems.
Altaie et al. (Sun,) studied this question.