What question did this study set out to answer?

This study aims to identify significant predictors of cervical cancer among women living with HIV and determine the best machine learning model for prediction.

April 27, 2026Open Access

Predicting cervical cancer among women living with HIV/AIDS at public health facilities in a resource-limited setting in Ethiopia using machine learning analysis

Key Points

This study aims to identify significant predictors of cervical cancer among women living with HIV and determine the best machine learning model for prediction.
Multi-center, cross-sectional design using secondary datasets from four antiretroviral therapy clinics in central Debre Markos.
Implemented seven machine learning models including Logistic Regression, Random Forest, K-Nearest Neighbors, and others to evaluate performance.
Model performance assessed using confusion matrix and area under the receiver operating characteristic curve.
K-Nearest Neighbors model achieved the highest accuracy of 98% and area under the curve of 0.68.
Key predictors included adherence at enrollment, screening visit type, nutritional status, months on antiretroviral therapy, follow-up status, and weight.
Strengthening nutritional support, improving follow-up, and enhancing adherence counseling may reduce cervical cancer risk.

Abstract

Cervical cancer is a malignancy associated with human immunodeficiency virus, characterized by abnormal cervical cell mutations. Machine learning techniques offer valuable support for early detection and prediction of cervical cancer, potentially lowering screening and treatment costs. This study specifically targeted women living with human immunodeficiency virus, aiming to identify the most significant predictors of cervical cancer and to determine the most effective supervised machine learning model for its prediction within this population. This study employed a multi-center, cross-sectional design using a secondary dataset from the smart care systems of four antiretroviral therapy clinics in central Debre Markos town. To determine the most relevant predictors, seven machine learning models, Logistic Regression, Random Forest, K-Nearest Neighbors, Support Vector Machine, Decision Tree, Extreme Gradient Boosting, and AdaBoost were implemented to identify the top-performing model. Model performance was assessed using the confusion matrix and the Area under the Receiver Operating Characteristic Curve. The findings indicated that adherence at enrollment, screening visit type, and nutritional status, months on anti-retroviral therapy, follow-up status, and weight were highly important predictors of cervical cancer. Among the evaluated models, the K-Nearest Neighbors model outperformed the others, achieving the highest accuracy of 98% and an Area under the Receiver Operating Characteristic Curve of 0.68. As demonstrated in this study, the K-Nearest Neighbors model showed the best performance in effectively predicting cervical cancer among women living with human immunodeficiency virus. Strengthening nutritional support interventions, improving follow-up mechanisms, and enhancing anti-retroviral therapy adherence counseling programs may collectively contribute to reducing the risk of cervical cancer among women living with human immunodeficiency virus. Future research should focus on validating the predictive model across diverse geographic regions and healthcare contexts to enhance its generalizability, robustness, and practical applicability.

Bookmark

View Full Paper

Bookmark

View Full Paper

Predicting cervical cancer among women living with HIV/AIDS at public health facilities in a resource-limited setting in Ethiopia using machine learning analysis

Key Points

Abstract

Cite This Study