This study develops a diabetes prediction model utilizing machine learning (ML) algorithms in conjunction with feature selection techniques to enhance predictive accuracy and computational efficiency. The dataset, sourced from 2023 diabetes screening records provided by the Yala Provincial Public Health Office in Thailand, was preprocessed using Min-Max and Z-Score normalization methods to ensure consistency. Nine ML algorithms were evaluated: Logistic Regression (LR), Decision Trees (DT), Classification and Regression Trees (CART), Bayesian Classifiers, K-Nearest Neighbors (KNN), Random Forest (RF), Multilayer Perceptron (MLP), XGBoost, CatBoost, and Gradient Boosting Machines (GBM). Model performance was assessed using 5-fold cross-validation and evaluated based on Accuracy, Precision, Recall, F1-Score, and ROC-AUC. Logistic Regression outperformed all other models, achieving an accuracy of 97.53%, a precision of 97.53%, a recall of 100%, and an F1-Score of 98.75%. Additionally, the study explored feature selection techniques, specifically Principal Component Analysis (PCA) and Random Forest, to reduce computational complexity while maintaining performance. Random Forest demonstrated greater efficiency than PCA in feature selection. These findings highlight the effectiveness of machine learning (ML) models in developing scalable, real-time diabetes screening systems. The proposed approach is particularly beneficial in resource-limited healthcare settings where traditional diagnostic tools may be inaccessible. By enhancing clinical decision-support systems, this research makes a practical contribution to improving diabetes care, enabling earlier interventions, and facilitating more accurate risk stratification for both patients and healthcare providers.
Building similarity graph...
Analyzing shared references across papers
Loading...
Thanakorn Pamutha
Wanchana Promthong
Sofwan Pahlawan
International Journal on Advanced Science Engineering and Information Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Pamutha et al. (Thu,) studied this question.
www.synapsesocial.com/papers/68af5bb6ad7bf08b1eadf74a — DOI: https://doi.org/10.18517/ijaseit.15.4.20457