Introduction Sarcopenia, a progressive age-related loss of skeletal muscle mass and strength, represents a growing public health challenge amid global population aging. Early detection remains difficult with conventional diagnostic approaches. Methods This study aimed to develop and validate reliable machine learning (ML) models to identify key risk factors for sarcopenia in community hospital settings. Using retrospective data from 1, 650 patients at a community health center, we collected comprehensive demographic, clinical, and lifestyle variables. Twelve ML models—including Random Forest, Support Vector Machine, XGBoost, and Logistic Regression—were constructed and evaluated using 5-fold cross validation. Results The CatBoost, LightGBM, and Gradient Boosting Decision Tree models demonstrated superior predictive performance, with area under the receiver operating characteristic curve (AUROC) values of 0. 999, 0. 996, and 0. 995, respectively. SHapley Additive exPlanations (SHAP) analysis revealed that SARCCalₛcore, body mass index (BMI), and age belong to the most influential predictors, while a greater chronic disease burden was positively associated with sarcopenia risk. Conclusion In conclusion, ML models show substantial potential for clinical application in identifying sarcopenia risk, thereby supporting early intervention strategies. This approach enhances detection capabilities and provides a practical tool for individualized treatment planning in community-based elderly care. Future research should integrate additional biomarkers and environmental factors to further improve model accuracy and facilitate integration into clinical workflows.
Zhao et al. (Thu,) studied this question.