What does this research mean for the field?

Machine learning models can accurately predict the risk of sarcopenia in community hospital patients, with AUROC values exceeding 0.99. Novelty: ClaimNovelty.NOVEL_FINDING. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

The aim is to develop and validate machine learning models for identifying risk factors of sarcopenia in community hospital settings.

March 1, 2026Open Access

Development and validation of a machine learning-based risk prediction model for sarcopenia in community hospital patients: a retrospective cohort study

Key Points

The aim is to develop and validate machine learning models for identifying risk factors of sarcopenia in community hospital settings.
Used data from 1,650 patients in a community health center
Collected demographic, clinical, and lifestyle variables
Constructed and evaluated twelve machine learning models including Random Forest and XGBoost
Used 5-fold cross validation for model evaluation
CatBoost, LightGBM, and Gradient Boosting models showed high predictive performance with AUROC values of 0.999, 0.996, and 0.995
SARC_Cal_score, BMI, and age identified as influential predictors
Greater chronic disease burden positively associated with sarcopenia risk

Abstract

Introduction Sarcopenia, a progressive age-related loss of skeletal muscle mass and strength, represents a growing public health challenge amid global population aging. Early detection remains difficult with conventional diagnostic approaches. Methods This study aimed to develop and validate reliable machine learning (ML) models to identify key risk factors for sarcopenia in community hospital settings. Using retrospective data from 1, 650 patients at a community health center, we collected comprehensive demographic, clinical, and lifestyle variables. Twelve ML models—including Random Forest, Support Vector Machine, XGBoost, and Logistic Regression—were constructed and evaluated using 5-fold cross validation. Results The CatBoost, LightGBM, and Gradient Boosting Decision Tree models demonstrated superior predictive performance, with area under the receiver operating characteristic curve (AUROC) values of 0. 999, 0. 996, and 0. 995, respectively. SHapley Additive exPlanations (SHAP) analysis revealed that SARCCalₛcore, body mass index (BMI), and age belong to the most influential predictors, while a greater chronic disease burden was positively associated with sarcopenia risk. Conclusion In conclusion, ML models show substantial potential for clinical application in identifying sarcopenia risk, thereby supporting early intervention strategies. This approach enhances detection capabilities and provides a practical tool for individualized treatment planning in community-based elderly care. Future research should integrate additional biomarkers and environmental factors to further improve model accuracy and facilitate integration into clinical workflows.

Bookmark

View Full Paper

Cite This Study

Zhao et al. (Thu,) studied this question.

synapsesocial.com/papers/69a3d79dec16d51705d2dd70 https://doi.org/https://doi.org/10.3389/fragi.2026.1772792

Bookmark

View Full Paper