Los puntos clave no están disponibles para este artículo en este momento.
Background Childhood obesity represents a significant global public health challenge. Accurate and rapid prediction models for identifying obesity risk in children and adolescents are essential for facilitating early prevention and enabling timely interventions. However, interpretable and user-friendly obesity risk prediction models based on nationally representative data remain limited. This study aimed to develop and validate a model to predict current obesity risk among children and adolescents in China. Methods The models were developed using cross-sectional data from the 2017–2018 Physical Activity and Fitness in China—The Youth Study (PAFCTYS; n = 35,016) and were temporally validated with 2020 data ( n = 3,495). Candidate predictors ( n = 38), primarily encompassing physical activity, sedentary behavior, and sociodemographic variables, were measured concurrently with the outcome. The participants included individuals from 31 administrative regions across China. Features were identified using the Least Absolute Shrinkage and Selection Operator (LASSO) with recursive feature elimination (RFE), and predictive models were developed using eight different machine learning algorithms. The model’s performance was evaluated using metrics such as AUC, sensitivity, specificity, Matthews Correlation Coefficient (MCC), and Brier score. The model was interpreted using the SHapley Additive exPlanation (SHAP) method. Results The random forest (RF) model outperformed all other models, achieving an AUC of 1.000 on the training set and 0.946 on the testing set. It also maintained good discriminative performance on the temporal validation dataset, with an AUC of 0.810. The RF model also shows the highest accuracy, specificity, and MCC, and the lowest Brier score. SHAP analysis indicates that parental BMI, using mobile electronic devices on weekdays, MVPA (moderate-to-vigorous physical activity), watching TV on weekdays, and sex are the top 5 most important features in the obesity risk prediction model. A web-based risk calculator based on the RF model has been successfully deployed. Conclusion Using a large, nationally representative sample and readily available variables, we successfully developed and validated an obesity risk prediction model for Chinese children and adolescents using explainable machine learning techniques. The model can quickly and accurately assess an individual’s current obesity risk, enabling healthcare agencies, schools, and parents to conduct large-scale obesity risk screening.
Chen et al. (Tue,) studied this question.