What question did this study set out to answer?

The study aims to create and validate a machine learning model that predicts depression risk in elderly patients with gastrointestinal or chronic liver diseases.

February 27, 2026Open Access

A longitudinal cohort study: developing an interpretable machine learning model to predict incident depression risk in elderly Chinese patients with gastrointestinal or chronic liver diseases

Puntos clave

The study aims to create and validate a machine learning model that predicts depression risk in elderly patients with gastrointestinal or chronic liver diseases.
Conducted a longitudinal cohort study using data from the China Health and Retirement Longitudinal Study (CHARLS).
Selected potential predictors at baseline using Least Absolute Shrinkage and Selection Operator (LASSO) regression.
Employed ten machine learning algorithms to develop prediction models.
Evaluated model performance using metrics including AUC, sensitivity, specificity, and F1-score.
Applied the SHAP framework to interpret feature contributions.
Identified ten key predictors for depression risk among 1,353 participants.
Achieved optimal model performance with a Logistic Regression model, showing an AUC of 0.723 (95% CI: 0.674–0.772).
Top predictors included self-reported health, life satisfaction, gender, education, and memory scores.

Resumen

Depression is highly prevalent in elderly patients with gastrointestinal (GID) or chronic liver diseases (CLD), significantly impairing quality of life and treatment outcomes. This study aimed to develop and validate an interpretable machine learning (ML) model to identify depression risk in this population, overcoming the “black box” limitation of conventional ML. This prospective analysis utilized data from the baseline (2018) and follow-up (2020) waves of the China Health and Retirement Longitudinal Study (CHARLS). Potential predictors measured at baseline were selected via Least Absolute Shrinkage and Selection Operator (LASSO) regression. The outcome was incident depression at the 2020 follow-up, defined by a CES-D-10 score ≥ 10 among participants free of depression at baseline. Ten ML algorithms were employed to construct models. Performance was evaluated using the area under the receiver operating characteristic curve (AUC), sensitivity, specificity, precision, F1-score, calibration curves, and decision curve analysis. The SHapley Additive exPlanations (SHAP) framework interpreted feature contributions. Among 1,353 participants (424 with depression), LASSO identified 10 key predictors. The Logistic Regression (LR) model demonstrated optimal discriminative performance, with an AUC of 0.723 (95% CI: 0.674–0.772). SHAP analysis revealed the top five predictors: self-reported health, life satisfaction, gender, education, and memory scores. We developed an interpretable ML model for predicting depression risk in elderly patients with GID or CLD. This tool aids early detection and intervention, potentially improving clinical outcomes in this vulnerable population.

Me gusta

Guardar

Ver artículo completo

Cite This Study

Chen et al. (Wed,) studied this question.

synapsesocial.com/papers/69a134dded1d949a99abe5e9 https://doi.org/https://doi.org/10.1186/s12877-026-07239-7

Me gusta

Guardar

Ver artículo completo