A LightGBM model predicted US county-level diabetes prevalence (variance = 0.952), identifying physical inactivity, racial/ethnic minority status, physical distress, and obesity as top predictors.
Cross-Sectional
Yes
What are the most important predictors of US county-level diabetes prevalence using a LightGBM model?
US county-level datasets incorporating 27 predictor variables
Light Gradient Boosting Machine (LightGBM) model
County-level diabetes prevalence
Physical inactivity, racial and ethnic minority status, frequent physical distress, and obesity are the strongest predictors of US county-level diabetes prevalence according to a machine learning analysis.
BACKGROUND: Diabetes and its risk factors are embedded in a complex multilevel ecology. Upstream factors (i.e., 'forcing factors' that refer to fundamental population-based social, economic, and political structures shaping health outcomes long before disease develops) and downstream risk factors (i.e., the individual-level characteristics and outcomes that result from upstream forcing factors) are to be considered when predicting diabetes. This study aims to predict diabetes prevalence at the United States' (US) county level using analytical methods that account for the contextual complexity of diabetes. METHODS: US county-level datasets incorporating 27 predictor variables were analysed cross-sectionally. A Light Gradient Boosting Machine (LightGBM) model was trained to predict county-level diabetes prevalence, after which model performance and feature importance were evaluated. FINDINGS: = 0.952) of the variance in county-level diabetes prevalence. Physical inactivity, racial and ethnic minority status, frequent physical distress, and obesity showed the highest Shapley (SHAP) importance values, which exceeded all remaining SHAP values. INTERPRETATION: This study delineates the most important predictors of diabetes prevalence based on a complex multilevel ecology. Efforts to reduce diabetes prevalence should address both upstream and downstream risk factors emphasising physical activity, obesity, equity, and cultural considerations.
Building similarity graph...
Analyzing shared references across papers
Loading...
Nicolaas P. Pronk
University of Minnesota
S Wang
Illinois Department of Natural Resources
RM Bergenstal
International Diabetes Federation
Diabetes/Metabolism Research and Reviews
University of Minnesota
University of Illinois Chicago
University of Minnesota System
Building similarity graph...
Analyzing shared references across papers
Loading...
Pronk et al. (Mon,) conducted a cross-sectional in Diabetes. Upstream and downstream risk factors was evaluated on County-level diabetes prevalence. A LightGBM model predicted US county-level diabetes prevalence (variance = 0.952), identifying physical inactivity, racial/ethnic minority status, physical distress, and obesity as top predictors.
synapsesocial.com/papers/6a2117dfd499ed480b170a5f — DOI: https://doi.org/10.1002/dmrr.70184