What question did this study set out to answer?

The aim is to benchmark various machine learning methods for predicting blood glucose levels in individuals with type 1 diabetes.

April 19, 2026Open Access

A Comprehensive Benchmark of Machine Learning Methods for Blood Glucose Prediction in Type 1 Diabetes: A Multi-Dataset Evaluation

Key Points

The aim is to benchmark various machine learning methods for predicting blood glucose levels in individuals with type 1 diabetes.
Evaluated ten machine learning models on two multi-patient datasets with a total of 34 subjects.
Used consistent preprocessing and temporal splitting for a fair comparison.
Included a novel hybrid model combining LightGBM and stochastic differential equation-based simulations.
The Hybrid LightGBM-SDE model outperformed all other models across multiple prediction horizons.
Recorded RMSE values decreasing from 22.42 mg/dL at 15 min to 37.22 mg/dL at 120 min.
99.7% of predictions at the 30 min horizon were within the acceptable Clarke Error Grid zones A and B.

Abstract

Managing blood glucose in type 1 diabetes (T1D) remains a daily clinical challenge, and accurate short-term prediction of glucose levels can meaningfully improve insulin dosing decisions while reducing the risk of dangerous hypoglycaemic episodes. Although numerous machine learning approaches have been proposed for this task, comparing their relative merits is difficult because published studies differ widely in datasets, preprocessing choices, and evaluation criteria. In this work, we address this research gap by benchmarking ten machine learning methods—from a naïve persistence baseline through classical linear regressors, gradient-boosted ensembles, and recurrent neural networks to a novel hybrid that couples LightGBM with stochastic differential equation (SDE)-based glucose–insulin simulation—on two multi-patient datasets comprising 34 T1D subjects, across prediction horizons of 15, 30, 60, and 120 min. Every method is trained and tested under identical preprocessing and temporal splitting conditions to ensure a fair comparison. The proposed Hybrid LightGBM-SDE model consistently outperforms all alternatives, recording RMSE values of 22.42 mg/dL at 15 min, 28.74 mg/dL at 30 min, 33.89 mg/dL at 60 min, and 37.22 mg/dL at 120 min—an improvement of between 13.6% and 27.0% relative to standalone LightGBM. At the clinically important 30 min horizon, 99.7% of predictions lie within the acceptable A and B zones of the Clarke Error Grid. Wilcoxon signed-rank tests confirm that performance differences are statistically significant (p < 10−10), and SHAP-based analysis shows that the SDE-derived simulation features are among the most influential predictors, especially at longer horizons. All source code and evaluation scripts are publicly released to support reproducibility. Due to temporary data access constraints, all experiments reported here use physics-based synthetic datasets generated from the Bergman minimal model, replicating the structural properties of the D1NAMO and HUPA-UCM collections; validation on the original clinical recordings is planned. Among the two synthetic datasets, the D1NAMO-equivalent cohort (nine patients) proves more challenging, with systematically higher per-patient RMSE variance. The clinically acceptable prediction accuracy at the 30 min horizon (99.7% in Clarke zones A + B) suggests potential for integration into insulin dosing decision-support systems.

Read Full Paperexternally

AI से पूछें

Bookmark

View Full Paper

Cite This Study

Kolev et al. (Fri,) studied this question.

synapsesocial.com/papers/69e47321010ef96374d8f032 https://doi.org/https://doi.org/10.3390/app16083928

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

AI से पूछें

Bookmark

View Full Paper