From raw data to actionable insights: preprocessing real-world data for machine learning in diabetes care | Synapse