TLDR: Glucose-ML is a new collection of 10 publicly available, longitudinal diabetes datasets, offering over 300,000 days of CGM data from 2500+ individuals. It aims to accelerate robust AI development by providing diverse data and includes a comparative analysis and a case study demonstrating how dataset choice significantly impacts AI model performance for blood glucose prediction.
Artificial intelligence (AI) is becoming increasingly vital in managing diabetes, offering advanced solutions for screening, decision support, and overall management. However, a significant challenge in this field has been the limited access to large, high-quality datasets, which hinders the development of reliable and robust AI algorithms.
To address this critical gap, researchers have introduced Glucose-ML, a comprehensive collection of 10 publicly available diabetes datasets. These datasets, released between 2018 and 2025, are designed to accelerate the creation of transparent, reproducible, and robust AI solutions for diabetes care. The Glucose-ML collection is impressive in its scale, encompassing over 300,000 days of continuous glucose monitor (CGM) data, with a staggering total of 38 million glucose samples. This rich data comes from more than 2500 individuals across four different countries, including people living with type 1 diabetes, type 2 diabetes, prediabetes, and even those without diabetes.
The creators of Glucose-ML have gone beyond just compiling data; they also provide a detailed comparative analysis of each dataset within the collection. This analysis is invaluable for algorithm developers, guiding them in selecting the most appropriate data for their specific AI solutions. The datasets are diverse, including not only glucose readings but also other relevant information such as insulin delivery data, activity tracker metrics, user-generated logs, and clinical measurements.
Understanding Data Impact: A Case Study in Blood Glucose Prediction
To demonstrate the practical utility of Glucose-ML and highlight the importance of data selection, the researchers conducted a case study focusing on blood glucose prediction, one of the most common AI tasks in diabetes management. They used two simple baseline algorithms: a ‘zero-order hold’ predictor (which assumes the future glucose value will be the same as the current one) and a ‘simple linear regression’ predictor. The goal was to predict blood glucose levels 30 minutes in advance across all 10 datasets.
The findings from this case study were significant. It was observed that the same AI algorithm could produce substantially different prediction results when developed and evaluated using different datasets. For instance, the zero-order hold predictor achieved its lowest error (RMSE of 16.1 mg/dL) on the BIG IDEAs dataset, which primarily includes individuals with prediabetes and no diabetes, indicating more stable glucose levels. Conversely, the same method showed its highest error (RMSE of 28.14 mg/dL) on the DiaTrend dataset, which features more dynamic and challenging glucose profiles from individuals with type 1 diabetes. This stark difference underscores how the characteristics of the dataset directly influence an AI model’s performance.
Also Read:
- Automated GPU Code Optimization: Introducing CUDA-L1’s Reinforcement Learning Approach
- AI Models Offer New Insights into Bridge Health Through Non-Destructive Evaluation
Recommendations for Robust AI Development
Based on their extensive work with Glucose-ML, the researchers offer key recommendations for developing robust AI solutions in diabetes and broader health domains:
- Data Selection: It is crucial to use multiple datasets that represent diverse subgroups and variations within the target population. This includes considering differences in glycemic control, age, geographical location, and ethnicity.
- Model Design: New AI models should always be benchmarked against simple, ‘naïve’ baselines, such as the zero-order hold predictor, to provide a clear reference for performance.
- Model Evaluation: AI solutions should be evaluated on publicly available datasets in addition to any private data. It’s also vital to avoid smoothing or interpolating missing data in test sets, as this can lead to inaccurate performance estimates.
The Glucose-ML project represents a significant step forward in making high-quality, real-world diabetes data accessible to the research community. By providing a diverse and comprehensive collection, along with insights into data characteristics and their impact on AI performance, Glucose-ML aims to foster more transparent, reproducible, and ultimately, more effective AI solutions for diabetes management. You can find more details about this valuable resource in the full research paper: Glucose-ML: A collection of longitudinal diabetes datasets for development of robust AI solutions.


