4.7 Article

Predicting adverse outcomes due to diabetes complications with machine learning using administrative health data

Journal

NPJ DIGITAL MEDICINE
Volume 4, Issue 1, Pages -

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41746-021-00394-8

Keywords

-

Funding

  1. ICES
  2. MOHLTC
  3. Canadian Institutes for Health Information (CIHI), Immigration, Refugees, and Citizenship Canada (IRCC)
  4. Office of the Registrar General (ORG) - Connaught Global Challenge Award [2018/19]
  5. Canada Research Chair in Population Health Analytics
  6. Ontario Graduate Scholarship
  7. Canadian Institutes of Health Research Banting and Best Canada Graduate Scholarship-Master's awards
  8. MITACS Accelerate Scholarship

Ask authors/readers for more resources

This study aimed to develop a machine learning-based model for predicting adverse outcomes due to diabetes complications using administrative health data in Ontario, Canada. The model had strong discrimination with an average test AUC of 77.7 and demonstrated the potential of machine learning and administrative health data for informing health planning and healthcare resource allocation for diabetes management at the population level.
Across jurisdictions, government and health insurance providers hold a large amount of data from patient interactions with the healthcare system. We aimed to develop a machine learning-based model for predicting adverse outcomes due to diabetes complications using administrative health data from the single-payer health system in Ontario, Canada. A Gradient Boosting Decision Tree model was trained on data from 1,029,366 patients, validated on 272,864 patients, and tested on 265,406 patients. Discrimination was assessed using the AUC statistic and calibration was assessed visually using calibration plots overall and across population subgroups. Our model predicting three-year risk of adverse outcomes due to diabetes complications (hyper/hypoglycemia, tissue infection, retinopathy, cardiovascular events, amputation) included 700 features from multiple diverse data sources and had strong discrimination (average test AUC = 77.7, range 77.7-77.9). Through the design and validation of a high-performance model to predict diabetes complications adverse outcomes at the population level, we demonstrate the potential of machine learning and administrative health data to inform health planning and healthcare resource allocation for diabetes management.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available