4.7 Article

Early Prediction of Gestational Diabetes Mellitus in the Chinese Population via Advanced Machine Learning

Journal

JOURNAL OF CLINICAL ENDOCRINOLOGY & METABOLISM
Volume 106, Issue 3, Pages E1191-E1205

Publisher

ENDOCRINE SOC
DOI: 10.1210/clinem/dgaa899

Keywords

GDM; early prediction; machine learning models; early pregnancy; BMI; thyroxine

Funding

  1. National Key Research and Development Program of China [2018YFC1002804, 2016YFC1000203]
  2. National Natural Science Foundation of China [81671412, 81661128010]
  3. Program of Shanghai Academic Research Leader [20XD1424100]
  4. Outstanding Youth Medical Talents of Shanghai Rising Stars of Medical Talent Youth Development Program, Chinese Academy of Medical Sciences (CAMS) Innovation Fund for Medical Sciences [2019-12M-5-064]
  5. Foundation of Shanghai Municipal Commission of Health and Family Planning [20144Y0110]
  6. Natural Science Foundation of Shanghai [20511101900, 20ZR1427200]
  7. Shanghai Shenkang Hospital Development Center
  8. Clinical Technology Innovation Project [SHDC12019107]
  9. Clinical Skills Improvement Foundation of Shanghai Jiaotong University School of Medicine [JQ201717]

Ask authors/readers for more resources

This study developed effective models for predicting early gestational diabetes mellitus in the Chinese population using machine learning approaches and investigated the relationship between GDM, thyroxine, and BMI.
Context: Accurate methods for early gestational diabetes mellitus (GDM) (during the first trimester of pregnancy) prediction in Chinese and other populations are lacking. Objectives: This work aimed to establish effective models to predict early GDM. Methods: Pregnancy data for 73 variables during the first trimester were extracted from the electronic medical record system. Based on a machine learning (ML)-driven feature selection method, 17 variables were selected for early GDM prediction. To facilitate clinical application, 7 variables were selected from the 17-variable panel. Advanced ML approaches were then employed using the 7-variable data set and the 73-variable data set to build models predicting early GDM for different situations, respectively. Results: A total of 16 819 and 14 992 cases were included in the training and testing sets, respectively. Using 73 variables, the deep neural network model achieved high discriminative power, with area under the curve (AUC) values of 0.80. The 7-variable logistic regression (LR) model also achieved effective discriminate power (AUC = 0.77). Low body mass index (BMI) (<= 17) was related to an increased risk of GDM, compared to a BMI in the range of 17 to 18 (minimum risk interval) (11.8% vs 8.7%, P = .09). Total 3,3,5-triiodothyronine (T3) and total thyroxin (T4) were superior to free T3 and free T4 in predicting GDM. Lipoprotein(a) was demonstrated a promising predictive value (AUC = 0.66). Conclusions: We employed ML models that achieved high accuracy in predicting GDM in early pregnancy. A clinically cost-effective 7-variable LR model was simultaneously developed. The relationship of GDM with thyroxine and BMI was investigated in the Chinese population.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available