4.7 Article

Quantifying the Impacts of Pre- and Post-Conception TSH Levels on Birth Outcomes: An Examination of Different Machine Learning Models

期刊

FRONTIERS IN ENDOCRINOLOGY
卷 12, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA
DOI: 10.3389/fendo.2021.755364

关键词

machine learning; thyroid-stimulating hormone (TSH); birth outcomes; preconception; post-conception

资金

  1. National Natural Science Foundation of China [81773379, 82122058, 81202165]
  2. Shanghai Municipal Commission of Health and the Family Foundation for Fifth Round of the Three-Year Public Health Action Plan of Shanghai [GWV-1.1, GWV-10.1XK08, 202040186, 2017YQ023]
  3. Harvard Data Science Fellowship
  4. Natural Science Foundation of Shanghai [21 ZR 1438400]
  5. Shanghai Municipal Commission of Health and the Family Foundation for Young Talents [2017YQ023]

向作者/读者索取更多资源

This study utilized machine learning models to analyze predictive features such as pre- and post-conception serum TSH levels in pregnant women. The XGBoost model showed superior performance in predicting characteristics and polytomous variables, while abnormal preconception TSH or not-well-controlled TSH had strong predictive ability for birth outcomes.
Background While previous studies identified risk factors for diverse pregnancy outcomes, traditional statistical methods had limited ability to quantify their impacts on birth outcomes precisely. We aimed to use a novel approach that applied different machine learning models to not only predict birth outcomes but systematically quantify the impacts of pre- and post-conception serum thyroid-stimulating hormone (TSH) levels and other predictive characteristics on birth outcomes. Methods We used data from women who gave birth in Shanghai First Maternal and Infant Hospital from 2014 to 2015. We included 14,110 women with the measurement of preconception TSH in the first analysis and 3,428 out of 14,110 women with both pre- and post-conception TSH measurement in the second analysis. Synthetic Minority Over-sampling Technique (SMOTE) was applied to adjust the imbalance of outcomes. We randomly split (7:3) the data into a training set and a test set in both analyses. We compared Area Under Curve (AUC) for dichotomous outcomes and macro F1 score for categorical outcomes among four machine learning models, including logistic model, random forest model, XGBoost model, and multilayer neural network models to assess model performance. The model with the highest AUC or macro F1 score was used to quantify the importance of predictive features for adverse birth outcomes with the loss function algorithm. Results The XGBoost model provided prominent advantages in terms of improved performance and prediction of polytomous variables. Predictive models with abnormal preconception TSH or not-well-controlled TSH, a novel indicator with pre- and post-conception TSH levels combined, provided the similar robust prediction for birth outcomes. The highest AUC of 98.7% happened in XGBoost model for predicting low Apgar score with not-well-controlled TSH adjusted. By loss function algorithm, we found that not-well-controlled TSH ranked 4(th), 6(th), and 7(th) among 14 features, respectively, in predicting birthweight, induction, and preterm birth, and 3(rd) among 19 features in predicting low Apgar score. Conclusions Our four machine learning models offered valid predictions of birth outcomes in women during pre- and post-conception. The predictive features panel suggested the combined TSH indicator (not-well-controlled TSH) could be a potentially competitive biomarker to predict adverse birth outcomes.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据