4.1 Article

Risk prediction for delayed clearance of high-dose methotrexate in pediatric hematological malignancies by machine learning

期刊

INTERNATIONAL JOURNAL OF HEMATOLOGY
卷 114, 期 4, 页码 483-493

出版社

SPRINGER JAPAN KK
DOI: 10.1007/s12185-021-03184-w

关键词

Methotrexate clearance; Pediatric hematological malignancies; Pharmacogenomics; SNP genotyping; Machine learning

资金

  1. National Natural Scientific Foundation of China [81503166]
  2. Natural Scientific Foundation of Hunan province in China [2018JJ3846]

向作者/读者索取更多资源

This study aimed to establish a predictive model using machine learning to identify children with hematologic malignancy at high risk for delayed clearance of high-dose methotrexate (HD-MTX). Five and three variables were found to be statistically significant in univariable and multivariate analysis. The C5.0 decision tree combined with SMOTE using five variables showed the highest AUC value (0.807).
This study aimed to establish a predictive model to identify children with hematologic malignancy at high risk for delayed clearance of high-dose methotrexate (HD-MTX) based on machine learning. A total of 205 patients were recruited. Five variables (hematocrit, risk classification, dose, SLC19A1 rs2838958, sex) and three variables (SLC19A1 rs2838958, sex, dose) were statistically significant in univariable analysis and, separately, multivariate logistic regression. The data was randomly split into a training cohort and a validation cohort. A nomogram for prediction of delayed HD-MTX clearance was constructed using the three variables in the training dataset and validated in the validation dataset. Five machine learning algorithms (cart classification and regression trees, naive Bayes, support vector machine, random forest, C5.0 decision tree) combined with different resampling methods were used for model building with five or three variables. When developed machine learning models were evaluated in the validation dataset, the C5.0 decision tree combined with the synthetic minority oversampling technique (SMOTE) using five variables had the highest area under the receiver operating characteristic curve (AUC 0.807 [95% CI 0.724-0.889]), a better performance than the nomogram (AUC 0.69 [95% CI 0.594-0.787]). The results support potential clinical application of machine learning for patient risk classification.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.1
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据