4.7 Article

XGBLC: an improved survival prediction model based on XGBoost

期刊

BIOINFORMATICS
卷 38, 期 2, 页码 410-418

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btab675

关键词

-

资金

  1. National Natural Science Foundation of China [61471078]
  2. Dalian Science and Technology Innovation Fund [2020JJ27SN066]
  3. Fundamental Research Funds for the Central Universities [3132014306, 3132015213, 3132017075]

向作者/读者索取更多资源

This study proposed an improved survival prediction model XGBLC based on the XGBoost framework, using Lasso-Cox to enhance the ability to analyze high-dimensional genomic data. Tested on 20 cancer datasets, XGBLC outperforms five state-of-the-art survival methods in terms of C-index, Brier score, and AUC.
Motivation: Survival analysis using gene expression profiles plays a crucial role in the interpretation of clinical research and assessment of disease therapy programs. Several prediction models have been developed to explore the relationship between patients' covariates and survival. However, the high-dimensional genomic features limit the prediction performance of the survival model. Thus, an accurate and reliable prediction model is necessary for survival analysis using high-dimensional genomic data. Results: In this study, we proposed an improved survival prediction model based on XGBoost framework called XGBLC, which used Lasso-Cox to enhance the ability to analyze high-dimensional genomic data. The novel first- and second-order gradient statistics of Lasso-Cox were defined to construct the loss function of XGBLC. We extensively tested our XGBLC algorithm on both simulated and real-world datasets, and estimated the performance of models with 5-fold cross-validation. Based on 20 cancer datasets from The Cancer Genome Atlas (TCGA), XGBLC outperforms five state-of-the-art survival methods in terms of C-index, Brier score and AUC. The results show that XGBLC still keeps good accuracy and robustness by comparing the performance on the simulated datasets with different scales. The developed prediction model would be beneficial for physicians to understand the effects of patient's genomic characteristics on survival and make personalized treatment decisions.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据