期刊
COMPUTATIONAL STATISTICS & DATA ANALYSIS
卷 53, 期 5, 页码 1590-1603出版社
ELSEVIER
DOI: 10.1016/j.csda.2008.05.021
关键词
-
资金
- Center for Medical Systems Biology (CMSB)
- Porticus Foundation
Knowledge of transcription of the human genome might greatly enhance our understanding of cancer. In particular, gene expression may be used to predict the survival of cancer patients. Microarray data are characterized by their high-dimensionality: the number of covariates (p similar to 1000) greatly exceeds the number of samples (n similar to 100), which is a considerable challenge in the context of survival prediction. An inventory of methods that have been used to model survival using gene expression is given. These methods are critically reviewed and compared in a qualitative way. Next, these methods are applied to three real-life data sets for a quantitative comparison. The choice of the evaluation measure of predictive performance is crucial for the selection of the best method. Depending on the evaluation measure, either the L-2-penalized Cox regression or the random forest ensemble method yields the best survival time prediction using the considered gene expression data sets. Consensus on the best evaluation measure of predictive performance is needed. (C) 2008 Elsevier B.V. All rights reserved.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据