4.1 Article

STABILITY INVESTIGATIONS OF MULTIVARIABLE REGRESSION MODELS DERIVED FROM LOW- AND HIGH-DIMENSIONAL DATA

期刊

JOURNAL OF BIOPHARMACEUTICAL STATISTICS
卷 21, 期 6, 页码 1206-1231

出版社

TAYLOR & FRANCIS INC
DOI: 10.1080/10543406.2011.629890

关键词

Complexity; High-dimensional data; Resampling; Stability; Variable selection

向作者/读者索取更多资源

Multivariable regression models can link a potentially large number of variables to various kinds of outcomes, such as continuous, binary, or time-to-event endpoints. Selection of important variables and selection of the functional form for continuous covariates are key parts of building such models but are notoriously difficult due to several reasons. Caused by multicollinearity between predictors and a limited amount of information in the data, (in)stability can be a serious issue of models selected. For applications with a moderate number of variables, resampling-based techniques have been developed for diagnosing and improving multivariable regression models. Deriving models for high-dimensional molecular data has led to the need for adapting these techniques to settings where the number of variables is much larger than the number of observations. Three studies with a time-to-event outcome, of which one has high-dimensional data, are used to illustrate several techniques. Investigations at the covariate level and at the predictor level are seen to provide considerable insight into model stability and performance. While some areas are indicated where resampling techniques for model building still need further refinement, our case studies illustrate that these techniques can already be recommended for wider use.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.1
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据