☆ 4.3 Article

The estimation and use of predictions for the assessment of model performance using large samples with multiply imputed data

BIOMETRICAL JOURNAL (2015)

期刊

BIOMETRICAL JOURNAL

卷 57, 期 4, 页码 614-632

出版社

WILEY

DOI: 10.1002/bimj.201400004

关键词

Measures of model performance; Missing data; Model validation; Multiple imputation; Prediction models; Rubin's rules

类别

Mathematical & Computational Biology Statistics & Probability

资金

MRC [MC_UU_12023/21, MC_U105260558, MR/L003120/1, MR/K014811/1, G0701619] Funding Source: UKRI
British Heart Foundation [RG/08/014/24067] Funding Source: researchfish
Medical Research Council [MR/K014811/1, G0701619, MC_U105260558, MC_UU_12023/21, MR/L003120/1] Funding Source: researchfish
National Institute for Health Research [NF-SI-0512-10165] Funding Source: researchfish

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Multiple imputation can be used as a tool in the process of constructing prediction models in medical and epidemiological studies with missing covariate values. Such models can be used to make predictions for model performance assessment, but the task is made more complicated by the multiple imputation structure. We summarize various predictions constructed from covariates, including multiply imputed covariates, and either the set of imputation-specific prediction model coefficients or the pooled prediction model coefficients. We further describe approaches for using the predictions to assess model performance. We distinguish between ideal model performance and pragmatic model performance, where the former refers to the model's performance in an ideal clinical setting where all individuals have fully observed predictors and the latter refers to the model's performance in a real-world clinical setting where some individuals have missing predictors. The approaches are compared through an extensive simulation study based on the UK700 trial. We determine that measures of ideal model performance can be estimated within imputed datasets and subsequently pooled to give an overall measure of model performance. Alternative methods to evaluate pragmatic model performance are required and we propose constructing predictions either from a second set of covariate imputations which make no use of observed outcomes, or from a set of partial prediction models constructed for each potential observed pattern of covariate. Pragmatic model performance is generally lower than ideal model performance. We focus on model performance within the derivation data, but describe how to extend all the methods to a validation dataset.

The estimation and use of predictions for the assessment of model performance using large samples with multiply imputed data

期刊

BIOMETRICAL JOURNAL

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

The estimation and use of predictions for the assessment of model performance using large samples with multiply imputed data

期刊

BIOMETRICAL JOURNAL

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文