☆ 4.6 Article

Does ignoring clustering in multicenter data influence the performance of prediction models? A simulation study

STATISTICAL METHODS IN MEDICAL RESEARCH (2018)

Journal

STATISTICAL METHODS IN MEDICAL RESEARCH

Volume 27, Issue 6, Pages 1723-1736

Publisher

SAGE PUBLICATIONS LTD

DOI: 10.1177/0962280216668555

Keywords

Mixed model; logistic regression; clinical prediction model; calibration; discrimination; predictive performance; bias

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Clinical risk prediction models are increasingly being developed and validated on multicenter datasets. In this article, we present a comprehensive framework for the evaluation of the predictive performance of prediction models at the center level and the population level, considering population-averaged predictions, center-specific predictions, and predictions assuming an average random center effect. We demonstrated in a simulation study that calibration slopes do not only deviate from one because of over- or underfitting of patterns in the development dataset, but also as a result of the choice of the model (standard versus mixed effects logistic regression), the type of predictions (marginal versus conditional versus assuming an average random effect), and the level of model validation (center versus population). In particular, when data is heavily clustered (ICC 20%), center-specific predictions offer the best predictive performance at the population level and the center level. We recommend that models should reflect the data structure, while the level of model validation should reflect the research question.

Does ignoring clustering in multicenter data influence the performance of prediction models? A simulation study

Journal

STATISTICAL METHODS IN MEDICAL RESEARCH

Publisher

SAGE PUBLICATIONS LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Does ignoring clustering in multicenter data influence the performance of prediction models? A simulation study

Journal

STATISTICAL METHODS IN MEDICAL RESEARCH

Publisher

SAGE PUBLICATIONS LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper