4.6 Article

External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb

Journal

JOURNAL OF CLINICAL EPIDEMIOLOGY
Volume 135, Issue -, Pages 79-89

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.jclinepi.2021.02.011

Keywords

Sample size; External validation; Clinical prediction model; Calibration and discrimination; Net benefit; Simulation

Funding

  1. National Institute for Health Research School for Primary Care Research(NIHR SPCR)
  2. Evidence Synthesis Working Group - National Institute for Health Research School for Primary Care Research(NIHR SPCR) [390]
  3. Netherlands Organisation for Health Research and Development [91617050]
  4. National Institute for Health Research [PDF-2015-08-044]
  5. Cancer Research UK [C49297/A27294]
  6. NIHR Biomedical Research Centre, Oxford
  7. NIHR [PDF2014-10872]
  8. National Institute for Health Research(NIHR)
  9. National Institute for Health Research [PDF-2015-08-044] Funding Source: researchfish

Ask authors/readers for more resources

Rules-of-thumb for sample size in external validation of clinical prediction models may not be precise, with factors like LP distribution affecting precision of performance estimates. A tailored simulation-based approach can offer more flexibility and reliability in determining sample size requirements for validation.
Introduction: Sample size rules-of-thumb for external validation of clinical prediction models suggest at least 100 events and 100 non-events. Such blanket guidance is imprecise, and not specific to the model or validation setting. We investigate factors affecting precision of model performance estimates upon external validation, and propose a more tailored sample size approach. Methods: Simulation of logistic regression prediction models to investigate factors associated with precision of performance estimates. Then, explanation and illustration of a simulation-based approach to calculate the minimum sample size required to precisely estimate a model's calibration, discrimination and clinical utility. Results: Precision is affected by the model's linear predictor (LP) distribution, in addition to number of events and total sample size. Sample sizes of 100 (or even 200) events and non-events can give imprecise estimates, especially for calibration. The simulationbased calculation accounts for the LP distribution and (mis)calibration in the validation sample. Application identifies 2430 required participants (531 events) for external validation of a deep vein thrombosis diagnostic model. Conclusion: Where researchers can anticipate the distribution of the model's LP (eg, based on development sample, or a pilot study), a simulation-based approach for calculating sample size for external validation offers more flexibility and reliability than rules-of-thumb. (c) 2021 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/)

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available