4.5 Article

Minimum sample size for external validation of a clinical prediction model with a continuous outcome

Journal

STATISTICS IN MEDICINE
Volume 40, Issue 1, Pages 133-146

Publisher

WILEY
DOI: 10.1002/sim.8766

Keywords

calibration; continuous outcomes; external validation; prediction model; sample size; R‐ squared

Funding

  1. British Heart Foundation [FS/17/76/33286]
  2. Cancer Research UK [C49297/A27294]
  3. European Horizon 2020 research and innovation programme [777090]
  4. Medical Research Council
  5. NIHR Biomedical Research Centre, Oxford
  6. NIHR Clinical Trials Unit Support Funding
  7. NIHR SPCR
  8. NIHR SPCR Evidence Synthesis Working Group [390]
  9. Wellcome [102215/2/13/2]
  10. MRC [MC_PC_19009] Funding Source: UKRI
  11. H2020 Societal Challenges Programme [777090] Funding Source: H2020 Societal Challenges Programme

Ask authors/readers for more resources

Clinical prediction models offer personalized outcome predictions for patient counseling and decision making, with external validation crucial for assessing model performance. Proposed criteria aim to determine minimum sample size needed for external validation of a clinical prediction model, considering factors like proportion of variance explained and agreement between predicted and observed values. The recommendations provide a framework for estimating precision and ensuring adequate sample sizes in future validation studies.
Clinical prediction models provide individualized outcome predictions to inform patient counseling and clinical decision making. External validation is the process of examining a prediction model's performance in data independent to that used for model development. Current external validation studies often suffer from small sample sizes, and subsequently imprecise estimates of a model's predictive performance. To address this, we propose how to determine the minimum sample size needed for external validation of a clinical prediction model with a continuous outcome. Four criteria are proposed, that target precise estimates of (i) R-2 (the proportion of variance explained), (ii) calibration-in-the-large (agreement between predicted and observed outcome values on average), (iii) calibration slope (agreement between predicted and observed values across the range of predicted values), and (iv) the variance of observed outcome values. Closed-form sample size solutions are derived for each criterion, which require the user to specify anticipated values of the model's performance (in particular R-2) and the outcome variance in the external validation dataset. A sensible starting point is to base values on those for the model development study, as obtained from the publication or study authors. The largest sample size required to meet all four criteria is the recommended minimum sample size needed in the external validation dataset. The calculations can also be applied to estimate expected precision when an existing dataset with a fixed sample size is available, to help gauge if it is adequate. We illustrate the proposed methods on a case-study predicting fat-free mass in children.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available