☆ 4.5 Article

Minimum sample size calculations for external validation of a clinical prediction model with a time-to-event outcome

STATISTICS IN MEDICINE (2022)

Journal

STATISTICS IN MEDICINE

Volume 41, Issue 7, Pages 1280-1295

Publisher

WILEY

DOI: 10.1002/sim.9275

Keywords

calibration; external validation; prediction model; sample size; time-to-event & survival data

Funding

Cancer Research UK [C14183/A29739, C41379/A27583, C49297/A27294]
NIHR Biomedical Research Centre, Oxford
Research Trainees Coordinating Centre [NIHR300100]
National Institutes of Health Research (NIHR) [NIHR300100] Funding Source: National Institutes of Health Research (NIHR)

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This article introduces how to calculate the sample size required for external validation of prediction models, extending guidelines from continuous and binary outcomes to time-to-event outcomes. A simulation-based framework is proposed to calculate the sample size needed for precise estimation of calibration, discrimination, and net-benefit in datasets containing censoring. Assumptions about the validation population and distribution of the model's linear predictor are essential for this process.

Previous articles in Statistics in Medicine describe how to calculate the sample size required for external validation of prediction models with continuous and binary outcomes. The minimum sample size criteria aim to ensure precise estimation of key measures of a model's predictive performance, including measures of calibration, discrimination, and net benefit. Here, we extend the sample size guidance to prediction models with a time-to-event (survival) outcome, to cover external validation in datasets containing censoring. A simulation-based framework is proposed, which calculates the sample size required to target a particular confidence interval width for the calibration slope measuring the agreement between predicted risks (from the model) and observed risks (derived using pseudo-observations to account for censoring) on the log cumulative hazard scale. Precise estimation of calibration curves, discrimination, and net-benefit can also be checked in this framework. The process requires assumptions about the validation population in terms of the (i) distribution of the model's linear predictor and (ii) event and censoring distributions. Existing information can inform this; in particular, the linear predictor distribution can be approximated using the C-index or Royston's D statistic from the model development article, together with the overall event risk. We demonstrate how the approach can be used to calculate the sample size required to validate a prediction model for recurrent venous thromboembolism. Ideally the sample size should ensure precise calibration across the entire range of predicted risks, but must at least ensure adequate precision in regions important for clinical decision-making. Stata and R code are provided.

Minimum sample size calculations for external validation of a clinical prediction model with a time-to-event outcome

Journal

STATISTICS IN MEDICINE

Publisher

WILEY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Minimum sample size calculations for external validation of a clinical prediction model with a time-to-event outcome

Journal

STATISTICS IN MEDICINE

Publisher

WILEY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper