4.7 Article

Modelling relative survival in the presence of incomplete data: a tutorial

Journal

INTERNATIONAL JOURNAL OF EPIDEMIOLOGY
Volume 39, Issue 1, Pages 118-128

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/ije/dyp309

Keywords

Cancer registry; colorectal cancer; missing data; multiple imputation; stage; relative survival

Funding

  1. Cancer Research UK [C1336/A5735]
  2. Cancer Survival Group in the London School of Hygiene and Tropical Medicine
  3. North West Cancer Intelligence Service
  4. Economic and Social Research Council [RES-063-270257]
  5. Cancer Research UK [11700] Funding Source: researchfish

Ask authors/readers for more resources

Methods We estimated relative survival for 29 563 colorectal cancer patients who were diagnosed between 1997 and 2004 and registered in the North West Cancer Intelligence Service. The method of multiple imputation (MI) was applied to account for the common example of incomplete stage at diagnosis, under the missing at random (MAR) assumption. Multivariable regression with a generalized linear model and Poisson error structure was then used to estimate the excess hazard of death of the colorectal cancer patients, over and above the background mortality, adjusting for significant predictors of mortality. Results Incomplete information on stage, morphology and grade meant that only 55% of the data could be included in the 'complete-case' analysis. All cases could be included after indicator method (IM) or MI method. Handling missing data by MI produced a significantly lower estimate of the excess mortality for stage, morphology and grade, with the largest reductions occurring for late-stage and high-grade tumours, when compared with the results of complete-case analysis. Conclusion In complete-case analysis, almost 50% of the information could not be included, and with the IM, all records with missing values for stage were combined into a single 'missing' category. We show that MI methods greatly improved the results by exploiting all the information in the incomplete records. This method also helped to ensure efficient inferences about survival were made from the multivariate regression analyses.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available