4.6 Article Proceedings Paper

A comparison of imputation techniques for handling missing predictor values in a risk model with a binary outcome

Journal

STATISTICAL METHODS IN MEDICAL RESEARCH
Volume 16, Issue 3, Pages 277-298

Publisher

SAGE PUBLICATIONS LTD
DOI: 10.1177/0962280206074466

Keywords

-

Funding

  1. Medical Research Council [MC_U122861386] Funding Source: Medline
  2. MRC [MC_U122861386] Funding Source: UKRI
  3. Medical Research Council [MC_U122861386] Funding Source: researchfish

Ask authors/readers for more resources

Risk models that aim to predict the future course and outcome of disease processes are increasingly used in health research, and it is important that they are accurate and reliable. Most of these risk models are fitted using routinely collected data in hospitals or general practices. Clinical outcomes such as short-term mortality will be near-complete, but many of the predictors may have missing values. A common approach to dealing with this is to perform a complete-case analysis. However, this may lead to overfitted models and biased estimates if entire patient subgroups are excluded. The aim of this paper is to investigate a number of methods for imputing missing data to evaluate their effect on risk model estimation and the reliability of the predictions. Multiple imputation methods, including hotdecking and multiple imputation by chained equations (MICE), were investigated along with several single imputation methods. A large national cardiac surgery database was used to create simulated yet realistic datasets. The results suggest that complete case analysis may produce unreliable risk predictions and should be avoided. Conditional mean imputation performed well in our scenario, but may not be appropriate if using variable selection methods. MICE was amongst the best performing multiple imputation methods with regards to the quality of the predictions. Additionally, it produced the least biased estimates, with good coverage, and hence is recommended for use in practice.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available