4.5 Article

Model averaging, missing data and multiple imputation: a case study for behavioural ecology

Journal

BEHAVIORAL ECOLOGY AND SOCIOBIOLOGY
Volume 65, Issue 1, Pages 103-116

Publisher

SPRINGER
DOI: 10.1007/s00265-010-1044-7

Keywords

Data augmentation; Data deletion; Estimation bias; The rate of missing information; Expectation maximization; QAIC; EPP; MCMC; House sparrows

Funding

  1. Marsden Fund [UOO0812]
  2. Royal Society
  3. Natural Environment Research Council [NE/F006071/1] Funding Source: researchfish
  4. NERC [NE/F006071/1] Funding Source: UKRI

Ask authors/readers for more resources

Model averaging, specifically information theoretic approaches based on Akaike's information criterion (IT-AIC approaches), has had a major influence on statistical practices in the field of ecology and evolution. However, a neglected issue is that in common with most other model fitting approaches, IT-AIC methods are sensitive to the presence of missing observations. The commonest way of handling missing data is the complete-case analysis (the complete deletion from the dataset of cases containing any missing values). It is well-known that this results in reduced estimation precision (or reduced statistical power), biased parameter estimates; however, the implications for model selection have not been explored. Here we employ an example from behavioural ecology to illustrate how missing data can affect the conclusions drawn from model selection or based on hypothesis testing. We show how missing observations can be recovered to give accurate estimates for IT-related indices (e.g. AIC and Akaike weight) as well as parameters (and their standard errors) by utilizing 'multiple imputation'. We use this paper to illustrate key concepts from missing data theory and as a basis for discussing available methods for handling missing data. The example is intended to serve as a practically oriented case study for behavioural ecologists deciding on how to handle missing data in their own datasets and also as a first attempt to consider the problems of conducting model selection and averaging in the presence of missing observations.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available