4.7 Article

Imputation of missing data in time series for air pollutants

Journal

ATMOSPHERIC ENVIRONMENT
Volume 102, Issue -, Pages 96-104

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.atmosenv.2014.11.049

Keywords

Air pollution; Data imputation; EM algorithm; Environmental epidemiology; Missing data; Particulate matter; Time series

Funding

  1. Brazilian Research Council - CNPq

Ask authors/readers for more resources

Missing data are major concerns in epidemiological studies of the health effects of environmental air pollutants. This article presents an imputation-based method that is suitable for multivariate time series data, which uses the EM algorithm under the assumption of normal distribution. Different approaches are considered for filtering the temporal component. A simulation study was performed to assess validity and performance of proposed method in comparison with some frequently used methods. Simulations showed that when the amount of missing data was as low as 5%, the complete data analysis yielded satisfactory results regardless of the generating mechanism of the missing data, whereas the validity began to degenerate when the proportion of missing values exceeded 10%. The proposed imputation method exhibited good accuracy and precision in different settings with respect to the patterns of missing observations. Most of the imputations obtained valid results, even under missing not at random. The methods proposed in this study are implemented as a package called mtsdi for the statistical software system R. (C) 2014 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available