☆ 4.6 Article

Multiple Imputation for Missing Data via Sequential Regression Trees

AMERICAN JOURNAL OF EPIDEMIOLOGY (2010)

期刊

AMERICAN JOURNAL OF EPIDEMIOLOGY

卷 172, 期 9, 页码 1070-1076

出版社

OXFORD UNIV PRESS INC

DOI: 10.1093/aje/kwq260

关键词

diagnostic check; imputation; missing data; pregnancy outcome; regression tree

类别

Public, Environmental & Occupational Health

资金

Environmental Protection Agency [R833293]
EPA [909223, R833293] Funding Source: Federal RePORTER

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Multiple imputation is particularly well suited to deal with missing data in large epidemiologic studies, because typically these studies support a wide range of analyses by many data users. Some of these analyses may involve complex modeling, including interactions and nonlinear relations. Identifying such relations and encoding them in imputation models, for example, in the conditional regressions for multiple imputation via chained equations, can be daunting tasks with large numbers of categorical and continuous variables. The authors present a nonparametric approach for implementing multiple imputation via chained equations by using sequential regression trees as the conditional models. This has the potential to capture complex relations with minimal tuning by the data imputer. Using simulations, the authors demonstrate that the method can result in more plausible imputations, and hence more reliable inferences, in complex settings than the naive application of standard sequential regression imputation techniques. They apply the approach to impute missing values in data on adverse birth outcomes with more than 100 clinical and survey variables. They evaluate the imputations using posterior predictive checks with several epidemiologic analyses of interest.

Multiple Imputation for Missing Data via Sequential Regression Trees

期刊

AMERICAN JOURNAL OF EPIDEMIOLOGY

出版社

OXFORD UNIV PRESS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Multiple Imputation for Missing Data via Sequential Regression Trees

期刊

AMERICAN JOURNAL OF EPIDEMIOLOGY

出版社

OXFORD UNIV PRESS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文