4.6 Article

Thirteen Questions About Using Machine Learning in Causal Research (You Won't Believe the Answer to Number 10!)

Journal

AMERICAN JOURNAL OF EPIDEMIOLOGY
Volume 190, Issue 8, Pages 1476-1482

Publisher

OXFORD UNIV PRESS INC
DOI: 10.1093/aje/kwab047

Keywords

causal inference; double-robustness; epidemiologic methods; inverse probability weighting; machine learning; propensity score; targeted maximum likelihood estimation

Funding

  1. National Library of Medicine [K99012868]
  2. National Institute of Environmental Health Sciences [R01ES029531]
  3. Office of the Director of the National Institutes of Health [DP2HD084070]
  4. Eunice Kennedy Shriver National Institute of Child Health and Human Development

Ask authors/readers for more resources

Machine learning is increasingly used in health sciences, not only for data-driven prediction but also for embedding within causal analyses to reduce biases. An introduction and orientation provided in a question-and-answer format can help epidemiologists interested in using machine learning techniques to overcome potential bias and maintain rigor. Sample software code is also available to facilitate entry into using these techniques.
Machine learning is gaining prominence in the health sciences, where much of its use has focused on data-driven prediction. However, machine learning can also be embedded within causal analyses, potentially reducing biases arising from model misspecification. Using a question-and-answer format, we provide an introduction and orientation for epidemiologists interested in using machine learning but concerned about potential bias or loss of rigor due to use of black box models. We conclude with sample software code that may lower the barrier to entry to using these techniques.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available