Journal
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL
Volume 150, Issue 4, Pages 700-709Publisher
AMER PSYCHOLOGICAL ASSOC
DOI: 10.1037/xge0000920
Keywords
binary outcomes; logistic regression; linear regression; average treatment effects; causal effects
Categories
Ask authors/readers for more resources
When estimating treatment effects on binary outcomes, linear regression is generally the best strategy. Linear regression coefficients are directly interpretable in terms of probabilities, and it is safer when interaction terms or fixed effects are included.
When the outcome is binary, psychologists often use nonlinear modeling strategies such as logit or probit. These strategies are often neither optimal nor justified when the objective is to estimate causal effects of experimental treatments. Researchers need to take extra steps to convert logit and probit coefficients into interpretable quantities, and when they do, these quantities often remain difficult to understand. Odds ratios, for instance, are described as obscure in many textbooks (e.g., Gelman & Hill, 2006, p. 83). I draw on econometric theory and established statistical findings to demonstrate that linear regression is generally the best strategy to estimate causal effects of treatments on binary outcomes. Linear regression coefficients are directly interpretable in terms of probabilities and, when interaction terms or fixed effects are included, linear regression is safer. I review the Neyman-Rubin causal model, which I use to prove analytically that linear regression yields unbiased estimates of treatment effects on binary outcomes. Then, I run simulations and analyze existing data on 24,191 students from 56 middle schools (Paluck, Shepherd, & Aronow, 2013) to illustrate the effectiveness of linear regression. Based on these grounds, I recommend that psychologists use linear regression to estimate treatment effects on binary outcomes.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available