4.6 Article

Bias and High-Dimensional Adjustment in Observational Studies of Peer Effects

Journal

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
Volume 116, Issue 534, Pages 507-517

Publisher

TAYLOR & FRANCIS INC
DOI: 10.1080/01621459.2020.1796393

Keywords

Causal inference; Observational study; Propensity scores; Randomized experiment; Social interactions

Ask authors/readers for more resources

Peer effects refer to the influence of peers on an individual's behavior. Research shows that adjusting for high-dimensional variables using propensity score models can effectively estimate peer effects, producing results statistically indistinguishable from those of large randomized experiments.
Peer effects, in which an individual's behavior is affected by peers' behavior, are posited by multiple theories in the social sciences. Randomized field experiments that identify peer effects, however, are often expensive or infeasible, so many studies of peer effects use observational data, which is expected to suffer from confounding. Here we show, in the context of information and media diffusion, that high-dimensional adjustment of a nonexperimental control group (660 million observations) using propensity score models produces estimates of peer effects statistically indistinguishable from those using a large randomized experiment (215 million observations). Compared with the experiment, naive observational estimators overstate peer effects by over 300% and commonly available variables (e.g., demographics) offer little bias reduction. Adjusting for a measure of prior behaviors closely related to the focal behavior reduces this bias by 91%, while models adjusting for over 3700 past behaviors provide additional bias reduction, reducing bias by over 97%, which is statistically indistinguishable from unbiasedness. This demonstrates how detailed records of behavior can improve studies of social influence, information diffusion, and imitation; these results are encouraging for the credibility of some studies but also cautionary for studies of peer effects in rare or new behaviors. More generally, these results show how large, high-dimensional datasets and statistical learning can be used to improve causal inference.for this article are available online.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available