4.7 Article

Causal Classification: Treatment Effect Estimation vs. Outcome Prediction

Journal

JOURNAL OF MACHINE LEARNING RESEARCH
Volume 23, Issue -, Pages -

Publisher

MICROTOME PUBL

Keywords

Bias-variance tradeoff; causal inference; treatment assignment

Ask authors/readers for more resources

The goal of causal classification is to identify individuals whose outcome would be positively changed by a treatment. This study explores the feasibility of using outcome prediction instead of treatment effect estimation for causal classification, and finds that in certain situations, outcome prediction may actually perform better. The theoretical analysis and simulations demonstrate when outcome prediction is preferable, such as when data on counterfactuals are limited or when there is correlation between outcomes and treatment effects.
The goal of causal classification is to identify individuals whose outcome would be posi-tively changed by a treatment. Examples include targeting advertisements and targeting retention incentives to reduce churn. Causal classification is challenging because we ob-serve individuals under only one condition (treated or untreated), so we do not know who was influenced by the treatment, but we may estimate the potential outcomes under each condition to decide whom to treat by estimating treatment effects. Curiously, we often see practitioners using simple outcome prediction instead, for example, predicting if someone will purchase if shown the ad. Rather than disregarding this as naive behavior, we present a theoretical analysis comparing treatment effect estimation and outcome prediction when addressing causal classification. We focus on the key question: When (if ever) is simple outcome prediction preferable to treatment effect estimation for causal classification? The analysis reveals a causal bias-variance tradeoff. First, when the treatment effect estimation depends on two outcome predictions, larger sampling variance may lead to more errors than the (biased) outcome prediction approach. Second, a stronger signal-to-noise ratio in out-come prediction implies that the bias can help with intervention decisions when outcomes are informative of effects. The theoretical results, as well as simulations, illustrate settings where outcome prediction should actually be better, including cases where (1) the bias may be partially corrected by choosing a different threshold, (2) outcomes and treatment effects are correlated, and (3) data to estimate counterfactuals are limited. A major practical implication is that, for some applications, it might be feasible to make good intervention decisions without any data on how individuals actually behave when intervened. Finally, we show that for a real online advertising application, outcome prediction models indeed excel at causal classification.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available