4.6 Article

Predicting counterfactual risks under hypothetical treatment strategies: an application to HIV

Journal

EUROPEAN JOURNAL OF EPIDEMIOLOGY
Volume 37, Issue 4, Pages 367-376

Publisher

SPRINGER
DOI: 10.1007/s10654-022-00855-8

Keywords

Counterfactual prediction; Causal inference; Dataset shift; Machine learning; Parametric g-formula; Transportability

Funding

  1. National Institutes of Health [K99 CA248335, R37 AI02634]
  2. Providence/Boston Center for AIDS Research [P30 AI042853]

Ask authors/readers for more resources

The article proposes a approach to counterfactual prediction for risk assessment in populations with different distribution of treatment strategies. It evaluates the performance robustness of a prediction algorithm through implementing contrast algorithms and generating counterfactual data, and discusses the challenges of estimating counterfactual risks under specific treatment strategies.
The accuracy of a prediction algorithm depends on contextual factors that may vary across deployment settings. To address this inherent limitation of prediction, we propose an approach to counterfactual prediction based on the g-formula to predict risk across populations that differ in their distribution of treatment strategies. We apply this to predict 5-year risk of mortality among persons receiving care for HIV in the U.S. Veterans Health Administration under different hypothetical treatment strategies. First, we implement a conventional approach to develop a prediction algorithm in the observed data and show how the algorithm may fail when transported to new populations with different treatment strategies. Second, we generate counterfactual data under different treatment strategies and use it to assess the robustness of the original algorithm's performance to these differences and to develop counterfactual prediction algorithms. We discuss how estimating counterfactual risks under a particular treatment strategy is more challenging than conventional prediction as it requires the same data, methods, and unverifiable assumptions as causal inference. However, this may be required when the alternative assumption of constant treatment patterns across deployment settings is unlikely to hold and new data is not yet available to retrain the algorithm.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available