4.6 Article

The Role of Prediction Modeling in Propensity Score Estimation: An Evaluation of Logistic Regression, bCART, and the Covariate-Balancing Propensity Score

Journal

AMERICAN JOURNAL OF EPIDEMIOLOGY
Volume 180, Issue 6, Pages 645-655

Publisher

OXFORD UNIV PRESS INC
DOI: 10.1093/aje/kwu181

Keywords

cardiovascular disease; covariate balance; diabetes; epidemiologic methods; propensity score; regression; simulation

Funding

  1. Merck Co., Inc. [EP09001.037]
  2. National Institute on Aging [R01 AG023178]
  3. Agency for Healthcare Research and Quality [K02HS017950]
  4. Pharmacoepidemiology Gillings Innovation Lab [GIL200811.0010]
  5. Center for Pharmacoepidemiology, Department of Epidemiology, University of North Carolina Gillings School of Global Public Health
  6. Center for Pharmacoepidemiology at the University of North Carolina Gillings School of Global Public Health
  7. Amgen, Inc.
  8. Genentech, Inc.
  9. GlaxoSmithKline, PLC
  10. Merck Co., Inc.
  11. Sanofi, SA

Ask authors/readers for more resources

The covariate-balancing propensity score (CBPS) extends logistic regression to simultaneously optimize covariate balance and treatment prediction. Although the CBPS has been shown to perform well in certain settings, its performance has not been evaluated in settings specific to pharmacoepidemiology and large database research. In this study, we use both simulations and empirical data to compare the performance of the CBPS with logistic regression and boosted classification and regression trees. We simulated various degrees of model misspecification to evaluate the robustness of each propensity score (PS) estimation method. We then applied these methods to compare the effect of initiating glucagonlike peptide-1 agonists versus sulfonylureas on cardiovascular events and all-cause mortality in the US Medicare population in 2007-2009. In simulations, the CBPS was generally more robust in terms of balancing covariates and reducing bias compared with misspecified logistic PS models and boosted classification and regression trees. All PS estimation methods performed similarly in the empirical example. For settings common to pharmacoepidemiology, logistic regression with balance checks to assess model specification is a valid method for PS estimation, but it can require refitting multiple models until covariate balance is achieved. The CBPS is a promising method to improve the robustness of PS models.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available