☆ 4.7 Article

Causal Modeling to Mitigate Selection Bias and Unmeasured Confounding in Internet-Based Epidemiology of COVID-19: Model Development and Validation

JMIR PUBLIC HEALTH AND SURVEILLANCE (2022)

Journal

JMIR PUBLIC HEALTH AND SURVEILLANCE

Volume 8, Issue 7, Pages -

Publisher

JMIR PUBLICATIONS, INC

DOI: 10.2196/31306

Keywords

selection bias; COVID-19; epidemiology; causality; sensitivity analysis; public health; surveillance; method; epidemiologic; research design; model; bias; development; validation; utility; implementation; sensitivity; design; research

Funding

National Institutes of Health [1R01EB025025-01, 1R01LM013364-01, 1R21HD091500-01, 1R01LM013083]
National Science Foundation [2014232]
Hartwell Foundation
Bill and Melinda Gates Foundation
National Science Foundation
Hartwell Foundation
Bill and Melinda Gates Foundation
Coulter Foundation
Lucile Packard Foundation
Auxiliaries Endowment [2014232]
Islamic Development Bank (ISDB) Transform Fund
Weston Havens Foundation
Stanford University\s Human Centered Artificial Intelligence Program
Precision Health and Integrated Diagnostics Center
Beckman Center
Bio-X Center
Predictives and Diagnostics Accelerator, Spectrum
Spark Program in Translational Research
MediaX
Wu Tsai Neurosciences Institute's Neuroscience:Translate Program
Stanford Interdisciplinary Graduate Fellowship (SIGF)

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This study demonstrates the practical utility of causal modeling in addressing selection bias and unmeasured confounding in epidemiological research. By constructing various causal models and comparing them with collected data, the study identifies the most compatible causal model and successfully estimates the infection rate.

Background: Selection bias and unmeasured confounding are fundamental problems in epidemiology that threaten study internal and external validity. These phenomena are particularly dangerous in internet-based public health surveillance, where traditional mitigation and adjustment methods are inapplicable, unavailable, or out of date. Recent theoretical advances in causal modeling can mitigate these threats, but these innovations have not been widely deployed in the epidemiological community.Objective: The purpose of our paper is to demonstrate the practical utility of causal modeling to both detect unmeasured confounding and selection bias and guide model selection to minimize bias. We implemented this approach in an applied epidemiological study of the COVID-19 cumulative infection rate in the New York City (NYC) spring 2020 epidemic.Methods: We collected primary data from Qualtrics surveys of Amazon Mechanical Turk (MTurk) crowd workers residing in New Jersey and New York State across 2 sampling periods: April 11-14 and May 8-11, 2020. The surveys queried the subjects on household health status and demographic characteristics. We constructed a set of possible causal models of household infection and survey selection mechanisms and ranked them by compatibility with the collected survey data. The most compatible causal model was then used to estimate the cumulative infection rate in each survey period.Results: There were 527 and 513 responses collected for the 2 periods, respectively. Response demographics were highly skewed toward a younger age in both survey periods. Despite the extremely strong relationship between age and COVID-19 symptoms, we recovered minimally biased estimates of the cumulative infection rate using only primary data and the most compatible causal model, with a relative bias of +3.8% and -1.9% from the reported cumulative infection rate for the first andConclusions: We successfully recovered accurate estimates of the cumulative infection rate from an internet-based crowdsourced sample despite considerable selection bias and unmeasured confounding in the primary data. This implementation demonstrates how simple applications of structural causal modeling can be effectively used to determine falsifiable model conditions, detect selection bias and confounding factors, and minimize estimate bias through model selection in a novel epidemiological context. As the disease and social dynamics of COVID-19 continue to evolve, public health surveillance protocols must continue to adapt; the emergence of Omicron variants and shift to at-home testing as recent challenges. Rigorous and transparent methods to develop, deploy, and diagnosis adapted surveillance protocols will be critical to their success.

Causal Modeling to Mitigate Selection Bias and Unmeasured Confounding in Internet-Based Epidemiology of COVID-19: Model Development and Validation

Journal

JMIR PUBLIC HEALTH AND SURVEILLANCE

Publisher

JMIR PUBLICATIONS, INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Causal Modeling to Mitigate Selection Bias and Unmeasured Confounding in Internet-Based Epidemiology of COVID-19: Model Development and Validation

Journal

JMIR PUBLIC HEALTH AND SURVEILLANCE

Publisher

JMIR PUBLICATIONS, INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper