4.4 Article

A two-stage latent factor regression method to model the common and unique effects of multiple highly correlated exposure variables

Journal

JOURNAL OF APPLIED STATISTICS
Volume -, Issue -, Pages -

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1080/02664763.2022.2138838

Keywords

Multicollinearity; variance inflation factor; factor analysis; principal component analysis

Funding

  1. Natural Sciences and Engineering Research Council of Canada (NSERC) [RGPIN/70212 -2019]

Ask authors/readers for more resources

In epidemiological and environmental health studies, accurately assessing the impact of multiple exposures on health outcomes is challenging. This study proposes a two-stage latent factor regression method that considers both latent factors and residual terms to address multicollinearity.
In many epidemiological and environmental health studies, developing an accurate exposure assessment of multiple exposures on a health outcome is often of interest. However, the problem is challenging in the presence of multicollinearity, which can lead to biased estimates of regression coefficients and inflated variance estimators. Selecting one exposure variable as a surrogate of multiple highly correlated exposure variables is often suggested in the literature as a solution to handle the multicollinearity problem. However, this may lead to loss of information, since the exposure variables that are highly correlated tend to have not only common but also additional effects on the outcome variable. In this study, a two-stage latent factor regression method is proposed. The key idea is to regress the dependent variable not only on the common latent factor(s) of the explanatory variables, but also on the residuals terms from the factor analysis as the explanatory variables. The proposed method is compared to the traditional latent factor regression and principal component regression for their performance of handling multicollinearity. Two case studies are presented. Simulation studies are performed to assess their performances in terms of the epidemiological interpretation and stability of parameter estimates.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available