4.4 Article

An automatic robust Bayesian approach to principal component regression

Journal

JOURNAL OF APPLIED STATISTICS
Volume 48, Issue 1, Pages 84-104

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1080/02664763.2019.1710478

Keywords

Dimension reduction; linear regression; outliers; principal component analysis; reversible jump algorithms; whole robustness

Funding

  1. NSERC (Natural Sciences and Engineering Research Council of Canada)
  2. FRQNT (Le Fonds de recherche du Quebec - Nature et technologies)
  3. SOA (Society of Actuaries)

Ask authors/readers for more resources

Principal component regression uses PCs as regressors and is useful for prediction in high-dimensional covariate settings. A Bayesian approach is introduced for robust handling of outliers and efficient identification of significant components.
Principal component regression uses principal components (PCs) as regressors. It is particularly useful in prediction settings with high-dimensional covariates. The existing literature treating of Bayesian approaches is relatively sparse. We introduce a Bayesian approach that is robust to outliers in both the dependent variable and the covariates. Outliers can be thought of as observations that are not in line with the general trend. The proposed approach automatically penalises these observations so that their impact on the posterior gradually vanishes as they move further and further away from the general trend, corresponding to a concept in Bayesian statistics called whole robustness. The predictions produced are thus consistent with the bulk of the data. The approach also exploits the geometry of PCs to efficiently identify those that are significant. Individual predictions obtained from the resulting models are consolidated according to model-averaging mechanisms to account for model uncertainty. The approach is evaluated on real data and compared to its nonrobust Bayesian counterpart, the traditional frequentist approach and a commonly employed robust frequentist method. Detailed guidelines to automate the entire statistical procedure are provided. All required code is made available, see .

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available