4.7 Article

Robust PAC(m & nbsp;): Training Ensemble Models Under Misspecification and Outliers

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNNLS.2023.3295168

Keywords

Bayesian learning; ensemble models; machine learning; misspecification; outliers; robustness

Ask authors/readers for more resources

Standard Bayesian learning is suboptimal in generalization under misspecification and outliers. PAC-Bayes theory shows that the free energy criterion of Bayesian learning bounds the generalization error for Gibbs predictors under uncontaminated sampling distributions. This justifies the limitations of Bayesian learning in misspecified models and outliers. Recent work introduces PAC(m) bounds to enhance performance under misspecification, and this work proposes a robust free energy criterion combining the generalized logarithm score function with PAC(m) ensemble bounds, counteracting the effects of misspecification and outliers.
Standard Bayesian learning is known to have suboptimal generalization capabilities under misspecification and in the presence of outliers. Probably approximately correct (PAC)-Bayes theory demonstrates that the free energy criterion minimized by Bayesian learning is a bound on the generalization error for Gibbs predictors (i.e., for single models drawn at random from the posterior) under the assumption of sampling distributions uncontaminated by outliers. This viewpoint provides a justification for the limitations of Bayesian learning when the model is misspecified, requiring ensembling, and when data are affected by outliers. In recent work, PAC-Bayes bounds-referred to as PAC(m)-were derived to introduce free energy metrics that account for the performance of ensemble predictors, obtaining enhanced performance under misspecification. This work presents a novel robust free energy criterion that combines the generalized logarithm score function with PAC(m) ensemble bounds. The proposed free energy training criterion produces predictive distributions that are able to concurrently counteract the detrimental effects of misspecification-with respect to both likelihood and prior distribution-and outliers.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available