4.5 Article

On the marginal likelihood and cross-validation

Journal

BIOMETRIKA
Volume 107, Issue 2, Pages 489-496

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/biomet/asz077

Keywords

Cross-validation; Marginal likelihood; Prequential scoring

Funding

  1. Alan Turing Institute
  2. Health Data Research UK
  3. Li Ka Shing Foundation
  4. Medical Research Council
  5. U.K. Engineering and Physical Sciences Research Council
  6. EPSRC [EP/R018561/1] Funding Source: UKRI

Ask authors/readers for more resources

In Bayesian statistics, the marginal likelihood, also known as the evidence, is used to evaluate model fit as it quantifies the joint probability of the data under the prior. In contrast, non-Bayesian models are typically compared using cross-validation on held-out data, either through k-fold partitioning or leave-p-out subsampling. We show that the marginal likelihood is formally equivalent to exhaustive leave-p-out cross-validation averaged over all values of p and all held-out test sets when using the log posterior predictive probability as the scoring rule. Moreover, the log posterior predictive score is the only coherent scoring rule under data exchangeability. This offers new insight into the marginal likelihood and cross-validation, and highlights the potential sensitivity of the marginal likelihood to the choice of the prior. We suggest an alternative approach using cumulative cross-validation following a preparatory training phase. Our work has connections to prequential analysis and intrinsic Bayes factors, but is motivated in a different way.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available