4.8 Article

Calibration after bootstrap for accurate uncertainty quantification in regression models

Journal

NPJ COMPUTATIONAL MATERIALS
Volume 8, Issue 1, Pages -

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41524-022-00794-8

Keywords

-

Funding

  1. National Science Foundation [1545481, 1636950, 1636910, 1931298]
  2. University of Wisconsin Harvey D. Spangler Professorship
  3. Direct For Computer & Info Scie & Enginr
  4. Div Of Information & Intelligent Systems [1636910] Funding Source: National Science Foundation
  5. Direct For Computer & Info Scie & Enginr
  6. Office of Advanced Cyberinfrastructure (OAC) [1636950] Funding Source: National Science Foundation
  7. Direct For Education and Human Resources
  8. Division Of Graduate Education [1545481] Funding Source: National Science Foundation
  9. Office of Advanced Cyberinfrastructure (OAC)
  10. Direct For Computer & Info Scie & Enginr [1931298] Funding Source: National Science Foundation

Ask authors/readers for more resources

This paper presents a calibration method to improve the accuracy of uncertainty estimation in machine learning models. It demonstrates that the direct bootstrap ensemble standard deviation is not an accurate estimate of uncertainty, but can be improved through calibration.
Obtaining accurate estimates of machine learning model uncertainties on newly predicted data is essential for understanding the accuracy of the model and whether its predictions can be trusted. A common approach to such uncertainty quantification is to estimate the variance from an ensemble of models, which are often generated by the generally applicable bootstrap method. In this work, we demonstrate that the direct bootstrap ensemble standard deviation is not an accurate estimate of uncertainty but that it can be simply calibrated to dramatically improve its accuracy. We demonstrate the effectiveness of this calibration method for both synthetic data and numerous physical datasets from the field of Materials Science and Engineering. The approach is motivated by applications in physical and biological science but is quite general and should be applicable for uncertainty quantification in a wide range of machine learning regression models.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available