☆ 4.6 Review

Strictly proper scoring rules, prediction, and estimation

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION (2007)

期刊

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION

卷 102, 期 477, 页码 359-378

出版社

AMER STATISTICAL ASSOC

DOI: 10.1198/016214506000001437

关键词

Bayes factor; Bregman divergence; brier score; coherent; continuous ranked probability score; cross-validation; entropy; kernel score; loss function; minimum contrast estimation; negative definite function; prediction interval; predictive distribution; quantile forecast; scoring rule; skill score; strictly proper; utility function

类别

Statistics & Probability

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Scoring rules assess the quality of probabilistic forecasts, by assigning a numerical score based on the predictive distribution and on the event or value that materializes. A scoring rule is proper. if the forecaster maximizes the expected score for an observation drawn from the distribution F if he or she issues the probabilistic forecast F, rather than G 4 F. It is strictly proper if the maximum is unique. In prediction problems, proper scoring rules encourage the forecaster to make careful assessments and to be honest. In estimation problems, strictly proper scoring rules provide attractive loss and utility functions that can be tailored to the problem at hand. This article reviews and develops the theory of proper scoring rules on general probability spaces, and proposes and discusses examples thereof. Proper scoring rules derive from convex functions and relate to information measures, entropy functions, and Bregman divergences. In the case of categorical variables, we prove a rigorous version of the Savage representation. Examples of scoring rules for probabilistic forecasts in the form of predictive densities include the logarithmic, spherical, pseudospherical, and quadratic scores. The continuous ranked probability score applies to probabilistic forecasts that take the form of predictive cumulative distribution functions. It generalizes the absolute error and forms a special case of a new and very general type of score, the energy score. Like many other scoring rules, the energy score admits a kernel representation in terms of negative definite functions, with links to inequalities of Hoeffding type, in both univariate and multivariate settings. Proper scoring rules for quantile and interval forecasts are also discussed. We relate proper scoring rules to Bayes factors and to cross-validation, and propose a novel form of cross-validation known as random-fold cross-validation. A case study on probabilistic weather forecasts in the North American Pacific Northwest illustrates the importance of propriety. We note optimum score approaches to point and quantile estimation, and propose the intuitively appealing interval score as a utility function in interval estimation that addresses width as well as coverage.

Strictly proper scoring rules, prediction, and estimation

期刊

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION

出版社

AMER STATISTICAL ASSOC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Strictly proper scoring rules, prediction, and estimation

期刊

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION

出版社

AMER STATISTICAL ASSOC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文