Why scoring functions cannot assess tail properties

19Citations
Citations of this article
18Readers
Mendeley users who have this article in their library.

Abstract

Motivated by the growing interest in sound forecast evaluation techniques with an emphasis on distribution tails rather than average behaviour, we investigate a fundamental question arising in this context: Can statistical features of distribution tails be elicitable, i.e. be the unique minimizer of an expected score? We demonstrate that expected scores are not suitable to distinguish genuine tail properties in a very strong sense. Specifically, we introduce the class of max-functionals, which contains key characteristics from extreme value theory, for instance the extreme value index. We show that its members fail to be elicitable and that their elicitation complexity is in fact infinite under mild regularity assumptions. Further we prove that, even if the information of a max-functional is reported via the entire distribution function, a proper scoring rule cannot separate maxfunctional values. These findings highlight the caution needed in forecast evaluation and statistical inference if relevant information is encoded by such functionals.

References Powered by Scopus

Strictly proper scoring rules, prediction, and estimation

3500Citations
N/AReaders
Get full text

Making and evaluating point forecasts

717Citations
N/AReaders
Get full text

SCORING RULES FOR CONTINUOUS PROBABILITY DISTRIBUTIONS.

645Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Precipitation nowcasting with orographic enhanced stacked generalization: Improving deep learning predictions on extreme events

60Citations
N/AReaders
Get full text

A review of predictive uncertainty estimation with machine learning

20Citations
N/AReaders
Get full text

A review of machine learning concepts and methods for addressing challenges in probabilistic hydrological post-processing and forecasting

19Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Brehmer, J. R., & Kirstinstrokorb. (2019). Why scoring functions cannot assess tail properties. Electronic Journal of Statistics, 13(2), 4015–4034. https://doi.org/10.1214/19-EJS1622

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 11

79%

Researcher 2

14%

Professor / Associate Prof. 1

7%

Readers' Discipline

Tooltip

Mathematics 5

42%

Economics, Econometrics and Finance 3

25%

Computer Science 2

17%

Physics and Astronomy 2

17%

Save time finding and organizing research with Mendeley

Sign up for free