Model diagnostics and forecast evaluation are closely related tasks, with the former concerning in-sample goodness (or lack) of fit and the latter addressing predictive performance out-of-sample. We review the ubiquitous setting in which forecasts are cast in the form of quantiles or quantile-bounded prediction intervals. We distinguish unconditional calibration, which corresponds to classical coverage criteria, from the stronger notion of conditional calibration, as can be visualized in quantile reliability diagrams. Consistent scoring functionsincluding, but not limited to, the widely used asymmetricpiecewise linear score or pinball lossprovide for comparative assessment and ranking, and link to the coefficient of determination and skill scores. We illustrate the use of these tools on Engel's food expenditure data, the Global Energy Forecasting Competition 2014, and the US COVID-19 Forecast Hub.
CITATION STYLE
Gneiting, T., Wolffram, D., Resin, J., Kraus, K., Bracher, J., Dimitriadis, T., … Schienle, M. (2023, March 10). Model Diagnostics and Forecast Evaluation for Quantiles. Annual Review of Statistics and Its Application. Annual Reviews Inc. https://doi.org/10.1146/annurev-statistics-032921-020240
Mendeley helps you to discover research relevant for your work.