The Brier score does not evaluate the clinical utility of diagnostic tests or prediction models

Melissa Assel; Daniel D. Sjoberg; Andrew J. Vickers

Journal ArticleOPEN ACCESS

The Brier score does not evaluate the clinical utility of diagnostic tests or prediction models

Assel M
Sjoberg D
Vickers A

Diagnostic and Prognostic Research (2017) 1(1)

DOI: 10.1186/s41512-017-0020-3

N/ACitations

123Readers

Abstract

A variety of statistics have been proposed as tools to help investigators assess the value of diagnostic tests or prediction models. The Brier score has been recommended on the grounds that it is a proper scoring rule that is affected by both discrimination and calibration. However, the Brier score is prevalence dependent in such a way that the rank ordering of tests or models may inappropriately vary by prevalence. We explored four common clinical scenarios: comparison of a highly accurate binary test with a continuous prediction model of moderate predictiveness; comparison of two binary tests where the importance of sensitivity versus specificity is inversely associated with prevalence; comparison of models and tests to default strategies of assuming that all or no patients are positive; and comparison of two models with miscalibration in opposite directions. In each case, we found that the Brier score gave an inappropriate rank ordering of the tests and models. Conversely, net benefit, a decision-analytic measure, gave results that always favored the preferable test or model. Brier score does not evaluate clinical value of diagnostic tests or prediction models. We advocate, as an alternative, the use of decision-analytic measures such as net benefit. Not applicable.

Cite

CITATION STYLE

APA

Assel, M., Sjoberg, D. D., & Vickers, A. J. (2017). The Brier score does not evaluate the clinical utility of diagnostic tests or prediction models. Diagnostic and Prognostic Research, 1(1). https://doi.org/10.1186/s41512-017-0020-3

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 48

60%

Researcher 23

29%

Professor / Associate Prof. 6

Lecturer / Post doc 3

Readers' Discipline

Medicine and Dentistry 33

54%

Mathematics 12

20%

Computer Science 9

15%

Biochemistry, Genetics and Molecular Bi... 7

11%

The Brier score does not evaluate the clinical utility of diagnostic tests or prediction models

Abstract

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline