On the Pluses and Minuses of Risk

Rodger Benham; Alistair Moffat; J. Shane Culpepper

Conference Proceedings

On the Pluses and Minuses of Risk

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 12004 LNCS 81-93

DOI: 10.1007/978-3-030-42835-8_8

1Citations

13Readers

Get full text

Abstract

Evaluating the effectiveness of retrieval models has been a mainstay in the IR community since its inception. Generally speaking, the goal is to provide a rigorous framework to compare the quality of two or more models, and determine which of them is the “better”. However, defining “better” or “best” in this context is not a simple task. Computing the average effectiveness over many queries is the most common approach used in Cranfield-style evaluations. But averages can hide subtle trade-offs in retrieval models – a percentage of the queries may well perform worse than a previous iteration of the model as a result of an optimization to improve some other subset. A growing body of work referred to as risk-sensitive evaluation, seeks to incorporate these effects. We scrutinize current approaches to risk-sensitive evaluation, and consider how risk and reward might be recast to better account for human expectations of result quality on a query by query basis.

Cite

CITATION STYLE

APA

Benham, R., Moffat, A., & Culpepper, J. S. (2020). On the Pluses and Minuses of Risk. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12004 LNCS, pp. 81–93). Springer. https://doi.org/10.1007/978-3-030-42835-8_8

On the Pluses and Minuses of Risk

Abstract

Cite

Register to see more suggestions