Exploring evaluation metrics: GMAP versus MAP

Sri Devi Ravana; Alistair Moffat

Conference Proceedings

Exploring evaluation metrics: GMAP versus MAP

ACM SIGIR 2008 - 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Proceedings (2008) 687-688

DOI: 10.1145/1390334.1390452

3Citations

21Readers

Get full text

Abstract

In retrieval experiments, an effectiveness metrics is used to generate a score for each system-topic pair being tested. It is then usual to average the system-topic scores to obtain a system score, which is used for the purpose of system comparison. In this paper we explore the ramifications of using the geometric mean (GMAP), rather than the arithmetic mean (MAP) when computing an aggregate system score from a set of system-topic scores. We find that GMAP does indeed handle variability in topic difficulty more consistently than does the usual MAP aggregation method.

Author supplied keywords

Cite

CITATION STYLE

APA

Ravana, S. D., & Moffat, A. (2008). Exploring evaluation metrics: GMAP versus MAP. In ACM SIGIR 2008 - 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Proceedings (pp. 687–688). https://doi.org/10.1145/1390334.1390452

Exploring evaluation metrics: GMAP versus MAP

Abstract

Author supplied keywords

Cite

Register to see more suggestions