How Reliable are the Results of Large-Scale Information Retrieval Experiments?

Justin Zobel

Conference ProceedingsOPEN ACCESS

How Reliable are the Results of Large-Scale Information Retrieval Experiments?

Zobel J

SIGIR 1998 - Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (1998) 307-314

DOI: 10.1145/290941.291014

25Citations

169Readers

Abstract

Two stages in measurement of techniques for information retrieval are gathering of documents for relevance assessment and use of the assessments to numerically evaluate effectiveness. We consider both of these stages in the context of the TREC experiments, to determine whether they lead to measurements that are trustworthy and fair. Our detailed empirical investigation of the TREC results shows that the measured relative performance of systems appears to be reliable, but that recall is overestimated: it is likely that many relevant documents have not been found. We propose a new pooling strategy that can significantly increase the number of relevant documents found for given effort, without compromising fairness.

Cite

CITATION STYLE

APA

Zobel, J. (1998). How Reliable are the Results of Large-Scale Information Retrieval Experiments? In SIGIR 1998 - Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 307–314). Association for Computing Machinery, Inc. https://doi.org/10.1145/290941.291014

How Reliable are the Results of Large-Scale Information Retrieval Experiments?

Abstract

Cite

Register to see more suggestions