Sampling precision to depth 10000 at CLEF 2009

Stephen Tomlinson

Conference Proceedings

Sampling precision to depth 10000 at CLEF 2009

Tomlinson S

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 6241 LNCS 78-85

DOI: 10.1007/978-3-642-15754-7_8

2Citations

8Readers

Get full text

Abstract

We conducted an experiment to test the completeness of the relevance judgments for the monolingual German, French, English and Persian (Farsi) information retrieval tasks of the Ad Hoc Track of the Cross-Language Evaluation Forum (CLEF) 2009. In the ad hoc retrieval tasks, the system was given 50 natural language queries, and the goal was to find all of the relevant documents (with high precision) in a particular document set. For each language, we submitted a sample of the first 10000 retrieved items to investigate the frequency of relevant items at deeper ranks than the official judging depth of 60 for German, French and English and 80 for Persian. The results suggest that, on average, the percentage of relevant items assessed was less than 62% for German, 27% for French, 35% for English and 22% for Persian. © 2010 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Tomlinson, S. (2010). Sampling precision to depth 10000 at CLEF 2009. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6241 LNCS, pp. 78–85). https://doi.org/10.1007/978-3-642-15754-7_8

Sampling precision to depth 10000 at CLEF 2009

Abstract

Cite

Register to see more suggestions