The paper investigates the measures of retrieval effectiveness employed in the ad-hoc track of INEX 2006. In particular, it looks at how the evaluation of the Focused task is affected when different methodology is employed in generating the so-called ideal recall-base, which forms the ground-truth of the evaluation. The results show that the choice of methodology can impact on the obtained performance scores and the relative ranking of systems in relation to each other, especially when the effectiveness scores are uniformly low across all systems. Most XCG measures show very similar levels of sensitivity to changes in the ideal recall-base. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Kazai, G. (2007). Choosing an ideal recall-base for the evaluation of the focused task: sensitivity analysis of the XCG evaluation measures. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4518 LNCS, pp. 35–44). Springer Verlag. https://doi.org/10.1007/978-3-540-73888-6_4
Mendeley helps you to discover research relevant for your work.