Investigating per topic upper bound for session search evaluation

Zhiwen Tang; Grace Hui Yang

Conference ProceedingsOPEN ACCESS

Investigating per topic upper bound for session search evaluation

ICTIR 2017 - Proceedings of the 2017 ACM SIGIR International Conference on the Theory of Information Retrieval (2017) 185-192

DOI: 10.1145/3121050.3121069

10Citations

6Readers

Abstract

Session search is a complex Information Retrieval (IR) task. As a result, its evaluation is also complex. A great number of factors need to be considered in the evaluation of session search. They include document relevance, document novelty, aspect-related novelty discounting, and user's efforts in examining the documents. Due to increased complexity, most existing session search evaluation metrics are NP-hard. Consequently, the optimal value, i.e. the upper bound, of a metric highly varies with the actual search topics. In Cranfield-like settings such as the Text REtrieval Conference (TREC), scores for systems are usually averaged across all search topics. With undetermined upper bound values, however, it could be unfair to compare IR systems across different topics. This paper addresses the problem by investigating the actual per topic upper bounds of existing session search metrics. Through decomposing the metrics, we derive the upper bounds via mathematical optimization. We show that after being normalized by the bounds, the NP-hard session search metrics are then able to provide robust comparison across various search topics. The new normalized metrics are experimented on official runs submitted to the TREC 2016 Dynamic Domain (DD) Track.

Author supplied keywords

Cite

CITATION STYLE

APA

Tang, Z., & Yang, G. H. (2017). Investigating per topic upper bound for session search evaluation. In ICTIR 2017 - Proceedings of the 2017 ACM SIGIR International Conference on the Theory of Information Retrieval (pp. 185–192). Association for Computing Machinery, Inc. https://doi.org/10.1145/3121050.3121069

Investigating per topic upper bound for session search evaluation

Abstract

Author supplied keywords

Cite

Register to see more suggestions