Efficiency Implications of Term Weighting for Passage Retrieval

Joel MacKenzie; Zhuyun Dai; Luke Gallagher; Jamie Callan

Conference ProceedingsOPEN ACCESS

Efficiency Implications of Term Weighting for Passage Retrieval

SIGIR 2020 - Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (2020) 1821-1824

DOI: 10.1145/3397271.3401263

23Citations

24Readers

Get full text

Abstract

Language model pre-training has spurred a great deal of attention for tasks involving natural language understanding, and has been successfully applied to many downstream tasks with impressive results. Within information retrieval, many of these solutions are too costly to stand on their own, requiring multi-stage ranking architectures. Recent work has begun to consider how to "backport" salient aspects of these computationally expensive models to previous stages of the retrieval pipeline. One such instance is DeepCT, which uses BERT to re-weight term importance in a given context at the passage level. This process, which is computed offline, results in an augmented inverted index with re-weighted term frequency values. In this work, we conduct an investigation of query processing efficiency over DeepCT indexes. Using a number of candidate generation algorithms, we reveal how term re-weighting can impact query processing latency, and explore how DeepCT can be used as a static index pruning technique to accelerate query processing without harming search effectiveness.

Author supplied keywords

Cite

CITATION STYLE

APA

MacKenzie, J., Dai, Z., Gallagher, L., & Callan, J. (2020). Efficiency Implications of Term Weighting for Passage Retrieval. In SIGIR 2020 - Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 1821–1824). Association for Computing Machinery, Inc. https://doi.org/10.1145/3397271.3401263

Efficiency Implications of Term Weighting for Passage Retrieval

Abstract

Author supplied keywords

Cite

Register to see more suggestions