Necessary and frequent terms in queries

Jiepu Jiang; James Allan

Conference Proceedings

Necessary and frequent terms in queries

SIGIR 2014 - Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval (2014) 1167-1170

DOI: 10.1145/2600428.2609536

1Citations

20Readers

Get full text

Abstract

Vocabulary mismatch has long been recognized as one of the major issues affecting search effectiveness. Ineffective queries usually fail to incorporate important terms and/or incorrectly include inappropriate keywords. However, in this paper we show another cause of reduced search performance: sometimes users issue reasonable query terms, but systems cannot identify the correct properties of those terms and take advantages of the properties. Specifically, we study two distinct types of terms that exist in all search queries: (1) necessary terms, for which term occurrence alone is indicative of document relevance; and (2) frequent terms, for which the relative term frequency is indicative of document relevance within the set of documents where the term appears. We evaluate these two properties of query terms in a dataset. Results show that only 1/3 of the terms are both necessary and frequent, while another 1/3 only hold one of the properties and the final third do not hold any of the properties. However, existing retrieval models do not clearly distinguish terms with the two properties and consider them differently. We further show the great potential of improving retrieval models by treating terms with distinct properties differently. Copyright 2014 ACM.

Author supplied keywords

Cite

CITATION STYLE

APA

Jiang, J., & Allan, J. (2014). Necessary and frequent terms in queries. In SIGIR 2014 - Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 1167–1170). Association for Computing Machinery. https://doi.org/10.1145/2600428.2609536

Necessary and frequent terms in queries

Abstract

Author supplied keywords

Cite

Register to see more suggestions