Representing document lengths with identifiers

0Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The length of each indexed document is needed by most common text retrieval scoring functions to rank it with respect to the current query. For efficiency purposes information retrieval systems maintain this information in the main memory. This paper proposes a novel strategy to encode the length of each document directly in the document identifier, thus reducing main memory demand. The technique is based on a simple document identifier assignment method and a function allowing the approximate length of each indexed document to be computed analytically.

Cite

CITATION STYLE

APA

Perego, R., Silvestri, F., & Tonellotto, N. (2011). Representing document lengths with identifiers. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6611 LNCS, pp. 665–669). Springer Verlag. https://doi.org/10.1007/978-3-642-20161-5_66

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free