Structured index organizations for high-throughput text querying

Vo Ngoc Anh; Alistair Moffat

Conference Proceedings

Structured index organizations for high-throughput text querying

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4209 LNCS 304-315

DOI: 10.1007/11880561_25

16Citations

38Readers

Get full text

Abstract

Inverted indexes are the preferred mechanism for supporting content-based queries in text retrieval systems, with the various data items usually stored compressed in some way. But different query modalities require that different information be held in the index. For example, phrase querying requires that word offsets be held as well as document numbers. In this study we describe an inverted index organization that provides efficient support for all of conjunctive Boolean queries, ranked queries, and phrase queries. Experimental results on a 426 GB document collection show that the methods we describe provide fast evaluation of all three querying modes. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Anh, V. N., & Moffat, A. (2006). Structured index organizations for high-throughput text querying. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4209 LNCS, pp. 304–315). Springer Verlag. https://doi.org/10.1007/11880561_25

Structured index organizations for high-throughput text querying

Abstract

Cite

Register to see more suggestions