Effect of inverted index partitioning schemes on performance of query processing in parallel text retrieval systems

B. Barla Cambazoglu; Aytul Catal; Cevdet Aykanat

Conference Proceedings

Effect of inverted index partitioning schemes on performance of query processing in parallel text retrieval systems

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4263 LNCS 717-725

DOI: 10.1007/11902140_75

14Citations

19Readers

Get full text

Abstract

Shared-nothing, parallel text retrieval systems require an inverted index, representing a document collection, to be partitioned among a number of processors. In general, the index can be partitioned based on either the terms or documents in the collection, and the way the partitioning is done greatly affects the query processing performance of the parallel system. In this work, we investigate the effect of these two index partitioning schemes on query processing. We conduct experiments on a 32-node PC cluster, considering the case where index is completely stored in disk. Performance results are reported for a large (30 GB) document collection using an MPI-based parallel query processing implementation. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Cambazoglu, B. B., Catal, A., & Aykanat, C. (2006). Effect of inverted index partitioning schemes on performance of query processing in parallel text retrieval systems. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4263 LNCS, pp. 717–725). Springer Verlag. https://doi.org/10.1007/11902140_75

Effect of inverted index partitioning schemes on performance of query processing in parallel text retrieval systems

Abstract

Cite

Register to see more suggestions