A Lempel-Ziv compressed structure for document listing

Héctor Ferrada; Gonzalo Navarro

Conference Proceedings

A Lempel-Ziv compressed structure for document listing

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 8214 LNCS 116-128

DOI: 10.1007/978-3-319-02432-5_16

9Citations

4Readers

Get full text

Abstract

Document listing is the problem of preprocessing a set of sequences, called documents, so that later, given a short string called the pattern, we retrieve the documents where the pattern appears. While optimal-time and linear-space solutions exist, the current emphasis is in reducing the space requirements. Current document listing solutions build on compressed suffix arrays. This paper is the first attempt to solve the problem using a Lempel-Ziv compressed index of the text collections. We show that the resulting solution is very fast to output most of the resulting documents, taking more time for the final ones. This makes this index particularly useful for interactive scenarios or when listing some documents is sufficient. Yet, it also offers a competitive space/time tradeoff when returning the full answers. © Springer International Publishing 2013.

Cite

CITATION STYLE

APA

Ferrada, H., & Navarro, G. (2013). A Lempel-Ziv compressed structure for document listing. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8214 LNCS, pp. 116–128). Springer Verlag. https://doi.org/10.1007/978-3-319-02432-5_16

A Lempel-Ziv compressed structure for document listing

Abstract

Cite

Register to see more suggestions