Space-efficient construction of LZ-index

13Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

A compressed full-text self-index is a data structure that replaces a text and in addition gives indexed access to it, while taking space proportional to the compressed text size. The LZ-index, in particular, requires 4uH k(1 + o(1)) bits of space, where u is the text length in characters and Hk is its k-th order empirical entropy. Although in practice the LZ-index needs 1.0-1.5 times the text size, its construction requires much more main memory (around 5 times the text size), which limits its applicability to large texts. In this paper we present a practical space-efficient algorithm to construct LZ-index, requiring (4 + ε)uHk +o(u) bits of space, for any constant 0 < ε < 1, and O(σu) time, being a the alphabet size. Our experimental results show that our method is efficient in practice, needing an amount of memory close to that of the final index. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Arroyuelo, D., & Navarro, G. (2005). Space-efficient construction of LZ-index. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3827 LNCS, pp. 1143–1152). https://doi.org/10.1007/11602613_113

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free