A compressed full-text self-index is a data structure that replaces a text and in addition gives indexed access to it, while taking space proportional to the compressed text size. The LZ-index, in particular, requires 4uH k(1 + o(1)) bits of space, where u is the text length in characters and Hk is its k-th order empirical entropy. Although in practice the LZ-index needs 1.0-1.5 times the text size, its construction requires much more main memory (around 5 times the text size), which limits its applicability to large texts. In this paper we present a practical space-efficient algorithm to construct LZ-index, requiring (4 + ε)uHk +o(u) bits of space, for any constant 0 < ε < 1, and O(σu) time, being a the alphabet size. Our experimental results show that our method is efficient in practice, needing an amount of memory close to that of the final index. © Springer-Verlag Berlin Heidelberg 2005.
CITATION STYLE
Arroyuelo, D., & Navarro, G. (2005). Space-efficient construction of LZ-index. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3827 LNCS, pp. 1143–1152). https://doi.org/10.1007/11602613_113
Mendeley helps you to discover research relevant for your work.