Simple compression code supporting random access and fast string matching

22Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Given a sequence S of n symbols over some alphabet ∑, we develop a new compression method that is (i) very simple to implement; (ii) provides O(1) time random access to any symbol of the original sequence; (iii) allows efficient pattern matching over the compressed sequence. Our simplest solution uses at most 2h + o(h) bits of space, where h = n(H0(S) + 1), and H 0(S) is the zeroth-order empirical entropy of S. We discuss a number of improvements and trade-offs over the basic method. The new method is applied to text compression. We also propose average case optimal string matching algorithms. © Springer-Verlag Berlin Heidelberg 2007.

Cite

CITATION STYLE

APA

Fredriksson, K., & Nikitin, F. (2007). Simple compression code supporting random access and fast string matching. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4525 LNCS, pp. 203–216). Springer Verlag. https://doi.org/10.1007/978-3-540-72845-0_16

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free