Compressed string dictionaries

31Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The problem of storing a set of strings - a string dictionary - in compact form appears naturally in many cases. While classically it has represented a small part of the whole data to be processed (e.g., for Natural Language processing or for indexing text collections), recent applications in Web engines, RDF graphs, Bioinformatics, and many others, handle very large string dictionaries, whose size is a significant fraction of the whole data. Thus efficient approaches to compress them are necessary. In this paper we empirically compare time and space performance of some existing alternatives, as well as new ones we propose. We show that space reductions of up to 20% of the original size of the strings is possible while supporting dictionary searches within a few microseconds, and up to 10% within a few tens or hundreds of microseconds. © 2011 Springer-Verlag.

Cite

CITATION STYLE

APA

Brisaboa, N. R., Cánovas, R., Claude, F., Martínez-Prieto, M. A., & Navarro, G. (2011). Compressed string dictionaries. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6630 LNCS, pp. 136–147). https://doi.org/10.1007/978-3-642-20662-7_12

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free