Position heaps: A simple and dynamic text indexing data structure

35Citations
Citations of this article
33Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

We address the problem of finding the locations of all instances of a string P in a text T, where preprocessing of T is allowed in order to facilitate the queries. Previous data structures for this problem include the suffix tree, the suffix array, and the compact DAWG. We modify a data structure called a sequence tree, which was proposed by Coffman and Eve (1970) [3] for hashing, and adapt it to the new problem. We can then produce a list of k occurrences of any string P in T in O(P+k) time. Because of properties shared by suffixes of a text that are not shared by arbitrary hash keys, we can build the structure in O(Ttime, which is much faster than Coffman and Eve's algorithm. These bounds are as good as those for the suffix tree, suffix array, and the compact DAWG. The advantages are the elementary nature of some of the algorithms for constructing and using the data structure and the asymptotic bounds we can give for updating the data structure when the text is edited. © 2010 Elsevier B.V. All rights reserved.

Author supplied keywords

Cite

CITATION STYLE

APA

Ehrenfeucht, A., McConnell, R. M., Osheim, N., & Woo, S. W. (2011). Position heaps: A simple and dynamic text indexing data structure. In Journal of Discrete Algorithms (Vol. 9, pp. 100–121). https://doi.org/10.1016/j.jda.2010.12.001

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free