Text indexing with errors

17Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper we address the problem of constructing an index for a text document or a collection of documents to answer various questions about the occurrences of a pattern when allowing a constant number of errors. In particular, our index can be built to report all occurrences, all positions, or all documents where a pattern occurs in time linear in the size of the query string and the number of results. This improves over previous work where the lookup time is not linear or depends upon the size of the document corpus. Our data structure has size O(n logk n) on average and with high probability for input size n and queries with up to k errors. Additionally, we present a trade-off between query time and index complexity that achieves worst-case bounded index size and preprocessing time with linear lookup time on average. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Maaß, M. G., & Nowak, J. (2005). Text indexing with errors. In Lecture Notes in Computer Science (Vol. 3537, pp. 21–32). Springer Verlag. https://doi.org/10.1007/11496656_3

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free