Text indexing with errors

Moritz G. Maaß; Johannes Nowak

Conference Proceedings

Text indexing with errors

Lecture Notes in Computer Science (2005) 3537 21-32

DOI: 10.1007/11496656_3

17Citations

4Readers

Get full text

Abstract

In this paper we address the problem of constructing an index for a text document or a collection of documents to answer various questions about the occurrences of a pattern when allowing a constant number of errors. In particular, our index can be built to report all occurrences, all positions, or all documents where a pattern occurs in time linear in the size of the query string and the number of results. This improves over previous work where the lookup time is not linear or depends upon the size of the document corpus. Our data structure has size O(n logk n) on average and with high probability for input size n and queries with up to k errors. Additionally, we present a trade-off between query time and index complexity that achieves worst-case bounded index size and preprocessing time with linear lookup time on average. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Maaß, M. G., & Nowak, J. (2005). Text indexing with errors. In Lecture Notes in Computer Science (Vol. 3537, pp. 21–32). Springer Verlag. https://doi.org/10.1007/11496656_3

Text indexing with errors

Abstract

Cite

Register to see more suggestions