Approximate string matching using compressed suffix arrays

16Citations
Citations of this article
15Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Let T be a text of length n and P be a pattern of length m, both strings over a fixed finite alphabet A. The k-difference (k-mismatch, respectively) problem is to find all occurrences of P in T that have edit distance (Hamming distance, respectively) at most k from P. In this paper we investigate a well-studied case in which k = 1 and T is fixed and preprocessed into an indexing data structure so that any pattern query can be answered faster [16-19]. This paper gives a solution using O(n) bits indexing data structure with O(m log2 n) query time. To the best of our knowledge, this is the first result which requires linear indexing space. The results can be extended for the fc-difference problem with k ≥ 1. © Springer-Verlag 2004.

Cite

CITATION STYLE

APA

Huynh, T. N. D., Hon, W. K., Lam, T. W., & Sung, W. K. (2004). Approximate string matching using compressed suffix arrays. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3109, 434–444. https://doi.org/10.1007/978-3-540-27801-6_33

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free