A parallel algorithm for fixed-length approximate string-matching with k-mismatches

8Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper deals with the approximate string-matching problem with Hamming distance. The approximate string-matching with k-mismatches problem is to find all locations at which a query of length m matches a factor of a text of length n with k or fewer mismatches. The approximate string-matching algorithms have both pleasing theoretical features, as well as direct applications, especially in computational biology.We consider a generalisation of this problem, the fixed-length approximate string-matching with k-mismatches problem: given a text t, a pattern x and an integer ℓ, search for all the occurrences in t of all factors of x of length ℓ with k or fewer mismatches with a factor of t. We present a practical parallel algorithm of comparable simplicity that requires only O(nm[ℓ/w]/p) time, where w is the word size of the machine (e.g. 32 or 64 in practice) and p the number of processors. Thus the algorithm's performance is independent of k and the alphabet size |Σ|. The proposed parallel algorithm makes use ofmessage-passing parallelism model, and word-level parallelism for efficient approximate string-matching. © Springer-Verlag Berlin Heidelberg 2010.

Cite

CITATION STYLE

APA

Crochemore, M., Iliopoulos, C. S., & Pissis, S. P. (2010). A parallel algorithm for fixed-length approximate string-matching with k-mismatches. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6060 LNCS, pp. 92–101). https://doi.org/10.1007/978-3-642-12476-1_6

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free