A parallel algorithm for fixed-length approximate string-matching with k-mismatches

Maxime Crochemore; Costas S. Iliopoulos; Solon P. Pissis

Conference Proceedings

A parallel algorithm for fixed-length approximate string-matching with k-mismatches

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 6060 LNCS 92-101

DOI: 10.1007/978-3-642-12476-1_6

8Citations

5Readers

Get full text

Abstract

This paper deals with the approximate string-matching problem with Hamming distance. The approximate string-matching with k-mismatches problem is to find all locations at which a query of length m matches a factor of a text of length n with k or fewer mismatches. The approximate string-matching algorithms have both pleasing theoretical features, as well as direct applications, especially in computational biology.We consider a generalisation of this problem, the fixed-length approximate string-matching with k-mismatches problem: given a text t, a pattern x and an integer ℓ, search for all the occurrences in t of all factors of x of length ℓ with k or fewer mismatches with a factor of t. We present a practical parallel algorithm of comparable simplicity that requires only O(nm[ℓ/w]/p) time, where w is the word size of the machine (e.g. 32 or 64 in practice) and p the number of processors. Thus the algorithm's performance is independent of k and the alphabet size |Σ|. The proposed parallel algorithm makes use ofmessage-passing parallelism model, and word-level parallelism for efficient approximate string-matching. © Springer-Verlag Berlin Heidelberg 2010.

Author supplied keywords

Cite

CITATION STYLE

APA

Crochemore, M., Iliopoulos, C. S., & Pissis, S. P. (2010). A parallel algorithm for fixed-length approximate string-matching with k-mismatches. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6060 LNCS, pp. 92–101). https://doi.org/10.1007/978-3-642-12476-1_6

A parallel algorithm for fixed-length approximate string-matching with k-mismatches

Abstract

Author supplied keywords

Cite

Register to see more suggestions