Two algorithms for approximate string matching in static texts

Petteri Jokinen; Esko Ukkonen

Conference Proceedings

Two algorithms for approximate string matching in static texts

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (1991) 520 LNCS 240-248

DOI: 10.1007/3-540-54345-7_67

N/ACitations

44Readers

Get full text

Abstract

The problem of finding all approximate occurrences P′ of a pattern string P in a text string T such that the edit distance between P and P′ is ≤k is considered. We concentrate on a scheme in which T is first preprocessed to make the subsequent searches with different P fast. Two preprocessing methods and the corresponding search algorithms are described. The first is based suffix automata and is applicable for edit distances with general edit operation costs. The second is a special design for unit cost edit distance and is based on q-gram lists. The preprocessing needs in both cases time and space O(|T|). The search algorithms run in the worst case in time O(|P‖T|) or O(k|T|), and in the best case in time O(|P|).

Cite

CITATION STYLE

APA

Jokinen, P., & Ukkonen, E. (1991). Two algorithms for approximate string matching in static texts. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 520 LNCS, pp. 240–248). Springer Verlag. https://doi.org/10.1007/3-540-54345-7_67

Two algorithms for approximate string matching in static texts

Abstract

Cite

Register to see more suggestions