Edit distance with duplications and contractions revisited

8Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper, we propose three algorithms for the problem of string edit distance with duplication and contraction operations, which improve the time complexity of previous algorithms for this problem. These include a faster algorithm for the general case of the problem, and two improvements which apply under certain assumptions on the cost function. The general algorithm is based on fast min-plus multiplication of square matrices, and obtains the running time of O( |∑|n3 log3 log n/log2 n), where n is the length of the input strings and |∑| is the alphabet size. This algorithm is further accelerated, under some assumption on the cost function, to O( |∑|(n2 + nn′2 log3 log n′/log2 n′)), time, where n′ is the length of the run-length encoding of the input. Another improvement is based on a new fast matrix-vector min-plus multiplication under a certain discreteness assumption, and yields an O( |∑|n3/log2 n) time algorithm. Furthermore, this algorithm is online, in the sense that one of the strings may be given letter by letter. As part of this algorithm we present the currently fastest online algorithm for weighted CFG parsing for discrete weighted grammars. This result is useful on its own. © 2011 Springer-Verlag.

Cite

CITATION STYLE

APA

Pinhas, T., Tsur, D., Zakov, S., & Ziv-Ukelson, M. (2011). Edit distance with duplications and contractions revisited. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6661 LNCS, pp. 441–454). https://doi.org/10.1007/978-3-642-21458-5_37

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free