A linear time approximation algorithm for the DCJ distance for genomes with bounded number of duplicates

2Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Rearrangements are large-scale mutations in genomes, responsible for complex changes and structural variations. Most rearrangements that modify the organization of a genome can be represented by the double cut and join (DCJ) operation. Given two genomes with the same content, so that we have exactly the same number of copies of each gene in each genome, we are interested in the problem of computing the rearrangement distance between them, i.e., finding the minimum number of DCJ operations that transform one genome into the other. We propose a linear time approximation algorithm with approximation factor O(k) for the DCJ distance problem, where k is the maximum number of duplicates of any gene in the input genomes. Our algorithm uses as an intermediate step an O(k)-approximation for the minimum common string partition problem, which is closely related to the DCJ distance problem. Experiments on simulated data sets show that the algorithm is very competitive both in efficiency and quality of the solutions.

Cite

CITATION STYLE

APA

Rubert, D. P., Feijão, P., Braga, M. D. V., Stoye, J., & Martinez, F. V. (2016). A linear time approximation algorithm for the DCJ distance for genomes with bounded number of duplicates. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9838 LNCS, pp. 293–306). Springer Verlag. https://doi.org/10.1007/978-3-319-43681-4_24

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free