Characterization and extraction of irredundant tandem motifs

6Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We address the problem of extracting pairs of subwords (m 1,m 2) from a text string s of length n, such that, given also an integer constant d in input, m 1 and m 2 occur in tandem within a maximum distance of d symbols in s. The main effort of this work is to eliminate the possible redundancy from the candidate set of the so found tandem motifs. To this aim, we first introduce the concept of maximality, characterized by four specific conditions, that we show to be not deducible by the corresponding notion of maximality already defined for simple (i.e., non tandem) motifs. Then, we further eliminate the remaining redundancy by defining the concept of irredundancy for tandem motifs. We prove that the number of non-overlapping irredundant tandems is O(d 2 n) which, considering d as a constant, leads to a linear number of tandems in the length of the input string. This is an order of magnitude less than previously developed compact indexes for tandem extraction. As a further contribution we show an algorithm to extract this compact irredundant index. © 2012 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Parida, L., Pizzi, C., & Rombo, S. E. (2012). Characterization and extraction of irredundant tandem motifs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7608 LNCS, pp. 385–397). Springer Verlag. https://doi.org/10.1007/978-3-642-34109-0_41

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free