A niched Pareto genetic algorithm for finding variable length regulatory motifs in DNA sequences

2Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

Abstract

The transcription factor binding sites also called as motifs are short, recurring patterns in DNA sequences that are presumed to have a biological function. Identification of the motifs from the promoter region of the genes is an important and unsolved problem specifically in the eukaryotic genomes. In this paper, we present a niched Pareto genetic algorithm to identify the regulatory motifs. This approach is based on the maximization of two objectives of the problem that is the motif length and the consensus similarity score. A long motif means it is less likely to be a false motif. The similarity score represents a motifs probability of conservation in a given set of sequences. Proposed method can find multiple, variable length motifs. In this method, we represented a candidate motif as a combination of length and starting position of the motif in each sequence of the co-regulated genes. This enables the algorithm to identify multiple motifs of variable length. We applied this approach on various data sets and the results show that it can find multiple motifs of variable length in co-regulated genes. © The Author(s) 2011.

Cite

CITATION STYLE

APA

Vijayvargiya, S., & Shukla, P. (2012). A niched Pareto genetic algorithm for finding variable length regulatory motifs in DNA sequences. 3 Biotech, 2(2), 141–148. https://doi.org/10.1007/s13205-011-0040-6

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free