Finding regulatory DNA motifs using alignment-free evolutionary conservation information

34Citations
Citations of this article
107Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

As an increasing number of eukaryotic genomes are being sequenced, comparative studies aimed at detecting regulatory elements in intergenic sequences are becoming more prevalent. Most comparative methods for transcription factor (TF) binding site discovery make use of global or local alignments of orthologous regulatory regions to assess whether a particular DNA site is conserved across related organisms, and thus more likely to be functional. Since binding sites are usually short, sometimes degenerate, and often independent of orientation, alignment algorithms may not align them correctly. Here, we present a novel, alignment-free approach for using conservation information for TF binding site discovery. We relax the definition of conserved sites: we consider a DNA site within a regulatory region to be conserved in an orthologous sequence if it occurs anywhere in that sequence, irrespective of orientation. We use this definition to derive informative priors over DNA sequence positions, and incorporate these priors into a Gibbs sampling algorithm for motif discovery. Our approach is simple and fast. It requires neither sequence alignments nor the phylogenetic relationships between the orthologous sequences, yet it is more effective on real biological data than methods that do. © The Author(s) 2010. Published by Oxford University Press.

Cite

CITATION STYLE

APA

Gordân, R., Narlikar, L., & Hartemink, A. J. (2010). Finding regulatory DNA motifs using alignment-free evolutionary conservation information. Nucleic Acids Research, 38(6). https://doi.org/10.1093/nar/gkp1166

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free