Finding transcription factor binding motifs for coregulated genes by combining sequence overrepresentation with cross-species conservation

Hui Jia; Jinming Li

Journal ArticleOPEN ACCESS

Finding transcription factor binding motifs for coregulated genes by combining sequence overrepresentation with cross-species conservation

Journal of Probability and Statistics (2012)

DOI: 10.1155/2012/830575

3Citations

12Readers

Abstract

Novel computational methods for finding transcription factor binding motifs have long been sought due to tedious work of experimentally identifying them. However, the current prevailing methods yield a large number of false positive predictions due to the short, variable nature of transcriptional factor binding sites (TFBSs). We proposed here a method that combines sequence overrepresentation and cross-species sequence conservation to detect TFBSs in upstream regions of a given set of coregulated genes. We applied the method to 35 S. cerevisiae transcriptional factors with known DNA binding motifs (with the support of orthologous sequences from genomes of S. mikatae, S. bayanus, and S. paradoxus), and the proposed method outperformed the single-genome-based motif finding methods MEME and AlignACE as well as the multiple-genome-based methods PHYME and Footprinter for the majority of these transcriptional factors. Compared with the prevailing motif finding software, our method has some advantages in finding transcriptional factor binding motifs for potential coregulated genes if the gene upstream sequences of multiple closely related species are available. Although we used yeast genomes to assess our method in this study, it might also be applied to other organisms if suitable related species are available and the upstream sequences of coregulated genes can be obtained for the multiple closely related species. Copyright © 2012 Hui Jia and Jinming Li.

Cite

CITATION STYLE

APA

Jia, H., & Li, J. (2012). Finding transcription factor binding motifs for coregulated genes by combining sequence overrepresentation with cross-species conservation. Journal of Probability and Statistics. https://doi.org/10.1155/2012/830575

Finding transcription factor binding motifs for coregulated genes by combining sequence overrepresentation with cross-species conservation

Abstract

Cite

Register to see more suggestions