Maximally efficient modeling of DNA sequence motifs at all levels of complexity

22Citations
Citations of this article
55Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Identification of transcription factor binding sites is necessary for deciphering gene regulatory networks. Several new methods provide extensive data about the specificity of transcription factors but most methods for analyzing these data to obtain specificity models are limited in scope by, for example, assuming additive interactions or are inefficient in their exploration of more complex models. This article describes an approach-encoding of DNA sequences as the vertices of a regular simplex-that allows simultaneous direct comparison of simple and complex models, with higher-order parameters fit to the residuals of lower-order models. In addition to providing an efficient assessment of all model parameters, this approach can yield valuable insight into the mechanism of binding by highlighting features that are critical to accurate models. © 2011 by the Genetics Society of America.

Cite

CITATION STYLE

APA

Stormo, G. D. (2011). Maximally efficient modeling of DNA sequence motifs at all levels of complexity. Genetics, 187(4), 1219–1224. https://doi.org/10.1534/genetics.110.126052

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free