Why transcription factor binding sites are ten nucleotides long

107Citations
Citations of this article
276Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Gene expression is controlled primarily by transcription factors, whose DNA binding sites are typically 10 nt long. We develop a population-genetic model to understand how the length and information content of such binding sites evolve. Our analysis is based on an inherent trade-off between specificity, which is greater in long binding sites, and robustness to mutation, which is greater in short binding sites. The evolutionary stable distribution of binding site lengths predicted by the model agrees with the empirical distribution (5-31 nt, with mean 9.9 nt for eukaryotes), and it is remarkably robust to variation in the underlying parameters of population size, mutation rate, number of transcription factor targets, and strength of selection for proper binding and selection against improper binding. In a systematic data set of eukaryotic and prokaryotic transcription factors we also uncover strong relationships between the length of a binding site and its information content per nucleotide, as well as between the number of targets a transcription factor regulates and the information content in its binding sites. Our analysis explains these features as well as the remarkable conservation of binding site characteristics across diverse taxa. © 2012 by the Genetics Society of America.

References Powered by Scopus

Transcriptional regulatory networks in Saccharomyces cerevisiae

2457Citations
N/AReaders
Get full text

Transcriptional regulatory code of a eukaryotic genome

1774Citations
N/AReaders
Get full text

The Origins of Genome Complexity

1263Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Non-coding RNA: What is functional and what is junk?

642Citations
N/AReaders
Get full text

On the immortality of television sets: "Function" in the human genome according to the evolution-free gospel of encode

360Citations
N/AReaders
Get full text

Consistent inverse correlation between DNA methylation of the first intron and gene expression across tissues and species

293Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Stewart, A. J., & Plotkin, J. B. (2012). Why transcription factor binding sites are ten nucleotides long. Genetics, 192(3), 973–985. https://doi.org/10.1534/genetics.112.143370

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 120

59%

Researcher 53

26%

Professor / Associate Prof. 26

13%

Lecturer / Post doc 4

2%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 120

57%

Biochemistry, Genetics and Molecular Bi... 74

35%

Computer Science 12

6%

Medicine and Dentistry 6

3%

Save time finding and organizing research with Mendeley

Sign up for free