Natural selection for operons depends on genome size

17Citations
Citations of this article
97Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

In prokaryotes, genome size is associated with metabolic versatility, regulatory complexity, effective population size, and horizontal transfer rates. We therefore analyzed the covariation of genome size and operon conservation to assess the evolutionary models of operon formation and maintenance. In agreement with previous results, intraoperonic pairs of essential and of highly expressed genes are more conserved. Interestingly, intraoperonic pairs of genes are also more conserved when they encode proteins at similar cell concentrations, suggesting a role of cotranscription in diminishing the cost of waste and shortfall in gene expression. Larger genomes have fewer and smaller operons that are also less conserved. Importantly, lower conservation in larger genomes was observed for all classes of operons in terms of gene expression, essentiality, and balanced protein concentration. We reached very similar conclusions in independent analyses of three major bacterial clades (α- and β-Proteobacteria and Firmicutes). Operon conservation is inversely correlated to the abundance of transcription factors in the genome when controlled for genome size. This suggests a negative association between the complexity of genetic networks and operon conservation. These results show that genome size and/or its proxies are key determinants of the intensity of natural selection for operon organization. Our data fit better the evolutionary models based on the advantage of coregulation than those based on genetic linkage or stochastic gene expression. We suggest that larger genomes with highly complex genetic networks and many transcription factors endure weaker selection for operons than smaller genomes with fewer alternative tools for genetic regulation. © The Author(s) 2013. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Figures

  • FIG. 1.—Association between genome size and the fraction of genes in operons, the length of operons and the density of coding sequences. Association between the fraction of genes in operons (A–C) and the length of operons as a function of genome size (D–F). Association between the fraction of genes in operons and the density of coding sequences (G–I). Results for a-Proteobacteria (A, D, G), b-Proteobacteria (B, E, H), and Firmicutes (C, F, I). Operon length was calculated as the average number of genes per operon for the whole genome operon predictions. Association between the variables was computed using the nonparametric Spearman rho.
  • FIG. 2.—Association of the OCI with genome size. (A) a-Proteobacteria; (B) b-Proteobacteria; and (C) Firmicutes. Association between the variables was computed using the nonparametric Spearman rho.
  • FIG. 3.—Association of the OCI with gene essentiality. Box plots indicate the distribution of OCI values per genome for pairs of intraoperonic essential (EE) and nonessential genes (NN). The box plots indicate the median (central line), the 25% and 75% percentiles (edges of boxes), the 1.5 interquartile ranges (whiskers), and the outliers (dots). Box plot associated P values correspond to the paired t-tests that means between the groups differ. Association between OCI and genome size for pairs of EE (black dots) and NN (gray dots) genes was computed using the nonparametric Spearman rho. For the a-Proteobacteria, the obligate intracellular cluster organisms were excluded of the correlation analysis (supplementary fig. S5, Supplementary Material online). Clades: (A) a-Proteobacteria, (B) b-Proteobacteria, and (C) Firmicutes.
  • FIG. 4.—Association of the OCI with expression levels. Box plots indicate the distribution of OCI values per genome for intra-operonic pairs of HE and LE genes. The box plots indicate the median (central line), the 25% and 75% percentiles (edges of boxes), the 1.5 interquartile ranges (whiskers) and the outliers (dots). Box plot associated P values correspond to paired-t statistical tests that means differ. Association between OCI and genome size for pairs of HE (black dots) and LE (gray dots) genes was computed using the nonparametric Spearman rho. For the a-Proteobacteria, the obligate intracellular cluster organisms were excluded of the correlation analysis (supplementary fig. S5, Supplementary Material online). Clades: (A) a-Proteobacteria, (B) b-Proteobacteria, and (C) Firmicutes. (*) The associations for b-Proteobacteria become nonsignificant after control for phylogenetic contrasts (supplementary table S4, Supplementary Material online).
  • FIG. 5.—Association of the OCI with balanced protein concentration levels. Box plots indicate the distribution of OCI values per genome for pairs of genes encoding proteins with BAL and UNB concentrations in the cell. The box plots indicate the median (central line), the 25% and 75% percentiles (edges of boxes), the 1.5 interquartile ranges (whiskers) and the outliers (dots). Box plot associated P values correspond to paired t-tests that means differ. Association between OCI and genome size for pairs of genes encoding proteins with balanced (black dots) and unbalanced (gray dots) concentrations in the cell was computed using the nonparametric Spearman rho. For the a-Proteobacteria, the obligate intracellular cluster organisms were excluded of the correlation analysis (supplementary fig. S5, Supplementary Material online). Clades: (A) a-Proteobacteria, (B) b-Proteobacteria, and (C) Firmicutes. (*) The associations for b-Proteobacteria become nonsignificant after control for phylogenetic contrasts (supplementary table S4, Supplementary Material online).
  • FIG. 6.—Association between the number of transcription factors, genome size, and the OCI. (A–C) For the three clades (a-Proteobacteria, b-Proteobacteria, and Firmicutes), the TF total number was inversely correlated with the OCI. Spearman correlation test (rho and P value) are indicated.
  • Table 1 Summary of the Statistical Spearman Association Tests (Rho) between Genome Size and the OCI in Function of Different Traits
  • Table 2 Summary of the Statistical t-Paired Tests and Associated P Values When Testing the Difference in the OCI between Genes Pairs with Different Traits (EE vs. NN; HE vs. LE; BAL vs. UNB)

Author supplied keywords

References Powered by Scopus

Search and clustering orders of magnitude faster than BLAST

16499Citations
N/AReaders
Get full text

A Simple, Fast, and Accurate Algorithm to Estimate Large Phylogenies by Maximum Likelihood

14987Citations
N/AReaders
Get full text

APE: Analyses of phylogenetics and evolution in R language

9374Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Theory of prokaryotic genome evolution

109Citations
N/AReaders
Get full text

A Minimal Model of Ribosome Allocation Dynamics Captures Trade-offs in Expression between Endogenous and Synthetic Genes

92Citations
N/AReaders
Get full text

A new genome of Acidithiobacillus thiooxidans provides insights into adaptation to a bioleaching environment

50Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Nuñez, P. A., Romero, H., Farber, M. D., & Rocha, E. P. C. (2013). Natural selection for operons depends on genome size. Genome Biology and Evolution, 5(11), 2242–2254. https://doi.org/10.1093/gbe/evt174

Readers over time

‘13‘14‘15‘16‘17‘18‘19‘20‘21‘22‘23‘24‘2506121824

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 40

54%

Researcher 25

34%

Professor / Associate Prof. 9

12%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 46

64%

Biochemistry, Genetics and Molecular Bi... 16

22%

Computer Science 8

11%

Chemistry 2

3%

Save time finding and organizing research with Mendeley

Sign up for free
0