A simple model based on mutation and selection explains trends in codon and amino-acid usage and GC composition within and across genomes

332Citations
Citations of this article
267Readers
Mendeley users who have this article in their library.

Abstract

Background: Correlations between genome composition (in terms of GC content) and usage of particular codons and amino acids have been widely reported, but poorly explained. We show here that a simple model of processes acting at the nucleotide level explains codon usage across a large sample of species (311 bacteria, 28 archaea and 257 eukaryotes). The model quantitatively predicts responses (slope and intercept of the regression line on genome GC content) of individual codons and amino acids to genome composition. Results: Codons respond to genome composition on the basis of their GC content relative to their synonyms (explaining 71-87% of the variance in response among the different codons, depending on measure). Amino-acid responses are determined by the mean GC content of their codons (explaining 71-79% of the variance). Similar trends hold for genes within a genome. Position-dependent selection for error minimization explains why individual bases respond differently to directional mutation pressure. Conclusions: Our model suggests that GC content drives codon usage (rather than the converse). It unifies a large body of empirical evidence concerning relationships between GC content and amino-acid or codon usage in disparate systems. The relationship between GC content and codon and amino-acid usage is ahistorical; it is replicated independently in the three domains of living organisms, reinforcing the idea that genes and genomes at mutation/selection equilibrium reproduce a unique relationship between nucleic acid and protein composition. Thus, the model may be useful in predicting amino-acid or nucleotide sequences in poorly characterized taxa.

References Powered by Scopus

The codon adaptation index-a measure of directional synonymous codon usage bias, and its potential applications

3020Citations
N/AReaders
Get full text

Non-Darwinian evolution

1120Citations
N/AReaders
Get full text

Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes: A proposal for a synonymous codon choice that is optimal for the E. coli translational system

1093Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Ab initio gene identification in metagenomic sequences

1261Citations
N/AReaders
Get full text

Codon bias and heterologous protein expression

1044Citations
N/AReaders
Get full text

Selection on codon bias

774Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Knight, R. D., Freeland, S. J., & Landweber, L. F. (2001). A simple model based on mutation and selection explains trends in codon and amino-acid usage and GC composition within and across genomes. Genome Biology, 2(4). https://doi.org/10.1186/gb-2001-2-4-research0010

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 119

56%

Researcher 56

26%

Professor / Associate Prof. 35

17%

Lecturer / Post doc 2

1%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 140

65%

Biochemistry, Genetics and Molecular Bi... 55

26%

Computer Science 11

5%

Engineering 8

4%

Article Metrics

Tooltip
Mentions
News Mentions: 1
References: 1

Save time finding and organizing research with Mendeley

Sign up for free