Background: Although the sequence requirements for translation initiation regions have been frequently analysed, usually the highly expressed genes are not treated as a separate dataset. Results: To investigate this, we analysed the mRNA regions downstream of initiation codons in nine bacteria, three archaea and three unicellular eukaryotes, comparing the dataset of highly expressed genes to the dataset of all genes. In addition to the detailed analysis of the nucleotide and codon frequencies we compared the N-termini of highly expressed proteins to the N-termini of all proteins coded in the genome. Conclusion: The most conserved pattern was observed at the amino acid level: strong alanine over-representation was observed at the second amino acid position of highly expressed proteins. This pattern is well conserved in all three domains of life. © 2006 Tats et al; licensee BioMed Central Ltd.
CITATION STYLE
Tats, A., Remm, M., & Tenson, T. (2006). Highly expressed proteins have an increased frequency of alanine in the second amino acid position. BMC Genomics, 7. https://doi.org/10.1186/1471-2164-7-28
Mendeley helps you to discover research relevant for your work.