Tandem and cryptic amino acid repeats accumulate in disordered regions of proteins

89Citations
Citations of this article
106Readers
Mendeley users who have this article in their library.

Abstract

Background: Amino acid repeats (AARs) are common features of protein sequences. They often evolve rapidly and are involved in a number of human diseases. They also show significant associations with particular Gene Ontology (GO) functional categories, particularly transcription, suggesting they play some role in protein function. It has been suggested recently that AARs play a significant role in the evolution of intrinsically unstructured regions (IURs) of proteins. We investigate the relationship between AAR frequency and evolution and their localization within proteins based on a set of 5,815 orthologous proteins from four mammalian (human, chimpanzee, mouse and rat) and a bird (chicken) genome. We consider two classes of AAR (tandem repeats and cryptic repeats: regions of proteins containing overrepresentations of short amino acid repeats). Results: Mammals show very similar repeat frequencies but chicken shows lower frequencies of many of the cryptic repeats common in mammals. Regions flanking tandem AARs evolve more rapidly than the rest of the protein containing the repeat and this phenomenon is more pronounced for non-conserved repeats than for conserved ones. GO associations are similar to those previously described for the mammals, but chicken cryptic repeats show fewer significant associations. Comparing the overlaps of AARs with IURs and protein domains showed that up to 96% of some AAR types are associated preferentially with IURs. However, no more than 15% of IURs contained an AAR. Conclusions: Their location within IURs explains many of the evolutionary properties of AARs. Further study is needed on the types of IURs containing AARs. © 2009 Simon and Hancock; licensee BioMed Central Ltd.

Figures

  • Table 1
  • Table 2
  • Table 3
  • Table 4
  • Table 5

References Powered by Scopus

CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice

58458Citations
N/AReaders
Get full text

A novel gene containing a trinucleotide repeat that is expanded and unstable on Huntington's disease chromosomes

7680Citations
N/AReaders
Get full text

The rapid generation of mutation data matrices from protein sequences

5918Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Classification of intrinsically disordered regions and proteins

1581Citations
N/AReaders
Get full text

Variable tandem repeats accelerate evolution of coding and regulatory sequences

462Citations
N/AReaders
Get full text

Evolution and disorder

229Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Simon, M., & Hancock, J. M. (2009). Tandem and cryptic amino acid repeats accumulate in disordered regions of proteins. Genome Biology, 10(6). https://doi.org/10.1186/gb-2009-10-6-r59

Readers over time

‘09‘10‘11‘12‘13‘14‘15‘16‘17‘18‘19‘20‘21‘22‘23‘240481216

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 52

62%

Researcher 21

25%

Professor / Associate Prof. 10

12%

Lecturer / Post doc 1

1%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 51

60%

Biochemistry, Genetics and Molecular Bi... 27

32%

Computer Science 4

5%

Chemistry 3

4%

Article Metrics

Tooltip
Mentions
References: 2

Save time finding and organizing research with Mendeley

Sign up for free
0