The increasing availability of prokaryotic genome sequences has shown that simple sequence repeats (SSRs) are widespread in prokaryotes and that there is extensive variation in their length, number and distribution. Considering their potential importance in generating genomic diversity, we determined the distribution of a specific group of SSRs, mononucleotide repeats of size between 5 and 13 nt, in 157 sequenced prokaryotic genomes. The data obtained in the present study show that (i) a large number of mononucleotide SSRs is present in all prokaryotic genomes investigated, (ii) shorter repeats are much more abundant than longer repeats, and (iii) in the majority of the genomes, longer mononucleotide SSRs are excluded from coding regions although we identified several organisms where mononucleotide SSRs are not excluded from the coding regions. We also observed that some genomes contain more mononucleotide SSRs than expected, while others contain significantly less. Bacterial genomes that contain much less mononucleotide SSRs than expected are generally larger and more GC-rich, while bacterial genomes that contain much more mononucleotide SSRs than expected are in general smaller and more AT-rich. Finally, we also noted that genomes that contain a high fraction of horizontally transferred genes have a lower mononucleotide SSR density and that A and T are generally overrepresented in mononucleotide SSRs. © The Author 2006. Kazusa DNA Research Institute.
CITATION STYLE
Coenye, T., & Vandamme, P. (2005). Characterization of mononucleotide repeats in sequenced prokaryotic genomes. DNA Research, 12(4), 221–233. https://doi.org/10.1093/dnares/dsi009
Mendeley helps you to discover research relevant for your work.