There is a desire for increasing use of statistical methods in analysing the growing amounts of bio-sequences. We present statistical methods that are useful when a protein alignment can be divided into two groups based on known features or traits. The approach is based on stratification of the data, and to show the applicability of the methods we present analysis of genomic data from proteobacteria orders. A dataset of 25 periplasmic/extracellular bacterial enzyme endonuclease I proteins was compiled to identify genotypic characteristics that separate the cold adapted proteins from ortholog sequences with a higher optimal growth temperature. Our results reveal that the cold adapted protein has a significantly more positively charged exterior. Life in a cold climate seems to be enabled by many minor structural modifications rather than a particular amino acid substitution. Redistribution of charge might be one of the most important signatures for cold adaptation. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Thorvaldsen, S., Ytterstad, E., & Flå, T. (2007). Inferring weak adaptations and selection biases in proteins from composition and substitution matrices. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4463 LNBI, pp. 382–393). Springer Verlag. https://doi.org/10.1007/978-3-540-72031-7_35
Mendeley helps you to discover research relevant for your work.