On reduced amino acid alphabets for phylogenetic inference

Edward Susko; Andrew J. Roger

Journal ArticleOPEN ACCESS

On reduced amino acid alphabets for phylogenetic inference

Molecular Biology and Evolution (2007) 24(9) 2139-2150

DOI: 10.1093/molbev/msm144

131Citations

117Readers

Abstract

We investigate the use of Markov models of evolution for reduced amino acid alphabets or bins of amino acids. The use of reduced amino acid alphabets can ameliorate effects of model misspecification and saturation. We present algorithms for 2 different ways of automating the construction of bins: minimizing criteria based on properties of rate matrices and minimizing criteria based on properties of alignments. By simulation, we show that in the absence of model misspecification, the loss of information due to binning is found to be insubstantial, and the use of Markov models at the binned level is found to be almost as effective as the more appropriate missing data approach. By applying these approaches to real data sets where compositional heterogeneity and/or saturation appear to be causing biased tree estimation, we find that binning can improve topological estimation in practice. © The Author 2007. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved.

Author supplied keywords

Cite

CITATION STYLE

APA

Susko, E., & Roger, A. J. (2007). On reduced amino acid alphabets for phylogenetic inference. Molecular Biology and Evolution, 24(9), 2139–2150. https://doi.org/10.1093/molbev/msm144

On reduced amino acid alphabets for phylogenetic inference

Abstract

Author supplied keywords

Cite

Register to see more suggestions