ASPEN, a methodology for reconstructing protein evolution with improved accuracy using ensemble models

4Citations
Citations of this article
28Readers
Mendeley users who have this article in their library.

Abstract

Evolutionary reconstruction algorithms produce models of the evolutionary history of proteins or species. Such algorithms are highly sensitive to their inputs: the sequences used and their alignments. Here, we asked whether the variance introduced by selecting different input sequences could be used to better identify accurate evolutionary models. We subsampled from available ortholog sequences and measured the distribution of observed relationships between paralogs produced across hundreds of models inferred from the subsamples. We observed two important phenomena. First, the reproducibility of an all-sequence, single-alignment reconstruction, measured by comparing topologies inferred from 90% subsamples, directly correlates with the accuracy of that single-alignment reconstruction, producing a measurable value for something that has been traditionally unknowable. Second, topologies that are most consistent with the observations made in the ensemble are more accurate and we present a meta algorithm that exploits this property to improve model accuracy.

Cite

CITATION STYLE

APA

Sloutsky, R., & Naegle, K. M. (2019). ASPEN, a methodology for reconstructing protein evolution with improved accuracy using ensemble models. ELife, 8. https://doi.org/10.7554/eLife.47676

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free