RIO: Analyzing proteomes by automated phylogenomics using resampled inference of orthologs

147Citations
Citations of this article
147Readers
Mendeley users who have this article in their library.

Abstract

Background: When analyzing protein sequences using sequence similarity searches, orthologous sequences (that diverged by speciation) are more reliable predictors of a new protein's function than paralogous sequences (that diverged by gene duplication). The utility of phylogenetic information in high-throughput genome annotation ("phylogenomics") is widely recognized, but existing approaches are either manual or not explicitly based on phylogenetic trees. Results: Here we present RIO (Resampled Inference of Orthologs), a procedure for automated phylogenomics using explicit phylogenetic inference. RIO analyses are performed over bootstrap resampled phylogenetic trees to estimate the reliability of orthology assignments. We also introduce supplementary concepts that are helpful for functional inference. RIO has been implemented as Perl pipeline connecting several C and Java programs. It is available at [http:// www.genetics.wustl.edu/eddy/forester/] . A web server is at [http://www.rio.wustl.edu/] . RIO was tested on the Arabidopsis thaliana and Caenorhabditis elegans proteomes. Conclusion: The RIO procedure is particularly useful for the automated detection of first representatives of novel protein subfamilies. We also describe how some orthologies can be misleading for functional inference. © 2002 Zmasek and Eddy; licensee BioMed Central Ltd.

Cite

CITATION STYLE

APA

Zmasek, C. M., & Eddy, S. R. (2002). RIO: Analyzing proteomes by automated phylogenomics using resampled inference of orthologs. BMC Bioinformatics, 3. https://doi.org/10.1186/1471-2105-3-14

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free