Phylogenetics is concerned with inferring the genealogical relationships between a group of organisms (or taxa), and this relationship is usually expressed as an evolutionary tree. However, inferring the phylogenetic tree is not a trivial task since it is impossible to know the true evolutionary history for a set of organisms. As a result, most phylogenetic analyses rely on effective heuristics for obtaining accurate trees. These heuristics use tree score as a basis for establishing an accurate depiction of evolutionary tree relationships. Relatively little work has been done to analyze the relationship between improving tree scores (fitness) and topological accuracy (distance). In this paper, we present a new fitness-distance correlation coefficient called rFD to quantify the relationship between evolutionary trees. By applying this measure to three biological datasets consisting of 44, 60, and 174 taxa, our results show that improvements in fitness are strongly correlated (rFD > 0.8) with topological accuracy to the best-tree-overall. Moreover, we investigated the use of the rFD coefficient if the best overall tree is not available and found similar results. Thus, our results show that rFD is a robust measure with several potential applications such as the development of stopping criteria for phylogenetic search.
CITATION STYLE
Park, H. J., & Williams, T. L. (2009). A fitness distance correlation measure for evolutionary trees. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5462 LNBI, pp. 331–342). https://doi.org/10.1007/978-3-642-00727-9_31
Mendeley helps you to discover research relevant for your work.