Robust Analysis of Phylogenetic Tree Space

26Citations
Citations of this article
53Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Phylogenetic analyses often produce large numbers of trees. Mapping trees' distribution in "tree space"can illuminate the behavior and performance of search strategies, reveal distinct clusters of optimal trees, and expose differences between different data sources or phylogenetic methods - but the high-dimensional spaces defined by metric distances are necessarily distorted when represented in fewer dimensions. Here, I explore the consequences of this transformation in phylogenetic search results from 128 morphological data sets, using stratigraphic congruence - a complementary aspect of tree similarity - to evaluate the utility of low-dimensional mappings. I find that phylogenetic similarities between cladograms are most accurately depicted in tree spaces derived from information-theoretic tree distances or the quartet distance. Robinson-Foulds tree spaces exhibit prominent distortions and often fail to group trees according to phylogenetic similarity, whereas the strong influence of tree shape on the Kendall-Colijn distance makes its tree space unsuitable for many purposes. Distances mapped into two or even three dimensions often display little correspondence with true distances, which can lead to profound misrepresentation of clustering structure. Without explicit testing, one cannot be confident that a tree space mapping faithfully represents the true distribution of trees, nor that visually evident structure is valid. My recommendations for tree space validation and visualization are implemented in a new graphical user interface in the "TreeDist"R package. [Multidimensional scaling; phylogenetic software; tree distance metrics; treespace projections.]

References Powered by Scopus

Laplacian eigenmaps for dimensionality reduction and data representation

6374Citations
N/AReaders
Get full text

Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis

5299Citations
N/AReaders
Get full text

A Nonlinear Mapping for Data Structure Analysis

2831Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Mito-nuclear discordance within Anthozoa, with notes on unique properties of their mitochondrial genomes

21Citations
N/AReaders
Get full text

Using Information Theory to Detect Rogue Taxa and Improve Consensus Trees

12Citations
N/AReaders
Get full text

Confusion will be my epitaph: genome-scale discordance stifles phylogenetic resolution of Holothuroidea

11Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Smith, M. R. (2022). Robust Analysis of Phylogenetic Tree Space. Systematic Biology, 71(5), 1255–1270. https://doi.org/10.1093/sysbio/syab100

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 15

65%

Researcher 6

26%

Professor / Associate Prof. 2

9%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 12

44%

Biochemistry, Genetics and Molecular Bi... 8

30%

Computer Science 5

19%

Arts and Humanities 2

7%

Article Metrics

Tooltip
Mentions
News Mentions: 2

Save time finding and organizing research with Mendeley

Sign up for free