Skip to main content

Research Repository

Advanced Search

Robust analysis of phylogenetic tree space

Smith, M.R.

Robust analysis of phylogenetic tree space Thumbnail


Authors



Abstract

Phylogenetic analyses often produce large numbers of trees. Mapping trees’ distribution in ‘tree space’ can illuminate the behaviour and performance of search strategies, reveal distinct clusters of optimal trees, and expose differences between different data sources or phylogenetic methods – but the high-dimensional spaces defined by metric distances are necessarily distorted when represented in fewer dimensions. Here, I explore the consequences of this transformation in phylogenetic search results from 128 morphological datasets, using stratigraphic congruence – a complementary aspect of tree similarity – to evaluate the utility of low-dimensional mappings. I find that phylogenetic similarities between cladograms are most accurately depicted in tree spaces derived from information-theoretic tree distances or the quartet distance. Robinson–Foulds tree spaces exhibit prominent distortions and often fail to group trees according to phylogenetic similarity, whereas the strong influence of tree shape on the Kendall–Colijn distance makes its tree space unsuitable for many purposes. Distances mapped into two or even three dimensions often display little correspondence with true distances, which can lead to profound misrepresentation of clustering structure. Without explicit testing, one cannot be confident that a tree space mapping faithfully represents the true distribution of trees, nor that visually evident structure is valid. My recommendations for tree space validation and visualization are implemented in a new graphical user interface in the ‘TreeDist’ R package.

Citation

Smith, M. (2022). Robust analysis of phylogenetic tree space. Systematic Biology, 71(5), 1255-1270. https://doi.org/10.1093/sysbio/syab100

Journal Article Type Article
Acceptance Date Dec 23, 2021
Online Publication Date Dec 28, 2021
Publication Date 2022-09
Deposit Date Dec 1, 2021
Publicly Available Date Dec 2, 2021
Journal Systematic Biology
Print ISSN 1063-5157
Electronic ISSN 1076-836X
Publisher Oxford University Press
Peer Reviewed Peer Reviewed
Volume 71
Issue 5
Pages 1255-1270
DOI https://doi.org/10.1093/sysbio/syab100

Files


Accepted Journal Article (408 Kb)
PDF

Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/

Copyright Statement
© The Author(s) 2021. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.





You might also like



Downloadable Citations