Dr Martin Smith martin.smith@durham.ac.uk
Associate Professor
Robust analysis of phylogenetic tree space
Smith, M.R.
Authors
Abstract
Phylogenetic analyses often produce large numbers of trees. Mapping trees’ distribution in ‘tree space’ can illuminate the behaviour and performance of search strategies, reveal distinct clusters of optimal trees, and expose differences between different data sources or phylogenetic methods – but the high-dimensional spaces defined by metric distances are necessarily distorted when represented in fewer dimensions. Here, I explore the consequences of this transformation in phylogenetic search results from 128 morphological datasets, using stratigraphic congruence – a complementary aspect of tree similarity – to evaluate the utility of low-dimensional mappings. I find that phylogenetic similarities between cladograms are most accurately depicted in tree spaces derived from information-theoretic tree distances or the quartet distance. Robinson–Foulds tree spaces exhibit prominent distortions and often fail to group trees according to phylogenetic similarity, whereas the strong influence of tree shape on the Kendall–Colijn distance makes its tree space unsuitable for many purposes. Distances mapped into two or even three dimensions often display little correspondence with true distances, which can lead to profound misrepresentation of clustering structure. Without explicit testing, one cannot be confident that a tree space mapping faithfully represents the true distribution of trees, nor that visually evident structure is valid. My recommendations for tree space validation and visualization are implemented in a new graphical user interface in the ‘TreeDist’ R package.
Citation
Smith, M. (2022). Robust analysis of phylogenetic tree space. Systematic Biology, 71(5), 1255-1270. https://doi.org/10.1093/sysbio/syab100
Journal Article Type | Article |
---|---|
Acceptance Date | Dec 23, 2021 |
Online Publication Date | Dec 28, 2021 |
Publication Date | 2022-09 |
Deposit Date | Dec 1, 2021 |
Publicly Available Date | Dec 2, 2021 |
Journal | Systematic Biology |
Print ISSN | 1063-5157 |
Electronic ISSN | 1076-836X |
Publisher | Oxford University Press |
Peer Reviewed | Peer Reviewed |
Volume | 71 |
Issue | 5 |
Pages | 1255-1270 |
DOI | https://doi.org/10.1093/sysbio/syab100 |
Public URL | https://durham-repository.worktribe.com/output/1221723 |
Files
Published Journal Article
(4.5 Mb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
Accepted Journal Article
(408 Kb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
Copyright Statement
© The Author(s) 2021. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
You might also like
Evolution: Assembling the Deuterostome body plan
(2023)
Journal Article
Protomelission is an early dasyclad alga and not a Cambrian bryozoan
(2023)
Journal Article
The Cambrian cirratuliform Iotuba denotes an early annelid radiation
(2023)
Journal Article
Using information theory to detect rogue taxa and improve consensus trees
(2021)
Journal Article
Inapplicable data and the position of palaeoscolecids within Ecdysozoa
(2022)
Journal Article
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search