‘Multi-SpaM’: A maximum-likelihood approach to phylogeny reconstruction using multiple spaced-word matches and quartet trees

Thomas Dencker; Chris André Leimeister; Michael Gerth; Christoph Bleidorn; Sagi Snir; Burkhard Morgenstern

Journal ArticleOPEN ACCESS

‘Multi-SpaM’: A maximum-likelihood approach to phylogeny reconstruction using multiple spaced-word matches and quartet trees

NAR Genomics and Bioinformatics (2020) 2(1)

DOI: 10.1093/nargab/lqz013

9Citations

15Readers

Abstract

Word-based or ‘alignment-free’ methods for phylogeny inference have become popular in recent years. These methods are much faster than traditional, alignment-based approaches, but they are generally less accurate. Most alignment-free methods calculate ‘pairwise’ distances between nucleic-acid or protein sequences; these distance values can then be used as input for tree-reconstruction programs such as neighbor-joining. In this paper, we propose the first word-based phylogeny approach that is based on ‘multiple’ sequence comparison and ‘maximum likelihood’. Our algorithm first samples small, gap-free alignments involving four taxa each. For each of these alignments, it then calculates a quartet tree and, finally, the program ‘Quartet MaxCut’ is used to infer a super tree for the full set of input taxa from the calculated quartet trees. Experimental results show that trees produced with our approach are of high quality.

Cite

CITATION STYLE

APA

Dencker, T., Leimeister, C. A., Gerth, M., Bleidorn, C., Snir, S., & Morgenstern, B. (2020). ‘Multi-SpaM’: A maximum-likelihood approach to phylogeny reconstruction using multiple spaced-word matches and quartet trees. NAR Genomics and Bioinformatics, 2(1). https://doi.org/10.1093/nargab/lqz013

‘Multi-SpaM’: A maximum-likelihood approach to phylogeny reconstruction using multiple spaced-word matches and quartet trees

Abstract

Cite

Register to see more suggestions