Tail paradox, partial identifiability, and influential priors in bayesian branch length inference

76Citations
Citations of this article
127Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Recent studies have observed that Bayesian analyses of sequence data sets using the program MrBayes sometimes generate extremely large branch lengths, with posterior credibility intervals for the tree length (sum of branch lengths) excluding the maximum likelihood estimates. Suggested explanations for this phenomenon include the existence of multiple local peaks in the posterior, lack of convergence of the chain in the tail of the posterior,mixing problems, andmisspecified priors on branch lengths. Here, we analyze the behavior of Bayesian Markov chain Monte Carlo algorithms when the chain is in the tail of the posterior distribution and note that all these phenomena can occur. In Bayesian phylogenetics, the likelihood function approaches a constant instead of zero when the branch lengths increase to infinity. The flat tail of the likelihood can cause poor mixing and undue influence of the prior.We suggest that the main cause of the extreme branch length estimates produced in many Bayesian analyses is the poor choice of a default prior on branch lengths in current Bayesian phylogenetic programs. The default prior in MrBayes assigns independent and identical distributions to branch lengths, imposing strong (and unreasonable) assumptions about the tree length. The problem is exacerbated by the strong correlation between the branch lengths and parameters in models of variable rates among sites or among site partitions. To resolve the problem, we suggest twomultivariate priors for the branch lengths (called compound Dirichlet priors) that are fairly diffuse and demonstrate their utility in the special case of branch length estimation on a star phylogeny.Our analysis highlights the need for careful thought in the specification of high-dimensional priors in Bayesian analyses. © The Author 2011.

References Powered by Scopus

MrBayes 3: Bayesian phylogenetic inference under mixed models

27287Citations
N/AReaders
Get full text

Relaxed phylogenetics and dating with confidence

5160Citations
N/AReaders
Get full text

Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: Approximate methods

2285Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Molecular phylogenetics: Principles and practice

503Citations
N/AReaders
Get full text

Bayesian species delimitation combining multiple genes and traits in a unified framework

150Citations
N/AReaders
Get full text

Innovative bayesian and parsimony phylogeny of dung beetles (coleoptera, scarabaeidae, scarabaeinae) enhanced by ontology-based partitioning of morphological characters

142Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Rannala, B., Zhu, T., & Yang, Z. (2012). Tail paradox, partial identifiability, and influential priors in bayesian branch length inference. Molecular Biology and Evolution, 29(1), 325–335. https://doi.org/10.1093/molbev/msr210

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 63

60%

Researcher 27

26%

Professor / Associate Prof. 13

12%

Lecturer / Post doc 2

2%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 76

78%

Biochemistry, Genetics and Molecular Bi... 12

12%

Computer Science 6

6%

Environmental Science 4

4%

Article Metrics

Tooltip
Mentions
News Mentions: 1

Save time finding and organizing research with Mendeley

Sign up for free