Spatial and temporal heterogeneity in nucleotide sequence evolution

32Citations
Citations of this article
75Readers
Mendeley users who have this article in their library.

Abstract

Models of nucleotide substitution make many simplifying assumptions about the evolutionary process, including that the same process acts on all sites in an alignment and on all branches on the phylogenetic tree. Many studies have shown that in reality the substitution process is heterogeneous and that this variability can introduce systematic errors into many forms of phylogenetic analyses. I propose a new rigorous approach for describing heterogeneity called a temporal hidden Markov model (THMM), which can distinguish between among site (spatial) heterogeneity and among lineage (temporal) heterogeneity. Several versions of the THMM are applied to 16 sets of aligned sequences to quantitatively assess the different forms of heterogeneity acting within them. The most general THMM provides the best fit in all the data sets examined, providing strong evidence of pervasive heterogeneity during evolution. Investigating individual forms of heterogeneity provides further insights. In agreement with previous studies, spatial rate heterogeneity (rates across sites [RAS]) is inferred to be the single most prevalent form of heterogeneity. Interestingly, RAS appears so dominant that failure to independently include it in the THMM masks other forms of heterogeneity, particularly temporal heterogeneity. Incorporating RAS into the THMM reveals substantial temporal and spatial heterogeneity in nucleotide composition and bias toward transition substitution in all alignments examined, although the relative importance of different forms of heterogeneity varies between data sets. Furthermore, the improvements in model fit observed by adding complexity to the model suggest that the THMMs used in this study do not capture all the evolutionary heterogeneity occurring in the data. These observations all indicate that current tests may consistently underestimate the degree of temporal heterogeneity occurring in data. Finally, there is a weak link between the amount of heterogeneity detected and the level of divergence between the sequences, suggesting that variability in the evolutionary process will be a particular problem for deep phylogeny. © The Author 2008. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved.

References Powered by Scopus

A New Look at the Statistical Model Identification

41320Citations
N/AReaders
Get full text

Evolutionary trees from DNA sequences: A maximum likelihood approach

12287Citations
N/AReaders
Get full text

Dating of the human-ape splitting by a molecular clock of mitochondrial DNA

7305Citations
N/AReaders
Get full text

Cited by Powered by Scopus

TreeGraph 2: Combining and visualizing evidence from different phylogenetic analyses

1357Citations
N/AReaders
Get full text

Phylogenomics provides robust support for a two-domains tree of life

162Citations
N/AReaders
Get full text

The effects of alignment error and alignment filtering on the sitewise detection of positive selection

124Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Whelan, S. (2008). Spatial and temporal heterogeneity in nucleotide sequence evolution. Molecular Biology and Evolution, 25(8), 1683–1694. https://doi.org/10.1093/molbev/msn119

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 30

42%

Researcher 25

35%

Professor / Associate Prof. 13

18%

Lecturer / Post doc 3

4%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 49

73%

Biochemistry, Genetics and Molecular Bi... 10

15%

Mathematics 5

7%

Computer Science 3

4%

Save time finding and organizing research with Mendeley

Sign up for free