Efficient and practical algorithms for deducing the history of recombination in populations

0Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

A phylogenetic network (or Ancestral Recombination Graph) is a generalization of a tree, allowing structural properties that are not tree-like. With the growth of genomic and population data, much of which does not fit ideal tree models, and the increasing appreciation of the genomic role of such phenomena as recombination (crossing-over and gene-conversion), recurrent and back mutation, horizontal gene transfer, and mobile genetic elements, there is greater need to understand the algorithmics and combinatorics of phylogenetic networks. In this talk I will survey a range of our recent results on phylogenetic networks with recombination, and show applications of these results to association mapping; finding recombination hotspots in genotype sequences; imputing the values of missing haplotype data; determining the extent of recombination in the evolution of LPL sequences; distinguishing the role of crossing-over from geneconversion in Arabidopsis; and characterizing some aspects of the haplotypes produced by the program PHASE. I will discuss the fundamental problem of constructing a phylogenetic network for a given set of binary (SNP) sequences derived from a known or unknown ancestral sequence, when each site in the sequence can mutate at most once in the network (the infinite sites model in population genetics), but recombination between sequences is allowed. The goal is to find a phylogenetic network that generates the given set of sequences, minimizing the number of recombination events used in the network. When all the "recombination cycles" are disjoint (which is likely with low recombination rates), we have developed efficient provably correct algorithms that find a network minimizing the number of recombinations, and have proven that the optimal solution is "essentially unique". I will also mention a network decomposition theory that shows the extent that these results can be generalized to arbitrary networks. For general phylogenetic networks (when the cycles are not constrained) we have developed algorithms that are efficient in practice and that empirically obtain close upper and lower bounds on the number of recombinations needed. In real data and simulations, these practical (heuristic) computations often produce bounds that match, demonstrating that a minimum recombination solution has been found. In small-size data we can guarantee that an optimal solution will been found by running an exponential-time algorithm to completion. For small-size data, we can also (provably) sample uniformly from the set of optimal solutions, and can also determine the phase of genotypic data so as to minimize the number of recombinations needed to derive the resulting haplotypes. More recently, we have extended the lower and upper bound algorithms to incorporate gene-conversion as an allowed operation. Those new algorithms allow us to investigate and distinguish the role of gene-conversion from single-crossover recombination. Various parts of this work are joint work with Satish Eddhu, Chuck Langley, Dean Hickerson, Yun Song, Yufeng Wu, V. Bansal, and Zhihong Ding. Support for this work was provided by NSF grant US-0513910. All the papers and associated software can be accesses at wwwcsif/cs.ucdavis.edu/~gusfield. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Gusfield, D. (2006). Efficient and practical algorithms for deducing the history of recombination in populations. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3992 LNCS-II, pp. 618–619). Springer Verlag. https://doi.org/10.1007/11758525_83

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free