A graph extension of the positional burrows-wheeler transform and its applications

9Citations
Citations of this article
61Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We present a generalization of the Positional Burrows- Wheeler Transform, or PBWT, to genome graphs, which we call the gPBWT. A genome graph is a collapsed representation of a set of genomes described as a graph. In a genome graph, a haplotype corresponds to a restricted form of walk. The gPBWT is a compressible representation of a set of these graph-encoded haplotypes that allows for efficient subhaplotype match queries.We give efficient algorithms for gPBWT construction and query operations.We describe our implementation, showing the compression and search of 1000 Genomes data. As a demonstration, we use the gPBWT to quickly count the number of haplotypes consistent with random walks in a genome graph, and with the paths taken by mapped reads; results suggest that haplotype consistency information can be practically incorporated into graph-based read mappers.

Cite

CITATION STYLE

APA

Novak, A. M., Garrison, E., & Paten, B. (2016). A graph extension of the positional burrows-wheeler transform and its applications. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9838 LNCS, pp. 246–256). Springer Verlag. https://doi.org/10.1007/978-3-319-43681-4_20

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free