Structural variation detection with read pair information—an improved null-hypothesis reduces bias

0Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Reads from paired-end and mate-pair libraries are often utilized to find structural variation in genomes, and one common approach is to use their fragment length for detection. After aligning readpairs to the reference, read-pair distances are analyzed for statistically significant deviations. However, previously proposed methods are based on a simplified model of observed fragment lengths that does not agree with data. We show how this model limits statistical analysis of identifying variants and propose a new model, by adapting a model we have previously introduced for contig scaffolding, which agrees with data. From this model we derive an improved null hypothesis that, when applied in the variant caller CLEVER, reduces the number of false positives and corrects a bias that contributes to more deletion calls than insertion calls. A reference implementation is freely available at https://github.com/ksahlin/GetDistr.

Cite

CITATION STYLE

APA

Sahlin, K., Frånberg, M., & Arvestad, L. (2016). Structural variation detection with read pair information—an improved null-hypothesis reduces bias. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9649, pp. 176–188). Springer Verlag. https://doi.org/10.1007/978-3-319-31957-5_13

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free