Linkage disequilibrium based genotype calling from low-coverage shotgun sequencing reads

9Citations
Citations of this article
43Readers
Mendeley users who have this article in their library.

Abstract

Background: Recent technology advances have enabled sequencing of individual genomes, promising to revolutionize biomedical research. However, deep sequencing remains more expensive than microarrays for performing whole-genome SNP genotyping.Results: In this paper we introduce a new multi-locus statistical model and computationally efficient genotype calling algorithms that integrate shotgun sequencing data with linkage disequilibrium (LD) information extracted from reference population panels such as Hapmap or the 1000 genomes project. Experiments on publicly available 454, Illumina, and ABI SOLiD sequencing datasets suggest that integration of LD information results in genotype calling accuracy comparable to that of microarray platforms from sequencing data of low-coverage. A software package implementing our algorithm, released under the GNU General Public License, is available at http://dna.engr.uconn.edu/software/GeneSeq/.Conclusions: Integration of LD information leads to significant improvements in genotype calling accuracy compared to prior LD-oblivious methods, rendering low-coverage sequencing as a viable alternative to microarrays for conducting large-scale genome-wide association studies. © 2011 Duitama et al; licensee BioMed Central Ltd.

Cite

CITATION STYLE

APA

Duitama, J., Kennedy, J., Dinakar, S., Hernández, Y., Wu, Y., & Mǎndoiu, I. I. (2011). Linkage disequilibrium based genotype calling from low-coverage shotgun sequencing reads. BMC Bioinformatics, 12(SUPPL. 1). https://doi.org/10.1186/1471-2105-12-S1-S53

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free