CSHAP: Efficient haplotype frequency estimation based on sparse representation

1Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Motivation: Estimating haplotype frequencies from genotype data plays an important role in genetic analysis. In silico methods are usually computationally involved since phase information is not available. Due to tight linkage disequilibrium and low recombination rates, the number of haplotypes observed in human populations is far less than all the possibilities. This motivates us to solve the estimation problem by maximizing the sparsity of existing haplotypes. Here, we propose a new algorithm by applying the compressive sensing (CS) theory in the field of signal processing, compressive sensing haplotype inference (CSHAP), to solve the sparse representation of haplotype frequencies based on allele frequencies and between-allele co-variances. Results: Our proposed approach can handle both individual genotype data and pooled DNA data with hundreds of loci. The CSHAP exhibits the same accuracy compared with the state-of-the-art methods, but runs several orders of magnitude faster. CSHAP can also handle with missing genotype data imputations efficiently.

Cite

CITATION STYLE

APA

Zhou, Y., Zhang, H., & Yang, Y. (2019). CSHAP: Efficient haplotype frequency estimation based on sparse representation. Bioinformatics, 35(16), 2827–2833. https://doi.org/10.1093/bioinformatics/bty1040

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free