CSHAP: Efficient haplotype frequency estimation based on sparse representation

Yinsheng Zhou; Han Zhang; Yaning Yang

Journal Article

CSHAP: Efficient haplotype frequency estimation based on sparse representation

Bioinformatics (2019) 35(16) 2827-2833

DOI: 10.1093/bioinformatics/bty1040

1Citations

11Readers

Get full text

Abstract

Motivation: Estimating haplotype frequencies from genotype data plays an important role in genetic analysis. In silico methods are usually computationally involved since phase information is not available. Due to tight linkage disequilibrium and low recombination rates, the number of haplotypes observed in human populations is far less than all the possibilities. This motivates us to solve the estimation problem by maximizing the sparsity of existing haplotypes. Here, we propose a new algorithm by applying the compressive sensing (CS) theory in the field of signal processing, compressive sensing haplotype inference (CSHAP), to solve the sparse representation of haplotype frequencies based on allele frequencies and between-allele co-variances. Results: Our proposed approach can handle both individual genotype data and pooled DNA data with hundreds of loci. The CSHAP exhibits the same accuracy compared with the state-of-the-art methods, but runs several orders of magnitude faster. CSHAP can also handle with missing genotype data imputations efficiently.

Cite

CITATION STYLE

APA

Zhou, Y., Zhang, H., & Yang, Y. (2019). CSHAP: Efficient haplotype frequency estimation based on sparse representation. Bioinformatics, 35(16), 2827–2833. https://doi.org/10.1093/bioinformatics/bty1040

CSHAP: Efficient haplotype frequency estimation based on sparse representation

Abstract

Cite

Register to see more suggestions