ParaHaplo 2.0: A program package for haplotype-estimation and haplotype-based whole-genome association study using parallel computing

Kazuharu Misawa; Naoyuki Kamatani

Journal ArticleOPEN ACCESS

ParaHaplo 2.0: A program package for haplotype-estimation and haplotype-based whole-genome association study using parallel computing

Source Code for Biology and Medicine (2010) 5

DOI: 10.1186/1751-0473-5-5

1Citations

10Readers

Abstract

Background: The use of haplotype-based association tests can improve the power of genome-wide association studies. Since the observed genotypes are unordered pairs of alleles, haplotype phase must be inferred. However, estimating haplotype phase is time consuming. When millions of single-nucleotide polymorphisms (SNPs) are analyzed in genome-wide association study, faster methods for haplotype estimation are required.Methods: We developed a program package for parallel computation of haplotype estimation. Our program package, ParaHaplo 2.0, is intended for use in workstation clusters using the Intel Message Passing Interface (MPI). We compared the performance of our algorithm to that of the regular permutation test on both Japanese in Tokyo, Japan and Han Chinese in Beijing, China of the HapMap dataset.Results: Parallel version of ParaHaplo 2.0 can estimate haplotypes 100 times faster than a non-parallel version of the ParaHaplo.Conclusion: ParaHaplo 2.0 is an invaluable tool for conducting haplotype-based genome-wide association studies (GWAS). The need for fast haplotype estimation using parallel computing will become increasingly important as the data sizes of such projects continue to increase. The executable binaries and program sources of ParaHaplo are available at the following address: http://en.sourceforge.jp/projects/parallelgwas/releases/. © 2010 Misawa and Kamatani; licensee BioMed Central Ltd.

Cite

CITATION STYLE

APA

Misawa, K., & Kamatani, N. (2010). ParaHaplo 2.0: A program package for haplotype-estimation and haplotype-based whole-genome association study using parallel computing. Source Code for Biology and Medicine, 5. https://doi.org/10.1186/1751-0473-5-5

ParaHaplo 2.0: A program package for haplotype-estimation and haplotype-based whole-genome association study using parallel computing

Abstract

Cite

Register to see more suggestions