pyGenClean: Efficient tool for genetic data clean up before association testing

Louis Philippe Lemieux Perreault; Sylvie Provost; Marc André Legault; Amina Barhdadi; Marie Pierre Dubé

Conference ProceedingsOPEN ACCESS

pyGenClean: Efficient tool for genetic data clean up before association testing

Bioinformatics (2013) 29(13) 1704-1705

DOI: 10.1093/bioinformatics/btt261

14Citations

49Readers

Abstract

Summary: Genetic association studies making use of high-throughput genotyping arrays need to process large amounts of data in the order of millions of markers per experiment. The first step of any analysis with genotyping arrays is typically the conduct of a thorough data clean up and quality control to remove poor quality genotypes and generate metrics to inform and select individuals for downstream statistical analysis. We have developed pyGenClean, a bioinformatics tool to facilitate and standardize the genetic data clean up pipeline with genotyping array data. In conjunction with a source batch-queuing system, the tool minimizes data manipulation errors, accelerates the completion of the data clean up process and provides informative plots and metrics to guide decision making for statistical analysis. © The Author 2013.

Cite

CITATION STYLE

APA

Lemieux Perreault, L. P., Provost, S., Legault, M. A., Barhdadi, A., & Dubé, M. P. (2013). pyGenClean: Efficient tool for genetic data clean up before association testing. In Bioinformatics (Vol. 29, pp. 1704–1705). https://doi.org/10.1093/bioinformatics/btt261

pyGenClean: Efficient tool for genetic data clean up before association testing

Abstract

Cite

Register to see more suggestions