Reducing bias of allele frequency estimates by modeling snp genotype data with informative missingness

N/ACitations
Citations of this article
18Readers
Mendeley users who have this article in their library.

Abstract

The presence of missing single-nucleotide polymorphism (SNP) genotypes is common in genetic studies. For studies with low-density SNPs, the most commonly used approach to dealing with genotype missingness is to simply remove the observations with missing genotypes from the analyses. This naïve method is straightforward but is valid only when the missingness is random. However, a given assay often has a different capability in geno-typing heterozygotes and homozygotes, causing the phenomenon of "differential dropout" in the sense that the missing rates of heterozygotes and homozygotes are different. In practice, differential dropout among genotypes exists in even carefully designed studies, such as the data from the HapMap project and the Wellcome Trust Case Control Consortium. Under the assumption of Hardy-Weinberg equilibrium and no genotyping error, we here propose a statistical method to model the differential dropout among different genotypes. Compared with the naïve method, our method provides more accurate allele frequency estimates when the differential dropout is present. To demonstrate its practical use, we further apply our method to the HapMap data and a scleroderma data set. © 2012 Lin and Liu.

Cite

CITATION STYLE

APA

Lin, W. Y., & Liu, N. (2012). Reducing bias of allele frequency estimates by modeling snp genotype data with informative missingness. Frontiers in Genetics, 3(JUN). https://doi.org/10.3389/fgene.2012.00107

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free