Discrepancies in dbSNP confirmation rates and allele frequency distributions from varying genotyping error rates and patterns

Adele A. Mitchell; Michael E. Zwick; Aravinda Chakravarti; David J. Cutler

Journal ArticleOPEN ACCESS

Discrepancies in dbSNP confirmation rates and allele frequency distributions from varying genotyping error rates and patterns

Bioinformatics (2004) 20(7) 1022-1032

DOI: 10.1093/bioinformatics/bth034

36Citations

45Readers

Abstract

Summary: Three recent publications have examined the quality and completeness of public database single nucleotide polymorphism (dbSNP) and have come to dramatically different conclusions regarding dbSNPs false positive rate and the proportion of dbSNPs that are expected to be common. These studies employed different genotyping technologies and different protocols in determining minimum acceptable genotyping quality thresholds. Because heterozygous sites typically have lower quality scores than homozygous sites, a higher minimum quality threshold reduces the number of false positive SNPs, but yields fewer heterozygotes and leads to fewer confirmed SNPs. To account for the different confirmation rates and distributions of minor allele frequencies, we propose that the three confirmation studies have different false positive and false negative rates. We developed a mathematical model to predict SNP confirmation rates and the apparent distribution of minor allele frequencies under user-specified false positive and false negative rates. We applied this model to the three published studies and to our own resequencing effort. We conclude that the dbSNP false positive rate is ∼15-17% and that the reported confirmation studies have vastly different genotyping error rates and patterns. © Oxford University Press 2004; all rights reserved.

Cite

CITATION STYLE

APA

Mitchell, A. A., Zwick, M. E., Chakravarti, A., & Cutler, D. J. (2004). Discrepancies in dbSNP confirmation rates and allele frequency distributions from varying genotyping error rates and patterns. Bioinformatics, 20(7), 1022–1032. https://doi.org/10.1093/bioinformatics/bth034

Discrepancies in dbSNP confirmation rates and allele frequency distributions from varying genotyping error rates and patterns

Abstract

Cite

Register to see more suggestions