SNP ascertainment bias in population genetic analyses: Why it is important, and how to correct it

  • Lachance J
  • Tishkoff S
  • 176

    Readers

    Mendeley users who have this article in their library.
  • 76

    Citations

    Citations of this article.

Abstract

Whole genome sequencing and SNP genotyping arrays can paint strikingly different pictures of demographic history and natural selection. This is because genotyping arrays contain biased sets of pre-ascertained SNPs. In this short review, we use comparisons between high-coverage whole genome sequences of African hunter-gatherers and data from genotyping arrays to highlight how SNP ascertainment bias distorts population genetic inferences. Sample sizes and the populations in which SNPs are discovered affect the characteristics of observed variants. We find that SNPs on genotyping arrays tend to be older and present in multiple populations. In addition, genotyping arrays cause allele frequency distributions to be shifted towards intermediate frequency alleles, and estimates of linkage disequilibrium are modified. Since population genetic analyses depend on allele frequencies, it is imperative that researchers are aware of the effects of SNP ascertainment bias. With this in mind, we describe multiple ways to correct for SNP ascertainment bias.

Author-supplied keywords

  • African hunter-gatherers
  • Human genetics
  • Population genetics
  • SNP ascertainment bias
  • Whole genome sequencing

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document

Authors

  • Joseph Lachance

  • Sarah A. Tishkoff

Cite this document

Choose a citation style from the tabs below

Save time finding and organizing research with Mendeley

Sign up for free