Power for Genetic Association Studies with Random Allele Frequencies and Genotype Distributions

34Citations
Citations of this article
59Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

One of the first and most important steps in planning a genetic association study is the accurate estimation of the statistical power under a proposed study design and sample size. In association studies for candidate genes or in fine-mapping applications, allele and genotype frequencies are often assumed to be known when, in fact, they are unknown (i.e., random variables from some distribution). For example, if we consider a diallelic marker with allele frequencies of 0.5 and 0.5 and Hardy-Weinberg proportions, the three genotype frequencies are often assumed to be 0.25, 0.50, and 0.25, and the statistical power is calculated. Unfortunately, ignoring this source of variation can inflate the estimated power of the study. In the present article, we propose averaging the estimates of power over the distribution of the genotype frequencies to calculate the true estimate of power for a fixed allele frequency. For the usual situation, in which allele frequencies in a population are not known, we propose placing a prior distribution on the allele frequency, taking advantage of any available genotype information. This Bayesian approach provides a more accurate estimate of power. We present examples for quantitative and qualitative traits in cohort studies of unrelated individuals and results from an extensive series of examples that show that ignoring the uncertainty in allele frequencies can inflate the estimated power of the study. We also present the results from case-control studies and show that standard methods may also overestimate power. As discussed in this article, the approach of fixing allele frequencies even if they are not known is the common approach to power calculations. We show that ignoring the sources of variation in allele frequencies tends to result in overestimates of power and, consequently, in studies that are underpowered.

References Powered by Scopus

An exact test for Hardy-Weinberg and multiple alleles

260Citations
N/AReaders
Get full text

The Finland-United States investigation of non-insulin-dependent diabetes mellitus genetics (FUSION) study. I. An autosomal genome scan for genes that predispose to type 2 diabetes

231Citations
N/AReaders
Get full text

Genetic Variants in the Epithelial Sodium Channel in Relation to Aldosterone and Potassium Excretion and Risk for Hypertension

114Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Estimation of effect size distribution from genome-wide association studies and implications for future discoveries

554Citations
N/AReaders
Get full text

Somatic mutations and germline sequence variants in the expressed tyrosine kinase genes of patients with de novo acute myeloid leukemia

200Citations
N/AReaders
Get full text

Genetics of age at menarche: A systematic review

87Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Ambrosius, W. T., Lange, E. M., & Langefeld, C. D. (2004). Power for Genetic Association Studies with Random Allele Frequencies and Genotype Distributions. American Journal of Human Genetics, 74(4), 683–693. https://doi.org/10.1086/383282

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 17

33%

Researcher 17

33%

Professor / Associate Prof. 14

27%

Lecturer / Post doc 4

8%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 29

60%

Medicine and Dentistry 9

19%

Biochemistry, Genetics and Molecular Bi... 6

13%

Mathematics 4

8%

Save time finding and organizing research with Mendeley

Sign up for free