Two-stage extreme phenotype sequencing design for discovering and testing common and rare genetic variants: Efficiency and power

14Citations
Citations of this article
34Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Next-generation sequencing technology provides an unprecedented opportunity to identify rare susceptibility variants. It is not yet financially feasible to perform whole-genome sequencing on a large number of subjects, and a two-stage design has been advocated to be a practical option. In stage I, variants are discovered by sequencing the whole genomes of a small number of carefully selected individuals. In stage II, the discovered variants of a large number of individuals are genotyped to assess associations. Individuals with extreme phenotypes are typically selected in stage I. Using simulated data for unrelated individuals, we explore two important aspects of this two-stage design: the efficiency of discovering common and rare single-nucleotide polymorphisms (SNPs) in stage I and the impact of incomplete SNP discovery in stage I on the power of testing associations in stage II. We applied a sum test and a sum of squared score test for gene-based association analyses evaluating the power of the two-stage design. We obtained the following results from extensive simulation studies and analysis of the GAW17 dataset. When individuals with trait values more extreme than the 99.7-99th quantile were included in stage I, the two-stage design could achieve the same power as or even higher than the one-stage design if the rare causal variants had large effect sizes. In such design, fewer than half of the total SNPs including more than half of the causal SNPs were discovered, which included nearly all SNPs with minor allele frequencies (MAFs) ≥5%, more than half of the SNPs with MAFs between 1% and 5%, and fewer than half of the SNPs with MAFs <1%. Although a one-stage design may be preferable to identify multiple rare variants having small to moderate effect sizes, our observations support using the two-stage design as a cost-effective option for next-generation sequencing studies. © 2012 S. Karger AG, Basel.

References Powered by Scopus

PLINK: A tool set for whole-genome association and population-based linkage analyses

26041Citations
N/AReaders
Get full text

Rare-variant association testing for sequencing data with the sequence kernel association test

1877Citations
N/AReaders
Get full text

Methods for Detecting Associations with Rare Variants for Common Diseases: Application to Analysis of Sequence Data

1220Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Cancer pharmacogenomics: Strategies and challenges

200Citations
N/AReaders
Get full text

Phenotypic extremes in rare variant study designs

48Citations
N/AReaders
Get full text

A systematic review of extreme phenotype strategies to search for rare variants in genetic studies of complex disorders

43Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Kang, G., Lin, D., Hakonarson, H., & Chen, J. (2012). Two-stage extreme phenotype sequencing design for discovering and testing common and rare genetic variants: Efficiency and power. Human Heredity, 73(3), 139–147. https://doi.org/10.1159/000337300

Readers over time

‘12‘13‘14‘16‘17‘18‘19‘20‘21‘2402468

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 11

42%

Researcher 10

38%

Professor / Associate Prof. 4

15%

Lecturer / Post doc 1

4%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 7

35%

Medicine and Dentistry 6

30%

Biochemistry, Genetics and Molecular Bi... 5

25%

Neuroscience 2

10%

Save time finding and organizing research with Mendeley

Sign up for free
0