Abstract
As we enter the ``Post Genome-wide Association Study'' era, there is a widespread desire to move towards sequencing study samples, often after an initial genome-wide association analysis that has found some loci associated with a phenotype of interest in a given population. The rationale is clear: the initial study is highly unlikely to have found all, or even most, associated loci. Indeed, in terms of heritable variation, for virtually every phenotype, much remains to be explained. The details of how such a sequence-based study should be designed are unclear. In this article we review the rationale for such a sequence-based approach, such as the desire to find new (likely rare) variants that are also associated with the phenotype, and the wish to explain the ``missing heritability''. We go on to review simulation-based and data-based ways of developing guidelines for the design of such next-generation sequence-based studies. Finally, we discuss the desirability of using biological prior information to help lessen the problems caused by ``the curse of dimensionality'' in this context.
Cite
CITATION STYLE
Marjoram, P., & Thomas, D. C. (2014). Next-Generation Sequencing Studies: Optimal Design and Analysis, Missing Heritability and Rare Variants. Current Epidemiology Reports, 1(4), 213–219. https://doi.org/10.1007/s40471-014-0022-4
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.