Stratified polygenic risk prediction model with application to CAGI bipolar disorder sequencing data

7Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Genetic data consists of a wide range of marker types, including common, low-frequency, and rare variants. Multiple genetic markers and their interactions play central roles in the heritability of complex disease. In this study, we propose an algorithm that uses a stratified variable selection design by genetic architectures and interaction effects, achieved by a dataset-adaptive W-test. The polygenic sets in all strata were integrated to form a classification rule. The algorithm was applied to the Critical Assessment of Genome Interpretation 4 bipolar challenge sequencing data. The prediction accuracy was 60% using genetic markers on an independent test set. We found that epistasis among common genetic variants contributed most substantially to prediction precision. However, the sample size was not large enough to draw conclusions for the lack of predictability of low-frequency variants and their epistasis.

Cite

CITATION STYLE

APA

Wang, M. H., Chang, B., Sun, R., Hu, I., Xia, X., Wu, W. K. K., … Zee, B. C. Y. (2017). Stratified polygenic risk prediction model with application to CAGI bipolar disorder sequencing data. Human Mutation, 38(9), 1235–1239. https://doi.org/10.1002/humu.23229

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free