DAIRRy-BLUP: A high-performance computing approach to genomic prediction

Arne De Coninck; Jan Fostier; Steven Maenhout; Bernard De Baets

Journal ArticleOPEN ACCESS

DAIRRy-BLUP: A high-performance computing approach to genomic prediction

Genetics (2014) 197(3) 813-822

DOI: 10.1534/genetics.114.163683

8Citations

52Readers

Abstract

In genomic prediction, common analysis methods rely on a linear mixed-model framework to estimate SNP marker effects and breeding values of animals or plants. Ridge regression-best linear unbiased prediction (RR-BLUP) is based on the assumptions that SNP marker effects are normally distributed, are uncorrelated, and have equal variances. We propose DAIRRy-BLUP, a parallel, Distributed-memory RR-BLUP implementation, based on single-trait observations (y), that uses the Average Information algorithm for restricted maximum-likelihood estimation of the variance components. The goal of DAIRRy-BLUP is to enable the analysis of large-scale data sets to provide more accurate estimates of marker effects and breeding values. A distributed-memory framework is required since the dimensionality of the problem, determined by the number of SNP markers, can become too large to be analyzed by a single computing node. Initial results show that DAIRRy-BLUP enables the analysis of very large-scale data sets (up to 1,000,000 individuals and 360,000 SNPs) and indicate that increasing the number of phenotypic and genotypic records has a more significant effect on the prediction accuracy than increasing the density of SNP arrays. © 2014 by the Genetics Society of America.

Cite

CITATION STYLE

APA

De Coninck, A., Fostier, J., Maenhout, S., & De Baets, B. (2014). DAIRRy-BLUP: A high-performance computing approach to genomic prediction. Genetics, 197(3), 813–822. https://doi.org/10.1534/genetics.114.163683

DAIRRy-BLUP: A high-performance computing approach to genomic prediction

Abstract

Cite

Register to see more suggestions