DAIRRy-BLUP: A high-performance computing approach to genomic prediction

8Citations
Citations of this article
52Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

In genomic prediction, common analysis methods rely on a linear mixed-model framework to estimate SNP marker effects and breeding values of animals or plants. Ridge regression-best linear unbiased prediction (RR-BLUP) is based on the assumptions that SNP marker effects are normally distributed, are uncorrelated, and have equal variances. We propose DAIRRy-BLUP, a parallel, Distributed-memory RR-BLUP implementation, based on single-trait observations (y), that uses the Average Information algorithm for restricted maximum-likelihood estimation of the variance components. The goal of DAIRRy-BLUP is to enable the analysis of large-scale data sets to provide more accurate estimates of marker effects and breeding values. A distributed-memory framework is required since the dimensionality of the problem, determined by the number of SNP markers, can become too large to be analyzed by a single computing node. Initial results show that DAIRRy-BLUP enables the analysis of very large-scale data sets (up to 1,000,000 individuals and 360,000 SNPs) and indicate that increasing the number of phenotypic and genotypic records has a more significant effect on the prediction accuracy than increasing the density of SNP arrays. © 2014 by the Genetics Society of America.

Cite

CITATION STYLE

APA

De Coninck, A., Fostier, J., Maenhout, S., & De Baets, B. (2014). DAIRRy-BLUP: A high-performance computing approach to genomic prediction. Genetics, 197(3), 813–822. https://doi.org/10.1534/genetics.114.163683

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free