Using bayesian multilevel whole genome regression models for partial pooling of training sets in genomic prediction

Frank Technow; L. Radu Totir

Journal ArticleOPEN ACCESS

Using bayesian multilevel whole genome regression models for partial pooling of training sets in genomic prediction

G3: Genes, Genomes, Genetics (2015) 5(8) 1603-1612

DOI: 10.1534/g3.115.019299

14Citations

60Readers

Abstract

Training set size is an important determinant of genomic prediction accuracy. Plant breeding programs are characterized by a high degree of structuring, particularly into populations. This hampers the establishment of large training sets for each population. Pooling populations increases training set size but ignores unique genetic characteristics of each. A possible solution is partial pooling with multilevel models, which allows estimating population-specific marker effects while still leveraging information across populations. We developed a Bayesian multilevel whole-genome regression model and compared its performance with that of the popular BayesA model applied to each population separately (no pooling) and to the joined data set (complete pooling). As an example, we analyzed a wide array of traits from the nested association mapping maize population. There we show that for small population sizes (e.g.,

Author supplied keywords

Cite

CITATION STYLE

APA

Technow, F., & Radu Totir, L. (2015). Using bayesian multilevel whole genome regression models for partial pooling of training sets in genomic prediction. G3: Genes, Genomes, Genetics, 5(8), 1603–1612. https://doi.org/10.1534/g3.115.019299

Using bayesian multilevel whole genome regression models for partial pooling of training sets in genomic prediction

Abstract

Author supplied keywords

Cite

Register to see more suggestions