Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle

10Citations
Citations of this article
22Readers
Mendeley users who have this article in their library.

Abstract

Background: Over the past decade, Fourier transform infrared (FTIR) spectroscopy has been used to predict novel milk protein phenotypes. Genomic data might help predict these phenotypes when integrated with milk FTIR spectra. The objective of this study was to investigate prediction accuracy for milk protein phenotypes when heterogeneous on-farm, genomic, and pedigree data were integrated with the spectra. To this end, we used the records of 966 Italian Brown Swiss cows with milk FTIR spectra, on-farm information, medium-density genetic markers, and pedigree data. True and total whey protein, and five casein, and two whey protein traits were analyzed. Multiple kernel learning constructed from spectral and genomic (pedigree) relationship matrices and multilayer BayesB assigning separate priors for FTIR and markers were benchmarked against a baseline partial least squares (PLS) regression. Seven combinations of covariates were considered, and their predictive abilities were evaluated by repeated random sub-sampling and herd cross-validations (CV). Results: Addition of the on-farm effects such as herd, days in milk, and parity to spectral data improved predictions as compared to those obtained using the spectra alone. Integrating genomics and/or the top three markers with a large effect further enhanced the predictions. Pedigree data also improved prediction, but to a lesser extent than genomic data. Multiple kernel learning and multilayer BayesB increased predictive performance, whereas PLS did not. Overall, multilayer BayesB provided better predictions than multiple kernel learning, and lower prediction performance was observed in herd CV compared to repeated random sub-sampling CV. Conclusions: Integration of genomic information with milk FTIR spectral can enhance milk protein trait predictions by 25% and 7% on average for repeated random sub-sampling and herd CV, respectively. Multiple kernel learning and multilayer BayesB outperformed PLS when used to integrate heterogeneous data for phenotypic predictions.

References Powered by Scopus

PLS-regression: A basic tool of chemometrics

8143Citations
N/AReaders
Get full text

Efficient methods to compute genomic predictions

4140Citations
N/AReaders
Get full text

mixOmics: An R package for ‘omics feature selection and multiple data integration

2278Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Real-time milk analysis integrated with stacking ensemble learning as a tool for the daily prediction of cheese-making traits in Holstein cattle

25Citations
N/AReaders
Get full text

Predicting milk protein fractions using infrared spectroscopy and a gradient boosting machine for breeding purposes in Holstein cattle

10Citations
N/AReaders
Get full text

Integrating on-farm and genomic information improves the predictive ability of milk infrared prediction of blood indicators of metabolic disorders in dairy cows

6Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Baba, T., Pegolo, S., Mota, L. F. M., Peñagaricano, F., Bittante, G., Cecchinato, A., & Morota, G. (2021). Integrating genomic and infrared spectral data improves the prediction of milk protein composition in dairy cattle. Genetics Selection Evolution, 53(1). https://doi.org/10.1186/s12711-021-00620-7

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 6

43%

Researcher 6

43%

Professor / Associate Prof. 2

14%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 5

63%

Computer Science 1

13%

Engineering 1

13%

Medicine and Dentistry 1

13%

Article Metrics

Tooltip
Social Media
Shares, Likes & Comments: 1

Save time finding and organizing research with Mendeley

Sign up for free