Partial least squares discriminant analysis and bayesian networks for metabolomic prediction of childhood asthma

22Citations
Citations of this article
78Readers
Mendeley users who have this article in their library.

Abstract

To explore novel methods for the analysis of metabolomics data, we compared the ability of Partial Least Squares Discriminant Analysis (PLS-DA) and Bayesian networks (BN) to build predictive plasma metabolite models of age three asthma status in 411 three year olds (n = 59 cases and 352 controls) from the Vitamin D Antenatal Asthma Reduction Trial (VDAART) study. The standard PLS-DA approach had impressive accuracy for the prediction of age three asthma with an Area Under the Curve Convex Hull (AUCCH) of 81%. However, a permutation test indicated the possibility of overfitting. In contrast, a predictive Bayesian network including 42 metabolites had a significantly higher AUCCH of 92.1% (p for difference < 0.001), with no evidence that this accuracy was due to overfitting. Both models provided biologically informative insights into asthma; in particular, a role for dysregulated arginine metabolism and several exogenous metabolites that deserve further investigation as potential causative agents. As the BN model outperformed the PLS-DA model in both accuracy and decreased risk of overfitting, it may therefore represent a viable alternative to typical analytical approaches for the investigation of metabolomics data.

Cite

CITATION STYLE

APA

Kelly, R. S., McGeachie, M. J., Lee-Sarwar, K. A., Kachroo, P., Chu, S. H., Virkud, Y. V., … Lasky-Su, J. (2018). Partial least squares discriminant analysis and bayesian networks for metabolomic prediction of childhood asthma. Metabolites, 8(4). https://doi.org/10.3390/metabo8040068

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free