Predictive Modeling for Metabolomics Data

31Citations
Citations of this article
69Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In recent years, mass spectrometry (MS)-based metabolomics has been extensively applied to characterize biochemical mechanisms, and study physiological processes and phenotypic changes associated with disease. Metabolomics has also been important for identifying biomarkers of interest suitable for clinical diagnosis. For the purpose of predictive modeling, in this chapter, we will review various supervised learning algorithms such as random forest (RF), support vector machine (SVM), and partial least squares-discriminant analysis (PLS-DA). In addition, we will also review feature selection methods for identifying the best combination of metabolites for an accurate predictive model. We conclude with best practices for reproducibility by including internal and external replication, reporting metrics to assess performance, and providing guidelines to avoid overfitting and to deal with imbalanced classes. An analysis of an example data will illustrate the use of different machine learning methods and performance metrics.

Cite

CITATION STYLE

APA

Ghosh, T., Zhang, W., Ghosh, D., & Kechris, K. (2020). Predictive Modeling for Metabolomics Data. In Methods in Molecular Biology (Vol. 2104, pp. 313–336). Humana Press Inc. https://doi.org/10.1007/978-1-0716-0239-3_16

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free