Genetic algorithms for simultaneous variable and sample selection in metabonomics

54Citations
Citations of this article
99Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Motivation: Metabolic profiles derived from high resolution 1 H-NMR data are complex, therefore statistical and machine learning approaches are vital for extracting useful information and biological insights. Focused modelling on targeted subsets of metabolites and samples can improve the predictive ability of models, and techniques such as genetic algorithms (GAs) have a proven utility in feature selection problems. The Consortium for Metabonomic Toxicology (COMET) obtained temporal NMR spectra of urine from rats treated with model toxins and stressors. Here, we develop a GA approach which simultaneously selects sets of samples and spectral regions from the COMET database to build robust, predictive classifiers of liver and kidney toxicity. Results: The results indicate that using simultaneous sample and variable selection improved performance by over 9% compared with either method alone. Simultaneous selection also halved computation time. Successful classifiers repeatedly selected particular variables indicating that this approach can aid defining biomarkers of toxicity. Novel visualizations of the results from multiple computations were developed to aid the interpretability of which samples and variables were frequently selected. This method provides an efficient way to determine the most discriminatory variables and samples for any post-genomic dataset. © The Author 2008. Published by Oxford University Press. All rights reserved.

Cite

CITATION STYLE

APA

Cavill, R., Keun, H. C., Holmes, E., Lindon, J. C., Nicholson, J. K., & Ebbels, T. M. D. (2009). Genetic algorithms for simultaneous variable and sample selection in metabonomics. Bioinformatics, 25(1), 112–118. https://doi.org/10.1093/bioinformatics/btn586

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free