Abstract
Background: A major goal of metagenomic studies is to identify specific functional adaptations of microbial communities to their habitats. The functional profile and the abundances for a sample can be estimated by mapping metagenomic sequences to the global metabolic network consisting of thousands of molecular reactions. Here we describe our development of statistical methods that can identify differentially abundant subnetworks between metagenomic samples. Methods: First, we introduced a scoring function for an arbitrary subnetwork and find the max-weight subnetwork in the global network by greedy search. Then we compute pabund and pstruct values using nonparametric approaches to answer two statistical questions: (i) Is this sub-network differentially abundant? (ii) What is the probability of finding such good subnetworks by chance? Significant metabolic subnetworks are detected on the basis of these two p values. (Figure Presented) Results: Simulated datasets We randomly choose a metabolic subnetwork as differentially abundant, and then simulate the abundance values from Gaussian distributions. Figure 1 shows the performance of different methods on discovering the significant subnetwork. Real metagenomic data sets We analyzed gut microbiome from obese or lean [1], and infant or adult subjects (Kurokawa et al, 2007), and found several interesting pathways. For example, five pathways in fatty acid biosynthesis are enriched in obese subjects, which confirm the results of a previous study that obese subjects have an increased capacity for dietary energy harvest. In addition, four and three homocysteine pathways are enriched in obese and infant subjects (Figure 2), indicating that they are highly correlated with the homocysteine levels in blood serum. Conclusions: We have developed statistical methods to find differentially abundant metabolic pathways in metagenomics. The performance is better than previous approaches. Results from real metagenomic datasets confirm previous observations and also provide several new biological insights.
Cite
CITATION STYLE
Liu, B., & Pop, M. (2010). Statistical methods for comparing the abundances of metabolic pathways in metagenomics. Genome Biology, 11(S1). https://doi.org/10.1186/gb-2010-11-s1-o7
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.