Identification of differentially expressed subnetworks based on multivariate ANOVA

29Citations
Citations of this article
56Readers
Mendeley users who have this article in their library.

Abstract

Background: Since high-throughput protein-protein interaction (PPI) data has recently become available for humans, there has been a growing interest in combining PPI data with other genome-wide data. In particular, the identification of phenotype-related PPI subnetworks using gene expression data has been of great concern. Successful integration for the identification of significant subnetworks requires the use of a search algorithm with a proper scoring method. Here we propose a multivariate analysis of variance (MANOVA)-based scoring method with a greedy search for identifying differentially expressed PPI subnetworks. Results: Given the MANOVA-based scoring method, we performed a greedy search to identify the subnetworks with the maximum scores in the PPI network. Our approach was successfully applied to human microarray datasets. Each identified subnetwork was annotated with the Gene Ontology (GO) term, resulting in the phenotype-related functional pathway or complex. We also compared these results with those of other scoring methods such as t statistic- and mutual information-based scoring methods. The MANOVA-based method produced subnetworks with a larger number of proteins than the other methods. Furthermore, the subnetworks identified by the MANOVA-based method tended to consist of highly correlated proteins. Conclusion: This article proposes a MANOVA-based scoring method to combine PPI data with expression data using a greedy search. This method is recommended for the highly sensitive detection of large subnetworks. © 2009 Hwang and Park; licensee BioMed Central Ltd.

Figures

  • Table 1: GO functional annotations for the ten most significant subnetworks (Serum Response data).
  • Table 2: GO functional annotations for the ten most significant subnetworks (Prostate Cancer Metastasis data).
  • Table 3: The number of significant networks identified by each scoring method.
  • Table 4: The cumulative distribution of the number of proteins in the subnetworks (Serum response data).
  • Table 5: The cumulative distribution of the number of proteins in the subnetworks (Prostate cancer data).
  • Table 6: Sensitivity and specificity of each method in the simulation study (ρ = 0.8, ρ' = 0.4).
  • Table 7: Sensitivity and specificity of each method in the simulation study (ρ = 0.8, ρ' = 0.1).

References Powered by Scopus

Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles

36521Citations
N/AReaders
Get full text

Significance analysis of microarrays applied to the ionizing radiation response

10052Citations
N/AReaders
Get full text

Evidence for dynamically organized modularity in the yeast protein-protein interaction network

1469Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Integrative approaches for finding modular structure in biological networks

412Citations
N/AReaders
Get full text

Stable feature selection for biomarker discovery

248Citations
N/AReaders
Get full text

Protein-protein interaction networks and subnetworks in the biology of disease

82Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Hwang, T., & Park, T. (2009). Identification of differentially expressed subnetworks based on multivariate ANOVA. BMC Bioinformatics, 10. https://doi.org/10.1186/1471-2105-10-128

Readers over time

‘10‘11‘12‘13‘14‘15‘16‘17‘18‘19‘20‘25036912

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 23

43%

Researcher 22

42%

Professor / Associate Prof. 8

15%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 29

64%

Computer Science 7

16%

Biochemistry, Genetics and Molecular Bi... 5

11%

Mathematics 4

9%

Save time finding and organizing research with Mendeley

Sign up for free
0