Finding consistent disease subnetworks across microarray datasets

Donny Soh; Difeng Dong; Yike Guo; Limsoon Wong

Journal ArticleOPEN ACCESS

Finding consistent disease subnetworks across microarray datasets

BMC Bioinformatics (2011) 12(SUPPL. 13)

DOI: 10.1186/1471-2105-12-S13-S15

24Citations

39Readers

Abstract

Background: While contemporary methods of microarray analysis are excellent tools for studying individual microarray datasets, they have a tendency to produce different results from different datasets of the same disease. We aim to solve this reproducibility problem by introducing a technique (SNet). SNet provides both quantitative and descriptive analysis of microarray datasets by identifying specific connected portions of pathways that are significant. We term such portions within pathways as " subnetworks" .Results: We tested SNet on independent datasets of several diseases, including childhood ALL, DMD and lung cancer. For each of these diseases, we obtained two independent microarray datasets produced by distinct labs on distinct platforms. In each case, our technique consistently produced almost the same list of significant nontrivial subnetworks from two independent sets of microarray data. The gene-level agreement of these significant subnetworks was between 51.18% to 93.01%. In contrast, when the same pairs of microarray datasets were analysed using GSEA, t-test and SAM, this percentage fell between 2.38% to 28.90% for GSEA, 49.60% tp 73.01% for t-test, and 49.96% to 81.25% for SAM. Furthermore, the genes selected using these existing methods did not form subnetworks of substantial size. Thus it is more probable that the subnetworks selected by our technique can provide the researcher with more descriptive information on the portions of the pathway actually affected by the disease.Conclusions: These results clearly demonstrate that our technique generates significant subnetworks and genes that are more consistent and reproducible across datasets compared to the other popular methods available (GSEA, t-test and SAM). The large size of subnetworks which we generate indicates that they are generally more biologically significant (less likely to be spurious). In addition, we have chosen two sample subnetworks and validated them with references from biological literature. This shows that our algorithm is capable of generating descriptive biologically conclusions. © 2011 Soh et al; licensee BioMed Central Ltd.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Soh, D., Dong, D., Guo, Y., & Wong, L. (2011). Finding consistent disease subnetworks across microarray datasets. BMC Bioinformatics, 12(SUPPL. 13). https://doi.org/10.1186/1471-2105-12-S13-S15

Readers' Seniority

PhD / Post grad / Masters / Doc 17

57%

Researcher 9

30%

Professor / Associate Prof. 4

13%

Readers' Discipline

Agricultural and Biological Sciences 18

60%

Computer Science 8

27%

Medicine and Dentistry 3

10%

Chemistry 1

Finding consistent disease subnetworks across microarray datasets

Abstract

References Powered by Scopus

Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles

Significance analysis of microarrays applied to the ionizing radiation response

Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring

Cited by Powered by Scopus

Stability of feature selection algorithm: A review

How advancement in biological network analysis methods empowers proteomics

Finding consistent disease subnetworks using PFSNet

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline