Reproducibility of finding enriched gene sets in biological data analysis

4Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Introducing the high-throughput measurement methods into molecular biology was a trigger to develop the algorithms for searching disorders in complex signalling systems, like pathways or gene ontologies. In recent years, there appeared many new solutions, but the results obtained with these techniques are ambiguous. In this work, five different algorithms for pathway enrichment analysis were compared using six microarray datasets covering cases with the same disease. The number of enriched pathways at different significance level and false positive rate of finding enrichment pathways was estimated, and reproducibility of obtained results between datasets was checked. The best performance was obtained for PLAGE method. However, taking into consideration the biological knowledge about analyzed disease condition, many findings may be false positives. Out of the other methods GSVA algorithm gave the most reproducible results across tested datasets, which was also validated in biological repositories. Similarly, good outcomes were given by GSEA method. ORA and PADOG gave poor sensitivity and reproducibility, which stand in contrary to previous research.

Cite

CITATION STYLE

APA

Zyla, J., Marczyk, M., & Polanska, J. (2017). Reproducibility of finding enriched gene sets in biological data analysis. In Advances in Intelligent Systems and Computing (Vol. 616, pp. 146–154). Springer Verlag. https://doi.org/10.1007/978-3-319-60816-7_18

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free