Mining data and metadata from the gene expression omnibus

56Citations
Citations of this article
119Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Publicly available gene expression datasets deposited in the Gene Expression Omnibus (GEO) are growing at an accelerating rate. Such datasets hold great value for knowledge discovery, particularly when integrated. Although numerous software platforms and tools have been developed to enable reanalysis and integration of individual, or groups, of GEO datasets, large-scale reuse of those datasets is impeded by minimal requirements for standardized metadata both at the study and sample levels as well as uniform processing of the data across studies. Here, we review methodologies developed to facilitate the systematic curation and processing of publicly available gene expression datasets from GEO. We identify trends for advanced metadata curation and summarize approaches for reprocessing the data within the entire GEO repository.

Cite

CITATION STYLE

APA

Wang, Z., Lachmann, A., & Ma’ayan, A. (2019, February 7). Mining data and metadata from the gene expression omnibus. Biophysical Reviews. Springer Verlag. https://doi.org/10.1007/s12551-018-0490-8

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free