Publicly available gene expression datasets deposited in the Gene Expression Omnibus (GEO) are growing at an accelerating rate. Such datasets hold great value for knowledge discovery, particularly when integrated. Although numerous software platforms and tools have been developed to enable reanalysis and integration of individual, or groups, of GEO datasets, large-scale reuse of those datasets is impeded by minimal requirements for standardized metadata both at the study and sample levels as well as uniform processing of the data across studies. Here, we review methodologies developed to facilitate the systematic curation and processing of publicly available gene expression datasets from GEO. We identify trends for advanced metadata curation and summarize approaches for reprocessing the data within the entire GEO repository.
CITATION STYLE
Wang, Z., Lachmann, A., & Ma’ayan, A. (2019, February 7). Mining data and metadata from the gene expression omnibus. Biophysical Reviews. Springer Verlag. https://doi.org/10.1007/s12551-018-0490-8
Mendeley helps you to discover research relevant for your work.