Towards large-scale sample annotation in gene expression repositories

Erik Pitzer; Ronilda Lacson; Christian Hinske; Jihoon Kim; Pedro A.F. Galante; Lucila Ohno-Machado

Conference ProceedingsOPEN ACCESS

Towards large-scale sample annotation in gene expression repositories

BMC Bioinformatics (2009) 10(SUPPL. 9)

DOI: 10.1186/1471-2105-10-S9-S9

6Citations

21Readers

Abstract

Background: Large repositories of biomedical research data are most useful to translational researchers if their data can be aggregated for efficient queries and analyses. However, inconsistent or non-existent annotations describing important sample details such as name of tissue or cell line, histopathological type, and subject characteristics like demographics, treatment, and survival are seldom present in data repositories, making it difficult to aggregate data. Results: We created a flexible software tool that allows efficient annotation of samples using a controlled vocabulary, and report on its use for the annotation of over 12,500 samples. Conclusion: While the amount of data is very large and seemingly poorly annotated, a lot of information is still within reach. Consistent tool-based re-annotation enables many new possibilities for large scale interpretation and analyses that would otherwise be impossible. © 2009 Pitzer et al; licensee BioMed Central Ltd.

Cite

CITATION STYLE

APA

Pitzer, E., Lacson, R., Hinske, C., Kim, J., Galante, P. A. F., & Ohno-Machado, L. (2009). Towards large-scale sample annotation in gene expression repositories. In BMC Bioinformatics (Vol. 10). BioMed Central. https://doi.org/10.1186/1471-2105-10-S9-S9

Towards large-scale sample annotation in gene expression repositories

Abstract

Cite

Register to see more suggestions