Building an efficient curation workflow for the Arabidopsis literature corpus.

16Citations
Citations of this article
48Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

TAIR (The Arabidopsis Information Resource) is the model organism database (MOD) for Arabidopsis thaliana, a model plant with a literature corpus of about 39 000 articles in PubMed, with over 4300 new articles added in 2011. We have developed a literature curation workflow incorporating both automated and manual elements to cope with this flood of new research articles. The current workflow can be divided into two phases: article selection and curation. Structured controlled vocabularies, such as the Gene Ontology and Plant Ontology are used to capture free text information in the literature as succinct ontology-based annotations suitable for the application of computational analysis methods. We also describe our curation platform and the use of text mining tools in our workflow. Database URL: www.arabidopsis.org

Cite

CITATION STYLE

APA

Li, D., Berardini, T. Z., Muller, R. J., & Huala, E. (2012). Building an efficient curation workflow for the Arabidopsis literature corpus. Database : The Journal of Biological Databases and Curation, 2012. https://doi.org/10.1093/database/bas047

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free