In this paper, we propose an approach for identifying curatable articles from a large document set. This system considers three parts of an article (title and abstract, MeSH terms, and captions) as its three individual representations and utilizes two domain-specific resources (UMLS and a tumor name list) to reveal the deep knowledge contained in the article. An SVM classifier is trained and cross-validation is employed to find the best combination of representations. The experimental results show overall high performance.
Mendeley helps you to discover research relevant for your work.
CITATION STYLE
Hou, W. J., Lee, C., & Chen, H. H. (2006). Classifying biological full-text articles for multi-database curation. In EACL 2006 - 11th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference (pp. 159–162). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1608974.1608997