Query selection methods for automated corpora construction with a use case in food-drug interactions

3Citations
Citations of this article
68Readers
Mendeley users who have this article in their library.

Abstract

In this paper, we address the problem of automatically constructing a relevant corpus of scientific articles about food-drug interactions. There is a growing number of scientific publications that describe food-drug interactions but currently building a high-coverage corpus that can be used for information extraction purposes is not trivial. We investigate several methods for automating the query selection process using an expert-curated corpus of food-drug interactions. Our experiments show that index terms features along with a decision tree classifier are the best approach for this task and that feature selection approaches and in particular gain ratio outperform frequency-based methods for query selection.

Cite

CITATION STYLE

APA

Bordea, G., Randriatsitohaina, T., Grabar, N., Mougin, F., & Hamon, T. (2019). Query selection methods for automated corpora construction with a use case in food-drug interactions. In BioNLP 2019 - SIGBioMed Workshop on Biomedical Natural Language Processing, Proceedings of the 18th BioNLP Workshop and Shared Task (pp. 115–124). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w19-5013

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free