Query selection methods for automated corpora construction with a use case in food-drug interactions

Georgeta Bordea; Tsanta Randriatsitohaina; Natalia Grabar; Fleur Mougin; Thierry Hamon

Conference ProceedingsOPEN ACCESS

Query selection methods for automated corpora construction with a use case in food-drug interactions

BioNLP 2019 - SIGBioMed Workshop on Biomedical Natural Language Processing, Proceedings of the 18th BioNLP Workshop and Shared Task (2019) 115-124

DOI: 10.18653/v1/w19-5013

3Citations

74Readers

Abstract

In this paper, we address the problem of automatically constructing a relevant corpus of scientific articles about food-drug interactions. There is a growing number of scientific publications that describe food-drug interactions but currently building a high-coverage corpus that can be used for information extraction purposes is not trivial. We investigate several methods for automating the query selection process using an expert-curated corpus of food-drug interactions. Our experiments show that index terms features along with a decision tree classifier are the best approach for this task and that feature selection approaches and in particular gain ratio outperform frequency-based methods for query selection.

Cite

CITATION STYLE

APA

Bordea, G., Randriatsitohaina, T., Grabar, N., Mougin, F., & Hamon, T. (2019). Query selection methods for automated corpora construction with a use case in food-drug interactions. In BioNLP 2019 - SIGBioMed Workshop on Biomedical Natural Language Processing, Proceedings of the 18th BioNLP Workshop and Shared Task (pp. 115–124). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w19-5013

Query selection methods for automated corpora construction with a use case in food-drug interactions

Abstract

Cite

Register to see more suggestions