Open information extraction (Open IE) is a task of extracting facts from a plain text without limiting the analysis to a predefined set of relationships. Although a significant number of studies have focused on this problem in the last years, there is a lack of available linguistic resources for languages other than English. An essential resource for the evaluation of Open IE methods is notably an annotated corpus. In this work, we present the challenges involved in the creation of a golden set corpus for the Open IE task in the Portuguese language. We describe our methodology, an annotation tool to support the task and our results on performing this annotation task in a small validation corpus.
CITATION STYLE
Glauber, R., de Oliveira, L. S., Sena, C. F. L., Claro, D. B., & Souza, M. (2018). Challenges of an Annotation Task for Open Information Extraction in Portuguese. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11122 LNAI, pp. 66–76). Springer Verlag. https://doi.org/10.1007/978-3-319-99722-3_7
Mendeley helps you to discover research relevant for your work.