Challenges of an Annotation Task for Open Information Extraction in Portuguese

Rafael Glauber; Leandro Souza de Oliveira; Cleiton Fernando Lima Sena; Daniela Barreiro Claro; Marlo Souza

Conference Proceedings

Challenges of an Annotation Task for Open Information Extraction in Portuguese

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11122 LNAI 66-76

DOI: 10.1007/978-3-319-99722-3_7

5Citations

6Readers

Get full text

Abstract

Open information extraction (Open IE) is a task of extracting facts from a plain text without limiting the analysis to a predefined set of relationships. Although a significant number of studies have focused on this problem in the last years, there is a lack of available linguistic resources for languages other than English. An essential resource for the evaluation of Open IE methods is notably an annotated corpus. In this work, we present the challenges involved in the creation of a golden set corpus for the Open IE task in the Portuguese language. We describe our methodology, an annotation tool to support the task and our results on performing this annotation task in a small validation corpus.

Author supplied keywords

Cite

CITATION STYLE

APA

Glauber, R., de Oliveira, L. S., Sena, C. F. L., Claro, D. B., & Souza, M. (2018). Challenges of an Annotation Task for Open Information Extraction in Portuguese. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11122 LNAI, pp. 66–76). Springer Verlag. https://doi.org/10.1007/978-3-319-99722-3_7

Challenges of an Annotation Task for Open Information Extraction in Portuguese

Abstract

Author supplied keywords

Cite

Register to see more suggestions