Mining relations from unstructured content

Ismini Lourentzou; Alfredo Alba; Anni Coden; Anna Lisa Gentile; Daniel Gruhl; Steve Welch

Conference Proceedings

Mining relations from unstructured content

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 10938 LNAI 363-375

DOI: 10.1007/978-3-319-93037-4_29

5Citations

9Readers

Get full text

Abstract

Extracting relations from unstructured Web content is a challenging task and for any new relation a significant effort is required to design, train and tune the extraction models. In this work, we investigate how to obtain suitable results for relation extraction with modest human efforts, relying on a dynamic active learning approach. We propose a method to reliably generate high quality training/test data for relation extraction - for any generic user-demonstrated relation, starting from a few user provided examples and extracting valuable samples from unstructured and unlabeled Web content. To this extent we propose a strategy which learns how to identify the best order to human-annotate data, maximizing learning performance early in the process. We demonstrate the viability of the approach (i) against state of the art datasets for relation extraction as well as (ii) a real case study identifying text expressing a causal relation between a drug and an adverse reaction from user generated Web content.

Cite

CITATION STYLE

APA

Lourentzou, I., Alba, A., Coden, A., Gentile, A. L., Gruhl, D., & Welch, S. (2018). Mining relations from unstructured content. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10938 LNAI, pp. 363–375). Springer Verlag. https://doi.org/10.1007/978-3-319-93037-4_29

Mining relations from unstructured content

Abstract

Cite

Register to see more suggestions