Can we survive without labelled data in NLP? Transfer learning for open information extraction

Injy Sarhan; Marco Spruit

Journal ArticleOPEN ACCESS

Can we survive without labelled data in NLP? Transfer learning for open information extraction

Applied Sciences (Switzerland) (2020) 10(17)

DOI: 10.3390/APP10175758

12Citations

37Readers

Abstract

Various tasks in natural language processing (NLP) suffer from lack of labelled training data, which deep neural networks are hungry for. In this paper, we relied upon features learned to generate relation triples from the open information extraction (OIE) task. First, we studied how transferable these features are from one OIE domain to another, such as from a news domain to a bio-medical domain. Second, we analyzed their transferability to a semantically related NLP task, namely, relation extraction (RE). We thereby contribute to answering the question: can OIE help us achieve adequate NLP performance without labelled data? Our results showed comparable performance when using inductive transfer learning in both experiments by relying on a very small amount of the target data, wherein promising results were achieved. When transferring to the OIE bio-medical domain, we achieved an F-measure of 78.0%, only 1% lower when compared to traditional learning. Additionally, transferring to RE using an inductive approach scored an F-measure of 67.2%, which was 3.8% lower than training and testing on the same task. Hereby, our analysis shows that OIE can act as a reliable source task.

Author supplied keywords

Cite

CITATION STYLE

APA

Sarhan, I., & Spruit, M. (2020). Can we survive without labelled data in NLP? Transfer learning for open information extraction. Applied Sciences (Switzerland), 10(17). https://doi.org/10.3390/APP10175758

Can we survive without labelled data in NLP? Transfer learning for open information extraction

Abstract

Author supplied keywords

Cite

Register to see more suggestions