Multi-domain named entity recognition with genre-aware and agnostic inference

34Citations
Citations of this article
151Readers
Mendeley users who have this article in their library.

Abstract

Named entity recognition is a key component of many text processing pipelines and it is thus essential for this component to be robust to different types of input. However, domain transfer of NER models with data from multiple genres has not been widely studied. To this end, we conduct NER experiments in three predictive setups on data from: a) multiple domains; b) multiple domains where the genre label is unknown at inference time; c) domains not encountered in training. We introduce a new architecture tailored to this task by using shared and private domain parameters and multi-task learning. This consistently outperforms all other baseline and competitive methods on all three experimental setups, with differences ranging between +1.95 to +3.11 average F1 across multiple genres when compared to standard approaches. These results illustrate the challenges that need to be taken into account when building real-world NLP applications that are robust to various types of text and the methods that can help, at least partially, alleviate these issues.

Cite

CITATION STYLE

APA

Wang, J., Kulkarni, M., & Preotiuc-Pietro, D. (2020). Multi-domain named entity recognition with genre-aware and agnostic inference. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 8476–8488). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.acl-main.750

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free