Multi-domain named entity recognition with genre-aware and agnostic inference

Jing Wang; Mayank Kulkarni; Daniel Preotiuc-Pietro

Conference ProceedingsOPEN ACCESS

Multi-domain named entity recognition with genre-aware and agnostic inference

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2020) 8476-8488

DOI: 10.18653/v1/2020.acl-main.750

37Citations

153Readers

Abstract

Named entity recognition is a key component of many text processing pipelines and it is thus essential for this component to be robust to different types of input. However, domain transfer of NER models with data from multiple genres has not been widely studied. To this end, we conduct NER experiments in three predictive setups on data from: a) multiple domains; b) multiple domains where the genre label is unknown at inference time; c) domains not encountered in training. We introduce a new architecture tailored to this task by using shared and private domain parameters and multi-task learning. This consistently outperforms all other baseline and competitive methods on all three experimental setups, with differences ranging between +1.95 to +3.11 average F1 across multiple genres when compared to standard approaches. These results illustrate the challenges that need to be taken into account when building real-world NLP applications that are robust to various types of text and the methods that can help, at least partially, alleviate these issues.

Cite

CITATION STYLE

APA

Wang, J., Kulkarni, M., & Preotiuc-Pietro, D. (2020). Multi-domain named entity recognition with genre-aware and agnostic inference. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 8476–8488). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.acl-main.750

Multi-domain named entity recognition with genre-aware and agnostic inference

Abstract

Cite

Register to see more suggestions