Split-NER: Named Entity Recognition via Two Question-Answering-based Classifications

6Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.

Abstract

In this work, we address the NER problem by splitting it into two logical sub-tasks: (1) Span Detection which simply extracts mention spans of entities, irrespective of entity type; (2) Span Classification which classifies the spans into their entity types. Further, we formulate both sub-tasks as question-answering (QA) problems and produce two leaner models which can be optimized separately for each sub-task. Experiments with four cross-domain datasets demonstrate that this two-step approach is both effective and time efficient. Our system, SplitNER outperforms baselines on OntoNotes5.0, WNUT17 and a cybersecurity dataset and gives on-par performance on BioNLP13CG. In all cases, it achieves a significant reduction in training time compared to its QA baseline counterpart. The effectiveness of our system stems from fine-tuning the BERT model twice, separately for span detection and classification. The source code can be found at github.com/c3sr/split-ner.

Cite

CITATION STYLE

APA

Arora, J., & Park, Y. (2023). Split-NER: Named Entity Recognition via Two Question-Answering-based Classifications. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 2, pp. 416–426). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-short.36

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free