Span-based Named Entity Recognition by Generating and Compressing Information

Nhung T.H. Nguyen; Makoto Miwa; Sophia Ananiadou

Conference ProceedingsOPEN ACCESS

Span-based Named Entity Recognition by Generating and Compressing Information

EACL 2023 - 17th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference (2023) 1976-1988

DOI: 10.18653/v1/2023.eacl-main.146

6Citations

19Readers

Abstract

The information bottleneck (IB) principle has been proven effective in various NLP applications. The existing work, however, only used either generative or information compression models to improve the performance of the target task. In this paper, we propose to combine the two types of IB models into one system to enhance Named Entity Recognition (NER). For one type of IB model, we incorporate two unsupervised generative components, span reconstruction and synonym generation, into a span-based NER system. The span reconstruction ensures that the contextualised span representation keeps the span information, while the synonym generation makes synonyms have similar representations even in different contexts. For the other type of IB model, we add a supervised IB layer that performs information compression into the system to preserve useful features for NER in the resulting span representations. Experiments on five different corpora indicate that jointly training both generative and information compression models can enhance the performance of the baseline span-based NER system. Our source code is publicly available at https://github.com/nguyennth/joint-ib-models.

Cite

CITATION STYLE

APA

Nguyen, N. T. H., Miwa, M., & Ananiadou, S. (2023). Span-based Named Entity Recognition by Generating and Compressing Information. In EACL 2023 - 17th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference (pp. 1976–1988). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.eacl-main.146

Span-based Named Entity Recognition by Generating and Compressing Information

Abstract

Cite

Register to see more suggestions