Named Entity Recognition via Noise Aware Training Mechanism with Data Filter

Xiusheng Huang; Yubo Chen; Shun Wu; Jun Zhao; Yuantao Xie; Weijian Sun

Conference ProceedingsOPEN ACCESS

Named Entity Recognition via Noise Aware Training Mechanism with Data Filter

Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (2021) 4791-4803

DOI: 10.18653/v1/2021.findings-acl.423

18Citations

61Readers

Abstract

Named entity recognition (NER) is a fundamental task in natural language processing, these is a long held belief that datasets benefit the model. However, not all the data help with generalization, and some samples may contain ambiguous entities or noisy labels. The existing methods can not distinguish hard samples from noisy samples well, and becomes particularly challenging in the case of overfitting. This paper proposes a new method called Noise-Aware-with-Filter (NAF) to solve the issues from two sides. From the perspective of the data, we design a Logit-Maximum-Difference (LMD) mechanism, which maximizes the diversity between different samples to help the model identify noisy samples. From the perspective of the model, we design an Incomplete-Trust (In-trust) loss function, which boosts LCRF with a robust Distrust-Cross-Entropy(DCE) term. Our proposed Intrust can effectively alleviate the overfitting caused by previous loss function. Experiments on six real-world Chinese and English NER datasets show that NAF outperforms the previous methods, and which obtained the state-ofthe-art(SOTA) results on the CoNLL2003 and CoNLL++ datasets.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Huang, X., Chen, Y., Wu, S., Zhao, J., Xie, Y., & Sun, W. (2021). Named Entity Recognition via Noise Aware Training Mechanism with Data Filter. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (pp. 4791–4803). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.findings-acl.423

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 13

68%

Researcher 4

21%

Professor / Associate Prof. 1

Lecturer / Post doc 1

Readers' Discipline

Computer Science 17

74%

Linguistics 4

17%

Philosophy 1

Neuroscience 1

Named Entity Recognition via Noise Aware Training Mechanism with Data Filter

Abstract

References Powered by Scopus

Rethinking the Inception Architecture for Computer Vision

Joint Optimization Framework for Learning with Noisy Labels

Named entity recognition for Chinese social media with jointly trained embeddings

Cited by Powered by Scopus

Towards better Chinese-centric neural machine translation for low-resource languages

How Important Are Good Method Names in Neural Code Generation? A Model Robustness Perspective

An End-to-End Named Entity Recognition Platform for Vietnamese Real Estate Advertisement Posts and Analytical Applications

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline