Named Entity Recognition via Noise Aware Training Mechanism with Data Filter

18Citations
Citations of this article
61Readers
Mendeley users who have this article in their library.

Abstract

Named entity recognition (NER) is a fundamental task in natural language processing, these is a long held belief that datasets benefit the model. However, not all the data help with generalization, and some samples may contain ambiguous entities or noisy labels. The existing methods can not distinguish hard samples from noisy samples well, and becomes particularly challenging in the case of overfitting. This paper proposes a new method called Noise-Aware-with-Filter (NAF) to solve the issues from two sides. From the perspective of the data, we design a Logit-Maximum-Difference (LMD) mechanism, which maximizes the diversity between different samples to help the model identify noisy samples. From the perspective of the model, we design an Incomplete-Trust (In-trust) loss function, which boosts LCRF with a robust Distrust-Cross-Entropy(DCE) term. Our proposed Intrust can effectively alleviate the overfitting caused by previous loss function. Experiments on six real-world Chinese and English NER datasets show that NAF outperforms the previous methods, and which obtained the state-ofthe-art(SOTA) results on the CoNLL2003 and CoNLL++ datasets.

References Powered by Scopus

Rethinking the Inception Architecture for Computer Vision

24270Citations
N/AReaders
Get full text

Joint Optimization Framework for Learning with Noisy Labels

606Citations
N/AReaders
Get full text

Named entity recognition for Chinese social media with jointly trained embeddings

393Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Towards better Chinese-centric neural machine translation for low-resource languages

11Citations
N/AReaders
Get full text

How Important Are Good Method Names in Neural Code Generation? A Model Robustness Perspective

9Citations
N/AReaders
Get full text

An End-to-End Named Entity Recognition Platform for Vietnamese Real Estate Advertisement Posts and Analytical Applications

3Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Huang, X., Chen, Y., Wu, S., Zhao, J., Xie, Y., & Sun, W. (2021). Named Entity Recognition via Noise Aware Training Mechanism with Data Filter. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (pp. 4791–4803). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.findings-acl.423

Readers over time

‘21‘22‘23‘24‘2507142128

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 13

68%

Researcher 4

21%

Professor / Associate Prof. 1

5%

Lecturer / Post doc 1

5%

Readers' Discipline

Tooltip

Computer Science 17

74%

Linguistics 4

17%

Philosophy 1

4%

Neuroscience 1

4%

Save time finding and organizing research with Mendeley

Sign up for free
0