WIDER & CLOSER: Mixture of Short-channel Distillers for Zero-shot Cross-lingual Named Entity Recognition

Jun Yu Ma; Beiduo Chen; Jia Chen Gu; Zhen Hua Ling; Wu Guo; Quan Liu; Zhigang Chen; Cong Liu

Conference ProceedingsOPEN ACCESS

WIDER & CLOSER: Mixture of Short-channel Distillers for Zero-shot Cross-lingual Named Entity Recognition

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022 (2022) 5171-5183

DOI: 10.18653/v1/2022.emnlp-main.345

8Citations

26Readers

Abstract

Zero-shot cross-lingual named entity recognition (NER) aims at transferring knowledge from annotated and rich-resource data in source languages to unlabeled and lean-resource data in target languages. Existing mainstream methods based on the teacher-student distillation framework ignore the rich and complementary information lying in the intermediate layers of pre-trained language models, and domain-invariant information is easily lost during transfer. In this study, a mixture of short-channel distillers (MSD) method is proposed to fully interact the rich hierarchical information in the teacher model and to transfer knowledge to the student model sufficiently and efficiently. Concretely, a multi-channel distillation framework is designed for sufficient information transfer by aggregating multiple distillers as a mixture. Besides, an unsupervised method adopting parallel domain adaptation is proposed to shorten the channels between the teacher and student models to preserve domain-invariant features. Experiments on four datasets across nine languages demonstrate that the proposed method achieves new state-of-the-art performance on zero-shot cross-lingual NER and shows great generalization and compatibility across languages and fields.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Ma, J. Y., Chen, B., Gu, J. C., Ling, Z. H., Guo, W., Liu, Q., … Liu, C. (2022). WIDER & CLOSER: Mixture of Short-channel Distillers for Zero-shot Cross-lingual Named Entity Recognition. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022 (pp. 5171–5183). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.emnlp-main.345

Readers' Seniority

PhD / Post grad / Masters / Doc 5

56%

Researcher 3

33%

Lecturer / Post doc 1

11%

Readers' Discipline

Computer Science 11

79%

Medicine and Dentistry 1

Linguistics 1

Neuroscience 1

WIDER & CLOSER: Mixture of Short-channel Distillers for Zero-shot Cross-lingual Named Entity Recognition

Abstract

References Powered by Scopus

A theory of learning from different domains

Cross-lingual name tagging and linking for 282 languages

Weakly supervised cross-lingual named entity recognition via effective annotation and representation projection

Cited by Powered by Scopus

DA-Net: A Disentangled and Adaptive Network for Multi-Source Cross-Lingual Transfer Learning

Den-ML: Multi-source cross-lingual transfer via denoising mutual learning

Syntax-Augmented Hierarchical Interactive Encoder for Zero-Shot Cross-Lingual Information Extraction

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline