Syntax-augmented multilingual BERT for cross-lingual transfer

Wasi Uddin Ahmad; Haoran Li; Kai Wei Chang; Yashar Mehdad

Conference Proceedings

Syntax-augmented multilingual BERT for cross-lingual transfer

ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference (2021) 1 4538-4554

DOI: 10.18653/v1/2021.acl-long.350

30Citations

101Readers

Get full text

Abstract

In recent years, we have seen a colossal effort in pre-training multilingual text encoders using large-scale corpora in many languages to facilitate cross-lingual transfer learning. However, due to typological differences across languages, the cross-lingual transfer is challenging. Nevertheless, language syntax, e.g., syntactic dependencies, can bridge the typological gap. Previous works have shown that pre-trained multilingual encoders, such as mBERT (Devlin et al., 2019), capture language syntax, helping cross-lingual transfer. This work shows that explicitly providing language syntax and training mBERT using an auxiliary objective to encode the universal dependency tree structure helps cross-lingual transfer. We perform rigorous experiments on four NLP tasks, including text classification, question answering, named entity recognition, and task-oriented semantic parsing. The experiment results show that syntax-augmented mBERT improves cross-lingual transfer on popular benchmarks, such as PAWS-X and MLQA, by 1.4 and 1.6 points on average across all languages. In the generalized transfer setting, the performance boosted significantly, with 3.9 and 3.1 points on average in PAWS-X and MLQA.

Cite

CITATION STYLE

APA

Ahmad, W. U., Li, H., Chang, K. W., & Mehdad, Y. (2021). Syntax-augmented multilingual BERT for cross-lingual transfer. In ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference (Vol. 1, pp. 4538–4554). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.acl-long.350

Syntax-augmented multilingual BERT for cross-lingual transfer

Abstract

Cite

Register to see more suggestions