Probing multilingual sentence representationswith x-probe

Vinit Ravishankar; Lilja Øvrelid; Erik Velldal

Conference ProceedingsOPEN ACCESS

Probing multilingual sentence representationswith x-probe

ACL 2019 - 4th Workshop on Representation Learning for NLP, RepL4NLP 2019 - Proceedings of the Workshop (2019) 156-168

DOI: 10.18653/v1/w19-4318

13Citations

87Readers

Abstract

This paper extends the task of probing sentence representations for linguistic insight in a multilingual domain. In doing so, we make two contributions: First, we provide datasets for multilingual probing, derived from Wikipedia, in five languages, viz. English, French, German, Spanish and Russian. Second, we evaluate six sentence encoders for each language, each trained by mapping sentence representations to English sentence representations, using sentences in a parallel corpus. We discover that cross-lingually mapped representations are often better at retaining certain linguistic information than representations derived from English encoders trained on natural language inference (NLI) as a downstream task.

Cite

CITATION STYLE

APA

Ravishankar, V., Øvrelid, L., & Velldal, E. (2019). Probing multilingual sentence representationswith x-probe. In ACL 2019 - 4th Workshop on Representation Learning for NLP, RepL4NLP 2019 - Proceedings of the Workshop (pp. 156–168). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w19-4318

Probing multilingual sentence representationswith x-probe

Abstract

Cite

Register to see more suggestions