TAPE: Assessing Few-shot Russian Language Understanding

8Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Recent advances in zero-shot and few-shot learning have shown promise for a scope of research and practical purposes. However, this fast-growing area lacks standardized evaluation suites for non-English languages, hindering progress outside the Anglo-centric paradigm. To address this line of research, we propose TAPE (Text Attack and Perturbation Evaluation), a novel benchmark that includes six more complex NLU tasks for Russian, covering multi-hop reasoning, ethical concepts, logic and commonsense knowledge. The TAPE's design focuses on systematic zero-shot and few-shot NLU evaluation: (i) linguistic-oriented adversarial attacks and perturbations for analyzing robustness, and (ii) subpopulations for nuanced interpretation. The detailed analysis of testing the autoregressive baselines indicates that simple spelling-based perturbations affect the performance the most, while paraphrasing the input has a more negligible effect. At the same time, the results demonstrate a significant gap between the neural and human baselines for most tasks. We publicly release TAPE to foster research on robust LMs that can generalize to new tasks when little to no supervision is available.

Cite

CITATION STYLE

APA

Taktasheva, E., Shavrina, T., Fenogenova, A., Shevelev, D., Katricheva, N., Tikhonova, M., … Mikhailov, V. (2022). TAPE: Assessing Few-shot Russian Language Understanding. In Findings of the Association for Computational Linguistics: EMNLP 2022 (pp. 2472–2497). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.findings-emnlp.183

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free