We investigate the performance of sentence embeddings models on several tasks for the Russian language. In our comparison, we include such tasks as multiple choice question answering, next sentence prediction, and paraphrase identification. We employ FastText embeddings as a baseline and compare it to ELMo and BERT embeddings. We conduct two series of experiments, using both unsupervised (i.e., based on similarity measure only) and supervised approaches for the tasks. Finally, we present datasets for multiple choice question answering and next sentence prediction in Russian.
CITATION STYLE
Popov, D., Pugachev, A., Svyatokum, P., Svitanko, E., & Artemova, E. (2019). Evaluation of sentence embedding models for natural language understanding problems in Russian. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11832 LNCS, pp. 205–217). Springer. https://doi.org/10.1007/978-3-030-37334-4_19
Mendeley helps you to discover research relevant for your work.