Abstract
The purpose of the research is to answer the question whether linguistic information is retained in vector representations of sentences. We introduce a method of analysing the content of sentence embeddings based on universal probing tasks, along with the classification datasets for two contrasting languages. We perform a series of probing and downstream experiments with different types of sentence embeddings, followed by a thorough analysis of the experimental results. Aside from dependency parser-based embeddings, linguistic information is retained best in the recently proposed LASER sentence embeddings.
Cite
CITATION STYLE
Krasnowska-Kieras, K., & Wróblewska, A. (2020). Empirical linguistic study of sentence embeddings. In ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (pp. 5729–5739). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p19-1573
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.