We present a dataset of word embeddings for the Polish language. Presented embeddings can be used as an input for Artificial Intelligence methods as an alternative for one-hot representation. Spatial relations between embeddings reflect relations such as alternatives and analogies. This improves generalization of methods using presented embeddings. Data from Wikipedia has been used together with skip-gram and contitous-bag-of-words methods introduced originally for English language by Mikolov et al. Current version of embeddings can be downloaded from http://publications.ics.p.lodz.pl/2016/word embeddings/.
CITATION STYLE
Rogalski, M., & Szczepaniak, P. S. (2016). Word embeddings for the Polish language. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9692, pp. 126–135). Springer Verlag. https://doi.org/10.1007/978-3-319-39378-0_12
Mendeley helps you to discover research relevant for your work.