A text classification problem in Kazakh language is examined. The amount of training data for the task in Kazakh is very limited, but plenty of labeled data in Russian are available. Language vector space transform is built and used to transfer knowledge from Russian into Kazakh language. The obtained classification quality is comparable to that of an approach that employed sophisticated automatic translation system.
CITATION STYLE
Smirnov, A., & Mendelev, V. (2017). Applying word embeddings to leverage knowledge available in one language in order to solve a practical text classification problem in another language. In Communications in Computer and Information Science (Vol. 661, pp. 248–254). Springer Verlag. https://doi.org/10.1007/978-3-319-52920-2_23
Mendeley helps you to discover research relevant for your work.