This paper presents an approach to Russian text vectorization based on SRSTI classifier. Our approach is based on using SRSTI categories as vector space dimensions. The categories are defined by lists of keywords. We explain our choice of SRSTI as a basis for vector space. We describe the keywords selection process, as well as vector calculation and comparison algorithm. We apply developed algorithm to marked-up SRSTI texts and user social profiles. We also suggest approaches to vector space improvement and evaluate them.
CITATION STYLE
Solomonova, Y., & Khlopotov, M. (2019). Russian Text Vectorization: An Approach Based on SRSTI Classifier. In Communications in Computer and Information Science (Vol. 1038 CCIS, pp. 754–764). Springer. https://doi.org/10.1007/978-3-030-37858-5_64
Mendeley helps you to discover research relevant for your work.