The paper addresses the task of automatic morpheme segmentation involving both splitting words into morphs and classification of resulted morphs. For segmentation of Russian words, a new model based on Bi-LSTM neural network is proposed and experimentally evaluated on several training data sets differing in labeling. The proposed model has comparable quality with the best supervised machine learning models for morpheme segmentation with classification, slightly outperforming them in word-level classification accuracy with score 89%.
CITATION STYLE
Bolshakova, E., & Sapin, A. (2019). Bi-LSTM model for morpheme segmentation of Russian words. In Communications in Computer and Information Science (Vol. 1119 CCIS, pp. 151–160). Springer. https://doi.org/10.1007/978-3-030-34518-1_11
Mendeley helps you to discover research relevant for your work.