Bi-LSTM model for morpheme segmentation of Russian words

Elena Bolshakova; Alexander Sapin

Conference Proceedings

Bi-LSTM model for morpheme segmentation of Russian words

Communications in Computer and Information Science (2019) 1119 CCIS 151-160

DOI: 10.1007/978-3-030-34518-1_11

5Citations

1Readers

Get full text

Abstract

The paper addresses the task of automatic morpheme segmentation involving both splitting words into morphs and classification of resulted morphs. For segmentation of Russian words, a new model based on Bi-LSTM neural network is proposed and experimentally evaluated on several training data sets differing in labeling. The proposed model has comparable quality with the best supervised machine learning models for morpheme segmentation with classification, slightly outperforming them in word-level classification accuracy with score 89%.

Author supplied keywords

Cite

CITATION STYLE

APA

Bolshakova, E., & Sapin, A. (2019). Bi-LSTM model for morpheme segmentation of Russian words. In Communications in Computer and Information Science (Vol. 1119 CCIS, pp. 151–160). Springer. https://doi.org/10.1007/978-3-030-34518-1_11

Bi-LSTM model for morpheme segmentation of Russian words

Abstract

Author supplied keywords

Cite

Register to see more suggestions