This paper deals with automatic morphological segmentation of Czech lemmas contained in the word-formation network DeriNet. Capturing derivational relations between base and derived lemmas, and segmenting lemmas into sequences of morphemes are two closely related formal models of how words come into existence. Thus we propose a novel segmentation method that benefits from the existence of the network; our solution constitutes new state-of-the-art for the Czech language.
CITATION STYLE
Bodnár, J., Žabokrtský, Z., & Ševčíková, M. (2020). Semi-supervised induction of morpheme boundaries in czech using a word-formation network. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12284 LNAI, pp. 189–196). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-58323-1_20
Mendeley helps you to discover research relevant for your work.