Unsupervised and Knowledge-free morpheme segmentation and analysis

Stefan Bordag

Conference Proceedings

Unsupervised and Knowledge-free morpheme segmentation and analysis

Bordag S

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2008) 5152 LNCS 881-891

DOI: 10.1007/978-3-540-85760-0_113

15Citations

17Readers

Get full text

Abstract

This paper presents a revised version of an unsupervised and knowledge-free morpheme boundary detection algorithm based on letter successor variety (LSV) and a trie classifier [1]. Additional knowledge about relatedness of the found morphs is obtained from a morphemic analysis based on contextual similarity. For the boundary detection the challenge of increasing recall of found morphs while retaining a high precision is tackled by adding a compound splitter, iterating the LSV analysis and dividing the trie classifier into two distinctly applied clasifiers. The result is a significantly improved overall performance and a decreased reliance on corpus size. Further possible improvements and analyses are discussed. © 2008 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Bordag, S. (2008). Unsupervised and Knowledge-free morpheme segmentation and analysis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5152 LNCS, pp. 881–891). Springer Verlag. https://doi.org/10.1007/978-3-540-85760-0_113

Unsupervised and Knowledge-free morpheme segmentation and analysis

Abstract

Author supplied keywords

Cite

Register to see more suggestions