Allomorfessor extends the unsupervised morpheme segmentation method Morfessor to account for the linguistic phenomenon of allomorphy, where one morpheme has several different surface forms. The method discovers common base forms for allomorphs from an unannotated corpus by finding small modifications, called mutations, for them. Using Maximum a Posteriori estimation, the model is able to decide the amount and types of the mutations needed for the particular language. In Morpho Challenge 2009 evaluations, the effect of the mutations was discovered to be rather small. However, Allomorfessor performed generally well, achieving the best results for English in the linguistic evaluation, and being in the top three in the application evaluations for all languages. © 2010 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Virpioja, S., Kohonen, O., & Lagus, K. (2010). Unsupervised morpheme analysis with Allomorfessor. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6241 LNCS, pp. 609–616). https://doi.org/10.1007/978-3-642-15754-7_73
Mendeley helps you to discover research relevant for your work.