We address the problem of learning a morphological automaton directly from a monolingual text corpus without recourse to additional resources. Like previous work in this area, our approach exploits orthographic regularities in a search for possible morphological segmentation points. Instead of affixes, however, we search for affix transformation rules that express correspondences between term clusters induced from the data. This focuses the system on substrings having syntactic function, and yields clusterto- cluster transformation rules which enable the system to process unknown morphological forms of known words accurately. A stem-weighting algorithm based on Hubs and Authorities is used to clarify ambiguous segmentation points. We evaluate our approach using the CELEX database. © 2005 Association for Computational Linguistics.
CITATION STYLE
Freitag, D. (2005). Morphology induction from term clusters. In CoNLL 2005 - Proceedings of the Ninth Conference on Computational Natural Language Learning (pp. 128–135). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1706543.1706566
Mendeley helps you to discover research relevant for your work.