In the world of non-proprietary NLP software the standard, and perhaps the best, HMM-based POS tagger is TnT (Brants, 2000). We argue here that some of the criticism aimed at HMM performance on languages with rich morphology should more properly be directed at TnT’s peculiar license, free but not open source, since it is those details of the implementation which are hidden from the user that hold the key for improved POS tagging across a wider variety of languages. We present HunPos1, a free and open source (LGPL-licensed) alternative, which can be tuned by the user to fully utilize the potential of HMM architectures, offering performance comparable to more complex models, but preserving the ease and speed of the training and tagging process.
CITATION STYLE
Halácsy, P., Kornai, A., & Oravecz, C. (2007). HunPos – An open source trigram tagger. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 209–212). Association for Computational Linguistics (ACL).
Mendeley helps you to discover research relevant for your work.