In the paper we present a tool for lemmatization of multi-word common noun phrases and named entities for Polish called PoLem1. The tool is based on a set of manually crafted rules and heuristics utilizing a set of dictionaries (including morphological, named entities and inflection patterns). The accuracy of lemmatization obtained by the tool reached 97.99% on a dataset with multi-word common noun phrases and 86.17% for case-sensitive evaluation on a dataset with named entities.
CITATION STYLE
Marcińczuk, M. (2017). Lemmatization of multi-word common noun phrases and named entities in Polish. In International Conference Recent Advances in Natural Language Processing, RANLP (Vol. 2017-September, pp. 483–491). Incoma Ltd. https://doi.org/10.26615/978-954-452-049-6_064
Mendeley helps you to discover research relevant for your work.