Stemming is a widely accepted practice in Document Information Retrieval Systems (DIRs), because it is more benefical than harmful [3] as well as having the virtue of improving retrieval efficiency by reducing the size of the term index. We will present a technique of semi-automatic stemming that is fine designed for JAVA environment. The method works without deep knowledge of grammar rules of a language in contradistinction to well-known Porter’s algorithm [8]. From that point of view, we can call our method universal for more languages. We will also present tests to show quality of the method and its errorrate.
CITATION STYLE
Galambos, L. (2001). Lemmatizer for document information retrieval systems in JAVA. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2234, pp. 243–252). Springer Verlag. https://doi.org/10.1007/3-540-45627-9_21
Mendeley helps you to discover research relevant for your work.