Lemmatizer for document information retrieval systems in JAVA

1Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Stemming is a widely accepted practice in Document Information Retrieval Systems (DIRs), because it is more benefical than harmful [3] as well as having the virtue of improving retrieval efficiency by reducing the size of the term index. We will present a technique of semi-automatic stemming that is fine designed for JAVA environment. The method works without deep knowledge of grammar rules of a language in contradistinction to well-known Porter’s algorithm [8]. From that point of view, we can call our method universal for more languages. We will also present tests to show quality of the method and its errorrate.

Cite

CITATION STYLE

APA

Galambos, L. (2001). Lemmatizer for document information retrieval systems in JAVA. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2234, pp. 243–252). Springer Verlag. https://doi.org/10.1007/3-540-45627-9_21

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free