STEMBR: A stemming algorithm for the Brazilian Portuguese language

13Citations
Citations of this article
24Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Stemming algorithms have traditionally been utilized in information retrieval systems as they generate a more concise word representation. However, the efficiency of these algorithms varies according to the language they are used with. This paper presents STEMBR, a stemmer for Brazilian Portuguese whereby the suffix treatment is based on a statistical study of the frequency of the last letter for words found in Brazilian web pages. The proposed stemmer is compared with another algorithm specifically developed for Portuguese. The results show the efficiency of our stemmer. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Alvares, R. V., Garcia, A. C. B., & Ferraz, I. (2005). STEMBR: A stemming algorithm for the Brazilian Portuguese language. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3808 LNCS, pp. 693–701). https://doi.org/10.1007/11595014_67

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free