Stemming algorithms have traditionally been utilized in information retrieval systems as they generate a more concise word representation. However, the efficiency of these algorithms varies according to the language they are used with. This paper presents STEMBR, a stemmer for Brazilian Portuguese whereby the suffix treatment is based on a statistical study of the frequency of the last letter for words found in Brazilian web pages. The proposed stemmer is compared with another algorithm specifically developed for Portuguese. The results show the efficiency of our stemmer. © Springer-Verlag Berlin Heidelberg 2005.
CITATION STYLE
Alvares, R. V., Garcia, A. C. B., & Ferraz, I. (2005). STEMBR: A stemming algorithm for the Brazilian Portuguese language. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3808 LNCS, pp. 693–701). https://doi.org/10.1007/11595014_67
Mendeley helps you to discover research relevant for your work.