STEMBR: A stemming algorithm for the Brazilian Portuguese language

Reinaldo Viana Alvares; Ana Cristina Bicharra Garcia; Inhaúma Ferraz

Conference Proceedings

STEMBR: A stemming algorithm for the Brazilian Portuguese language

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2005) 3808 LNCS 693-701

DOI: 10.1007/11595014_67

13Citations

24Readers

Get full text

Abstract

Stemming algorithms have traditionally been utilized in information retrieval systems as they generate a more concise word representation. However, the efficiency of these algorithms varies according to the language they are used with. This paper presents STEMBR, a stemmer for Brazilian Portuguese whereby the suffix treatment is based on a statistical study of the frequency of the last letter for words found in Brazilian web pages. The proposed stemmer is compared with another algorithm specifically developed for Portuguese. The results show the efficiency of our stemmer. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Alvares, R. V., Garcia, A. C. B., & Ferraz, I. (2005). STEMBR: A stemming algorithm for the Brazilian Portuguese language. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3808 LNCS, pp. 693–701). https://doi.org/10.1007/11595014_67

STEMBR: A stemming algorithm for the Brazilian Portuguese language

Abstract

Cite

Register to see more suggestions