The Malay language may be written using either Roman or Jawi characters. Most Malay stemmers cover only Roman (Rumi) affixes. This paper proposes a stemmer for Jawi characters using two sets of rules in Jawi: one set of rules is used to stem various forms of derived words, and another set is used to replace the use of a dictionary by producing the root word for each derivative. This stemmer has been tested using 1185 derived words consisting of prefix, circumfix, suffix, and infix. The results show that 84.89% of Jawi root words have been successfully stemmed. © 2011 Springer-Verlag.
CITATION STYLE
Sulaiman, S., Omar, K., Omar, N., Murah, M. Z., & Abdul Rahman, H. (2011). A Malay stemmer for Jawi characters. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7106 LNAI, pp. 668–676). https://doi.org/10.1007/978-3-642-25832-9_68
Mendeley helps you to discover research relevant for your work.