A Malay stemmer for Jawi characters

5Citations
Citations of this article
22Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The Malay language may be written using either Roman or Jawi characters. Most Malay stemmers cover only Roman (Rumi) affixes. This paper proposes a stemmer for Jawi characters using two sets of rules in Jawi: one set of rules is used to stem various forms of derived words, and another set is used to replace the use of a dictionary by producing the root word for each derivative. This stemmer has been tested using 1185 derived words consisting of prefix, circumfix, suffix, and infix. The results show that 84.89% of Jawi root words have been successfully stemmed. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Sulaiman, S., Omar, K., Omar, N., Murah, M. Z., & Abdul Rahman, H. (2011). A Malay stemmer for Jawi characters. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7106 LNAI, pp. 668–676). https://doi.org/10.1007/978-3-642-25832-9_68

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free