An improved arabic word's roots extraction method using n-gram technique

15Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.

Abstract

Arabic language is distinguished by its morphological richness, which forces the workers in the field of Arabic language Processing (i.e., information retrieval, document's classification, text summarizing) to deal with many words that seem to be different but in reality they came from an identical root word. One of the methods to overcome this problem is to return the words to their roots. This research aims to provide a new algorithm, that returns roots of Arabic words using n-gram technique without using morphological rules in order to avoid the complexity arising from the morphological richness of the language in one hand and the multiplicity of morphological rules in other hand. The proposed algorithm uses a list that contains over 4,500 identical roots words. © 2014 Science Publications.

Cite

CITATION STYLE

APA

Yousef, N., Abu-Errub, A., Odeh, A., & Khafajeh, H. (2014). An improved arabic word’s roots extraction method using n-gram technique. Journal of Computer Science, 10(4), 716–719. https://doi.org/10.3844/jcssp.2014.716.719

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free