Arabic Stemming techniques as feature extraction applied in Arabic text classification

3Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper, we conduct a comparative study about the impact of stemming algorithms, as feature extraction systems, on the task of classification of Arabic text documents. Stemming is forceful and fierce as in reducing words to their three-letters roots. Which may influence the semantics, as various words with divers implications may share the same root. Light stemming, by examination, expels oftentimes utilized prefixes and suffixes in Arabic words. Light stemming doesn’t extract the root and thus doesn’t influence the semantics of words. However, the result of the light stemming is not necessarily a word. For the evaluation, we used corpus contains 5,070 records that fall into six classes. A several tests were done utilizing two separate illustrations of the same corpus. The K-Nearest Neighbors (KNN) classifier was utilized for the classification task. The recall measure is used to evaluate the performance of these methods.

Cite

CITATION STYLE

APA

Boukil, S., El Adnani, F., El Moutaouakkil, A. E., Cherrat, L., & Ezziyyani, M. (2018). Arabic Stemming techniques as feature extraction applied in Arabic text classification. In Lecture Notes in Networks and Systems (Vol. 25, pp. 349–361). Springer. https://doi.org/10.1007/978-3-319-69137-4_31

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free