Arabic Stemming techniques as feature extraction applied in Arabic text classification

Samir Boukil; Fatiha El Adnani; Abd Elmajid El Moutaouakkil; Loubna Cherrat; Mostafa Ezziyyani

Book Chapter

Arabic Stemming techniques as feature extraction applied in Arabic text classification

Springer, (2018), 349-361

DOI: 10.1007/978-3-319-69137-4_31

3Citations

7Readers

Get full text

Abstract

In this paper, we conduct a comparative study about the impact of stemming algorithms, as feature extraction systems, on the task of classification of Arabic text documents. Stemming is forceful and fierce as in reducing words to their three-letters roots. Which may influence the semantics, as various words with divers implications may share the same root. Light stemming, by examination, expels oftentimes utilized prefixes and suffixes in Arabic words. Light stemming doesn’t extract the root and thus doesn’t influence the semantics of words. However, the result of the light stemming is not necessarily a word. For the evaluation, we used corpus contains 5,070 records that fall into six classes. A several tests were done utilizing two separate illustrations of the same corpus. The K-Nearest Neighbors (KNN) classifier was utilized for the classification task. The recall measure is used to evaluate the performance of these methods.

Author supplied keywords

Cite

CITATION STYLE

APA

Boukil, S., El Adnani, F., El Moutaouakkil, A. E., Cherrat, L., & Ezziyyani, M. (2018). Arabic Stemming techniques as feature extraction applied in Arabic text classification. In Lecture Notes in Networks and Systems (Vol. 25, pp. 349–361). Springer. https://doi.org/10.1007/978-3-319-69137-4_31

Arabic Stemming techniques as feature extraction applied in Arabic text classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions