Classification of arabic writer based on clustering techniques

5Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Arabic text categorization for pattern recognitions is challenging. We propose for the first time a novel holistic method based on clustering for classifying Arabic writer. The categorization is accomplished stage-wise. Firstly, these document images are sectioned into lines, words, and characters. Secondly, their structural and statistical features are obtained from sectioned portions. Thirdly, F-Measure is used to evaluate the performance of the extracted features and their combination in different linkage methods for each distance measures and different numbers of groups. Finally, experiments are conducted on the standard KHATT dataset of Arabic handwritten text comprised of varying samples from 1000 writers. The results in the generation step are obtained from multiple runs of individual clustering methods for each distance measures. The best results are achieved when intensity, lines slope and their combination set of features are applied. It is demonstrated that different numbers of clusters having good set of features can deliver significant improvements for the handwritten structures clustering.

Cite

CITATION STYLE

APA

Ahmed, A. A., Al-Tamimi, M. S., Al-Sanjary, O. I., & Sulong, G. (2018). Classification of arabic writer based on clustering techniques. In Lecture Notes on Data Engineering and Communications Technologies (Vol. 5, pp. 48–58). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-319-59427-9_6

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free