Characters Segmentation from Arabic Handwritten Document Images: Hybrid Approach

5Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.

Abstract

Character segmentation in Unconstrained Arabic handwriting is a complex and challenging task due to the overlapping and touching of words or letters. Such issues have not been widely investigated in the literature. Addressing these issues in the segmentation stage reduces errors in the segmentation process, which plays a significant role in enhancing the accuracy of the Arabic optical character recognition. Therefore, this paper proposes a hybrid approach to improve the accuracy for interconnection, overlapping or touching character segmentation. The proposed method includes several stages: removing extra shapes such as signatures from the document. Using morphological operations, connected components and bounding box detection, detect and extract individual words directly from the document. Finally, the touching characters segmentation is achieved based on background thinning and computational analysis of the word's region. The proposed method has been tested on KHATT, IFN/ENIT database and our own collected dataset. The experimental results showed that the proposed method obtained high performance and improved the accuracy compared to other methods.

Cite

CITATION STYLE

APA

Boraik, O. A., Ravikumar, M., & Saif, M. A. N. (2022). Characters Segmentation from Arabic Handwritten Document Images: Hybrid Approach. International Journal of Advanced Computer Science and Applications, 13(4), 395–403. https://doi.org/10.14569/IJACSA.2022.0130447

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free