A deep learning based arabic script recognition system: Benchmark on khat

Riaz Ahmad; Saeeda Naz; Muhammad Afzal; Sheikh Rashid; Marcus Liwicki; Andreas Dengel

Journal ArticleOPEN ACCESS

A deep learning based arabic script recognition system: Benchmark on khat

International Arab Journal of Information Technology (2020) 17(3) 299-305

DOI: 10.34028/iajit/17/3/3

27Citations

44Readers

Get full text

Abstract

This paper presents a deep learning benchmark on a complex dataset known as KFUPM Handwritten Arabic TexT (KHATT). The KHATT data-set consists of complex patterns of handwritten Arabic text-lines. This paper contributes mainly in three aspects i.e., (1) pre-processing, (2) deep learning based approach, and (3) data-augmentation. The pre-processing step includes pruning of white extra spaces plus de-skewing the skewed text-lines. We deploy a deep learning approach based on Multi-Dimensional Long Short-Term Memory (MDLSTM) networks and Connectionist Temporal Classification (CTC). The MDLSTM has the advantage of scanning the Arabic text-lines in all directions (horizontal and vertical) to cover dots, diacritics, strokes and fine inflammation. The data-augmentation with a deep learning approach proves to achieve better and promising improvement in results by gaining 80.02% Character Recognition (CR) over 75.08% as baseline.

Author supplied keywords

Cite

CITATION STYLE

APA

Ahmad, R., Naz, S., Afzal, M., Rashid, S., Liwicki, M., & Dengel, A. (2020). A deep learning based arabic script recognition system: Benchmark on khat. International Arab Journal of Information Technology, 17(3), 299–305. https://doi.org/10.34028/iajit/17/3/3

A deep learning based arabic script recognition system: Benchmark on khat

Abstract

Author supplied keywords

Cite

Register to see more suggestions