This paper presents a deep learning benchmark on a complex dataset known as KFUPM Handwritten Arabic TexT (KHATT). The KHATT data-set consists of complex patterns of handwritten Arabic text-lines. This paper contributes mainly in three aspects i.e., (1) pre-processing, (2) deep learning based approach, and (3) data-augmentation. The pre-processing step includes pruning of white extra spaces plus de-skewing the skewed text-lines. We deploy a deep learning approach based on Multi-Dimensional Long Short-Term Memory (MDLSTM) networks and Connectionist Temporal Classification (CTC). The MDLSTM has the advantage of scanning the Arabic text-lines in all directions (horizontal and vertical) to cover dots, diacritics, strokes and fine inflammation. The data-augmentation with a deep learning approach proves to achieve better and promising improvement in results by gaining 80.02% Character Recognition (CR) over 75.08% as baseline.
CITATION STYLE
Ahmad, R., Naz, S., Afzal, M., Rashid, S., Liwicki, M., & Dengel, A. (2020). A deep learning based arabic script recognition system: Benchmark on khat. International Arab Journal of Information Technology, 17(3), 299–305. https://doi.org/10.34028/iajit/17/3/3
Mendeley helps you to discover research relevant for your work.