A Holistic Approach to Urdu Language Word Recognition using Deep Neural Networks

10Citations
Citations of this article
17Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Urdu is one of the most popular languages in the world. It is a Persianized standard register of the Hindi language with considerable and valuable literature. While digital libraries are constantly replacing conventional libraries, a vast amount of Urdu literature is still handwritten. Digitizing this handwritten literature is essential to preserve it and make it more accessible. Nevertheless, the scarcity of Urdu Optical Character Recognition (OCR) research limits a digital library's scope to a manual document search. The limited research work in this area is mainly due to the complexity of Urdu Script. Unlike the English language, the Urdu writing style is cursive, bidirectional, and character shapes and sizes highly vary depending on their osition. Holistic word recognition is found to be a better solution among many other text segmentation techniques as it takes the complete word into account instead of segmenting it explicitly or implicitly. For this project, the data of five different Urdu words were collected for training and testing a convolutional neural network and 96% recognition accuracy was achieved.

Cite

CITATION STYLE

APA

Khan, H. R., Kazmi, M., Khalid, H., Abul Hasan, M., Fayyaz, N., & Qazi, S. A. (2021). A Holistic Approach to Urdu Language Word Recognition using Deep Neural Networks. Engineering, Technology and Applied Science Research, 11(3), 7140–7145. https://doi.org/10.48084/etasr.4143

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free