Ethiopic natural scene text recognition using deep learning approaches

2Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The success of deep learning approaches for scene text recognition in English, Chinese and Arabic language inspired us to pose a benchmark scene text recognition for Ethiopic script. To transcribe the word images to the cross bonding text, we use a segmentation free end-to-end trainable Convolutional and Recurrent Neural Network (CRNN) hybrid architecture. In the network, robust representation features from cropped word images are extracted at convolutional layer and the extracted representations features are transcribed to a sequence of labels by the recurrent layer and transcription layer. The transcription is not bounded by lexicon or word length. Due to it is effective uses to transcribe sequence-to-sequence tasks, CTC loss is applied to train the network. In order to train the proposed model, we prepare synthetic word images from Unicode fonts of Ethiopic scripts, besides the model performance is evaluated on real scene text dataset collected from different sources. The experiment result of the proposed model, shows a promising result.

Cite

CITATION STYLE

APA

Addis, D., Liu, C. M., & Ta, V. D. (2020). Ethiopic natural scene text recognition using deep learning approaches. In Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST (Vol. 308 LNICST, pp. 502–511). Springer. https://doi.org/10.1007/978-3-030-43690-2_36

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free