Extraction and Identification of Manipuri and Mizo Texts from Scene and Document Images

Loitongbam Sanayai Meetei; Thoudam Doren Singh; Sivaji Bandyopadhyay

Conference ProceedingsOPEN ACCESS

Extraction and Identification of Manipuri and Mizo Texts from Scene and Document Images

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11941 LNCS 405-414

DOI: 10.1007/978-3-030-34869-4_44

4Citations

4Readers

Abstract

The content inside an image is exceptionally compelling. As such, text within an image can be of special interest and compared to other semantic contents, it tends to be effectively extracted. Text detection within an image is the task of detecting and localizing the portion of an image that contains the text information. Manipuri and Mizo are respectively the lingua francas of two neighboring northeastern states of Manipur and Mizoram in India. While Manipuri, is currently written using Meetei Mayek script and Bengali script, Mizo is written in Roman script with circumflex accent added to the vowels. In this work, we report the task of text detection in natural scene images and document images in Manipuri and Mizo. We made a comparative study between Maximally Stable Extremal Regions (MSER) coupled with Stroke Width Transform (SWT) and Efficient and Accurate Scene Text Detector (EAST) for the text detection. The detected text portion of both the languages is subjected to Optical Character Recognition (OCR) and a post OCR processing of spelling correction. In our experiment of the text detection, EAST outperformed the other method.

Author supplied keywords

Cite

CITATION STYLE

APA

Meetei, L. S., Singh, T. D., & Bandyopadhyay, S. (2019). Extraction and Identification of Manipuri and Mizo Texts from Scene and Document Images. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11941 LNCS, pp. 405–414). Springer. https://doi.org/10.1007/978-3-030-34869-4_44

Extraction and Identification of Manipuri and Mizo Texts from Scene and Document Images

Abstract

Author supplied keywords

Cite

Register to see more suggestions