A high performance domain specific OCR for Bangla script

Md Abul Hasnat; S. M. Murtoza Habib; Mumit Khan

Conference Proceedings

A high performance domain specific OCR for Bangla script

Novel Algorithms and Techniques in Telecommunications, Automation and Industrial Electronics (2008) 174-178

DOI: 10.1007/978-1-4020-8737-0_31

19Citations

9Readers

Get full text

Abstract

Research on recognizing Bengali script has been started since mid 1980's. A variety of different techniques have been applied and the performance is examined. In this paper we present a high performance domain specific OCR for recognizing Bengali script. We select the training data set from the script of the specified domain. We choose Hidden Markov Model (HMM) for character classification due to its simple and straightforward way of representation. We examine the primary error types that mainly occurred at preprocessing level and carefully handled those errors by adding special error correcting module as a part of recognizer. Finally we added a dictionary and some error specific rules to correct the probable errors after the word formation is done. The entire technique significantly increases the performance of the OCR for a specific domain to a great extent. © Springer Science+Business Media B.V. 2008.

Cite

CITATION STYLE

APA

Hasnat, M. A., Murtoza Habib, S. M., & Khan, M. (2008). A high performance domain specific OCR for Bangla script. In Novel Algorithms and Techniques in Telecommunications, Automation and Industrial Electronics (pp. 174–178). https://doi.org/10.1007/978-1-4020-8737-0_31

A high performance domain specific OCR for Bangla script

Abstract

Cite

Register to see more suggestions