Enhanced ensemble technique for optical character recognition

Imad Qasim Habeeb; Zeyad Qasim Al-Zaydi; Hanan Najm Abdulkhudhur

Conference Proceedings

Enhanced ensemble technique for optical character recognition

Communications in Computer and Information Science (2018) 938 213-225

DOI: 10.1007/978-3-030-01653-1_13

7Citations

5Readers

Get full text

Abstract

Optical character recognition (OCR) is the electronic transformation of images into a computer-encoded text. OCR systems often produce poor accuracy for noisy images. Ensemble recognition techniques are used to improve OCR accuracy. The idea of the ensemble recognition techniques is to produce N-versions of an input image. These versions are similar but not identical. They are passed through the OCR engine to turn them into different OCR outputs, which later leads to select the best between them. Existing ensemble techniques need to be more effective to reduce OCR error rate. This research proposed enhanced ensemble technique to overcome the drawbacks of existing techniques. The proposed technique was evaluated against three other relevant existing techniques. The performance measurements used in this research were Word Error Rate (WER) and Character Error Rate (CER). Experimental results showed a relative decrease of 14.37% and 40.13% over the WER and CER of the best existing technique. This study contributes to the OCR domain as the proposed technique could facilitate the automatic recognition of documents. Hence, it will lead to a better information extraction.

Author supplied keywords

Cite

CITATION STYLE

APA

Habeeb, I. Q., Al-Zaydi, Z. Q., & Abdulkhudhur, H. N. (2018). Enhanced ensemble technique for optical character recognition. In Communications in Computer and Information Science (Vol. 938, pp. 213–225). Springer Verlag. https://doi.org/10.1007/978-3-030-01653-1_13

Enhanced ensemble technique for optical character recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions