An OCR Engine for Printed Receipt Images using Deep Learning Techniques

Cagrı Sayallar; Ahmet Sayar; Nurcan Babalık

Journal ArticleOPEN ACCESS

An OCR Engine for Printed Receipt Images using Deep Learning Techniques

International Journal of Advanced Computer Science and Applications (2023) 14(2) 833-840

DOI: 10.14569/IJACSA.2023.0140295

7Citations

77Readers

Abstract

The digitization of receipts and invoices, and the recording of expenses in industry and accounting have begun to be used in the field of finance tracking. However, 100% success in character recognition for document digitization has not yet been achieved. In this study, a new Optical Character Recognition (OCR) engine called Nacsoft OCR was developed on Turkish receipt data by using artificial intelligence methods. The proposed OCR engine has been compared to widely used engines, Easy OCR, Tesseract OCR, and the Google Vision API. The benchmarking was made on English and Turkish receipts, and the accuracies of OCR engines in terms of character recognition and their speeds are presented. It is known that OCR character recognition engines perform better at word recognition when provided word position information. Therefore, the performance of the Nacsoft OCR engine in determining the word position was also compared with the performance of the other OCR engines, and the results were presented.

Author supplied keywords

Cite

CITATION STYLE

APA

Sayallar, C., Sayar, A., & Babalık, N. (2023). An OCR Engine for Printed Receipt Images using Deep Learning Techniques. International Journal of Advanced Computer Science and Applications, 14(2), 833–840. https://doi.org/10.14569/IJACSA.2023.0140295

An OCR Engine for Printed Receipt Images using Deep Learning Techniques

Abstract

Author supplied keywords

Cite

Register to see more suggestions