Tesseract Vs Gocr A Comparative Study

  • Dhiman S
  • Singh A
Citations of this article
Mendeley users who have this article in their library.


Optical Character Recognition (OCR) is a technique used to convert scanned images into machine readable text formats. Different types of Optical Character Recognition (OCR) Tools are used in market from earlier times have their own strengths and weaknesses. They provided different results on the basis of different metrics or parameters. But in this paper we are going to compare two open source tools i.e. Tesseract and GOCR. This paper firstly provides the introduction of open source tools Tesseract and GOCR, architecture of Tesseract and description about their working. In this paper, Tools are compared on the basis of Precision as well as Accuracy by considering different parameters that are Image Type, Resolution, Brightness and Font Type.




Dhiman, S., & Singh, A. J. (2013). Tesseract Vs Gocr A Comparative Study. International Journal of Recent Technology and Engineering (IJRTE) (pp. 2277–3878). Retrieved from http://code.google.com/p/tesseract-ocr.

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free