An Open Source Tesseract based Tool for Extracting Text from Images with Application in Braille Translation for the Visually Impaired

  • Chakraborty P
  • Mallik A
N/ACitations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

Many valuable paper documents are usually scanned and kept as images for backup. Extracting text from the images is quite helpful and thus a need for some tool for this extraction is always there. One of the important applications of this tool is its use in Braille Translation. Braille has been the primary writing and reading system used by the visually impaired since the 19 th century. This application that extracts text from images and then converts it to Braille will prove to be quite useful for converting old valuable documents or books into Braille format. In this paper the complete methodology used for the extraction of texts from scanned images and for the translation of texts to Braille is presented. The scanned images are initially pre-processed and converted to grayscale and then passed through an adaptive threshold function for conversion to binary image. Then it is sent for Recognition using Google's powerful Tesseract recognition engine which is considered to be the best Open Source OCR Engine currently available. The generated text is then post-processed using a spell checking API JOrtho for removing the errors in the previous step. The final corrected text is then translated to a six dot cell Braille format using a set of rules provided by www.iceb.org. The translation to Braille includes conversion of numbers, alphabets, symbols and compound letters. The translated text can then be saved for printing the document later or for sending it to a Refreshable Braille Display.

Cite

CITATION STYLE

APA

Chakraborty, P., & Mallik, A. (2013). An Open Source Tesseract based Tool for Extracting Text from Images with Application in Braille Translation for the Visually Impaired. International Journal of Computer Applications, 68(16), 26–32. https://doi.org/10.5120/11664-7254

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free