Abstract
Optical Character Recognition (OCR) refers to the process of converting printed, hand printed and handwritten Tamil text documents into software translated Tamil Text. As part of the preprocessing phase the image file is checked for skewing. If the image is skewed, it is corrected by a simple rotation technique. Then the image is passed through a noise elimination phase and is binarized. The preprocessed image is segmented. Thus a database of character image glyphs is created out of the segmentation phase. Then all the image glyphs are considered for recognition. Each image glyph is passed through various routines which extract the features of the glyph. The glyphs are now set ready for classification and recognition based on the above said features. The extracted features are considered for recognition using Hidden Markov Model (HMM). The recognition rate achieved is 89%. © 2012 ICST Institute for Computer Science, Social Informatics and Telecommunications Engineering.
Author supplied keywords
Cite
CITATION STYLE
Kannan, R. J., Suresh, R. M., & Selvakumar, A. (2012). Applications of hidden Markov model to recognize handwritten Tamil characters. In Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering (Vol. 108 LNICST, pp. 282–290). https://doi.org/10.1007/978-3-642-35615-5_43
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.