Innovative feature sets for machine learning based Telugu character recognition

12Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.

Abstract

In this Information age, all sources of information like historic documents, books, manuscripts are digitized and are available all over the world through internet in the form of scanned copies. These scanned images contain valuable information which are available either in colour or black and white for pleasant viewing. Optical Character Recognition (OCR) technology provides facility to search for keywords in these digital copies. In this paper, new method in which building an OCR system for Telugu language script; mainly focussing on the character recognition module. Features extracted through Discrete Wavelet Transform (DWT), Projection Profile (PP) and Singular Value Decomposition (SVD) is evaluated using k-Nearest Neighbour (k-NN) and Support Vector Machine (SVM) classifiers. Most productive results are obtained from the DWT features with SVM classifiers.

Cite

CITATION STYLE

APA

Jyothi, J., Manjusha, K., Anand Kumar, M., & Soman, K. P. (2015). Innovative feature sets for machine learning based Telugu character recognition. Indian Journal of Science and Technology, 8(24). https://doi.org/10.17485/ijst/2015/v8i24/79996

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free