Telugu language is one of the most spoken Indian languages throughout the world. Since it has an old heritage, so Telugu literature and newspaper publications can be scanned to identify individual words. Identification of Telugu word images poses serious problems owing to its complex structure and larger set of individual characters. This paper aims to develop a novel methodology to achieve the same using SIFT (Scale Invariant Feature Transform) features of telugu words and classifying these features using BoVW (bag of visual words). The features are clustered to create a dictionary using k-means clustering. These words are used to create a visual codebook of the word images and the classification is achieved through SVM (Support Vector Machine).
CITATION STYLE
Lakshmi, K. M., & Babu, T. R. (2019). Efficient technique for word identification and recognition in Telugu documents. International Journal of Recent Technology and Engineering, 8(2), 6053–6057. https://doi.org/10.35940/ijrte.B3793.078219
Mendeley helps you to discover research relevant for your work.