During document scanning, skew is inevitably introduced into the incoming document image. Presence of additional modified characters, which get plugged in as extensions and remain as disjointed protrusions of a main character is really challenging in estimating inclination in skewed documents made up of texts in south Indian languages (Kannada, Telugu, Tamil and Malayalam). In this paper, we present a novel script independent (for south Indian) skew estimation technique based on Gaussian Mixture Models (GMM). The Expectation-Maximization (EM) algorithm is used to learn the mixture of Gaussians. Subsequently the cluster means are subjected to moments to estimate the skew angle. Experiments on printed and handwritten documents corrupted by noise is done. Our method shows significantly improved performance as compared to other existing methods. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Aradhya, V. N. M., Rao, A., & Kumar, G. H. (2007). Language independent skew estimation technique based on Gaussian Mixture Models: A case study on South Indian scripts. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4815 LNCS, pp. 487–494). Springer Verlag. https://doi.org/10.1007/978-3-540-77046-6_60
Mendeley helps you to discover research relevant for your work.