Segmentation of printed devnagari documents

7Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Document segmentation is one of the most important phases in machine recognition of any language. Correct segmentation of individual symbols decides the success of character recognition technique. It is used to decompose an image of a sequence of characters into sub images of individual symbols by segmenting lines and words. Devnagari is the most popular script in India. It is used for writing Hindi, Marathi, Sanskrit and Nepali languages. Moreover, Hindi is the third most popular language in the world. Devnagari documents consist of vowels, consonants and various modifiers. Hence a proper segmentation Devnagari word is challenging. A simple approach based on bounded box to segment Devnagari documents is proposed in this paper. Various challenges in segmentation of Devnagari script are also discussed. © 2011 Springer-Verlag.

Cite

CITATION STYLE

APA

Dongre, V. J., & Mankar, V. H. (2011). Segmentation of printed devnagari documents. In Communications in Computer and Information Science (Vol. 198 CCIS, pp. 211–218). https://doi.org/10.1007/978-3-642-22555-0_23

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free