Soft clustering for segmenting touching characters in printed scripts

1Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Segmentation of characters from the printed script is an important preprocessing step in automatic Optical Character Recognition (OCR). The performances of the various machine learning algorithms depend on the results of segmentation of the characters. The situation is more challenging when the scripts contain touching characters. Touching characters are predominant in different Indian scripts like Assamese, Bangla, Devanagari, Oriya, Gurmukhi, and many others. In such cases, the accuracy of an OCR system depends on the quality of segmentation of touching characters. In this paper, we explore the effectiveness of fuzzy, rough, and rough fuzzy k-means clustering to segment touching characters. We use different compound characters dataset from Devanagari, Assamese, and Bangla printed scripts for experimentation. Our results reveal that soft k-means are an effective alternative method for segmenting touching characters.

Cite

CITATION STYLE

APA

Nath, K., & Roy, S. (2019). Soft clustering for segmenting touching characters in printed scripts. In Lecture Notes in Electrical Engineering (Vol. 521, pp. 289–297). Springer Verlag. https://doi.org/10.1007/978-981-13-1906-8_30

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free