Classification of handwritten document image into text and non-text regions

5Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Segmentation of document image into text and non-text regions is an essential process in document layout analysis which is one of the preprocessing steps in optical character recognition. Usually handwritten documents has no specific layout. It may contain non text regions such as diagrams, graphics, tables etc. In this work we propose a novel approach to segment text and non text components in Malayalam handwritten document image using Simplified Fuzzy ARTMAP (SFAM) classifier. Binarized document image is dilated horizontally and vertically and merged together. Perform connected component labelling on the smeared image. A set of geometrical and statistical features are extracted from each component and given to SFAM for classifying it into text and non text components. Experimental results are promising and it can be extended to other scripts also. © 2013 Springer.

Cite

CITATION STYLE

APA

Vidya, V., Indhu, T. R., & Bhadran, V. K. (2013). Classification of handwritten document image into text and non-text regions. In Lecture Notes in Electrical Engineering (Vol. 222 LNEE, pp. 103–112). https://doi.org/10.1007/978-81-322-1000-9_10

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free