Information density based image binarization for text document containing graphics

Soma Datta; Nabendu Chaki; Sankhayan Choudhury

Conference ProceedingsOPEN ACCESS

Information density based image binarization for text document containing graphics

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9842 LNCS 105-115

DOI: 10.1007/978-3-319-45378-1_10

2Citations

3Readers

Abstract

In this work, a new clustering based binarization technique has been proposed. Clustering is done depending on the information density of the input image. Here input image is considered as a set of text, images as foreground and some random noises, marks of ink, spots of oil, etc. in the background. It is often quite difficult to separate the foreground from the background based on existing binarization technique. The existing methods offer good result if the input image contains only text. Experimental results indicate that this method is particularly good for degraded text document containing graphic images as well. USCSIPI database is used for testing phase. It is compared with iterative partitioning, Otsu’s method for seven different metrics.

Author supplied keywords

Cite

CITATION STYLE

APA

Datta, S., Chaki, N., & Choudhury, S. (2016). Information density based image binarization for text document containing graphics. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9842 LNCS, pp. 105–115). Springer Verlag. https://doi.org/10.1007/978-3-319-45378-1_10

Information density based image binarization for text document containing graphics

Abstract

Author supplied keywords

Cite

Register to see more suggestions