A robust approach to extraction of texts from camera captured images

Sudipto Banerjee; Koustav Mullick; Ujjwal Bhattacharya

Conference Proceedings

A robust approach to extraction of texts from camera captured images

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8357 LNCS 30-46

DOI: 10.1007/978-3-319-05167-3_3

1Citations

9Readers

Get full text

Abstract

Here, we present our recent study of a robust but simple approach to extraction of texts from camera-captured images. In the proposed approach, we first identify pixels which are highly specular. Connected components of this set of specular pixels are obtained. Pixels belonging to each such component are separately binarized using the well-known Otsu's approach. We next apply smoothing on the whole image before obtaining its Canny edge representation. Bounding rectangle of each connected component of the Canny edge image is obtained and multiple components with pairwise overlapping bounding boxes are merged. Otsu's thresholding technique is applied separately on different parts of input image defined by the resulting bounding boxes. Although Otsu's thresholding approach does not generally provide acceptable performance on camera captured images, we observed its suitability when applied severally as in the above. The binarized specular components obtained at the initial stage replace the corresponding regions of the latter binarized image. Finally, a set of postprocessing operations is used to remove certain non-text components of the binarized image. © 2014 Springer International Publishing Switzerland.

Cite

CITATION STYLE

APA

Banerjee, S., Mullick, K., & Bhattacharya, U. (2014). A robust approach to extraction of texts from camera captured images. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8357 LNCS, pp. 30–46). Springer Verlag. https://doi.org/10.1007/978-3-319-05167-3_3

A robust approach to extraction of texts from camera captured images

Abstract

Cite

Register to see more suggestions