Segmentation of scanned images of newspapers and magazines

4Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this chapter, we present a method for the segmentation of scanned images of newspapers and magazines. For further Mixed Raster Content (MRC) compression, we classify blocks of an image into background, or picture areas, or text regions. The method is relatively simple and fast, because it is intended for implementation in firmware. Textural features are calculated for image blocks. We train three one-to-rest classifiers by the AdaBoost technique on a publicly available dataset. The output of the classifiers can be treated as a posteriori probability. We smooth these probabilities among adjacent blocks. After smoothing, a voting procedure sets the class for each block. We argue that the Dual Leave-Group-of-Sources-Out cross-validation scheme is beneficial for the tuning of algorithm parameters. We discuss the advantages and shortcomings of several segmentation quality metrics.

Cite

CITATION STYLE

APA

Safonov, I. V., Kurilin, I. V., Rychagov, M. N., & Tolstaya, E. V. (2019). Segmentation of scanned images of newspapers and magazines. In Signals and Communication Technology (pp. 107–122). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-05342-0_5

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free