Segmentation of scanned images of newspapers and magazines

Ilia V. Safonov; Ilya V. Kurilin; Michael N. Rychagov; Ekaterina V. Tolstaya

Book Chapter

Segmentation of scanned images of newspapers and magazines

Springer Science and Business Media Deutschland GmbH, (2019), 107-122

DOI: 10.1007/978-3-030-05342-0_5

4Citations

7Readers

Get full text

Abstract

In this chapter, we present a method for the segmentation of scanned images of newspapers and magazines. For further Mixed Raster Content (MRC) compression, we classify blocks of an image into background, or picture areas, or text regions. The method is relatively simple and fast, because it is intended for implementation in firmware. Textural features are calculated for image blocks. We train three one-to-rest classifiers by the AdaBoost technique on a publicly available dataset. The output of the classifiers can be treated as a posteriori probability. We smooth these probabilities among adjacent blocks. After smoothing, a voting procedure sets the class for each block. We argue that the Dual Leave-Group-of-Sources-Out cross-validation scheme is beneficial for the tuning of algorithm parameters. We discuss the advantages and shortcomings of several segmentation quality metrics.

Author supplied keywords

Cite

CITATION STYLE

APA

Safonov, I. V., Kurilin, I. V., Rychagov, M. N., & Tolstaya, E. V. (2019). Segmentation of scanned images of newspapers and magazines. In Signals and Communication Technology (pp. 107–122). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-05342-0_5

Segmentation of scanned images of newspapers and magazines

Abstract

Author supplied keywords

Cite

Register to see more suggestions