Restoration of decorative headline images for document retrieval

0Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

This paper describes a method for restoring decorative character images in headlines of newspapers and magazines. Although headlines contain useful Keywords for document retrieval, conventional OCRs cannot always recognize them because the characters are often printed in reverse and with various background textures. We made filters that generate multiple candidate images by changing a small number of simple parameters (namely, by setting a threshold for stroke-width filtering and reversing black and white), so that one of the candidates contains a “normal” image whose characters are printed in black on a white background. If all the candidate images are recognized and an index is created, the Keywords in headlines are expected to be retrieved without manual keyword entry and verification processes. In an experiment that we conducted, about 90% of characters in headline images segmented from newspapers were restored in the sense that one of the restored candidate images contained correct character images.

Cite

CITATION STYLE

APA

Amano, T. (1999). Restoration of decorative headline images for document retrieval. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1655, pp. 22–31). Springer Verlag. https://doi.org/10.1007/3-540-48172-9_3

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free