Building super-resolution image generator for OCR accuracy improvement

Xujun Peng; Chao Wang

Conference Proceedings

Building super-resolution image generator for OCR accuracy improvement

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 12116 LNCS 145-160

DOI: 10.1007/978-3-030-57058-3_11

9Citations

9Readers

Get full text

Abstract

Super-resolving a low resolution (LR) document image can not only enhance the visual quality and readability of the text, but improve the optical character recognition (OCR) accuracy. However, even despite the ill-posed nature of image super-resolution (SR) problem, how do we treat the finer details of text with large upscale factors and suppress noises and artifacts at the same time, especially for low quality document images is still a challenging task. Thus, in order to boost the OCR accuracy, we propose a generative adversarial network (GAN) based framework in this paper, where a SR image generator and a document image quality discriminator are constructed. To obtain high quality SR document image, multiple losses are designed to encourage the generator to learn the structural properties of texts. Meanwhile, the quality discriminator is trained based on a relativistic loss function. Based on the proposed framework, the obtained SR document images not only maintain the details of textures but remove the background noises, which achieve better OCR performance on the public databases. The source codes and pre-trained models are available at https://gitlab.com/xujun.peng/doc-super-resolution.

Author supplied keywords

Cite

CITATION STYLE

APA

Peng, X., & Wang, C. (2020). Building super-resolution image generator for OCR accuracy improvement. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12116 LNCS, pp. 145–160). Springer. https://doi.org/10.1007/978-3-030-57058-3_11

Building super-resolution image generator for OCR accuracy improvement

Abstract

Author supplied keywords

Cite

Register to see more suggestions