Document image enhancement methods are often used to improve the accuracy and efficiency of automated document analysis and recognition tasks such as character recognition. These document images could be degraded or damaged for various reasons including aging, fading handwriting, poor lighting conditions, watermarks, etc. In recent years, with the improvement of computer performance and the continuous development of deep learning, many methods have been proposed to enhance the quality of these document images. In this paper, we review six tasks of document degradation, namely, background texture, page smudging, fading, poor lighting conditions, watermarking, and blurring. We summarize the main models for each degradation problem as well as recent work, such as the binarization model that can be used to deal with the degradation of background textures, lettering smudges. When facing the problem of fading, a model for stroke connectivity can be used, while the other three degradation problems are mostly deep learning models. We discuss the current limitations and challenges of each degradation task and introduce the common public datasets and metrics. We identify several promising research directions and opportunities for future research.
CITATION STYLE
Zhou, Y., Zuo, S., Yang, Z., He, J., Shi, J., & Zhang, R. (2023, July 1). A Review of Document Image Enhancement Based on Document Degradation Problem. Applied Sciences (Switzerland). Multidisciplinary Digital Publishing Institute (MDPI). https://doi.org/10.3390/app13137855
Mendeley helps you to discover research relevant for your work.