Multiscale fully convolutional network-based approach for multilingual character segmentation

7Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.

Abstract

Character segmentation is a challenging task for optical character recognition systems. Traditional methods usually utilize rule-based algorithms but most of them are not applicable in modern intelligent recognition applications that require high accuracy. It is especially the case for text containing Eastern Asian language characters with complex pictograph structures, such as Chinese. To alleviate this problem, this study proposes an encoder–decoder structure-based multiscale fully convolutional network (MSFCN) model for optical character segmentation. Comparing with other methods, MSFCN can not only effectively extract semantic details from images but also exploit boundary information of intervals between characters, thereby distinguishing characters from a background in pixel level. Extensive experiments have been conducted on two benchmark data sets of ICDAR2013 and MLCS. Obtained results prove that MSFCN achieves state-of-the-art segmentation performance and indicated its practical application value.

Cite

CITATION STYLE

APA

Yu, C., Liu, J., & Li, Y. (2021). Multiscale fully convolutional network-based approach for multilingual character segmentation. IET Computer Vision, 15(6), 449–461. https://doi.org/10.1049/cvi2.12034

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free