Abstract
Character segmentation is a challenging task for optical character recognition systems. Traditional methods usually utilize rule-based algorithms but most of them are not applicable in modern intelligent recognition applications that require high accuracy. It is especially the case for text containing Eastern Asian language characters with complex pictograph structures, such as Chinese. To alleviate this problem, this study proposes an encoder–decoder structure-based multiscale fully convolutional network (MSFCN) model for optical character segmentation. Comparing with other methods, MSFCN can not only effectively extract semantic details from images but also exploit boundary information of intervals between characters, thereby distinguishing characters from a background in pixel level. Extensive experiments have been conducted on two benchmark data sets of ICDAR2013 and MLCS. Obtained results prove that MSFCN achieves state-of-the-art segmentation performance and indicated its practical application value.
Cite
CITATION STYLE
Yu, C., Liu, J., & Li, Y. (2021). Multiscale fully convolutional network-based approach for multilingual character segmentation. IET Computer Vision, 15(6), 449–461. https://doi.org/10.1049/cvi2.12034
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.