An Efficient Prototype-Based Model for Handwritten Text Recognition with Multi-loss Fusion

Ming Ming Yu; Heng Zhang; Fei Yin; Cheng Lin Liu

Conference Proceedings

An Efficient Prototype-Based Model for Handwritten Text Recognition with Multi-loss Fusion

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2022) 13639 LNCS 404-418

DOI: 10.1007/978-3-031-21648-0_28

1Citations

2Readers

Get full text

Abstract

Prototype learning has achieved good performance in many fields, showing higher flexibility and generalization. In this paper, we propose an efficient text line recognition method based on prototype learning with feature-level sliding windows for classification. In this framework, we combine weakly supervised discrimination and generation loss for learning feature representations with intra-class compactness and inter-class separability. Then, dynamic weighting and pseudo-label filtering are also adopted to reduce the influence of unreliable pseudo-labels and improve training stability significantly. Furthermore, we introduce consistency regularization to obtain more reliable confidence distributions and pseudo-labels. Experimental results on digital and Chinese handwritten text datasets demonstrate the superiority of our method and justify advantages in transfer learning on small-size datasets.

Author supplied keywords

Cite

CITATION STYLE

APA

Yu, M. M., Zhang, H., Yin, F., & Liu, C. L. (2022). An Efficient Prototype-Based Model for Handwritten Text Recognition with Multi-loss Fusion. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13639 LNCS, pp. 404–418). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-21648-0_28

An Efficient Prototype-Based Model for Handwritten Text Recognition with Multi-loss Fusion

Abstract

Author supplied keywords

Cite

Register to see more suggestions