An Efficient Prototype-Based Model for Handwritten Text Recognition with Multi-loss Fusion

1Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Prototype learning has achieved good performance in many fields, showing higher flexibility and generalization. In this paper, we propose an efficient text line recognition method based on prototype learning with feature-level sliding windows for classification. In this framework, we combine weakly supervised discrimination and generation loss for learning feature representations with intra-class compactness and inter-class separability. Then, dynamic weighting and pseudo-label filtering are also adopted to reduce the influence of unreliable pseudo-labels and improve training stability significantly. Furthermore, we introduce consistency regularization to obtain more reliable confidence distributions and pseudo-labels. Experimental results on digital and Chinese handwritten text datasets demonstrate the superiority of our method and justify advantages in transfer learning on small-size datasets.

Cite

CITATION STYLE

APA

Yu, M. M., Zhang, H., Yin, F., & Liu, C. L. (2022). An Efficient Prototype-Based Model for Handwritten Text Recognition with Multi-loss Fusion. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13639 LNCS, pp. 404–418). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-21648-0_28

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free