Prototype learning has achieved good performance in many fields, showing higher flexibility and generalization. In this paper, we propose an efficient text line recognition method based on prototype learning with feature-level sliding windows for classification. In this framework, we combine weakly supervised discrimination and generation loss for learning feature representations with intra-class compactness and inter-class separability. Then, dynamic weighting and pseudo-label filtering are also adopted to reduce the influence of unreliable pseudo-labels and improve training stability significantly. Furthermore, we introduce consistency regularization to obtain more reliable confidence distributions and pseudo-labels. Experimental results on digital and Chinese handwritten text datasets demonstrate the superiority of our method and justify advantages in transfer learning on small-size datasets.
CITATION STYLE
Yu, M. M., Zhang, H., Yin, F., & Liu, C. L. (2022). An Efficient Prototype-Based Model for Handwritten Text Recognition with Multi-loss Fusion. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13639 LNCS, pp. 404–418). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-21648-0_28
Mendeley helps you to discover research relevant for your work.