Self-attention Networks for Non-recurrent Handwritten Text Recognition

Rafael d’Arce; Terence Norton; Sion Hannuna; Nello Cristianini

Conference Proceedings

Self-attention Networks for Non-recurrent Handwritten Text Recognition

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2022) 13639 LNCS 389-403

DOI: 10.1007/978-3-031-21648-0_27

2Citations

3Readers

Get full text

Abstract

Handwritten text recognition is still an unsolved problem in the field of machine learning. Nevertheless, this technology has improved considerably in the last decade in part thanks to advancements in recurrent neural networks. Unfortunately, due to their sequential nature, recurrent models cannot be effectively parallelised during training. Meanwhile, in natural language processing research, the transformer has recently become the dominant architecture; replacing the recurrent networks that were once popular. These new models are far more efficient to train than their predecessors because their primary building block, the self-attention network, can process sequences entirely non-recurrently. This work demonstrates that self-attention networks can replace the recurrent networks of state-of-the-art handwriting recognition models and achieve competitive error rates, while reducing the time required to train and the number of parameters significantly.

Author supplied keywords

Cite

CITATION STYLE

APA

d’Arce, R., Norton, T., Hannuna, S., & Cristianini, N. (2022). Self-attention Networks for Non-recurrent Handwritten Text Recognition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13639 LNCS, pp. 389–403). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-21648-0_27

Self-attention Networks for Non-recurrent Handwritten Text Recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions