Making Large Collections of Handwritten Material Easily Accessible and Searchable

Anders Hast; Per Cullhed; Ekta Vats; Matteo Abrate

Conference Proceedings

Making Large Collections of Handwritten Material Easily Accessible and Searchable

Communications in Computer and Information Science (2019) 988 18-28

DOI: 10.1007/978-3-030-11226-4_2

1Citations

3Readers

Get full text

Abstract

Libraries and cultural organisations contain a rich amount of digitised historical handwritten material in the form of scanned images. A vast majority of this material has not been transcribed yet, owing to technological challenges and lack of expertise. This renders the task of making these historical collections available for public access challenging, especially in performing a simple text search across the collection. Machine learning based methods for handwritten text recognition are gaining importance these days, which require huge amount of pre-transcribed texts for training the system. However, it is impractical to have access to several thousands of pre-transcribed documents due to adversities transcribers face. Therefore, this paper presents a training-free word spotting algorithm as an alternative for handwritten text transcription, where case studies on Alvin (Swedish repository) and Clavius on the Web are presented. The main focus of this work is on discussing prospects of making materials in the Alvin platform and Clavius on the Web easily searchable using a word spotting based handwritten text recognition system.

Author supplied keywords

Cite

CITATION STYLE

APA

Hast, A., Cullhed, P., Vats, E., & Abrate, M. (2019). Making Large Collections of Handwritten Material Easily Accessible and Searchable. In Communications in Computer and Information Science (Vol. 988, pp. 18–28). Springer Verlag. https://doi.org/10.1007/978-3-030-11226-4_2

Making Large Collections of Handwritten Material Easily Accessible and Searchable

Abstract

Author supplied keywords

Cite

Register to see more suggestions