Libraries and cultural organisations contain a rich amount of digitised historical handwritten material in the form of scanned images. A vast majority of this material has not been transcribed yet, owing to technological challenges and lack of expertise. This renders the task of making these historical collections available for public access challenging, especially in performing a simple text search across the collection. Machine learning based methods for handwritten text recognition are gaining importance these days, which require huge amount of pre-transcribed texts for training the system. However, it is impractical to have access to several thousands of pre-transcribed documents due to adversities transcribers face. Therefore, this paper presents a training-free word spotting algorithm as an alternative for handwritten text transcription, where case studies on Alvin (Swedish repository) and Clavius on the Web are presented. The main focus of this work is on discussing prospects of making materials in the Alvin platform and Clavius on the Web easily searchable using a word spotting based handwritten text recognition system.
CITATION STYLE
Hast, A., Cullhed, P., Vats, E., & Abrate, M. (2019). Making Large Collections of Handwritten Material Easily Accessible and Searchable. In Communications in Computer and Information Science (Vol. 988, pp. 18–28). Springer Verlag. https://doi.org/10.1007/978-3-030-11226-4_2
Mendeley helps you to discover research relevant for your work.