Ancient printed documents are an infinite source of knowledge, but digital uses are usually complicated due to the age and the quality of the print. The Linguistic Atlas of France (ALF) maps are composed of printed phonetic words used to locate how words were pronounced over the country. Those words were printed using the Rousselot-Gillieron alphabet (extension of Latin alphabet) which bring character recognition problems due to the large number of diacritics. In this paper, we propose a phonetic character recognition process based on a space-filling curves approach. We proposed an original method adapted to this particular data set, able to finely classify, with more than 70% of accuracy, noisy and specific characters.
CITATION STYLE
Owczarek, V., Drapeau, J., Burie, J. C., Franco, P., Coustaty, M., Mullot, R., & Eglin, V. (2020). Classification of phonetic characters by space-filling curves. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12116 LNCS, pp. 89–100). Springer. https://doi.org/10.1007/978-3-030-57058-3_7
Mendeley helps you to discover research relevant for your work.