The Effects of Character-Level Data Augmentation on Style-Based Dating of Historical Manuscripts

4Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Identifying the production dates of historical manuscripts is one of the main goals for paleographers when studying ancient documents. Automatized methods can provide paleographers with objective tools to estimate dates more accurately. Previously, statistical features have been used to date digitized historical manuscripts based on the hypothesis that handwriting styles change over periods. However, the sparse availability of such documents poses a challenge in obtaining robust systems. Hence, the research of this article explores the influence of data augmentation on the dating of historical manuscripts. Linear Support Vector Machines were trained with k-fold cross-validation on textural and grapheme-based features extracted from historical manuscripts of different collections, including the Medieval Paleographical Scale, early Aramaic manuscripts, and the Dead Sea Scrolls. Results show that training models with augmented data improve the performance of historical manuscripts dating by 1%-3% in cumulative scores. Additionally, this indicates further enhancement possibilities by considering models specific to the features and the documents' scripts.

Cite

CITATION STYLE

APA

Koopmans, L., Dhali, M. A., & Schomaker, L. (2023). The Effects of Character-Level Data Augmentation on Style-Based Dating of Historical Manuscripts. In International Conference on Pattern Recognition Applications and Methods (Vol. 1, pp. 124–135). Science and Technology Publications, Lda. https://doi.org/10.5220/0011699500003411

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free