Enriching Digitized Medieval Manuscripts: Linking Image, Text and Lexical Knowledge

0Citations
Citations of this article
67Readers
Mendeley users who have this article in their library.

Abstract

This paper describes an on-going project of transcribing and annotating digitized manuscripts of medieval Spanish with paleographic and lexical information. We link lexical units from the manuscripts with the Multilingual Central Repository (MCR), making terms retrievable by any of the languages that integrate MCR. The goal of the project is twofold: creating a paleographic knowledge base from digitized medieval facsimiles, that will allow paleographers, philologist, historical linguist, and humanities scholars in general, to analyze and retrieve graphemic, lexical and textual information from historical documents; and on the other hand, developing machine readable documents that will link image representations of graphemic and lexical units in a facsimile with Linked Open Data resources. This paper concentrates on the encoding and cross-linking procedures. c 2015 Association for Computational Linguistics and The Asian Federation of Natural Language Processing.

Cite

CITATION STYLE

APA

Álvarez, A. A. (2015). Enriching Digitized Medieval Manuscripts: Linking Image, Text and Lexical Knowledge. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 2015-text, pp. 73–77). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w15-3711

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free