Harmonizing different lemmatization strategies for building a knowledge base of linguistic resources for Latin

6Citations
Citations of this article
78Readers
Mendeley users who have this article in their library.

Abstract

The interoperability between lemmatized corpora of Latin and other resources that use the lemma as indexing key is hampered by the multiple lemmatization strategies that different projects adopt. In this paper we discuss how we tackle the challenges raised by harmonizing different lemmatization criteria in a project that aims to connect linguistic resources for Latin using the Linked Data paradigm. The paper introduces the architecture supporting an open-ended, lemma-based Knowledge Base, built to make textual and lexical resources for Latin interoperable. Particularly, the paper describes the inclusion into the Knowledge Base of its lexical basis, of a word formation lexicon and of a lemmatized and syntactically annotated corpus.

Cite

CITATION STYLE

APA

Mambrini, F., & Passarotti, M. (2019). Harmonizing different lemmatization strategies for building a knowledge base of linguistic resources for Latin. In LAW 2019 - 13th Linguistic Annotation Workshop, Proceedings of the Workshop (pp. 71–80). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w19-4009

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free