Legal NERC with ontologies, Wikipedia and curriculum learning

23Citations
Citations of this article
88Readers
Mendeley users who have this article in their library.

Abstract

In this paper, we present a Wikipediabased approach to develop resources for the legal domain. We establish a mapping between a legal domain ontology, LKIF (Hoekstra et al., 2007), and a Wikipediabased ontology, YAGO (Suchanek et al., 2007), and through that we populate LKIF. Moreover, we use the mentions of those entities inWikipedia text to train a specific Named Entity Recognizer and Classifier. We find that this classifier works well in the Wikipedia, but, as could be expected, performance decreases in a corpus of judgments of the European Court of Human Rights. However, this tool will be used as a preprocess for human annotation. We resort to a technique called curriculum learning aimed to overcome problems of overfitting by learning increasingly more complex concepts. However, we find that in this particular setting, the method works best by learning from most specific to most general concepts, not the other way round.

Cite

CITATION STYLE

APA

Cardellino, C., Teruel, M., Alemany, L. A., & Villata, S. (2017). Legal NERC with ontologies, Wikipedia and curriculum learning. In 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017 - Proceedings of Conference (Vol. 2, pp. 254–259). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/e17-2041

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free