Combining WordNet and Word Embeddings in Data Augmentation for Legal Texts

4Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.

Abstract

Creating balanced labeled textual corpora for complex tasks, like legal analysis, is a challenging and expensive process that often requires the collaboration of domain experts. To address this problem, we propose a data augmentation method based on the combination of GloVe word embeddings and the WordNet ontology. We present an example of application in the legal domain, specifically on decisions of the Court of Justice of the European Union. Our evaluation with human experts confirms that our method is more robust than the alternatives.

Cite

CITATION STYLE

APA

Perçin, S., Galassi, A., Lagioia, B. F., Ruggeri, F., Santin, P., Sartor, G., & Torroni, P. (2022). Combining WordNet and Word Embeddings in Data Augmentation for Legal Texts. In NLLP 2022 - Natural Legal Language Processing Workshop 2022, Proceedings of the Workshop (pp. 47–52). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.nllp-1.4

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free