Organizing Portuguese Legal Documents through Topic Discovery

Daniela Vianna; Edleno Silva De Moura

Conference ProceedingsOPEN ACCESS

Organizing Portuguese Legal Documents through Topic Discovery

SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (2022) 3388-3392

DOI: 10.1145/3477495.3536329

4Citations

18Readers

Get full text

Abstract

A significant challenge in the legal domain is to organize and summarize a constantly growing collection of legal documents, uncovering hidden topics, or themes, that later can support tasks such as legal case retrieval and legal judgment prediction. This massive amount of digital legal documents, combined with the inherent complexity of judiciary systems worldwide, presents a promising scenario for Machine Learning solutions, mainly those taking advantage of all the advancements in the area of Natural Language Processing (NLP). It is in this scenario that Jusbrasil, the largest legal tech company in Brazil, is situated. Using a dataset partially curated by the Jusbrasil legal team, we explore topic modeling solutions using state of the art language models, trained with legal Portuguese documents, to automatically organize and summarize this complex collection of documents. Instead of using an entire legal case, which usually is composed of many pages, we show that it is possible to efficiently organize the collection using the syllabus (in Portuguese, ementa jurisprudencial) from each court decision as they concisely summarize the main points presented by the entire decision.

Author supplied keywords

Cite

CITATION STYLE

APA

Vianna, D., & Silva De Moura, E. (2022). Organizing Portuguese Legal Documents through Topic Discovery. In SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 3388–3392). Association for Computing Machinery, Inc. https://doi.org/10.1145/3477495.3536329

Organizing Portuguese Legal Documents through Topic Discovery

Abstract

Author supplied keywords

Cite

Register to see more suggestions