RulingBR: A Summarization Dataset for Legal Texts

9Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Text summarization consists in generating a shorter version of an input document, which captures its main ideas. Despite the recent developments in this area, most of the existing techniques have been tested mostly in English and Chinese, due in part to the low availability of datasets in other languages. In addition, experiments have been run mostly on collections of news articles, which could lead to some bias in the research. In this paper, we address both these limitations by creating a dataset for the summarization of legal texts in Portuguese. The dataset, called RulingBR, contains about 10K rulings from the Brazilian Federal Supreme Court. We describe how the dataset was assembled and we also report on the results of standard summarization methods which may serve as a baseline for future works.

Author supplied keywords

Cite

CITATION STYLE

APA

de Vargas Feijó, D., & Moreira, V. P. (2018). RulingBR: A Summarization Dataset for Legal Texts. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11122 LNAI, pp. 255–264). Springer Verlag. https://doi.org/10.1007/978-3-319-99722-3_26

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free