RulingBR: A Summarization Dataset for Legal Texts

Diego de Vargas Feijó; Viviane Pereira Moreira

Conference Proceedings

RulingBR: A Summarization Dataset for Legal Texts

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11122 LNAI 255-264

DOI: 10.1007/978-3-319-99722-3_26

9Citations

13Readers

Get full text

Abstract

Text summarization consists in generating a shorter version of an input document, which captures its main ideas. Despite the recent developments in this area, most of the existing techniques have been tested mostly in English and Chinese, due in part to the low availability of datasets in other languages. In addition, experiments have been run mostly on collections of news articles, which could lead to some bias in the research. In this paper, we address both these limitations by creating a dataset for the summarization of legal texts in Portuguese. The dataset, called RulingBR, contains about 10K rulings from the Brazilian Federal Supreme Court. We describe how the dataset was assembled and we also report on the results of standard summarization methods which may serve as a baseline for future works.

Author supplied keywords

Cite

CITATION STYLE

APA

de Vargas Feijó, D., & Moreira, V. P. (2018). RulingBR: A Summarization Dataset for Legal Texts. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11122 LNAI, pp. 255–264). Springer Verlag. https://doi.org/10.1007/978-3-319-99722-3_26

RulingBR: A Summarization Dataset for Legal Texts

Abstract

Author supplied keywords

Cite

Register to see more suggestions