PACCSS-It: A parallel corpus of complex-simple sentences for automatic text simplification

38Citations
Citations of this article
101Readers
Mendeley users who have this article in their library.

Abstract

In this paper we present PaCCSS-IT, a Parallel Corpus of Complex-Simple Sentences for ITalian. To build the resource we develop a new method for automatically acquiring a corpus of complex-simple paired sentences able to intercept structural transformations and particularly suitable for text simplification. The method requires a wide amount of texts that can be easily extracted from the web making it suitable also for less-resourced languages. We test it on the Italian language making available the biggest Italian corpus for automatic text simplification.

Cite

CITATION STYLE

APA

Brunato, D., Cimino, A., Dell’Orletta, F., & Venturi, G. (2016). PACCSS-It: A parallel corpus of complex-simple sentences for automatic text simplification. In EMNLP 2016 - Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 351–361). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/d16-1034

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free