Design and annotation of the first Italian corpus for text simplification

N/ACitations
Citations of this article
91Readers
Mendeley users who have this article in their library.

Abstract

In this paper, we present design and construction of the first Italian corpus for automatic and semi-automatic text simplification. In line with current approaches, we propose a new annotation scheme specifically conceived to identify the typology of changes an original sentence undergoes when it is manually simplified. Such a scheme has been applied to two aligned Italian corpora, containing original texts with corresponding simplified versions, selected as representative of two different manual simplification strategies and addressing different target reader populations. Each corpus was annotated with the operations foreseen in the annotation scheme, covering different levels of linguistic description. Annotation results were analysed with the final aim of capturing peculiarities and differences of the different simplification strategies pursued in the two corpora.

Cite

CITATION STYLE

APA

Brunato, D., Dell’Orletta, F., Venturi, G., & Montemagni, S. (2020). Design and annotation of the first Italian corpus for text simplification. In LAW 2015 - 9th Linguistic Annotation Workshop, held in conjuncion with NAACL 2015 - Proceedings of the Workshop (pp. 31–41). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/w15-1604

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free