A data-driven methodology to assess text complexity based on syntactic and semantic measurements

3Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper we propose a data driven methodology to assess text complexity of Spanish school texts. We model the problem as a classification task, that can be solved in a data-driven fashion using machine learning techniques. We show empirically that the discriminative power of the classifier depends on school grade level. Our proposal includes multiple predictors that capture different dimensions of text complexity such as coherence and cohesion. We provide an importance analysis of predictors across several complexity levels. Finally, we assess the model performance using accuracy and correlation measurements. The proposed model achieves accuracies of 0.7.

Cite

CITATION STYLE

APA

Palma, D., Soto, C., Veliz, M., Riffo, B., & Gutiérrez, A. (2020). A data-driven methodology to assess text complexity based on syntactic and semantic measurements. In Advances in Intelligent Systems and Computing (Vol. 1018, pp. 509–515). Springer Verlag. https://doi.org/10.1007/978-3-030-25629-6_79

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free