Automatic Text Difficulty Estimation Using Embeddings and Neural Networks

Anna Filighera; Tim Steuer; Christoph Rensing

Conference Proceedings

Automatic Text Difficulty Estimation Using Embeddings and Neural Networks

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11722 LNCS 335-348

DOI: 10.1007/978-3-030-29736-7_25

28Citations

19Readers

Get full text

Abstract

Text difficulty, also called reading difficulty, refers to the complexity of texts on a language level. For many educational applications, such as learning resource recommendation systems, the text difficulty of text is highly relevant information. However, manual annotation of text difficulty is very expensive and not feasible for large collections of texts. For this reason, many approaches to automatic text difficulty estimation have been proposed in the past. All text difficulty estimation models published thus far have one thing in common: they rely on manually engineered feature sets. This is problematic as features are tailored to a specific type of text and do not generalize well to other types and languages. To alleviate this problem we propose a novel approach using neural networks and embeddings to the task of text difficulty classification. Our approach distinguishes between 5 reading levels which correspond to non-overlapping age groups ranging from ages 7 to 16. It performs comparably to existing state-of-the-art approaches in terms of accuracy and Pearson correlation coefficient while being easier and cheaper to adapt to new types of text.

Author supplied keywords

Cite

CITATION STYLE

APA

Filighera, A., Steuer, T., & Rensing, C. (2019). Automatic Text Difficulty Estimation Using Embeddings and Neural Networks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11722 LNCS, pp. 335–348). Springer Verlag. https://doi.org/10.1007/978-3-030-29736-7_25

Automatic Text Difficulty Estimation Using Embeddings and Neural Networks

Abstract

Author supplied keywords

Cite

Register to see more suggestions