We present a low-resource, language-independent system for text difficulty assessment. We replicate and improve upon a baseline by Shen et al. (2013) on the Interagency Language Roundtable (ILR) scale. Our work demonstrates that the addition of morphological, information theoretic, and language modeling features to a traditional readability baseline greatly benefits our performance. We use the Margin-Infused Relaxed Algorithm and Support Vector Machines for experiments on Arabic, Dari, English, and Pashto, and provide a detailed analysis of our results.
CITATION STYLE
Salesky, E., & Shen, W. (2014). Exploiting Morphological, Grammatical, and Semantic Correlates for Improved Text Difficulty Assessment. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 155–162). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/w14-1819
Mendeley helps you to discover research relevant for your work.