Spoken text difficulty estimation using linguistic features

11Citations
Citations of this article
78Readers
Mendeley users who have this article in their library.

Abstract

We present an automated method for estimating the difficulty of spoken texts for use in generating items that assess non-native learners' listening proficiency. We collected information on the perceived difficulty of listening to various English monologue speech samples using a Likert-scale questionnaire distributed to 15 non-native English learners. We averaged the overall rating provided by three nonnative learners at different proficiency levels into an overall score of listenability. We then trained a multiple linear regression model with the listenability score as the dependent variable and features from both natural language and speech processing as the independent variables. Our method demonstrated a correlation of 0.76 with the listenability score, comparable to the agreement between the nonnative learners' ratings and the listenability score.

Cite

CITATION STYLE

APA

Yoon, S. Y., Cho, Y., & Napolitano, D. (2016). Spoken text difficulty estimation using linguistic features. In Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2016 at the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2016 (pp. 267–276). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w16-0531

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free