Abstract
We present an automated method for estimating the difficulty of spoken texts for use in generating items that assess non-native learners' listening proficiency. We collected information on the perceived difficulty of listening to various English monologue speech samples using a Likert-scale questionnaire distributed to 15 non-native English learners. We averaged the overall rating provided by three nonnative learners at different proficiency levels into an overall score of listenability. We then trained a multiple linear regression model with the listenability score as the dependent variable and features from both natural language and speech processing as the independent variables. Our method demonstrated a correlation of 0.76 with the listenability score, comparable to the agreement between the nonnative learners' ratings and the listenability score.
Cite
CITATION STYLE
Yoon, S. Y., Cho, Y., & Napolitano, D. (2016). Spoken text difficulty estimation using linguistic features. In Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2016 at the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2016 (pp. 267–276). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w16-0531
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.