In this paper we employ a novel approach to advancing our understanding of the development of writing in English and German children across school grades using classification tasks. The data used come from two recently compiled corpora: The English data come from the the GiC corpus (983 school children in second-, sixth-, ninth- and eleventh-grade) and the German data are from the FD-LEX corpus (930 school children in fifth- and ninthgrade). The key to this paper is the combined use of what we refer to as 'complexity contours', i.e. series of measurements that capture the progression of linguistic complexity within a text, and Recurrent Neural Network (RNN) classifiers that adequately capture the sequential information in those contours. Our experiments demonstrate that RNN classifiers trained on complexity contours achieve higher classification accuracy than one trained on text-average complexity scores. In a second step, we determine the relative importance of the features from four distinct categories through a Sensitivity-Based Pruning approach.
CITATION STYLE
Kerz, E., Qiao, Y., Wiechmann, D., & Ströbel, M. (2020). Becoming linguistically mature: Modeling English and German children’s writing development across school grades. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 65–74). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.bea-1.6
Mendeley helps you to discover research relevant for your work.