NT2Lex: A CEFR-graded lexical resource for Dutch as a foreign language linked to Open Dutch WordNet

Anaïs Tack; Thomas François; Piet Desmet; Cédrick Fairon

Conference ProceedingsOPEN ACCESS

NT2Lex: A CEFR-graded lexical resource for Dutch as a foreign language linked to Open Dutch WordNet

Proceedings of the 13th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2018 at the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HTL 2018 (2018) 137-146

DOI: 10.18653/v1/w18-0514

8Citations

75Readers

Abstract

In this paper, we introduce NT2Lex, a novel lexical resource for Dutch as a foreign language (NT2) which includes frequency distributions of 17,743 words and expressions attested in expert-written textbook texts and readers graded along the scale of the Common European Framework of Reference (CEFR). In essence, the lexicon informs us about what kind of vocabulary should be understood when reading Dutch as a non-native reader at a particular proficiency level. The main novelty of the resource with respect to the previously developed CEFR-graded lexicons concerns the introduction of corpus-based evidence for L2 word sense complexity through the linkage to Open Dutch WordNet (Postma et al., 2016). The resource thus contains, on top of the lemmatised and part-of-speech tagged lexical entries, a total of 11,999 unique word senses and 8,934 distinct synsets.

Cite

CITATION STYLE

APA

Tack, A., François, T., Desmet, P., & Fairon, C. (2018). NT2Lex: A CEFR-graded lexical resource for Dutch as a foreign language linked to Open Dutch WordNet. In Proceedings of the 13th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2018 at the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HTL 2018 (pp. 137–146). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w18-0514

NT2Lex: A CEFR-graded lexical resource for Dutch as a foreign language linked to Open Dutch WordNet

Abstract

Cite

Register to see more suggestions