ELQA: A Corpus of Metalinguistic Questions and Answers about English

4Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

We present ELQA, a corpus of questions and answers in and about the English language. Collected from two online forums, the >70k questions (from English learners and others) cover wide-ranging topics including grammar, meaning, fluency, and etymology. The answers include descriptions of general properties of English vocabulary and grammar as well as explanations about specific (correct and incorrect) usage examples. Unlike most NLP datasets, this corpus is metalinguistic-it consists of language about language. As such, it can facilitate investigations of the metalinguistic capabilities of NLU models, as well as educational applications in the language learning domain. To study this, we define a free-form question answering task on our dataset and conduct evaluations on multiple LLMs (Large Language Models) to analyze their capacity to generate metalinguistic answers.

Cite

CITATION STYLE

APA

Behzad, S., Sakaguchi, K., Schneider, N., & Zeldes, A. (2023). ELQA: A Corpus of Metalinguistic Questions and Answers about English. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1, pp. 2031–2047). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-long.113

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free