Extracting Linguistic Signal From Item Text and Its Application to Modeling Item Characteristics

3Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

This chapter discusses the evolution of natural language processing (NLP) approaches to text representation and how different ways of representing text can be utilized for a relatively understudied task in educational assessment – that of predicting item characteristics from item text. The first part of the chapter gives an introductory overview of the transition from hypothesis-driven linguistic features to non-contextualized and then contextualized embeddings. This overview is intended for assessment professionals who do not have a background in NLP. The second part demonstrates how these approaches could be applied to predicting item difficulty, response time, and item biserial for a set of clinical multiple-choice questions from a high-stakes licensing exam. These items are written to a common reading level so that they differ only in the difficulty of the construct they measure (i.e., clinical knowledge). The chapter concludes by discussing practical considerations for developing such models (e.g., the role of training data), as well as the implications of model interpretability and the use of the predictions in the context of high-stakes assessment.

Cite

CITATION STYLE

APA

Yaneva, V., Baldwin, P., Ha, L. A., & Runyon, C. (2023). Extracting Linguistic Signal From Item Text and Its Application to Modeling Item Characteristics. In Advancing Natural Language Processing in Educational Assessment (pp. 167–182). Taylor and Francis. https://doi.org/10.4324/9781003278658-14

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free