Learning Automated Essay Scoring Models Using Item-Response-Theory-Based Scores to Decrease Effects of Rater Biases

Masaki Uto; Masashi Okano

Journal ArticleOPEN ACCESS

Learning Automated Essay Scoring Models Using Item-Response-Theory-Based Scores to Decrease Effects of Rater Biases

IEEE Transactions on Learning Technologies (2021) 14(6) 763-776

DOI: 10.1109/TLT.2022.3145352

24Citations

52Readers

Abstract

In automated essay scoring (AES), scores are automatically assigned to essays as an alternative to grading by humans. Traditional AES typically relies on handcrafted features, whereas recent studies have proposed AES models based on deep neural networks to obviate the need for feature engineering. Those AES models generally require training on a large dataset of graded essays. However, assigned grades in such a training dataset are known to be biased owing to effects of rater characteristics when grading is conducted by assigning a few raters in a rater set to each essay. Performance of AES models drops when such biased data are used for model training. Researchers in the fields of educational and psychological measurement have recently proposed item response theory (IRT) models that can estimate essay scores while considering effects of rater biases. This study, therefore, proposes a new method that trains AES models using IRT-based scores for dealing with rater bias within training data.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Uto, M., & Okano, M. (2021). Learning Automated Essay Scoring Models Using Item-Response-Theory-Based Scores to Decrease Effects of Rater Biases. IEEE Transactions on Learning Technologies, 14(6), 763–776. https://doi.org/10.1109/TLT.2022.3145352

Readers over time

Readers' Seniority

Lecturer / Post doc 5

50%

PhD / Post grad / Masters / Doc 5

50%

Readers' Discipline

Computer Science 4

40%

Psychology 4

40%

Neuroscience 1

10%

Economics, Econometrics and Finance 1

10%

Learning Automated Essay Scoring Models Using Item-Response-Theory-Based Scores to Decrease Effects of Rater Biases

Abstract

Author supplied keywords

References Powered by Scopus

A New Look at the Statistical Model Identification

Stan: A probabilistic programming language

A neural approach to automated essay scoring

Cited by Powered by Scopus

Integration of Prediction Scores from Various Automated Essay Scoring Models Using Item Response Theory

Automated Essay Scoring and Revising Based on Open-Source Large Language Models

Neural Automated Essay Scoring Considering Logical Structure

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline