This paper compares the accuracy of several variations of the BLEU algorithm when applied to automatically evaluating student essays. The different configurations include closed-class word removal, stemming, two baseline word-sense disambiguation procedures, and translating the texts into a simple semantic representation. We also prove empirically that the accuracy is kept when the student answers are translated automatically. Although none of the representations clearly outperform the others, some conclusions are drawn from the results.
CITATION STYLE
Alfonseca, E., & Pérez, D. (2004). Automatic assessment of open ended questions with a BLEU-inspired algorithm and shallow NLP. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3230, pp. 25–35). Springer Verlag. https://doi.org/10.1007/978-3-540-30228-5_3
Mendeley helps you to discover research relevant for your work.