Applying latent Dirichlet allocation to automatic essay grading

Tuomo Kakkonen; Niko Myller; Erkki Sutinen

Conference Proceedings

Applying latent Dirichlet allocation to automatic essay grading

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4139 LNAI 110-120

DOI: 10.1007/11816508_13

26Citations

34Readers

Get full text

Abstract

We report experiments on automatic essay grading using Latent Dirichlet Allocation (LDA). LDA is a "bag-of-words" type of language modeling and dimension reduction method, reported to out-perform other related methods, Latent Semantic Analysis (LSA) and Probabilistic Latent Semantic Analysis (PLSA) in Information Retrieval (IR) domain. We introduce LDA in detail and compare its strengths and weaknesses to LSA and PLSA. We also compare empirically the performance of LDA to LSA and PLSA. The experiments were run with three essay sets consisting in total of 283 essays from different domains. On contrary to the findings in IR, LDA achieved slightly worse results compared to LSA and PLSA in the experiments. We state the reasons for LSA and PLSA outperforming LDA and indicate further research directions. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Kakkonen, T., Myller, N., & Sutinen, E. (2006). Applying latent Dirichlet allocation to automatic essay grading. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4139 LNAI, pp. 110–120). Springer Verlag. https://doi.org/10.1007/11816508_13

Applying latent Dirichlet allocation to automatic essay grading

Abstract

Cite

Register to see more suggestions