We report experiments on automatic essay grading using Latent Dirichlet Allocation (LDA). LDA is a "bag-of-words" type of language modeling and dimension reduction method, reported to out-perform other related methods, Latent Semantic Analysis (LSA) and Probabilistic Latent Semantic Analysis (PLSA) in Information Retrieval (IR) domain. We introduce LDA in detail and compare its strengths and weaknesses to LSA and PLSA. We also compare empirically the performance of LDA to LSA and PLSA. The experiments were run with three essay sets consisting in total of 283 essays from different domains. On contrary to the findings in IR, LDA achieved slightly worse results compared to LSA and PLSA in the experiments. We state the reasons for LSA and PLSA outperforming LDA and indicate further research directions. © Springer-Verlag Berlin Heidelberg 2006.
CITATION STYLE
Kakkonen, T., Myller, N., & Sutinen, E. (2006). Applying latent Dirichlet allocation to automatic essay grading. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4139 LNAI, pp. 110–120). Springer Verlag. https://doi.org/10.1007/11816508_13
Mendeley helps you to discover research relevant for your work.