Faster teaching by POMDP planning

Anna N. Rafferty; Emma Brunskill; Thomas L. Griffiths; Patrick Shafto

Conference Proceedings

Faster teaching by POMDP planning

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6738 LNAI 280-287

DOI: 10.1007/978-3-642-21869-9_37

54Citations

56Readers

Get full text

Abstract

Both human and automated tutors must infer what a student knows and plan future actions to maximize learning. Though substantial research has been done on tracking and modeling student learning, there has been significantly less attention on planning teaching actions and how the assumed student model impacts the resulting plans. We frame the problem of optimally selecting teaching actions using a decision-theoretic approach and show how to formulate teaching as a partially-observable Markov decision process (POMDP) planning problem. We consider three models of student learning and present approximate methods for finding optimal teaching actions given the large state and action spaces that arise in teaching. An experimental evaluation of the resulting policies on a simple concept-learning task shows that framing teacher action planning as a POMDP can accelerate learning relative to baseline performance. © 2011 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Rafferty, A. N., Brunskill, E., Griffiths, T. L., & Shafto, P. (2011). Faster teaching by POMDP planning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6738 LNAI, pp. 280–287). https://doi.org/10.1007/978-3-642-21869-9_37

Faster teaching by POMDP planning

Abstract

Cite

Register to see more suggestions