Scalable initial state interdiction for factored MDPs

Swetasudha Panda; Yevgeniy Vorobeychik

Conference Proceedings

Scalable initial state interdiction for factored MDPs

IJCAI International Joint Conference on Artificial Intelligence (2018) 2018-July 4801-4807

DOI: 10.24963/ijcai.2018/667

1Citations

6Readers

Get full text

Abstract

We propose a novel Stackelberg game model of MDP interdiction in which the defender modifies the initial state of the planner, who then responds by computing an optimal policy starting with that state. We first develop a novel approach for MDP interdiction in factored state space that allows the defender to modify the initial state. The resulting approach can be computationally expensive for large factored MDPs. To address this, we develop several interdiction algorithms that leverage variations of reinforcement learning using both linear and non-linear function approximation. Finally, we extend the interdiction framework to consider a Bayesian interdiction problem in which the inter-dictor is uncertain about some of the planner's initial state features. Extensive experiments demonstrate the effectiveness of our approaches.

Cite

CITATION STYLE

APA

Panda, S., & Vorobeychik, Y. (2018). Scalable initial state interdiction for factored MDPs. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2018-July, pp. 4801–4807). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2018/667

Scalable initial state interdiction for factored MDPs

Abstract

Cite

Register to see more suggestions