Boundary extension features for width-based planning with simulators on continuous-state domains

3Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Width-based planning algorithms have been shown to be competitive with state-of-the-art heuristic search and SAT-based approaches, without requiring access to a model of action effects and preconditions, just access to a black-box simulator. Width-based planners search is guided by a measure of the novelty of states, that requires observations on simulator states to be given as a set of features. This paper proposes agnostic feature mapping mechanisms that define the features online, as exploration progresses and the domain of continuous state variables is revealed. We demonstrate the effectiveness of these features on the OpenAI gym “classical control” suite of benchmarks. We compare our online planners with state-of-the-art deep reinforcement learning algorithms, and show that width-based planners using our features find policies of the same quality with significantly less computational resources.

Cite

CITATION STYLE

APA

Teichteil-Königsbuch, F., Ramirez, M., & Lipovetzky, N. (2020). Boundary extension features for width-based planning with simulators on continuous-state domains. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2021-January, pp. 4183–4189). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2020/578

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free