Equi-Reward Utility Maximizing Design in stochastic environments

21Citations
Citations of this article
21Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We present the Equi-Reward Utility Maximizing Design (ER-UMD) problem for redesigning stochastic environments to maximize agent performance. ER-UMD fits well contemporary applications that require offline design of environments where robots and humans act and cooperate. To find an optimal modification sequence we present two novel solution techniques: a compilation that embeds design into a planning problem, allowing use of off-the-shelf solvers to find a solution, and a heuristic search in the modifications space, for which we present an admissible heuristic. Evaluation shows the feasibility of the approach using standard benchmarks from the probabilistic planning competition and a benchmark we created for a vacuum cleaning robot setting.

Cite

CITATION STYLE

APA

Keren, S., Pineda, L., Gal, A., Karpas, E., & Zilberstein, S. (2017). Equi-Reward Utility Maximizing Design in stochastic environments. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 0, pp. 4353–4360). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2017/608

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free