Abstract
We present the Equi-Reward Utility Maximizing Design (ER-UMD) problem for redesigning stochastic environments to maximize agent performance. ER-UMD fits well contemporary applications that require offline design of environments where robots and humans act and cooperate. To find an optimal modification sequence we present two novel solution techniques: a compilation that embeds design into a planning problem, allowing use of off-the-shelf solvers to find a solution, and a heuristic search in the modifications space, for which we present an admissible heuristic. Evaluation shows the feasibility of the approach using standard benchmarks from the probabilistic planning competition and a benchmark we created for a vacuum cleaning robot setting.
Cite
CITATION STYLE
Keren, S., Pineda, L., Gal, A., Karpas, E., & Zilberstein, S. (2017). Equi-Reward Utility Maximizing Design in stochastic environments. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 0, pp. 4353–4360). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2017/608
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.