How should we gather information to make effective decisions'? A classical answer to this fundamental problem is given by the decision-theoretic value of information. Unfortunately, optimizing this objective is intractable, and myopic (greedy) approximations are known to perform poorly. In this paper, we introduce DiRECt, an efficient yet near-optimal algorithm for nonmyopically optimizing value of information. Crucially, DIRECT uses a novel surrogate objective that is: (1) aligned with the value of information problem (2) efficient to evaluate and (3) adaptive submodular. This latter property enables us to utilize an efficient greedy optimization while providing strong approximation guarantees. We demonstrate the utility of our approach on four diverse case-studies: touch-based robotic localization, comparison-based preference learning, wild-life conservation management, and preference elicitation in behavioral economics. In the first application, we demonstrate DiRECt in closed-loop on an actual robotic platform.
CITATION STYLE
Chen, Y., Bagnell, J. A., Javdani, S., Srinivasa, S., Karbasi, A., & Krause, A. (2015). Submodular surrogates for value of information. In Proceedings of the National Conference on Artificial Intelligence (Vol. 5, pp. 3511–3518). AI Access Foundation. https://doi.org/10.1609/aaai.v29i1.9694
Mendeley helps you to discover research relevant for your work.