We introduce a new formulation of the Hidden Parameter Markov Decision Process (HiP-MDP), a framework for modeling families of related tasks using low-dimensional latent embeddings. Our new framework correctly models the joint uncertainty in the latent parameters and the state space. We also replace the original Gaussian Process-based model with a Bayesian Neural Network, enabling more scalable inference. Thus, we expand the scope of the HiP-MDP to applications with higher dimensions and more complex dynamics.
CITATION STYLE
Killian, T., Daulton, S., Konidaris, G., & Doshi-Velez, F. (2017). Robust and efficient transfer learning with hidden parameter Markov decision processes. In Advances in Neural Information Processing Systems (Vol. 2017-December, pp. 6251–6262). Neural information processing systems foundation. https://doi.org/10.1609/aaai.v31i1.11065
Mendeley helps you to discover research relevant for your work.