Generalized value iteration networks: Life beyond lattices

34Citations
Citations of this article
79Readers
Mendeley users who have this article in their library.

Abstract

In this paper, we introduce a generalized value iteration network (GVIN), which is an end-to-end neural network planning module. GVIN emulates the value iteration algorithm by using a novel graph convolution operator, which enables GVIN to learn and plan on irregular spatial graphs. We propose three novel differentiable kernels as graph convolution operators and show that the embedding-based kernel achieves the best performance. Furthermore, we present episodic Q-learning, an improvement upon traditional n-step Q-learning that stabilizes training for VIN and GVIN. Lastly, we evaluate GVIN on planning problems in 2D mazes, irregular graphs, and real-world street networks, showing that GVIN generalizes well for both arbitrary graphs and unseen graphs of larger scale and outperforms a naive generalization of VIN (discretizing a spatial graph into a 2D image).

Cite

CITATION STYLE

APA

Niu, S., Chen, S., Guo, H., Targonski, C., Smith, M. C., & Kovačević, J. (2018). Generalized value iteration networks: Life beyond lattices. In 32nd AAAI Conference on Artificial Intelligence, AAAI 2018 (pp. 6246–6253). AAAI press. https://doi.org/10.1609/aaai.v32i1.12081

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free