Generalized value iteration networks: Life beyond lattices

Sufeng Niu; Siheng Chen; Hanyu Guo; Colin Targonski; Melissa C. Smith; Jelena Kovačević

Conference ProceedingsOPEN ACCESS

Generalized value iteration networks: Life beyond lattices

32nd AAAI Conference on Artificial Intelligence, AAAI 2018 (2018) 6246-6253

DOI: 10.1609/aaai.v32i1.12081

40Citations

79Readers

Abstract

In this paper, we introduce a generalized value iteration network (GVIN), which is an end-to-end neural network planning module. GVIN emulates the value iteration algorithm by using a novel graph convolution operator, which enables GVIN to learn and plan on irregular spatial graphs. We propose three novel differentiable kernels as graph convolution operators and show that the embedding-based kernel achieves the best performance. Furthermore, we present episodic Q-learning, an improvement upon traditional n-step Q-learning that stabilizes training for VIN and GVIN. Lastly, we evaluate GVIN on planning problems in 2D mazes, irregular graphs, and real-world street networks, showing that GVIN generalizes well for both arbitrary graphs and unseen graphs of larger scale and outperforms a naive generalization of VIN (discretizing a spatial graph into a 2D image).

Cite

CITATION STYLE

APA

Niu, S., Chen, S., Guo, H., Targonski, C., Smith, M. C., & Kovačević, J. (2018). Generalized value iteration networks: Life beyond lattices. In 32nd AAAI Conference on Artificial Intelligence, AAAI 2018 (pp. 6246–6253). AAAI press. https://doi.org/10.1609/aaai.v32i1.12081

Generalized value iteration networks: Life beyond lattices

Abstract

Cite

Register to see more suggestions