Planning from Pixels in Atari with Learned Symbolic Representations

5Citations
Citations of this article
22Readers
Mendeley users who have this article in their library.

Abstract

Width-based planning methods have been shown to yield state-of-the-art performance in the Atari 2600 domain using pixel input. One successful approach, RolloutIW, represents states with the B-PROST boolean feature set. An augmented version of RolloutIW, π-IW, shows that learned features can be competitive with handcrafted ones for width-based search. In this paper, we leverage variational autoencoders (VAEs) to learn features directly from pixels in a principled manner, and without supervision. The inference model of the trained VAEs extracts boolean features from pixels, and RolloutIW plans with these features. The resulting combination outperforms the original RolloutIW and human professional play on Atari 2600 and drastically reduces the size of the feature set.

References Powered by Scopus

Human-level control through deep reinforcement learning

22504Citations
N/AReaders
Get full text

Representation learning: A review and new perspectives

9838Citations
N/AReaders
Get full text

Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning

6295Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Grounding LTLf Specifications in Image Sequences

6Citations
N/AReaders
Get full text

Is Policy Learning Overrated?: Width-Based Planning and Active Learning for Atari

0Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Dittadi, A., Drachmann, F. K., & Bolander, T. (2021). Planning from Pixels in Atari with Learned Symbolic Representations. In 35th AAAI Conference on Artificial Intelligence, AAAI 2021 (Vol. 6A, pp. 4941–4949). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v35i6.16627

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 4

67%

Researcher 2

33%

Readers' Discipline

Tooltip

Computer Science 7

78%

Business, Management and Accounting 1

11%

Materials Science 1

11%

Save time finding and organizing research with Mendeley

Sign up for free