Feature learning and transfer performance prediction for video reinforcement learning tasks via a Siamese convolutional neural network

Jinhua Song; Yang Gao; Hao Wang

Conference Proceedings

Feature learning and transfer performance prediction for video reinforcement learning tasks via a Siamese convolutional neural network

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11301 LNCS 350-361

DOI: 10.1007/978-3-030-04167-0_32

2Citations

5Readers

Get full text

Abstract

In this paper, we handle the negative transfer problem by a deep learning method to predict the transfer performance (positive/negative transfer) between two reinforcement learning tasks. We consider same domain transfer for video reinforcement learning tasks such as video games which can be described as images and perceived by an agent with visual ability. Our method directly trains a neural network from raw task descriptions without other prior knowledge such as models of tasks, target task samples and human experience. The architecture of our neural network consists of two parts: a siamese convolutional neural network to learn the features of each pair of tasks and a softmax layer to predict the binary transfer performance. We conduct extensive experiments in the maze domain and the Ms. PacMan domain to evaluate the performance of our method. The results show the effectiveness and superiority of our method compared with the baseline methods.

Author supplied keywords

Cite

CITATION STYLE

APA

Song, J., Gao, Y., & Wang, H. (2018). Feature learning and transfer performance prediction for video reinforcement learning tasks via a Siamese convolutional neural network. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11301 LNCS, pp. 350–361). Springer Verlag. https://doi.org/10.1007/978-3-030-04167-0_32

Feature learning and transfer performance prediction for video reinforcement learning tasks via a Siamese convolutional neural network

Abstract

Author supplied keywords

Cite

Register to see more suggestions