In this paper, we handle the negative transfer problem by a deep learning method to predict the transfer performance (positive/negative transfer) between two reinforcement learning tasks. We consider same domain transfer for video reinforcement learning tasks such as video games which can be described as images and perceived by an agent with visual ability. Our method directly trains a neural network from raw task descriptions without other prior knowledge such as models of tasks, target task samples and human experience. The architecture of our neural network consists of two parts: a siamese convolutional neural network to learn the features of each pair of tasks and a softmax layer to predict the binary transfer performance. We conduct extensive experiments in the maze domain and the Ms. PacMan domain to evaluate the performance of our method. The results show the effectiveness and superiority of our method compared with the baseline methods.
CITATION STYLE
Song, J., Gao, Y., & Wang, H. (2018). Feature learning and transfer performance prediction for video reinforcement learning tasks via a Siamese convolutional neural network. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11301 LNCS, pp. 350–361). Springer Verlag. https://doi.org/10.1007/978-3-030-04167-0_32
Mendeley helps you to discover research relevant for your work.