Latent structure matching for knowledge transfer in reinforcement learning

1Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.

Abstract

Reinforcement learning algorithms usually require a large number of empirical samples and give rise to a slow convergence in practical applications. One solution is to introduce transfer learning: Knowledge from well-learned source tasks can be reused to reduce sample request and accelerate the learning of target tasks. However, if an unmatched source task is selected, it will slow down or even disrupt the learning procedure. Therefore, it is very important for knowledge transfer to select appropriate source tasks that have a high degree of matching with target tasks. In this paper, a novel task matching algorithm is proposed to derive the latent structures of value functions of tasks, and align the structures for similarity estimation. Through the latent structure matching, the highly-matched source tasks are selected effectively, from which knowledge is then transferred to give action advice, and improve exploration strategies of the target tasks. Experiments are conducted on the simulated navigation environment and the mountain car environment. The results illustrate the significant performance gain of the improved exploration strategy, compared with traditional e-greedy exploration strategy. A theoretical proof is also given to verify the improvement of the exploration strategy based on latent structure matching.

References Powered by Scopus

A survey on transfer learning

18230Citations
N/AReaders
Get full text

Distributed optimization and statistical learning via the alternating direction method of multipliers

15941Citations
N/AReaders
Get full text

Nonlinear dimensionality reduction by locally linear embedding

13165Citations
N/AReaders
Get full text

Cited by Powered by Scopus

A taxonomy for similarity metrics between Markov decision processes

8Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Zhou, Y., & Yang, F. (2020). Latent structure matching for knowledge transfer in reinforcement learning. Future Internet, 12(2). https://doi.org/10.3390/fi12020036

Readers over time

‘20‘21‘22‘23‘240481216

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 10

83%

Lecturer / Post doc 2

17%

Readers' Discipline

Tooltip

Computer Science 5

42%

Mathematics 3

25%

Engineering 2

17%

Business, Management and Accounting 2

17%

Save time finding and organizing research with Mendeley

Sign up for free
0