Extreme learning machine not only has the best generalization performance but also has simple structure and convenient calculation. In this paper, its merits are used for reinforcement learning. The use of extreme learning machine on Q function approximation can improve the speed of reinforcement learning. As the number of hidden layer nodes is equal to that of samples, the larger sample size will seriously affect the learning speed. To solve this problem, a rolling time-window mechanism is introduced to the algorithm, which can reduce the size of the sample space to a certain extent. Finally, our algorithm is compared with a reinforcement learning based on a traditional BP neural network using a boat problem. Simulation results show that the proposed algorithm is faster and more effective. © 2012 Springer-Verlag.
CITATION STYLE
Pan, J., Wang, X., Cheng, Y., & Cao, G. (2012). Reinforcement learning based on extreme learning machine. In Communications in Computer and Information Science (Vol. 304 CCIS, pp. 80–86). https://doi.org/10.1007/978-3-642-31837-5_12
Mendeley helps you to discover research relevant for your work.