A fuzzy Q learning based on a self-organizing fuzzy radial basis function (FRBF) network is proposed to solve the 'curse of dimensionality' problem caused by state space generalization in the paper. A FRBF network is used to represent continuous action and the corresponding Q value. The interpolation technique is adopted to represent the appropriate utility value for the wining local action of every fuzzy rule. Neurons can be organized by the FRBF network itself. The methods of the structure and parameter learning, based on new adding and merging neurons techniques and a gradient descent algorithm, are simple and effective, with a high accuracy and a compact structure. Simulation results on balancing control of inverted pendulum illustrate the performance and applicability of the proposed fuzzy Q learning scheme to real-world problems with continuous states and continuous actions. © Springer-Verlag Berlin Heidelberg 2006.
CITATION STYLE
Wang, X., Cheng, Y., & Sun, W. (2006). Q learning based on self-organizing fuzzy radial basis function network. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3971 LNCS, pp. 607–615). Springer Verlag. https://doi.org/10.1007/11759966_90
Mendeley helps you to discover research relevant for your work.