Learning to control dynamic systems with automatic quantization

Charles X. Ling; Ralph Buchal

Conference ProceedingsOPEN ACCESS

Learning to control dynamic systems with automatic quantization

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (1993) 667 LNAI 372-377

DOI: 10.1007/3-540-56602-3_153

0Citations

12Readers

Abstract

Reinforcement learning is often used in learning to control dynamic systems, which are described by quantitative state variables. Most previous work that learns qualitative (symbolic) control rules cannot construct symbols themselves. That is, a correct partition of the state variables, or a correct set of qualitative symbols, is given to the learning program. We do not make this assumption in our work of learning to control dynamic systems. The learning task is divided into two phases. The first phase is to extract symbols from quantitative inputs. This process is also commonly called quantization. The second phase is to evaluate the symbols obtained in the first phase and to induce the best possible symbolic rules based on those symbols. These two phases interact with each other and thus make the whole learning task very difficult. We demonstrate that our new method, called STAQ (Set Training with Automatic Quantization), can aggressively partition the input variables to a finer resolution until the correct control rules based on these partitions (symbols) are learned. In particular, we use STAQ to solve the well-known cart-pole balancing problem.

Cite

CITATION STYLE

APA

Ling, C. X., & Buchal, R. (1993). Learning to control dynamic systems with automatic quantization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 667 LNAI, pp. 372–377). Springer Verlag. https://doi.org/10.1007/3-540-56602-3_153

Learning to control dynamic systems with automatic quantization

Abstract

Cite

Register to see more suggestions