Technical process control is a highly interesting area of application serving a high practical impact. Since classical controller design is, in general, a demanding job, this area constitutes a highly attractive domain for the application of learning approaches—in par- ticular, reinforcement learning (RL) methods. RL provides concepts for learning controllers that, by cleverly exploiting information from interactions with the process, can acquire high- quality control behaviour from scratch. This article focuses on the presentation of four typical benchmark problems whilst high- lighting important and challenging aspects of technical process control: nonlinear dynamics; varying set-points; long-term dynamic effects; influence of external variables; and the pri- macy of precision.We propose performance measures for controller quality that apply both to classical control design and learning controllers,measuring precision, speed, and stability of the controller. A second set of key-figures describes the performance from the perspec- tive of a learning approach while providing information about the efficiency of the method with respect to the learning effort needed. For all four benchmark problems, extensive and detailed information is provided with which to carry out the evaluations outlined in this article. A close evaluation of our own RL learning scheme, NFQCA (Neural Fitted Q Iteration with Continuous Actions), in acordance with the proposed scheme on all four benchmarks, thereby provides performance figures on both control quality and learning behavior.
CITATION STYLE
Hafner, R., & Riedmiller, M. (2011). Reinforcement learning in feedback control. Machine Learning, 84(1–2), 137–169. https://doi.org/10.1007/s10994-011-5235-x
Mendeley helps you to discover research relevant for your work.