In agent control issues, the idea of combining reinforcement learning and planning has attracted much attention. Two methods focus on micro and macro action respectively. Their advantages would show together if there is good cooperation between them. An essential for the cooperation is to find an appropriate boundary, assigning different functions to each method. Such a boundary could be represented by parameters in a planning algorithm. In this paper, we create an optimization strategy for planning parameters, through analysis of the connection of reaction and planning; we also create a non-gradient method for accelerating the optimization. The whole algorithm can find a satisfactory setting of planning parameters, making full use of the reaction capability of specific agents.
CITATION STYLE
Chen, X. (2021). Adjust Planning Strategies to Accommodate Reinforcement Learning Agents. In Journal of Physics: Conference Series (Vol. 1757). IOP Publishing Ltd. https://doi.org/10.1088/1742-6596/1757/1/012066
Mendeley helps you to discover research relevant for your work.