Heuristic function negotiation for markov decision process and its application in UAV simulation

2Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

The traditional reinforcement learning (RL) methods can solve Markov Decision Processes (MDPs) online, but these learning methods cannot effectively use a priori knowledge to guide the learning process. The exploration of the optimal policy is time-consuming and does not employ the information about specific issues. To tackle the problem, this paper proposes heuristic function negotiation (HFN) as an online learning framework. The HFN framework extends MDPs and introduces heuristic functions. HFN changes the state-action dual layer structure of traditional RL to the triple layer structure, in which multiple heuristic functions can be set to meet the needs required to solve the problem. The HFN framework can use different algorithms to let the functions negotiate to determine the appropriate action, and adjust the impact of each function according to the rewards. The HFN framework introduces domain knowledge by setting heuristic functions and thus speeds up the problem solving of MDPs. Furthermore, user preferences can be reflected in the learning process, which improves the flexibility of RL. The experiments show that, by setting reasonable heuristic functions, the learning results of the HFN framework are more efficient than traditional RL. We also apply HFN to the air combat simulation of unmanned aerial vehicles (UAVs), which shows that different function settings lead to different combat behaviors. Copyright © 2014 The Institute of Electronics, Inf rmation and Communication Engineers.

Cite

CITATION STYLE

APA

Zhao, F., Qin, Z., & Shao, Z. (2014). Heuristic function negotiation for markov decision process and its application in UAV simulation. IEICE Transactions on Information and Systems, E97-D(1), 89–97. https://doi.org/10.1587/transinf.E97.D.89

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free