Abstract
According to the requirements of the live-virtual-constructive (LVC) tactical confrontation (TC) on the virtual entity (VE) decision model of graded combat capability, diversified actions, real-time decision-making, and generalization for the enemy, the confrontation process is modeled as a zero-sum stochastic game (ZSG). By introducing the theory of dynamic relative power potential field, the problem of reward sparsity in the model can be solved. By reward shaping, the problem of credit assignment between agents can be solved. Based on the idea of meta-learning, an extensible multi-agent deep reinforcement learning (EMADRL) framework and solving method is proposed to improve the effectiveness and efficiency of model solving. Experiments show that the model meets the requirements well and the algorithm learning efficiency is high.
Author supplied keywords
Cite
CITATION STYLE
Gao, A., Guo, Q., Dong, Z., Tang, Z., Zhang, Z., & Feng, Q. (2022). Research on virtual entity decision model for LVC tactical confrontation of army units. Journal of Systems Engineering and Electronics, 33(5), 1249–1267. https://doi.org/10.23919/JSEE.2022.000119
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.