Adaptive regret minimization for learning complex team-based tactics

Duong D. Nguyen; Arvind Rajagopalan; Jijoong Kim; Cheng Chew Lim

Journal ArticleOPEN ACCESS

Adaptive regret minimization for learning complex team-based tactics

IEEE Access (2019) 7 103019-103030

DOI: 10.1109/ACCESS.2019.2930640

2Citations

13Readers

Abstract

This paper presents an approach and analysis for performing decentralized cooperative control of a team of decoys to achieve the Honeypot Ambush tactic. In this tactic, the threats are successfully lured into a designated region where they can be easily defeated. The decoys learn to cooperate by incorporating a game-theory-based online-learning method, known as regret minimization, to maximize the team’s global reward. The decoy agents are assumed to have physical limitations and to be subject to certain stringent range constraints required for deceiving the networked threats. By employing an efficient coordination mechanism, the agents learn to be less greedy and allow weaker agents to catch up on their rewards to improve team performance. Such a coordination solution corresponds to achieving convergence to coarse correlated equilibrium. The numerical results verify the effectiveness of the proposed solution to achieve a global satisfaction outcome and to adapt to a wide spectrum of scenarios.

Author supplied keywords

Cite

CITATION STYLE

APA

Nguyen, D. D., Rajagopalan, A., Kim, J., & Lim, C. C. (2019). Adaptive regret minimization for learning complex team-based tactics. IEEE Access, 7, 103019–103030. https://doi.org/10.1109/ACCESS.2019.2930640

Adaptive regret minimization for learning complex team-based tactics

Abstract

Author supplied keywords

Cite

Register to see more suggestions