HiMacMic: Hierarchical Multi-Agent Deep Reinforcement Learning with Dynamic Asynchronous Macro Strategy

Hancheng Zhang; Guozheng Li; Chi Harold Liu; Guoren Wang; Jian Tang

Conference ProceedingsOPEN ACCESS

HiMacMic: Hierarchical Multi-Agent Deep Reinforcement Learning with Dynamic Asynchronous Macro Strategy

Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2023) 3239-3248

DOI: 10.1145/3580305.3599379

0Citations

8Readers

Get full text

Abstract

Multi-agent deep reinforcement learning (MADRL) has been widely used in many scenarios such as robotics and game AI. However, existing methods mainly focus on the optimization of agents' micro policies without considering the macro strategy. As a result, they cannot perform well in complex or sparse reward scenarios like the StarCraft Multi-Agent Challenge (SMAC) and Google Research Football (GRF). To this end, we propose a hierarchical MADRL framework called "HiMacMic"with dynamic asynchronous macro strategy. Spatially, HiMacMic determines a critical position by using a positional heat map. Temporally, the macro strategy dynamically decides its deadline and updates it asynchronously among agents. We validate HiMacMic in four widely used benchmarks, namely: Overcooked, GRF, SMAC and SMAC-v2 with nine chosen scenarios. Results show that HiMacMic not only converges faster and achieves higher results than ten existing approaches, but also shows its adaptability to different environment settings.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhang, H., Li, G., Liu, C. H., Wang, G., & Tang, J. (2023). HiMacMic: Hierarchical Multi-Agent Deep Reinforcement Learning with Dynamic Asynchronous Macro Strategy. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 3239–3248). Association for Computing Machinery. https://doi.org/10.1145/3580305.3599379

HiMacMic: Hierarchical Multi-Agent Deep Reinforcement Learning with Dynamic Asynchronous Macro Strategy

Abstract

Author supplied keywords

Cite

Register to see more suggestions