We consider the challenges of learning multi-agent/robot macro-action-based deep Q-nets including how to properly update each macro-action value and accurately maintain macro-action-observation trajectories. We address these challenges by first proposing two fundamental frameworks for learning macro-action-value function and joint macro-actionvalue function. Furthermore, we present two new approaches of learning decentralized macro-action-based policies, which involve a new double Q-update rule that facilitates the learning of decentralized Q-nets by using a centralized Q-net for action selection. Our approaches are evaluated both in simulation and on real robots.
CITATION STYLE
Xiao, Y., Hoffman, J., Xia, T., & Amato, C. (2020). Multi-agent/robot deep reinforcement learning with macro-actions (student abstract). In AAAI 2020 - 34th AAAI Conference on Artificial Intelligence (pp. 13965–13966). AAAI press. https://doi.org/10.1609/aaai.v34i10.7255
Mendeley helps you to discover research relevant for your work.