A Cordial Sync: Going Beyond Marginal Policies for Multi-agent Embodied Tasks

Unnat Jain; Luca Weihs; Eric Kolve; Ali Farhadi; Svetlana Lazebnik; Aniruddha Kembhavi; Alexander Schwing

Conference Proceedings

A Cordial Sync: Going Beyond Marginal Policies for Multi-agent Embodied Tasks

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 12350 LNCS 471-490

DOI: 10.1007/978-3-030-58558-7_28

11Citations

67Readers

Get full text

Abstract

Autonomous agents must learn to collaborate. It is not scalable to develop a new centralized agent every time a task’s difficulty outpaces a single agent’s abilities. While multi-agent collaboration research has flourished in gridworld-like environments, relatively little work has considered visually rich domains. Addressing this, we introduce the novel task FurnMove in which agents work together to move a piece of furniture through a living room to a goal. Unlike existing tasks, FurnMove requires agents to coordinate at every timestep. We identify two challenges when training agents to complete FurnMove: existing decentralized action sampling procedures do not permit expressive joint action policies and, in tasks requiring close coordination, the number of failed actions dominates successful actions. To confront these challenges we introduce SYNC-policies (synchronize your actions coherently) and CORDIAL (coordination loss). Using SYNC-policies and CORDIAL, our agents achieve a 58% completion rate on FurnMove, an impressive absolute gain of 25 % points over competitive decentralized baselines. Our dataset, code, and pretrained models are available at https://unnat.github.io/cordial-sync.

Author supplied keywords

Cite

CITATION STYLE

APA

Jain, U., Weihs, L., Kolve, E., Farhadi, A., Lazebnik, S., Kembhavi, A., & Schwing, A. (2020). A Cordial Sync: Going Beyond Marginal Policies for Multi-agent Embodied Tasks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12350 LNCS, pp. 471–490). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-58558-7_28

A Cordial Sync: Going Beyond Marginal Policies for Multi-agent Embodied Tasks

Abstract

Author supplied keywords

Cite

Register to see more suggestions