Distributed and Multi-Agent Planning
2014
Abstract
Decentralised planning in partially observable multi-agent domains is limited by the interacting agents' incomplete knowledge of their peers, which impacts their ability to work jointly towards a common goal. In this context, communication is often used as a means of observation exchange, which helps each agent in reducing uncertainty and acquiring a more centralised view of the world. However, despite these merits, planning with communicated observations is highly sensitive to communication channel noise and synchronisation issues, e.g. message losses, delays, and corruptions. In this paper, we propose an alternative approach to partially observable uncoordinated collaboration, where agents simultaneously execute and communicate their actions to their teammates. Our method extends a state-of-the-art Monte-Carlo planner for use in multi-agent systems, where communicated actions are incorporated directly in the sampling and learning process. We evaluate our approach in a benchmark multi-agent domain, and a more complex multi-robot problem with a larger action space. The experimental results demonstrate that our approach can lead to robust collaboration under challenging communication constraints and high noise levels, even in the presence of teammates who do not use any communication.
References (20)
- Barrett, S.; Agmon, N.; Hazon, N.; Kraus, S.; and Stone, P. 2013a. Communicating with Unknown Teammates. In AAMAS Adaptive Learning Agents (ALA) Workshop.
- Barrett, S.; Stone, P.; Kraus, S.; and Rosenfeld, A. 2013b. Teamwork with Limited Knowledge of Teammates. In AAAI.
- Becker, R.; Lesser, V.; and Zilberstein, S. 2005. Analyzing Myopic Approaches for Multi-Agent Communication. In Proceedings of Intelligent Agent Technology, 550-557.
- Bernstein, D. S.; Givan, R.; Immerman, N.; and Zilberstein, S. 2002. The Complexity of Decentralized Control of Markov Decision Processes. Math. Oper. Res. 27(4):819- 840.
- Coles, A. J.; Coles, A.; Olaya, A. G.; Celorrio, S. J.; López, C. L.; Sanner, S.; and Yoon, S. 2012. A Survey of the Seventh International Planning Competition. AI Magazine 33(1).
- Gelly, S.; Kocsis, L.; Schoenauer, M.; Sebag, M.; Silver, D.; Szepesvári, C.; and Teytaud, O. 2012. The grand challenge of computer Go: Monte Carlo tree search and extensions. Commun. ACM 55(3):106-113.
- Kaelbling, L. P.; Littman, M. L.; and Cassandra, A. R. 1998. Planning and Acting in Partially Observable Stochastic Do- mains. Artificial Intelligence 101(1-2):99-134.
- Kocsis, L., and Szepesvári, C. 2006. Bandit Based Monte- Carlo Planning. In European Conference on Machine Learn- ing (ECML), 282-293.
- Oliehoek, F. A., and Spaan, M. T. J. 2012. Tree-Based Solu- tion Methods for Multiagent POMDPs with Delayed Com- munication. In AAAI.
- Oliehoek, F. A.; Spaan, M. T. J.; and Vlassis, N. 2007. Dec- POMDPs with delayed communication. In AAMAS Work- shop on Multi-agent Sequential Decision Making in Uncer- tain Domains.
- Papadimitriou, C., and Tsitsiklis, J. N. 1987. The Com- plexity of Markov Decision Processes. Math. Oper. Res. 12(3):441-450.
- Petrick, R.; Geib, C.; and Steedman, M. 2009. Inte- grating Low-Level Robot/Vision with High-Level Planning and Sensing in PACO-PLUS. Technical Report, PACO- PLUS Project Deliverable 4.3.5 (available at http:// www.paco-plus.org).
- Pynadath, D. V., and Tambe, M. 2002. The Communicative Multiagent Team Decision Problem: Analyzing Teamwork Theories and Models. J. Artif. Intell. Res. (JAIR) 16:389- 423. Roth, M.; Simmons, R.; and Veloso, M. 2005. Reasoning About Joint Beliefs for Execution-time Communication De- cisions. In AAMAS, 786-793.
- Seuken, S., and Zilberstein, S. 2007. Memory-Bounded Dynamic Programming for DEC-POMDPs. In IJCAI.
- Silver, D., and Veness, J. 2010. Monte-Carlo Planning in Large POMDPs. In NIPS, 2164-2172.
- Spaan, M. T. J.; Oliehoek, F. A.; and Vlassis, N. A. 2008. Multiagent Planning Under Uncertainty with Stochastic Communication Delays. In ICAPS, volume 8, 338-345.
- Stone, P.; Kaminka, G. A.; Kraus, S.; and Rosenschein, J. S. 2010. Ad Hoc Autonomous Agent Teams: Collaboration without Pre-Coordination. In AAAI.
- Wu, F.; Zilberstein, S.; and Chen, X. 2011a. Online Planning for Ad Hoc Autonomous Agent Teams. In IJCAI, 439-445.
- Wu, F.; Zilberstein, S.; and Chen, X. 2011b. Online Plan- ning for Multi-Agent Systems with Bounded Communica- tion. Artificial Intelligence 175(2):487-511.
- Zhang, C., and Lesser, V. R. 2013. Coordinating multi- agent reinforcement learning with limited communication. In AAMAS, 1101-1108.