Cooperative Multiagent Systems Research Papers

Des programmes mathématiques pour les processus décisionnels de Markoff décentralisés et partiellement observés

2025

In this thesis, we study the problem of the optimal decentralized control of a partially observed Markov process over a finite horizon. The mathematical model corresponding to the problem is a decentralized POMDP (DEC-POMDP). Many... more

descriptionView Paper arrow_downwardDownload

Solving Transition-Independent Multi-agent MDPs with Sparse Interactions (Extended version)

by Mathijs de Weerdt

2025, arXiv (Cornell University)

In cooperative multi-agent sequential decision making under uncertainty, agents must coordinate to find an optimal joint policy that maximises joint value. Typical algorithms exploit additive structure in the value function, but in the... more

descriptionView Paper arrow_downwardDownload

Stabilization of Information Sharing for Queries Answering in Multiagent Systems

by Phan Thị Thuỳ Dung

2025, Lecture Notes in Computer Science

We consider multiagent systems situated in unpredictable environments. Agents viewed as abductive logic programs with abducibles being literals the agent could sense or receive from other agents, must cooperate to provide answers to users... more

descriptionView Paper arrow_downwardDownload

Fair and efficient multi-agent routing for cooperative and autonomous agricultural fleets with implements

by Marin Lujak

2025, Computers and Operations Research

The growing use of autonomous tractor fleets with detachable implements presents complex logistical challenges in agriculture. Current systems often rely on simple heuristics and avoid implement swapping, limiting efficiency. A central... more

descriptionView Paper arrow_downwardDownload

Heuristic search for identical payoff Bayesian games

by jilles steeve dibangoye

2025

Bayesian games can be used to model single-shot decision problems in which agents only possess incomplete information about other agents, and hence are important for multiagent coordination under uncertainty. Moreover they can be used to... more

descriptionView Paper arrow_downwardDownload

On Markov Policies For Decentralized POMDPs

by jilles steeve dibangoye

2025, HAL (Le Centre pour la Communication Scientifique Directe)

This paper formulates the optimal decentralized control problem for a class of mathematical models in which the system to be controlled is characterized by a finite-state discrete-time Markov process. The states of this internal process... more

descriptionView Paper arrow_downwardDownload

Learning to Act in Decentralized Partially Observable MDPs

by jilles steeve dibangoye

2025, HAL (Le Centre pour la Communication Scientifique Directe)

We address a long-standing open problem of reinforcement learning in decentralized partially observable Markov decision processes. Previous attempts focussed on different forms of generalized policy iteration, which at best led to local... more

descriptionView Paper arrow_downwardDownload

On Markov Policies For Decentralized POMDPs

by jilles steeve dibangoye

2025

This paper formulates the optimal decentralized control problem for a class of mathematical models in which the system to be controlled is characterized by a finite-state discrete-time Markov process. The states of this internal process... more

descriptionView Paper arrow_downwardDownload

Heuristic search for identical payoff Bayesian games

by jilles steeve dibangoye

2025, Autonomous Agents & Multiagent Systems/Agent Theories, Architectures, and Languages

Bayesian games can be used to model single-shot decision problems in which agents only possess incomplete informa- tion about other agents, and hence are important for mul- tiagent coordination under uncertainty. Moreover they can be used... more

descriptionView Paper arrow_downwardDownload

Heuristic Search Value Iteration can solve zero-sum Partially Observable Stochastic Games

by jilles steeve dibangoye

2025

HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or... more

descriptionView Paper arrow_downwardDownload

Producing efficient error-bounded solutions for transition independent decentralized MDPs

by jilles steeve dibangoye

2025, HAL (Le Centre pour la Communication Scientifique Directe)

There has been substantial progress on algorithms for single-agent sequential decision making using partially observable Markov decision processes (POMDPs). A number of efficient algorithms for solving POMDPs share two desirable... more

descriptionView Paper arrow_downwardDownload

Exploiting Separability in Multiagent Planning with Continuous-State MDPs

by jilles steeve dibangoye

2025, HAL (Le Centre pour la Communication Scientifique Directe)

Recent years have seen significant advances in techniques for optimally solving multiagent problems represented as decentralized partially observable Markov decision processes (Dec-POMDPs). A new method achieves scalability gains by... more

descriptionView Paper arrow_downwardDownload

Learning to Act in Decentralized Partially Observable MDPs

by jilles steeve dibangoye

2025

We address a long-standing open problem of reinforcement learning in decentralized partially observable Markov decision processes. Previous attempts focussed on different forms of generalized policy iteration, which at best led to local... more

descriptionView Paper arrow_downwardDownload

An Environment Transformation-based Framework for Comparison of Open-World Learning Agents

by Dustin Dannenhauer

2024

To compare the ability of agents to learn in open worlds, we need a framework with clear definitions of open world environments and how they can vary. This paper provides such a framework, proposing clear scientific definitions for open... more

descriptionView Paper arrow_downwardDownload

Coordinate-Descent Adaptation over Networks

by Bicheng Ying

2024, Zenodo (CERN European Organization for Nuclear Research)

This work examines the mean-square error performance of diffusion stochastic algorithms under a generalized coordinate-descent scheme. In this setting, the adaptation step by each agent is limited to a random subset of the coordinates of... more

descriptionView Paper arrow_downwardDownload

Matrix-Based Characterization of the Motion and Wrench Uncertainties in Robotic Manipulators

by Venkat Krovi

2024, ArXiv

Characterization of the uncertainty in robotic manipulators is the focus of this paper. Based on the random matrix theory (RMT), we propose uncertainty characterization schemes in which the uncertainty is modeled at the macro (system)... more

descriptionView Paper arrow_downwardDownload

Wrench Uncertainty Quantification and Reconfiguration Analysis in Loosely Interconnected Cooperative Systems

by Venkat Krovi

2024, ASCE-ASME J. Risk and Uncert. in Engrg. Sys., Part B: Mech. Engrg.

Loosely interconnected cooperative systems such as cable robots are particularly susceptible to uncertainty. Such uncertainty is exacerbated by addition of the base mobility to realize reconfigurability within the system. However, it also... more

descriptionView Paper arrow_downwardDownload

Producing efficient error-bounded solutions for transition independent decentralized MDPs

by François Charpillet

2024, HAL (Le Centre pour la Communication Scientifique Directe)

There has been substantial progress on algorithms for single-agent sequential decision making using partially observable Markov decision processes (POMDPs). A number of efficient algorithms for solving POMDPs share two desirable... more

descriptionView Paper arrow_downwardDownload

Multiagent Metareasoning through Organizational Design

by Edmund Durfee

2023, Proceedings of the AAAI Conference on Artificial Intelligence

We formulate an approach to multiagent metareasoning that uses organizational design to focus each agent's reasoning on the aspects of its local problem that let it make the most worthwhile contributions to joint behavior. By... more

descriptionView Paper arrow_downwardDownload

Agreement Technologies for Coordination in Smart Cities

by Holger Billhardt

2023, Applied sciences

Many challenges in today's society can be tackled by distributed open systems. This is particularly true for domains that are commonly perceived under the umbrella of smart cities, such as intelligent transportation, smart energy grids,... more

descriptionView Paper arrow_downwardDownload

Scalable Reinforcement Learning Policies for Multi-Agent Control

by Christopher Hsu

2023, 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

We develop a Multi-Agent Reinforcement Learning (MARL) method to learn scalable control policies for target tracking. Our method can handle an arbitrary number of pursuers and targets; we show results for tasks consisting up to 1000... more

descriptionView Paper arrow_downwardDownload

Exploiting Separability in Multiagent Planning with Continuous-State MDPs

by François Charpillet

2023, HAL (Le Centre pour la Communication Scientifique Directe)

Recent years have seen significant advances in techniques for optimally solving multiagent problems represented as decentralized partially observable Markov decision processes (Dec-POMDPs). A new method achieves scalability gains by... more

descriptionView Paper arrow_downwardDownload

Agent Neighbourhood for Learning Approximated Policies in DEC-MDP

by Brahim Chaib-draa

2023, Proceedings of Evolutionary Models of Collaboration (EMC 2007), Workshop of the Int. Joint Conf. on AI (IJCAI 2007), Hyderabad, India

Resolving multiagent team decision problems, where agents share a common goal, is challenging since the number of states and joint actions is exponential with the number of agents. Even if the resolution of such problems is theoretically... more

descriptionView Paper arrow_downwardDownload

Diversifying Agent's Behaviors in Interactive Decision Models

by Yifeng Zeng

2023, arXiv (Cornell University)

descriptionView Paper arrow_downwardDownload

Stabilization of cooperative information agents in unpredictable environment: a logic programming approach

by Phan Thùy Dung 2K-20ACN

2023, Theory and Practice of Logic Programming

An information agent is viewed as a deductive database consisting of 3 parts: • an observation database containing the facts the agent has observed or sensed from its surrounding environment. • an input database containing the information... more

descriptionView Paper arrow_downwardDownload

Stabilization of Information Sharing for Queries Answering in Multiagent Systems

by Do Lan Hanh

2023, Lecture Notes in Computer Science

We consider multiagent systems situated in unpredictable environments. Agents viewed as abductive logic programs with abducibles being literals the agent could sense or receive from other agents, must cooperate to provide answers to users... more

descriptionView Paper arrow_downwardDownload

Stabilization of cooperative information agents in unpredictable environment: a logic programming approach

by Do Lan Hanh

2023, Theory and Practice of Logic Programming

An information agent is viewed as a deductive database consisting of 3 parts: • an observation database containing the facts the agent has observed or sensed from its surrounding environment. • an input database containing the information... more

descriptionView Paper arrow_downwardDownload

Sequential Stochastic Optimization in Separable Learning Environments

by Reid Bishop

2023, ArXiv

We consider a class of sequential decision-making problems under uncertainty that can encompass various types of supervised learning concepts. These problems have a completely observed state process and a partially observed modulation... more

descriptionView Paper arrow_downwardDownload

Stabilization of Information Sharing for Queries Answering in Multiagent Systems

by Minh Sâm Dung

2023, Lecture Notes in Computer Science

We consider multiagent systems situated in unpredictable environments. Agents viewed as abductive logic programs with abducibles being literals the agent could sense or receive from other agents, must cooperate to provide answers to users... more

descriptionView Paper arrow_downwardDownload

Stabilization of cooperative information agents in unpredictable environment: a logic programming approach

by Minh Sâm Dung

2023, Theory and Practice of Logic Programming

An information agent is viewed as a deductive database consisting of 3 parts: • an observation database containing the facts the agent has observed or sensed from its surrounding environment. • an input database containing the information... more

descriptionView Paper arrow_downwardDownload

Stabilization of Information Sharing for Queries Answering in Multiagent Systems

by minh dung

2022, Lecture Notes in Computer Science

We consider multiagent systems situated in unpredictable environments. Agents viewed as abductive logic programs with abducibles being literals the agent could sense or receive from other agents, must cooperate to provide answers to users... more

descriptionView Paper arrow_downwardDownload

Exploiting separability in multiagent planning with continuous-state MDPs

by Christopher Amato

2022, adaptive agents and multi-agents systems

Recent years have seen significant advances in techniques for optimally solving multiagent problems represented as decentralized partially observable Markov decision processes (Dec-POMDPs). A new method achieves scalability gains by... more

descriptionView Paper arrow_downwardDownload

Heuristic search for identical payoff Bayesian games

by Christopher Amato

2022, Autonomous Agents & Multiagent Systems/Agent Theories, Architectures, and Languages

Bayesian games can be used to model single-shot decision problems in which agents only possess incomplete informa- tion about other agents, and hence are important for mul- tiagent coordination under uncertainty. Moreover they can be used... more

descriptionView Paper arrow_downwardDownload

Producing efficient error-bounded solutions for transition independent decentralized mdps

by Christopher Amato

2022, adaptive agents and multi-agents systems

There has been substantial progress on algorithms for single-agent sequential decision making using partially observable Markov decision processes (POMDPs). A number of efficient algorithms for solving POMDPs share two desirable... more

descriptionView Paper arrow_downwardDownload

Graph-based Cross Entropy method for solving multi-robot decentralized POMDPs

by Christopher Amato

2022, 2016 IEEE International Conference on Robotics and Automation (ICRA)

This paper introduces a probabilistic algorithm for multi-robot decision-making under uncertainty, which can be posed as a Decentralized Partially Observable Markov Decision Process (Dec-POMDP). Dec-POMDPs are inherently synchronous... more

descriptionView Paper arrow_downwardDownload

Scaling Up Optimal Heuristic Search in Dec-POMDPs via Incremental Expansion

by Christopher Amato

2022

Planning under uncertainty for multiagent systems can be formalized as a decentralized partially observable Markov decision process. We advance the state of the art for optimal solution of this model, building on the Multiagent A*... more

descriptionView Paper arrow_downwardDownload

Dec-POMDPs as Non-Observable MDPs

by Christopher Amato

2022

A recent insight in the field of decentralized partially observable Markov decision processes (Dec-POMDPs) is that it is possible to convert a Dec-POMDP to a non-observable MDP, which is a special case of POMDP. This technical report... more

descriptionView Paper arrow_downwardDownload

Sublinear Regret for Learning POMDPs

by Ningyuan Chen

2022

Yi Xiong Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Hong Kong, China, yxiong@se.cuhk.edu.hk Ningyuan Chen Rotman School of Management, University of Toronto, Toronto, Canada,... more

descriptionView Paper arrow_downwardDownload

Incremental Clustering and Expansion for Faster Optimal Planning in Decentralized POMDPs

by Chris Amato

2022

This article presents the state-of-the-art in optimal solution methods for decentralized partially observable Markov decision processes (Dec-POMDPs), which are general models for collaborative multiagent planning under uncertainty.... more

descriptionView Paper arrow_downwardDownload

Producing efficient error-bounded solutions for transition independent decentralized mdps

by Chris Amato

2022

There has been substantial progress on algorithms for single-agent sequential decision making using partially observable Markov decision processes (POMDPs). A number of efficient algorithms for solving POMDPs share two desirable... more

descriptionView Paper arrow_downwardDownload

Producing efficient error-bounded solutions for transition independent decentralized mdps

by Chris Amato

2022

There has been substantial progress on algorithms for single-agent sequential decision making using partially observable Markov decision processes (POMDPs). A number of efficient algorithms for solving POMDPs share two desirable... more

descriptionView Paper arrow_downwardDownload

Scaling Up Optimal Heuristic Search in Dec-POMDPs via Incremental Expansion

by Chris Amato

2022

Planning under uncertainty for multiagent systems can be formalized as a decentralized partially observable Markov decision process. We advance the state of the art for optimal solution of this model, building on the Multiagent A*... more

descriptionView Paper arrow_downwardDownload

Producing Efficient Error-bounded Solutions for Transition Independent Decentralized MDPs

by Chris Amato

2022

There has been substantial progress on algorithms for single-agent sequential decision making using partially observable Markov decision processes (POMDPs). A number of efficient algorithms for solving POMDPs share two desirable... more

descriptionView Paper arrow_downwardDownload

Dynamic coordination of ambulances for emergency medical assistance services

by Vicente Sánchez-Brunete

2022, Knowledge-Based Systems

The main objective of emergency medical assistance (EMA) services is to attend patients with sudden diseases at any possible location within an area of influence. This usually consists in providing "in situ" assistance and, if necessary,... more

descriptionView Paper arrow_downwardDownload

Message-passing algorithms for large structured decentralized POMDPs

by Akshat Kumar 7D dpsk

2022

Decentralized POMDPs provide a rigorous framework for multi-agent decision-theoretic planning. However, their high complexity has limited scalability. In this work, we present a promising new class of algorithms based on probabilistic... more

descriptionView Paper arrow_downwardDownload

Performance based task assignment in multi-robot patrolling

by Henrik Christensen

2022, Proceedings of the 28th Annual ACM Symposium on Applied Computing - SAC '13

This article applies a performance metric to the multi-robot patrolling task to more efficiently distribute patrol areas among robot team members. The multi-robot patrolling task employs multiple robots to perform frequent visits to known... more

descriptionView Paper arrow_downwardDownload

A multi-agent system to support ambulance coordination in time-critical patient treatment

by Isabel Cuevas

2021, 7th Simposio Argentino de …

Abstract. Stroke is the third highest cause of mortality and the first cause of disabled people in western countries. A significant number of the people who survive live with serious physical and psychological disabil-ities and require... more

descriptionView Paper arrow_downwardDownload

Coordinate-descent adaptation over networks

by Bicheng Ying

2021, 2017 25th European Signal Processing Conference (EUSIPCO)

This work examines the mean-square error performance of diffusion stochastic algorithms under a generalized coordinate-descent scheme. In this setting, the adaptation step by each agent is limited to a random subset of the coordinates of... more

descriptionView Paper arrow_downwardDownload

Cooperative manipulation and transportation with aerial robots

by vijay kumar

2021, Autonomous Robots

In this paper we consider the problem of controlling multiple robots manipulating and transporting an object in three dimensions via cables. We develop robot configurations that ensure static equilibrium of the object at a desired pose... more

Fig. 1. A rigid body suspended by n cables with world-frame pivot points q;. Analysis techniques for cable-actuated parallel manipulators assume that q; is fixed while J; varies in magnitude, while for cooperative aerial manipulation we fix |; and vary q; by changing the positions of the aerial robots.

Fig. 2. A team of three point-model robots manipulate a payload in three dimensions. The coordinates of the robots in the inertial frame W are q; = [vi, yi, 21] and in the body-fixed frame (attached to the payload) B are q; = [Zi, Yi, Zi]. The rigid body transformation from B to W is A € SE(3). Additionally, we denote the projection of the robot position q; along q; — pi to the plane Z = 1 as G; = [#:, 9, 1].

Fig. 3. For a payload with mass m = 0.25kg and #, = 1m, #3 = 0.5m, and wR = 0.87 m, the numerically determined workspace of valid tensions in the normalized coordinates space {%1, 1, 2}. Any point selected in these valid regions meets the conditions of static equilibrium while ensuring positive and bounded tensions for all robots. Note that in these plots we do not consider inter-robot collisions (as compared to Fig. 9(a)).

Fig. 4. Determining the pose of the payload for hovering robots via spherical coordinates (Fig. 4(a)). Given a representative robot configuration, five equilibrium solutions for the pose of the payload are found via the methods in Sect. III-B (Fig. 4(b)). The only stable configuration is orange (as noted in Table I).

Fig. 5. The aerial robots and manipulation payload for experimentation. The payload is defined by m = 0.25kg and @ = 1m, @} = 0.5m, and ge = 0.87m, with 1; = 1m.

From (13), it is clear that two solution sets are valid, but only for specific intervals. For example, consider pitch (3) when applying controls in the yaw directions [—7/2, 0, 7/2, 7] with {F,, Fy, F.} > 0. For each of the four cases respectively, we expect 0 < 0, 6 > 0, 6 > 0, and 8 <0. A similar argument may be made for a. Therefore, we must construct a piecewise-smooth curve as a function of the external forces {F,, Fy} and y over the intervals [—7, V1], [11; V2], [y2, 7], defined by the zero points of the conditions in (13). For a, we find that We now construct piecewise-smooth solutions that reflect the expected inputs for all values of y. From (13), we denote the positive a solution as a+ and the negative solution as a_. The piecewise-smooth input for a@ is

Fig. 6. In Fig. 6(a) the robots assume the desired configuration while in Fig. 6(b) one robot has suffered a large actuation failure. Figure 6(c) depicts the mean squared position and orientation error of the payload while Fig. 6(d) provides data on the individual robot control error. Vertical dashed lines in Figs. 6(c) and 6(d) show the exact time of the robot actuator failure.

Fig. 7. Snapshots demonstrating cooperative manipulation and transportation. The team starts from initial conditions before takeoff (Fig. 7(a)), stabilizes the platform at each desired pose (Figs.7(b) — 7(d)), and returns the payload to the first pose before landing (Fig. 7(e)). Colored circles highlight individual robot positions during the evolution of the experiment. Videos of the experiments are available at http://kumar.cis.upenn.edu/movies/RSS2009.flv

Fig. 10. A disturbance is applied to the payload in experimentation at time 55s in two trials. In Fig. 10(a), the robot configuration is selected based on the maximization of the natural frequency of the payload and in Fig. 10(b), the robot configuration is chosen to be similar to that shown in Fig. 9(b). Note that the configuration in Fig. 10(a) attenuates error more quickly. Fig. 9. Various points in Qj, for a = 2 = 0 (Figs. 9(b)-9(d)). Figure 9(a) depicts numerically determined regions of valid tensions in the space of q requiring A; < mg and ||q; — qj|| > 1m (for collision avoidance), with black points indicating the configurations selected in Figs. 9(b)-9(d). For completeness, the normalized coordinates, {%1, 91, §2}, of each configu- ration follows: {—0.2724, —0.3054, —0.3054} (Fig. 9(b)), {0, 0, —0.9} (Fig. 9(c)), {0.6, 0.45, —0.9} (Fig. 9(d)).

descriptionView Paper arrow_downwardDownload

Cooperative manipulation and transportation with aerial robots

by Dr. Vijay Kumar B P

2021, Autonomous Robots

In this paper we consider the problem of controlling multiple robots manipulating and transporting an object in three dimensions via cables. We develop robot configurations that ensure static equilibrium of the object at a desired pose... more

descriptionView Paper arrow_downwardDownload

Cooperative Multiagent Systems

Related Topics