Minimizing Negative Side Effects in Cooperative Multi-Agent Systems using Distributed Coordination

Autores

  • Moumita Choudhury University of Massachusetts Amherst
  • Sandhya Saisubramanian
  • Hao Zhang
  • Shlomo Zilberstein

DOI:

https://doi.org/10.32473/flairs.37.1.135573

Palavras-chave:

Multi-agent System

Resumo

Autonomous agents operating in real-world environments frequently encounter undesirable outcomes or negative side effects (NSEs) when working collaboratively alongside other agents. Even when agents can execute their primary task optimally when operating in isolation, their training may not account for potential negative interactions that arise in the presence of other agents. We frame the challenge of minimizing NSEs as a lexicographic decentralized Markov decision process in which we assume independence of rewards and transitions with respect to the primary assigned tasks, but recognize that addressing negative side effects creates a form of dependence among the agents. We present a lexicographic Q-learning approach to mitigate the NSEs using human feedback models while maintaining near-optimality with respect to the assigned tasks---up to some given slack. Our empirical evaluation across two domains demonstrates that our collaborative approach effectively mitigates NSEs, outperforming non-collaborative methods.

Downloads

Publicado

2024-05-12

Como Citar

Choudhury, M., Saisubramanian, S., Zhang, H., & Zilberstein, S. (2024). Minimizing Negative Side Effects in Cooperative Multi-Agent Systems using Distributed Coordination. The International FLAIRS Conference Proceedings, 37(1). https://doi.org/10.32473/flairs.37.1.135573

Edição

Seção

Main Track Proceedings