Best Response Computation in Multiplayer Imperfect-Information Stochastic Games

Sam Ganzfried

doi:10.32473/flairs.v35i.130546

Best Response Computation in Multiplayer Imperfect-Information Stochastic Games

Autores/as

Sam Ganzfried Ganzfried Research

DOI:

https://doi.org/10.32473/flairs.v35i.130546

Palabras clave:

game theory, stochastic game, imperfect information

Resumen

Computing a best response is a fundamental task in game theory. One of its uses is to compute the degree of approximation error of an approximation of Nash equilibrium strategies. In many game classes best responses can be computed in polynomial time, but in imperfect-information stochastic games it is equivalent to solving a POMDP and is PSPACE-complete. Prior work has developed an algorithm for computing an approximation of Nash equilibrium strategies in a 4-player imperfect-information naval strategic planning problem which is modeled as a stochastic game. A heuristic approach was developed for computing best responses to determine the approximation error of these strategies, which was shown to have limited scalability. In this paper we present approaches that utilize parallelism to significantly speed up this computation allowing us to compute best responses in significantly larger games.

Descargas

PDF (English)

Publicado

2022-05-04

Cómo citar

Ganzfried, S. (2022). Best Response Computation in Multiplayer Imperfect-Information Stochastic Games. The International FLAIRS Conference Proceedings, 35. https://doi.org/10.32473/flairs.v35i.130546

Descargar cita

Número

Vol. 35 (2022): Proceedings of FLAIRS-35

Sección

Main Track

Licencia

Derechos de autor 2022 Sam Ganzfried

Esta obra está bajo una licencia internacional Creative Commons Atribución-NoComercial 4.0.

Best Response Computation in Multiplayer Imperfect-Information Stochastic Games

Autores/as

DOI:

Palabras clave:

Resumen

Descargas

Publicado

Cómo citar

Número

Sección

Licencia

Desarrollado por

Enviar un artículo

Idioma