Concurrent Stochastic Games with Stateful-Discounted and Parity Objectives: Complexity and Algorithms

Asadi, Ali; Chatterjee, Krishnendu; Saona, Raimundo; Svoboda, Jakub

doi:10.4230/LIPIcs.FSTTCS.2024.5

Abstract

We study two-player zero-sum concurrent stochastic games with finite state and action space played for an infinite number of steps. In every step, the two players simultaneously and independently choose an action. Given the current state and the chosen actions, the next state is obtained according to a stochastic transition function. An objective is a measurable function on plays (or infinite trajectories) of the game, and the value for an objective is the maximal expectation that the player can guarantee against the adversarial player. We consider: (a) stateful-discounted objectives, which are similar to the classic discounted-sum objectives, but states are associated with different discount factors rather than a single discount factor; and (b) parity objectives, which are a canonical representation for ω-regular objectives. For stateful-discounted objectives, given an ordering of the discount factors, the limit value is the limit of the value of the stateful-discounted objectives, as the discount factors approach zero according to the given order.
The computational problem we consider is the approximation of the value within an arbitrary additive error. The above problem is known to be in EXPSPACE for the limit value of stateful-discounted objectives and in PSPACE for parity objectives. The best-known algorithms for both the above problems are at least exponential time, with an exponential dependence on the number of states and actions. Our main results for the value approximation problem for the limit value of stateful-discounted objectives and parity objectives are as follows: (a) we establish TFNP[NP] complexity; and (b) we present algorithms that improve the dependency on the number of actions in the exponent from linear to logarithmic. In particular, if the number of states is constant, our algorithms run in polynomial time.

Rajeev Alur, Thomas A Henzinger, and Orna Kupferman. Alternating-time temporal logic. Journal of the ACM (JACM), 49(5):672-713, 2002. URL: https://doi.org/10.1145/585265.585270.
Luc Attia and Miquel Oliu-Barton. A formula for the value of a stochastic game. Proceedings of the National Academy of Sciences, 116(52):26435-26443, 2019.
Saugata Basu. New results on quantifier elimination over real closed fields and applications to constraint databases. Journal of the ACM (JACM), 46(4):537-555, 1999. URL: https://doi.org/10.1145/320211.320240.
Sougata Bose, Rasmus Ibsen-Jensen, and Patrick Totzke. Bounded-memory strategies in partial-information games. In Proceedings of the 39th Annual ACM/IEEE Symposium on Logic in Computer Science, pages 1-14, 2024. URL: https://doi.org/10.1145/3661814.3662096.
Krishnendu Chatterjee. Stochastic ω-regular games. University of California, Berkeley, 2007.
Krishnendu Chatterjee, Luca de Alfaro, and Thomas A Henzinger. The complexity of quantitative concurrent parity games. In Proceedings of the seventeenth annual ACM-SIAM symposium on Discrete algorithm, pages 678-687, 2006. URL: http://dl.acm.org/citation.cfm?id=1109557.1109631.
Krishnendu Chatterjee, Rupak Majumdar, and Thomas A Henzinger. Stochastic limit-average games are in EXPTIME. International Journal of Game Theory, 37(2):219-234, 2008. URL: https://doi.org/10.1007/S00182-007-0110-5.
Luca de Alfaro and Thomas A Henzinger. Concurrent omega-regular games. In Proceedings Fifteenth Annual IEEE Symposium on Logic in Computer Science, pages 141-154. IEEE, 2000. URL: https://doi.org/10.1109/LICS.2000.855763.
Luca de Alfaro, Thomas A Henzinger, and Orna Kupferman. Concurrent reachability games. In Proceedings 39th Annual Symposium on Foundations of Computer Science, pages 564-564. IEEE Computer Society, 1998.
Luca de Alfaro, Thomas A Henzinger, and Rupak Majumdar. Discounting the future in systems theory. In International Colloquium on Automata, Languages, and Programming, pages 1022-1037. Springer, 2003. URL: https://doi.org/10.1007/3-540-45061-0_79.
Luca de Alfaro, Thomas A Henzinger, and Freddy YC Mang. The control of synchronous systems. In International Conference on Concurrency Theory, pages 458-473. Springer, 2000.
Luca de Alfaro, Thomas A Henzinger, and Freddy YC Mang. The control of synchronous systems, part II. In International Conference on Concurrency Theory, pages 566-581. Springer, 2001.
Luca de Alfaro and Rupak Majumdar. Quantitative solution of omega-regular games. In Proceedings of the thirty-third annual ACM symposium on Theory of computing, pages 675-683, 2001. URL: https://doi.org/10.1145/380752.380871.
Kousha Etessami and Mihalis Yannakakis. Recursive concurrent stochastic games. Logical Methods in Computer Science, 4, 2008. URL: https://doi.org/10.2168/LMCS-4(4:7)2008.
Hugh Everett. Recursive games. Contributions to the Theory of Games, 3(39):47-78, 1957.
Jerzy Filar and Koos Vrieze. Competitive Markov Decision Processes. Springer-Verlag, 1997.
Søren Kristoffer Stiil Frederiksen and Peter Bro Miltersen. Approximating the value of a concurrent reachability game in the polynomial time hierarchy. In International Symposium on Algorithms and Computation, pages 457-467. Springer, 2013. URL: https://doi.org/10.1007/978-3-642-45030-3_43.
Hugo Gimbert and Wiesław Zielonka. Discounting infinite games but how and why? Electronic Notes in Theoretical Computer Science, 119(1):3-9, 2005. URL: https://doi.org/10.1016/J.ENTCS.2004.07.005.
Hugo Gimbert and Wiesław Zielonka. Blackwell optimal strategies in priority mean-payoff games. International Journal of Foundations of Computer Science, 23(03):687-711, 2012. URL: https://doi.org/10.1142/S0129054112400345.
Kristoffer Arnsfelt Hansen, Rasmus Ibsen-Jensen, and Peter Bro Miltersen. The complexity of solving reachability games using value and strategy iteration. In International Computer Science Symposium in Russia, pages 77-90. Springer, 2011. URL: https://doi.org/10.1007/978-3-642-20712-9_7.
Kristoffer Arnsfelt Hansen, Michal Koucky, Niels Lauritzen, Peter Bro Miltersen, and Elias P Tsigaridas. Exact algorithms for solving stochastic games. In Proceedings of the forty-third annual ACM symposium on Theory of computing, pages 205-214, 2011.
Kristoffer Arnsfelt Hansen, Michal Koucky, and Peter Bro Miltersen. Winning concurrent reachability games requires doubly-exponential patience. In 2009 24th Annual IEEE Symposium on Logic In Computer Science, pages 332-341. IEEE, 2009. URL: https://doi.org/10.1109/LICS.2009.44.
Donald A Martin. The determinacy of Blackwell games. The Journal of Symbolic Logic, 63(4):1565-1581, 1998. URL: https://doi.org/10.2307/2586667.
Jean-François Mertens and Abraham Neyman. Stochastic games. International Journal of Game Theory, 10:53-66, 1981.
Miquel Oliu-Barton. New algorithms for solving zero-sum stochastic games. Mathematics of Operations Research, 46(1):255-267, 2021. URL: https://doi.org/10.1287/MOOR.2020.1055.
Lloyd Stowell Shapley. Stochastic games. Proceedings of the national academy of sciences, 39(10):1095-1100, 1953.
Wolfgang Thomas. Languages, automata, and logic. In Handbook of Formal Languages: Volume 3 Beyond Words, pages 389-455. Springer, 1997. URL: https://doi.org/10.1007/978-3-642-59126-6_7.

Concurrent Stochastic Games with Stateful-Discounted and Parity Objectives: Complexity and Algorithms

Authors Ali Asadi , Krishnendu Chatterjee , Raimundo Saona , Jakub Svoboda

File

Document Identifiers

Subject Classification

ACM Subject Classification

Keywords

Metrics

Abstract

Cite As Get BibTex

Author Details

References

Thanks for your feedback!

Could not send message

Concurrent Stochastic Games with Stateful-Discounted and Parity Objectives: Complexity and Algorithms

Authors Ali Asadi , Krishnendu Chatterjee , Raimundo Saona , Jakub Svoboda

File

Document Identifiers

Related Versions

Subject Classification

ACM Subject Classification

Keywords

Metrics

Abstract

Cite As Get BibTex

Author Details

Funding

References

Thanks for your feedback!

Could not send message