Concurrent Stochastic Games with Stateful-Discounted and Parity Objectives: Complexity and Algorithms

Authors Ali Asadi , Krishnendu Chatterjee , Raimundo Saona , Jakub Svoboda



PDF
Thumbnail PDF

File

LIPIcs.FSTTCS.2024.5.pdf
  • Filesize: 0.8 MB
  • 17 pages

Document Identifiers

Author Details

Ali Asadi
  • Institute of Science and Technology Austria (ISTA), Klosterneuburg, Austria
Krishnendu Chatterjee
  • Institute of Science and Technology Austria (ISTA), Klosterneuburg, Austria
Raimundo Saona
  • Institute of Science and Technology Austria (ISTA), Klosterneuburg, Austria
Jakub Svoboda
  • Institute of Science and Technology Austria (ISTA), Klosterneuburg, Austria

Cite As Get BibTex

Ali Asadi, Krishnendu Chatterjee, Raimundo Saona, and Jakub Svoboda. Concurrent Stochastic Games with Stateful-Discounted and Parity Objectives: Complexity and Algorithms. In 44th IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science (FSTTCS 2024). Leibniz International Proceedings in Informatics (LIPIcs), Volume 323, pp. 5:1-5:17, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2024) https://doi.org/10.4230/LIPIcs.FSTTCS.2024.5

Abstract

We study two-player zero-sum concurrent stochastic games with finite state and action space played for an infinite number of steps. In every step, the two players simultaneously and independently choose an action. Given the current state and the chosen actions, the next state is obtained according to a stochastic transition function. An objective is a measurable function on plays (or infinite trajectories) of the game, and the value for an objective is the maximal expectation that the player can guarantee against the adversarial player. We consider: (a) stateful-discounted objectives, which are similar to the classic discounted-sum objectives, but states are associated with different discount factors rather than a single discount factor; and (b) parity objectives, which are a canonical representation for ω-regular objectives. For stateful-discounted objectives, given an ordering of the discount factors, the limit value is the limit of the value of the stateful-discounted objectives, as the discount factors approach zero according to the given order.
The computational problem we consider is the approximation of the value within an arbitrary additive error. The above problem is known to be in EXPSPACE for the limit value of stateful-discounted objectives and in PSPACE for parity objectives. The best-known algorithms for both the above problems are at least exponential time, with an exponential dependence on the number of states and actions. Our main results for the value approximation problem for the limit value of stateful-discounted objectives and parity objectives are as follows: (a) we establish TFNP[NP] complexity; and (b) we present algorithms that improve the dependency on the number of actions in the exponent from linear to logarithmic. In particular, if the number of states is constant, our algorithms run in polynomial time.

Subject Classification

ACM Subject Classification
  • Theory of computation → Logic and verification
Keywords
  • Concurrent Stochastic Games
  • Parity Objectives
  • Discounted-sum Objectives

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads

References

  1. Rajeev Alur, Thomas A Henzinger, and Orna Kupferman. Alternating-time temporal logic. Journal of the ACM (JACM), 49(5):672-713, 2002. URL: https://doi.org/10.1145/585265.585270.
  2. Luc Attia and Miquel Oliu-Barton. A formula for the value of a stochastic game. Proceedings of the National Academy of Sciences, 116(52):26435-26443, 2019. Google Scholar
  3. Saugata Basu. New results on quantifier elimination over real closed fields and applications to constraint databases. Journal of the ACM (JACM), 46(4):537-555, 1999. URL: https://doi.org/10.1145/320211.320240.
  4. Sougata Bose, Rasmus Ibsen-Jensen, and Patrick Totzke. Bounded-memory strategies in partial-information games. In Proceedings of the 39th Annual ACM/IEEE Symposium on Logic in Computer Science, pages 1-14, 2024. URL: https://doi.org/10.1145/3661814.3662096.
  5. Krishnendu Chatterjee. Stochastic ω-regular games. University of California, Berkeley, 2007. Google Scholar
  6. Krishnendu Chatterjee, Luca de Alfaro, and Thomas A Henzinger. The complexity of quantitative concurrent parity games. In Proceedings of the seventeenth annual ACM-SIAM symposium on Discrete algorithm, pages 678-687, 2006. URL: http://dl.acm.org/citation.cfm?id=1109557.1109631.
  7. Krishnendu Chatterjee, Rupak Majumdar, and Thomas A Henzinger. Stochastic limit-average games are in EXPTIME. International Journal of Game Theory, 37(2):219-234, 2008. URL: https://doi.org/10.1007/S00182-007-0110-5.
  8. Luca de Alfaro and Thomas A Henzinger. Concurrent omega-regular games. In Proceedings Fifteenth Annual IEEE Symposium on Logic in Computer Science, pages 141-154. IEEE, 2000. URL: https://doi.org/10.1109/LICS.2000.855763.
  9. Luca de Alfaro, Thomas A Henzinger, and Orna Kupferman. Concurrent reachability games. In Proceedings 39th Annual Symposium on Foundations of Computer Science, pages 564-564. IEEE Computer Society, 1998. Google Scholar
  10. Luca de Alfaro, Thomas A Henzinger, and Rupak Majumdar. Discounting the future in systems theory. In International Colloquium on Automata, Languages, and Programming, pages 1022-1037. Springer, 2003. URL: https://doi.org/10.1007/3-540-45061-0_79.
  11. Luca de Alfaro, Thomas A Henzinger, and Freddy YC Mang. The control of synchronous systems. In International Conference on Concurrency Theory, pages 458-473. Springer, 2000. Google Scholar
  12. Luca de Alfaro, Thomas A Henzinger, and Freddy YC Mang. The control of synchronous systems, part II. In International Conference on Concurrency Theory, pages 566-581. Springer, 2001. Google Scholar
  13. Luca de Alfaro and Rupak Majumdar. Quantitative solution of omega-regular games. In Proceedings of the thirty-third annual ACM symposium on Theory of computing, pages 675-683, 2001. URL: https://doi.org/10.1145/380752.380871.
  14. Kousha Etessami and Mihalis Yannakakis. Recursive concurrent stochastic games. Logical Methods in Computer Science, 4, 2008. URL: https://doi.org/10.2168/LMCS-4(4:7)2008.
  15. Hugh Everett. Recursive games. Contributions to the Theory of Games, 3(39):47-78, 1957. Google Scholar
  16. Jerzy Filar and Koos Vrieze. Competitive Markov Decision Processes. Springer-Verlag, 1997. Google Scholar
  17. Søren Kristoffer Stiil Frederiksen and Peter Bro Miltersen. Approximating the value of a concurrent reachability game in the polynomial time hierarchy. In International Symposium on Algorithms and Computation, pages 457-467. Springer, 2013. URL: https://doi.org/10.1007/978-3-642-45030-3_43.
  18. Hugo Gimbert and Wiesław Zielonka. Discounting infinite games but how and why? Electronic Notes in Theoretical Computer Science, 119(1):3-9, 2005. URL: https://doi.org/10.1016/J.ENTCS.2004.07.005.
  19. Hugo Gimbert and Wiesław Zielonka. Blackwell optimal strategies in priority mean-payoff games. International Journal of Foundations of Computer Science, 23(03):687-711, 2012. URL: https://doi.org/10.1142/S0129054112400345.
  20. Kristoffer Arnsfelt Hansen, Rasmus Ibsen-Jensen, and Peter Bro Miltersen. The complexity of solving reachability games using value and strategy iteration. In International Computer Science Symposium in Russia, pages 77-90. Springer, 2011. URL: https://doi.org/10.1007/978-3-642-20712-9_7.
  21. Kristoffer Arnsfelt Hansen, Michal Koucky, Niels Lauritzen, Peter Bro Miltersen, and Elias P Tsigaridas. Exact algorithms for solving stochastic games. In Proceedings of the forty-third annual ACM symposium on Theory of computing, pages 205-214, 2011. Google Scholar
  22. Kristoffer Arnsfelt Hansen, Michal Koucky, and Peter Bro Miltersen. Winning concurrent reachability games requires doubly-exponential patience. In 2009 24th Annual IEEE Symposium on Logic In Computer Science, pages 332-341. IEEE, 2009. URL: https://doi.org/10.1109/LICS.2009.44.
  23. Donald A Martin. The determinacy of Blackwell games. The Journal of Symbolic Logic, 63(4):1565-1581, 1998. URL: https://doi.org/10.2307/2586667.
  24. Jean-François Mertens and Abraham Neyman. Stochastic games. International Journal of Game Theory, 10:53-66, 1981. Google Scholar
  25. Miquel Oliu-Barton. New algorithms for solving zero-sum stochastic games. Mathematics of Operations Research, 46(1):255-267, 2021. URL: https://doi.org/10.1287/MOOR.2020.1055.
  26. Lloyd Stowell Shapley. Stochastic games. Proceedings of the national academy of sciences, 39(10):1095-1100, 1953. Google Scholar
  27. Wolfgang Thomas. Languages, automata, and logic. In Handbook of Formal Languages: Volume 3 Beyond Words, pages 389-455. Springer, 1997. URL: https://doi.org/10.1007/978-3-642-59126-6_7.
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail