Lower Bounds for Pseudo-Deterministic Counting in a Stream

Authors Vladimir Braverman, Robert Krauthgamer , Aditya Krishnan, Shay Sapir



PDF
Thumbnail PDF

File

LIPIcs.ICALP.2023.30.pdf
  • Filesize: 0.71 MB
  • 14 pages

Document Identifiers

Author Details

Vladimir Braverman
  • Rice University, Houston, TX, USA
Robert Krauthgamer
  • Weizmann Institute of Science, Rehovot, Israel
Aditya Krishnan
  • Pinecone, San Francisco, CA, USA
Shay Sapir
  • Weizmann Institute of Science, Rehovot, Israel

Cite AsGet BibTex

Vladimir Braverman, Robert Krauthgamer, Aditya Krishnan, and Shay Sapir. Lower Bounds for Pseudo-Deterministic Counting in a Stream. In 50th International Colloquium on Automata, Languages, and Programming (ICALP 2023). Leibniz International Proceedings in Informatics (LIPIcs), Volume 261, pp. 30:1-30:14, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2023)
https://doi.org/10.4230/LIPIcs.ICALP.2023.30

Abstract

Many streaming algorithms provide only a high-probability relative approximation. These two relaxations, of allowing approximation and randomization, seem necessary - for many streaming problems, both relaxations must be employed simultaneously, to avoid an exponentially larger (and often trivial) space complexity. A common drawback of these randomized approximate algorithms is that independent executions on the same input have different outputs, that depend on their random coins. Pseudo-deterministic algorithms combat this issue, and for every input, they output with high probability the same "canonical" solution. We consider perhaps the most basic problem in data streams, of counting the number of items in a stream of length at most n. Morris’s counter [CACM, 1978] is a randomized approximation algorithm for this problem that uses O(log log n) bits of space, for every fixed approximation factor (greater than 1). Goldwasser, Grossman, Mohanty and Woodruff [ITCS 2020] asked whether pseudo-deterministic approximation algorithms can match this space complexity. Our main result answers their question negatively, and shows that such algorithms must use Ω(√{log n / log log n}) bits of space. Our approach is based on a problem that we call Shift Finding, and may be of independent interest. In this problem, one has query access to a shifted version of a known string F ∈ {0,1}^{3n}, which is guaranteed to start with n zeros and end with n ones, and the goal is to find the unknown shift using a small number of queries. We provide for this problem an algorithm that uses O(√n) queries. It remains open whether poly(log n) queries suffice; if true, then our techniques immediately imply a nearly-tight Ω(log n/log log n) space bound for pseudo-deterministic approximate counting.

Subject Classification

ACM Subject Classification
  • Theory of computation → Streaming, sublinear and near linear time algorithms
  • Theory of computation → Lower bounds and information complexity
  • Theory of computation → Pseudorandomness and derandomization
Keywords
  • streaming algorithms
  • pseudo-deterministic
  • approximate counting

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads

References

  1. Noga Alon, Yossi Matias, and Mario Szegedy. The space complexity of approximating the frequency moments. In Proceedings of the Twenty-Eighth Annual ACM Symposium on the Theory of Computing, pages 20-29, 1996. URL: https://doi.org/10.1145/237814.237823.
  2. Alexandr Andoni, Piotr Indyk, Dina Katabi, and Haitham Hassanieh. Shift finding in sub-linear time. In Proceedings of the Twenty-Fourth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA, pages 457-465, 2013. URL: https://doi.org/10.1137/1.9781611973105.33.
  3. Omri Ben-Eliezer, Rajesh Jayaram, David P. Woodruff, and Eylon Yogev. A framework for adversarially robust streaming algorithms. J. ACM, 69(2):17:1-17:33, 2022. URL: https://doi.org/10.1145/3498334.
  4. Amit Chakrabarti, Prantar Ghosh, and Manuel Stoeckl. Adversarially robust coloring for graph streams. In 13th Innovations in Theoretical Computer Science Conference, ITCS, volume 215 of LIPIcs, pages 37:1-37:23. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2022. URL: https://doi.org/10.4230/LIPIcs.ITCS.2022.37.
  5. Peter Dixon, A. Pavan, and N. V. Vinodchandran. On pseudodeterministic approximation algorithms. In 43rd International Symposium on Mathematical Foundations of Computer Science, MFCS, volume 117 of LIPIcs, pages 61:1-61:11. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2018. URL: https://doi.org/10.4230/LIPIcs.MFCS.2018.61.
  6. Peter Dixon, A. Pavan, and N. V. Vinodchandran. Complete problems for multi-pseudodeterministic computations. In 12th Innovations in Theoretical Computer Science Conference, ITCS, volume 185 of LIPIcs, pages 66:1-66:16. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2021. URL: https://doi.org/10.4230/LIPIcs.ITCS.2021.66.
  7. Peter Dixon, A. Pavan, Jason Vander Woude, and N. V. Vinodchandran. Pseudodeterminism: promises and lowerbounds. In STOC '22: 54th Annual ACM Symposium on Theory of Computing, pages 1552-1565, 2022. URL: https://doi.org/10.1145/3519935.3520043.
  8. Philippe Flajolet. Approximate counting: A detailed analysis. BIT, 25(1):113-134, 1985. URL: https://doi.org/10.1007/BF01934993.
  9. Eran Gat and Shafi Goldwasser. Probabilistic search algorithms with unique answers and their cryptographic applications. Electron. Colloquium Comput. Complex., TR11-136, 2011. URL: https://eccc.weizmann.ac.il/report/2011/136, URL: https://arxiv.org/abs/TR11-136.
  10. Sumanta Ghosh and Rohit Gurjar. Matroid intersection: A pseudo-deterministic parallel reduction from search to weighted-decision. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, APPROX/RANDOM, volume 207 of LIPIcs, pages 41:1-41:16. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2021. URL: https://doi.org/10.4230/LIPIcs.APPROX/RANDOM.2021.41.
  11. Michel X. Goemans, Shafi Goldwasser, and Dhiraj Holden. Doubly-efficient pseudo-deterministic proofs. CoRR, abs/1910.00994, 2019. URL: https://arxiv.org/abs/1910.00994.
  12. Oded Goldreich. Multi-pseudodeterministic algorithms. Electron. Colloquium Comput. Complex., TR19-012, 2019. URL: https://eccc.weizmann.ac.il/report/2019/012, URL: https://arxiv.org/abs/TR19-012.
  13. Oded Goldreich, Shafi Goldwasser, and Dana Ron. On the possibilities and limitations of pseudodeterministic algorithms. In Innovations in Theoretical Computer Science, ITCS, pages 127-138. ACM, 2013. URL: https://doi.org/10.1145/2422436.2422453.
  14. Shafi Goldwasser and Ofer Grossman. Perfect bipartite matching in pseudo-deterministic RNC. Electron. Colloquium Comput. Complex., TR15-208, 2015. URL: https://eccc.weizmann.ac.il/report/2015/208, URL: https://arxiv.org/abs/TR15-208.
  15. Shafi Goldwasser, Ofer Grossman, and Dhiraj Holden. Pseudo-deterministic proofs. In 9th Innovations in Theoretical Computer Science Conference, ITCS, volume 94 of LIPIcs, pages 17:1-17:18. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2018. URL: https://doi.org/10.4230/LIPIcs.ITCS.2018.17.
  16. Shafi Goldwasser, Ofer Grossman, Sidhanth Mohanty, and David P. Woodruff. Pseudo-deterministic streaming. In 11th Innovations in Theoretical Computer Science Conference, ITCS, volume 151 of LIPIcs, pages 79:1-79:25, 2020. URL: https://doi.org/10.4230/LIPIcs.ITCS.2020.79.
  17. Shafi Goldwasser, Russell Impagliazzo, Toniann Pitassi, and Rahul Santhanam. On the pseudo-deterministic query complexity of NP search problems. In 36th Computational Complexity Conference, CCC, volume 200 of LIPIcs, pages 36:1-36:22. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2021. URL: https://doi.org/10.4230/LIPIcs.CCC.2021.36.
  18. André Gronemeier and Martin Sauerhoff. Applying approximate counting for computing the frequency moments of long data streams. Theory Comput. Syst., 44(3):332-348, 2009. URL: https://doi.org/10.1007/s00224-007-9048-z.
  19. Ofer Grossman. Finding primitive roots pseudo-deterministically. Electron. Colloquium Comput. Complex., TR15-207, 2015. URL: https://eccc.weizmann.ac.il/report/2015/207, URL: https://arxiv.org/abs/TR15-207.
  20. Ofer Grossman, Meghal Gupta, and Mark Sellke. Tight space lower bound for pseudo-deterministic approximate counting. arXiv preprint, 2023. URL: https://arxiv.org/abs/2304.01438.
  21. Ofer Grossman and Yang P. Liu. Reproducibility and pseudo-determinism in log-space. In Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA, pages 606-620. SIAM, 2019. URL: https://doi.org/10.1137/1.9781611975482.38.
  22. Moritz Hardt and David P. Woodruff. How robust are linear sketches to adaptive inputs? In Proceedings of the Forty-Fifth Annual ACM Symposium on Theory of Computing, pages 121-130, 2013. URL: https://doi.org/10.1145/2488608.2488624.
  23. Haitham Hassanieh, Fadel Adib, Dina Katabi, and Piotr Indyk. Faster GPS via the sparse fourier transform. In The 18th Annual International Conference on Mobile Computing and Networking, Mobicom, pages 353-364. ACM, 2012. URL: https://doi.org/10.1145/2348543.2348587.
  24. Dhiraj Holden. A note on unconditional subexponential-time pseudo-deterministic algorithms for BPP search problems. CoRR, abs/1707.05808, 2017. URL: https://arxiv.org/abs/1707.05808.
  25. Haim Kaplan, Yishay Mansour, Kobbi Nissim, and Uri Stemmer. Separating adaptive streaming from oblivious streaming using the bounded storage model. In Advances in Cryptology - CRYPTO, volume 12827 of Lecture Notes in Computer Science, pages 94-121. Springer, 2021. URL: https://doi.org/10.1007/978-3-030-84252-9_4.
  26. Zhenjian Lu, Igor Carboni Oliveira, and Rahul Santhanam. Pseudodeterministic algorithms and the structure of probabilistic time. In STOC '21: 53rd Annual ACM Symposium on Theory of Computing, pages 303-316, 2021. URL: https://doi.org/10.1145/3406325.3451085.
  27. Jérémie O. Lumbroso. How Flajolet processed streams with coin flips. CoRR, abs/1805.00612, 2018. URL: https://arxiv.org/abs/1805.00612.
  28. Robert Morris. Counting large numbers of events in small registers. Commun. ACM, 21(10):840-842, 1978. URL: https://doi.org/10.1145/359619.359627.
  29. Jelani Nelson and Huacheng Yu. Optimal bounds for approximate counting. In Proceedings of the 41st ACM Symposium on Principles of Database Systems, PODS, pages 119-127, 2022. URL: https://doi.org/10.1145/3517804.3526225.
  30. Igor Carboni Oliveira and Rahul Santhanam. Pseudodeterministic constructions in subexponential time. In Proceedings of the 49th Annual ACM Symposium on Theory of Computing, STOC, pages 665-677, 2017. URL: https://doi.org/10.1145/3055399.3055500.
  31. Igor Carboni Oliveira and Rahul Santhanam. Pseudo-derandomizing learning and approximation. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, APPROX/RANDOM, volume 116 of LIPIcs, pages 55:1-55:19. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2018. URL: https://doi.org/10.4230/LIPIcs.APPROX-RANDOM.2018.55.
  32. Manuel Stoeckl. Streaming algorithms for the missing item finding problem. In Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 793-818, 2023. URL: https://doi.org/10.1137/1.9781611977554.ch32.
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail