Towards Optimal Dynamic Indexes for Approximate (and Exact) Triangle Counting

Authors Shangqi Lu, Yufei Tao



PDF
Thumbnail PDF

File

LIPIcs.ICDT.2021.6.pdf
  • Filesize: 0.98 MB
  • 23 pages

Document Identifiers

Author Details

Shangqi Lu
  • The Chinese University of Hong Kong, China
Yufei Tao
  • The Chinese University of Hong Kong, China

Cite AsGet BibTex

Shangqi Lu and Yufei Tao. Towards Optimal Dynamic Indexes for Approximate (and Exact) Triangle Counting. In 24th International Conference on Database Theory (ICDT 2021). Leibniz International Proceedings in Informatics (LIPIcs), Volume 186, pp. 6:1-6:23, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2021)
https://doi.org/10.4230/LIPIcs.ICDT.2021.6

Abstract

In ICDT'19, Kara, Ngo, Nikolic, Olteanu, and Zhang gave a structure which maintains the number T of triangles in an undirected graph G = (V, E) along with the edge insertions/deletions in G. Using O(m) space (m = |E|), their structure supports an update in O(√m log m) amortized time which is optimal (up to polylog factors) subject to the OMv-conjecture (Henzinger, Krinninger, Nanongkai, and Saranurak, STOC'15). Aiming to improve the update efficiency, we study: - the optimal tradeoff between update time and approximation quality. We require a structure to provide the (ε, Γ)-guarantee: when queried, it should return an estimate t of T that has relative error at most ε if T ≥ Γ, or an absolute error at most ε ⋅ Γ, otherwise. We prove that, under any ε ≤ 0.49 and subject to the OMv-conjecture, no structure can guarantee O(m^{0.5-δ}/Γ) expected amortized update time and O(m^{2/3-δ}) query time simultaneously for any constant δ > 0; this is true for Γ = m^c of any constant c in [0, 1/2). We match the lower bound with a structure that ensures Õ((1/ε)³ ⋅ √m/Γ) amortized update time with high probability, and O(1) query time. - (for exact counting) how to achieve arboricity-sensitive update time. For any 1 ≤ Γ ≤ √m, we describe a structure of O(min{α m + m log m, (m/Γ)²}) space that maintains T precisely, and supports an update in Õ(min{α + Γ, √m}) amortized time, where α is the largest arboricity of G in history (and does not need to be known). Our structure reconstructs the aforementioned ICDT'19 result up to polylog factors by setting Γ = √m, but achieves Õ(m^{0.5-δ}) update time as long as α = O(m^{0.5-δ}).

Subject Classification

ACM Subject Classification
  • Theory of computation → Database query processing and optimization (theory)
Keywords
  • Triangle Counting
  • Data Structures
  • Lower Bounds
  • Graph Algorithms

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads

References

  1. Noga Alon, Raphael Yuster, and Uri Zwick. Finding and counting given length cycles. Algorithmica, 17(3):209-223, 1997. Google Scholar
  2. Sepehr Assadi, Michael Kapralov, and Sanjeev Khanna. A simple sublinear-time algorithm for counting arbitrary subgraphs via edge sampling. In Innovations in Theoretical Computer Science (ITCS), pages 6:1-6:20, 2019. Google Scholar
  3. Ziv Bar-Yossef, Ravi Kumar, and D. Sivakumar. Reductions in streaming algorithms, with an application to counting triangles in graphs. In Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 623-632, 2002. Google Scholar
  4. Suman K. Bera and Amit Chakrabarti. Towards tighter space bounds for counting triangles and other substructures in graph streams. In Proceedings of Symposium on Theoretical Aspects of Computer Science (STACS), pages 11:1-11:14, 2017. Google Scholar
  5. Suman K. Bera and C. Seshadhri. How the degeneracy helps for triangle counting in graph streams. In Proceedings of ACM Symposium on Principles of Database Systems (PODS), pages 457-467, 2020. Google Scholar
  6. Edvin Berglin and Gerth Stølting Brodal. A simple greedy algorithm for dynamic graph orientation. Algorithmica, 82(2):245-259, 2020. Google Scholar
  7. Christoph Berkholz, Jens Keppeler, and Nicole Schweikardt. Answering conjunctive queries under updates. In Proceedings of ACM Symposium on Principles of Database Systems (PODS), pages 303-318, 2017. Google Scholar
  8. Christoph Berkholz, Jens Keppeler, and Nicole Schweikardt. Answering FO+MOD queries under updates on bounded degree databases. ACM Transactions on Database Systems (TODS), 43(2):7:1-7:32, 2018. Google Scholar
  9. Vladimir Braverman, Rafail Ostrovsky, and Dan Vilenchik. How hard is counting triangles in the streaming model? In Proceedings of International Colloquium on Automata, Languages and Programming (ICALP), pages 244-254, 2013. Google Scholar
  10. Laurent Bulteau, Vincent Froese, Konstantin Kutzkov, and Rasmus Pagh. Triangle counting in dynamic graph streams. Algorithmica, 76(1):259-278, 2016. Google Scholar
  11. Luciana S. Buriol, Gereon Frahling, Stefano Leonardi, Alberto Marchetti-Spaccamela, and Christian Sohler. Counting triangles in data streams. In Proceedings of ACM Symposium on Principles of Database Systems (PODS), pages 253-262, 2006. Google Scholar
  12. Yu Chen and Ke Yi. Random sampling and size estimation over cyclic joins. In Proceedings of International Conference on Database Theory (ICDT), pages 7:1-7:18, 2020. Google Scholar
  13. N. Chiba and T. Nishizeki. Arboricity and subgraph listing algorithms. SIAM Journal of Computing, 14(1):210-223, 1985. Google Scholar
  14. Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, and Clifford Stein. Introduction to Algorithms, Second Edition. The MIT Press, 2001. Google Scholar
  15. Graham Cormode and Hossein Jowhari. A second look at counting triangles in graph streams (corrected). Theoretical Computer Science, 683:22-30, 2017. Google Scholar
  16. Talya Eden, Amit Levi, Dana Ron, and C. Seshadhri. Approximately counting triangles in sublinear time. SIAM J. Comput., 46(5):1603-1646, 2017. Google Scholar
  17. David Eppstein and Emma S. Spiro. The h-index of a graph and its application to dynamic subgraph statistics. J. Graph Algorithms Appl., 16(2):543-567, 2012. Google Scholar
  18. David García-Soriano and Konstantin Kutzkov. Triangle counting in streamed graphs via small vertex covers. In SIAM International Conference on Data Mining (SDM), pages 352-360, 2014. Google Scholar
  19. Isaac Goldstein, Moshe Lewenstein, and Ely Porat. On the hardness of set disjointness and set intersection with bounded universe. In International Symposium on Algorithms and Computation (ISAAC), pages 7:1-7:22, 2019. Google Scholar
  20. Monika Henzinger, Sebastian Krinninger, Danupon Nanongkai, and Thatchaphol Saranurak. Unifying and strengthening hardness for dynamic problems via the online matrix-vector multiplication conjecture. In Proceedings of ACM Symposium on Theory of Computing (STOC), pages 21-30, 2015. Google Scholar
  21. Monika Henzinger, Sebastian Krinninger, Danupon Nanongkai, and Thatchaphol Saranurak. Unifying and strengthening hardness for dynamic problems via the online matrix-vector multiplication conjecture. CoRR, abs/1511.06773, 2015. Google Scholar
  22. Xiaocheng Hu, Yufei Tao, and Chin-Wan Chung. I/O-Efficient Algorithms on Triangle Listing and Counting. ACM Transactions on Database Systems (TODS), 39(4):27:1-27:30, 2014. Google Scholar
  23. Madhav Jha, C. Seshadhri, and Ali Pinar. A space efficient streaming algorithm for triangle counting using the birthday paradox. In Proceedings of ACM Knowledge Discovery and Data Mining (SIGKDD), pages 589-597, 2013. Google Scholar
  24. Hossein Jowhari and Mohammad Ghodsi. New streaming algorithms for counting triangles in graphs. In Computing and Combinatorics, pages 710-716, 2005. Google Scholar
  25. John Kallaugher and Eric Price. A hybrid sampling scheme for triangle counting. In Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 1778-1797, 2017. Google Scholar
  26. Daniel M. Kane, Kurt Mehlhorn, Thomas Sauerwald, and He Sun. Counting arbitrary subgraphs in data streams. In Proceedings of International Colloquium on Automata, Languages and Programming (ICALP), pages 598-609, 2012. Google Scholar
  27. Ahmet Kara, Hung Q. Ngo, Milos Nikolic, Dan Olteanu, and Haozhe Zhang. Counting triangles under updates in worst-case optimal time. In Proceedings of International Conference on Database Theory (ICDT), pages 4:1-4:18, 2019. Google Scholar
  28. Casper Kejlberg-Rasmussen, Tsvi Kopelowitz, Seth Pettie, and Ely Porat. Dynamic set intersection. CoRR, abs/1407.6755, 2014. Google Scholar
  29. Tsvi Kopelowitz, Seth Pettie, and Ely Porat. Higher lower bounds from the 3sum conjecture. In Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 1272-1287, 2016. Google Scholar
  30. Madhusudan Manjunath, Kurt Mehlhorn, Konstantinos Panagiotou, and He Sun. Approximate counting of cycles in streams. In Proceedings of European Symposium on Algorithms (ESA), 2011. Google Scholar
  31. Andrew McGregor, Sofya Vorotnikova, and Hoa T. Vu. Better algorithms for counting triangles in data streams. In Proceedings of ACM Symposium on Principles of Database Systems (PODS), pages 401-411, 2016. Google Scholar
  32. Rasmus Pagh and Charalampos E. Tsourakakis. Colorful triangle counting and a mapreduce implementation. Information Processing Letters (IPL), 112(7):277-281, 2012. Google Scholar
  33. Mihai Patrascu. Towards polynomial lower bounds for dynamic problems. In Proceedings of ACM Symposium on Theory of Computing (STOC), pages 603-610, 2010. Google Scholar
  34. A. Pavan, Kanat Tangwongsan, Srikanta Tirthapura, and Kun-Lung Wu. Counting and sampling triangles from a graph stream. Proceedings of the VLDB Endowment (PVLDB), 6(14):1870-1881, 2013. Google Scholar
  35. C. Seshadhri, Ali Pinar, and Tamara G. Kolda. Wedge sampling for computing clustering coefficients and triangle counts on large graphs. Statistical Analysis and Data Mining, 7(4):294-307, 2014. Google Scholar
  36. Cheng Sheng, Yufei Tao, and Jianzhong Li. Exact and approximate algorithms for the most connected vertex problem. ACM Transactions on Database Systems (TODS), 37(2):12:1-12:39, 2012. Google Scholar
  37. Charalampos E. Tsourakakis, Mihail N. Kolountzakis, and Gary L. Miller. Triangle sparsifiers. J. Graph Algorithms Appl., 15(6):703-726, 2011. Google Scholar
  38. Jeffrey Scott Vitter. Random sampling with a reservoir. ACM Trans. Math. Softw., 11(1):37-57, 1985. Google Scholar
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail