Generating Practical Random Hyperbolic Graphs in Near-Linear Time and with Sub-Linear Memory

Author Manuel Penschuck



PDF
Thumbnail PDF

File

LIPIcs.SEA.2017.26.pdf
  • Filesize: 0.91 MB
  • 21 pages

Document Identifiers

Author Details

Manuel Penschuck

Cite AsGet BibTex

Manuel Penschuck. Generating Practical Random Hyperbolic Graphs in Near-Linear Time and with Sub-Linear Memory. In 16th International Symposium on Experimental Algorithms (SEA 2017). Leibniz International Proceedings in Informatics (LIPIcs), Volume 75, pp. 26:1-26:21, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2017)
https://doi.org/10.4230/LIPIcs.SEA.2017.26

Abstract

Random graph models, originally conceived to study the structure of networks and the emergence of their properties, have become an indispensable tool for experimental algorithmics. Amongst them, hyperbolic random graphs form a well-accepted family, yielding realistic complex networks while being both mathematically and algorithmically tractable. We introduce two generators MemGen and HyperGen for the G_{alpha,C}(n) model, which distributes n random points within a hyperbolic plane and produces m=n*d/2 undirected edges for all point pairs close by; the expected average degree d and exponent 2*alpha+1 of the power-law degree distribution are controlled by alpha>1/2 and C. Both algorithms emit a stream of edges which they do not have to store. MemGen keeps O(n) items in internal memory and has a time complexity of O(n*log(log n) + m), which is optimal for networks with an average degree of d=Omega(log(log n)). For realistic values of d=o(n / log^{1/alpha}(n)), HyperGen reduces the memory footprint to O([n^{1-alpha}*d^alpha + log(n)]*log(n)). In an experimental evaluation, we compare HyperGen with four generators among which it is consistently the fastest. For small d=10 we measure a speed-up of 4.0 compared to the fastest publicly available generator increasing to 29.6 for d=1000. On commodity hardware, HyperGen produces 3.7e8 edges per second for graphs with 1e6 < m < 1e12 and alpha=1, utilising less than 600MB of RAM. We demonstrate nearly linear scalability on an Intel Xeon Phi.
Keywords
  • Random hyperbolic graph generator
  • streaming algorithm

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads

References

  1. Alok Aggarwal and Jeffrey Scott Vitter. The input/output complexity of sorting and related problems. Communications of the ACM, 31(9), pages 1116-1127, 1988. Google Scholar
  2. Réka Albert and Albert-László Barabási. Statistical mechanics of complex networks. CoRR, cond-mat/0106096, 2001. URL: http://dx.doi.org/10.1103/RevModPhys.74.47.
  3. Lars Backstrom, Paolo Boldi, Marco Rosa, Johan Ugander, and Sebastiano Vigna. Four degrees of separation. In Web Science 2012, WebSci'12, Evanston, IL, USA - June 22-24, 2012, pages 33-42, 2012. URL: http://dx.doi.org/10.1145/2380718.2380723.
  4. Jon Louis Bentley and James B. Saxe. Generating sorted lists of random numbers. ACM Trans. Math. Softw., 6(3):359-364, 1980. URL: http://dx.doi.org/10.1145/355900.355907.
  5. Thomas Bläsius, Tobias Friedrich, Anton Krohmer, and Sören Laue. Efficient embedding of scale-free graphs in the hyperbolic plane. In 24th Annual European Symposium on Algorithms, ESA 2016, Aarhus, Denmark, 2016. URL: http://dx.doi.org/10.4230/LIPIcs.ESA.2016.16.
  6. Michel Bode, Nikolaos Fountoulakis, and Tobias Müller. On the giant component of random hyperbolic graphs, pages 425-429. Scuola Normale Superiore, Pisa, 2013. URL: http://dx.doi.org/10.1007/978-88-7642-475-5_68.
  7. Marián Boguñá, Fragkiskos Papadopoulos, and Dmitri Krioukov. Sustaining the internet with hyperbolic mapping. Nature Communications, Sep 2010. URL: http://dx.doi.org/10.1038/ncomms1063.
  8. Béla Bollobás. Random Graphs. Cambridge Studies in Advanced Mathematics. Cambridge University Press, 2 edition, 2001. URL: http://dx.doi.org/10.1017/CBO9780511814068.
  9. Karl Bringmann, Ralph Keusch, and Johannes Lengler. Geometric inhomogeneous random graphs. CoRR, abs/1511.00576, 2015. URL: http://arxiv.org/abs/1511.00576.
  10. Sergei N Dorogovtsev and José FF Mendes. Evolution of networks: From biological nets to the Internet and WWW. OUP Oxford, 2013. Google Scholar
  11. Minos N. Garofalakis, Johannes Gehrke, and Rajeev Rastogi, editors. Data Stream Management - Processing High-Speed Data Streams. Data-Centric Systems and Applications. Springer, 2016. URL: http://dx.doi.org/10.1007/978-3-540-28608-0.
  12. Luca Gugelmann, Konstantinos Panagiotou, and Ueli Peter. Random hyperbolic graphs: Degree sequence and clustering - (extended abstract). In Automata, Languages, and Programming - 39th International Colloquium, ICALP 2012, Warwick, UK, July 9-13, 2012, Proceedings, Part II, pages 573-585, 2012. URL: http://dx.doi.org/10.1007/978-3-642-31585-5_51.
  13. Piyush Gupta and P. R. Kumar. The capacity of wireless networks. IEEE Trans. Information Theory, 46(2):388-404, 2000. URL: http://dx.doi.org/10.1109/18.825799.
  14. Joseph JáJá. An Introduction to Parallel Algorithms. Addison-Wesley, 1992. Google Scholar
  15. Matthias Kretz. Extending C++ for explicit data-parallel programming via SIMD vector types. PhD thesis, Goethe University Frankfurt am Main, 2015. Google Scholar
  16. Dmitri V. Krioukov, Fragkiskos Papadopoulos, Maksim Kitsak, Amin Vahdat, and Marián Boguñá. Hyperbolic geometry of complex networks. Phys. Rev. E, 82:036106, Sep 2010. URL: http://dx.doi.org/10.1103/PhysRevE.82.036106.
  17. Ravi Kumar, Jasmine Novak, and Andrew Tomkins. Structure and evolution of online social networks. In Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Philadelphia, PA, USA, August 20-23, 2006, pages 611-617, 2006. URL: http://dx.doi.org/10.1145/1150402.1150476.
  18. Sebastian Lamm. Communication efficient algorithms for generating massive networks. Master’s thesis, Karlsruhe Institute of Technology, 2017. URL: http://dx.doi.org/10.5445/IR/1000068617.
  19. Jure Leskovec and Andrej Krevl. SNAP Datasets: Stanford large network dataset collection. http://snap.stanford.edu/data, June 2014.
  20. Andrew McGregor. Graph stream algorithms: a survey. SIGMOD Record, 43(1):9-20, 2014. URL: http://dx.doi.org/10.1145/2627692.2627694.
  21. Robert Meusel, Sebastiano Vigna, Oliver Lehmberg, and Christian Bizer. Graph structure in the web - revisited: a trick of the heavy tail. In 23rd International World Wide Web Conference, WWW'14, Seoul, Republic of Korea, April 7-11, 2014, Companion Volume, pages 427-432, 2014. URL: http://dx.doi.org/10.1145/2567948.2576928.
  22. Ulrich Meyer, Peter Sanders, and Jop F. Sibeyn, editors. Algorithms for Memory Hierarchies, Advanced Lectures [Dagstuhl Research Seminar, March 10-14, 2002], volume 2625 of Lecture Notes in Computer Science. Springer, 2003. Google Scholar
  23. Seth A. Myers, Aneesh Sharma, Pankaj Gupta, and Jimmy J. Lin. Information network or social network?: the structure of the twitter follow graph. In 23rd International World Wide Web Conference, Seoul, Republic of Korea, 2014. URL: http://dx.doi.org/10.1145/2567948.2576939.
  24. Yuval Shavitt and Tomer Tankel. Hyperbolic embedding of internet graph for distance estimation and overlay construction. IEEE/ACM Trans. Netw., 16(1):25-36, 2008. URL: http://dx.doi.org/10.1145/1373452.1373455.
  25. Christian Staudt, Aleksejs Sazonovs, and Henning Meyerhenke. Networkit: A tool suite for large-scale complex network analysis. Network Science, 4(4):508–530, 2016. URL: http://dx.doi.org/10.1017/nws.2016.20.
  26. Moritz von Looz, Henning Meyerhenke, and Roman Prutkin. Generating random hyperbolic graphs in subquadratic time. In Algorithms and Computation - 26th International Symposium, ISAAC 2015, Nagoya, Japan, December 9-11, 2015, Proceedings, pages 467-478, 2015. URL: http://dx.doi.org/10.1007/978-3-662-48971-0_40.
  27. Moritz von Looz, Mustafa Safa Özdayi, Sören Laue, and Henning Meyerhenke. Generating massive complex networks with hyperbolic geometry faster in practice. In 2016 IEEE High Performance Extreme Computing Conference, HPEC 2016, Waltham, MA, USA, September 13-15, 2016, pages 1-6, 2016. URL: http://dx.doi.org/10.1109/HPEC.2016.7761644.