Brief Announcement: Distributed Sparsest Cut via Eigenvalue Estimation
Abstract
We give new, improved bounds for approximating the sparsest cut value or in other words the conductance of a graph in the model. As our main result, we present an algorithm running in rounds in which every vertex outputs a value satisfying . In most regimes, our algorithm improves significantly over the previously fastest algorithm for the problem [Chen, Meierhans, Probst Gutenberg, Saranurak; SODA 25]. Additionally, our result generalizes to -way conductance.
We obtain these results, by approximating the eigenvalues of the normalized Laplacian matrix , where, is the adjacency matrix and is the diagonal matrix with the weighted degrees on the diagonal. We show our algorithms are near-optimal by proving a lower bound for computing the smallest non-trivial eigenvalue of , even in the stronger LOCAL model
The previous state of the art sparsest cut algorithm is in the technical realm of expander decompositions. Our algorithms, on the other hand, are relatively simple and easy to implement. At the core, they rely on the well-known power method, which comes down to repeatedly multiplying the Laplacian with a vector. This operation can be performed in a single round in the model.
All our algorithms apply to weighted, undirected graphs. Our lower bounds apply even in unweighted graphs.
Keywords and phrases:
CONGEST, Sparsest Cut, Laplacian, Eigenvalues, Spectral Graph TheoryCopyright and License:
2012 ACM Subject Classification:
Theory of computation Graph algorithms analysis ; Theory of computation Distributed algorithmsAcknowledgements:
We would like to thank Leo Wennmann, for the many hours of discussion on this topic. We would like Joachim Orthaber and Malte Baumecker, for thinking along. We would like to thank Faith Ellen, Sebastian Brandt, Alexandre Nolin, and Eva Rotenberg, for the initial discussions that were the inspiration for this research. We would like to thank Schloss Dagstuhl, since the research was initiated during the Dagstuhl seminar Graph Algorithms: Distributed Meets Dynamic.Funding:
This research was funded in whole or in part by the Austrian Science Fund (FWF) https://doi.org/10.55776/P36280 and https://doi.org/10.55776/I6915. For open access purposes, the author has applied a CC BY public copyright license to any author-accepted manuscript version arising from this submission.Editor:
Dariusz R. KowalskiSeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
A graph cut is a partition of the vertices into two disjoint subsets, where the size of the cut is the total number of edges crossing between them, and the sparsity of the cut measures how small this cut is relative to total edge weight of the smaller side, i.e., in comparison to the sum of its vertex degrees. Graph cuts are a central object in theoretical computer science and algorithm design. Cuts are especially important in communication networks because they represent the capacity for information to flow between different parts of the system. A sparse cut – where only a few edges connect two large subsets – can become a bottleneck, severely limiting bandwidth or throughput. A central question for guaranteeing the performance of network algorithms is to determine the minimum sparsity of cuts in a given communication network. For example the runtime of the famous PUSH-PULL protocol depends on the sparsest cut value [11]. More generally, graphs without sparse cuts support fast mixing of random walks [20], and enable efficient routing due to their strong connectivity [9, 10]. This motivates the use of expander decompositions, which partition a graph into high-conductance clusters (i.e, no sparse internal cuts), and sparse inter-cluster connections. Such decompositions are powerful algorithmic tools: thanks to efficient routing, many algorithms run significantly faster on high-conductance subgraphs, allowing problems to be solved locally within clusters before handling the sparse inter-cluster parts.
In this work, we present simple, time and bandwidth efficient distributed algorithms to approximate the sparsest cut value. Our randomized algorithms run in the classic model of distributed computing where a communication network is abstracted as an -vertex graph with vertices being computing entities and edges serving as bandwidth-limited communication links. Communication happens in synchronous rounds, in each of which each vertex can send a -bit message to each of its neighbors and the complexity measure is the number of communication rounds until the vertices have computed their outputs, e.g., until they have output an approximate value of the sparsest cut.
Formally, the sparsity of a set (or cut) in a graph is defined as as
where the volume of a set of vertices is the sum of of the degrees of the vertices in the set . The value is also known as the conductance of the set . The sparsest cut of a graph is the minimum sparsity of a subset . This is also known as the conductance of the graph.
The relative notion can be viewed as a normalized quantification of cut size. In contrast to computing minimum cuts, which quantify the absolute size of a cut, determining the sparsest cut or the conductance is not only NP-complete [17] but also APX-hard under the unique games conjecture [6]. Consequently, although no polynomial-time algorithm can approximate the optimum arbitrarily well (unless ), a variety of approximation algorithms with provable – albeit weaker – guarantees have been developed across different computational models. Next, we present our results and compare them with the state of the art in the field. As the sparsest cut and the conductance have been extensively studied we provide further background and related work in the full version.
2 Our Contribution on Sparsest Cut Approximation
Our first result is an efficient algorithm to approximate the sparsest cut.
Theorem 1.
There is a randomized algorithm that, given an undirected weighted -node graph with sparsest cut , w.h.p. gives an approximation of the sparsest cut in rounds. In particular, every vertex outputs such that .
In general, there are two approaches for developing approximation algorithms for the sparsest cut, the flow approach, that we detail in the full version, and the spectral approach, which we use to prove Theorem 1. On a high-level, one computes the spectrum (or parts of it) of the graph, i.e., its eigenvalues, and then uses Cheeger’s Inequality222Cheeger’s Inequality [2, 1, 21] relates the second-smallest eigenvalue of the normalized Laplacian to the graph’s conductance , showing that . to derive an approximation of from the eigenvalues. Due to the limitations imposed by Cheeger’s Inequality, the best possible approximation achievable through spectral methods is . Our technical result is slightly stronger than Theorem 1. In fact, we can attain this theoretical bound – up to an arbitrarily small additive error – in rounds. See the proof of Theorem 1 for the details.
For unweighted graphs, we compare our result against the previously best algorithm by Chen, Meierhans, Probst Gutenberg, and Saranurak that provides a -approximation of [7]333They do not state this result explicitly, but they implement the cut-matching game from [18] (see Appendix A) in the distributed setting. As stated in [18], this cut-matching game gives such an approximation.. Our approximation is more precise than [7] when , and is faster in all cases: [7] takes rounds. Upon inspection of their , they have a total round complexity of at least in the regime where their approximation is better.
Prior to [7], Das Sarma, Molla, and Pandurangan [19] gave a different algorithm to approximate the sparsest cut, which also works for weighted graphs. They computed an -approximation in rounds if the graph has a sparse cut of balance , i.e., both sides of the cut satisfy . Building on the methods of [19], Kuhn and Molla improved the approximation ratio to and the runtime to [14]. For balanced cuts , we improve their approximation ratio by a factor , while matching their running time. For unbalanced cuts, also their algorithm can take as much as time, so we do not only improve the approximation ratio, but also the running time exponentially.
-Way Sparsest Cut.
There exists a natural generalization of the sparsest cut to multi-cuts [15]: the -way sparsest cut or -way conductance is defined follows:
where the minimum is taken over disjoint, non-empty . Note that . The -way conductance captures how well-connected a graph is across multiple parts; it can describe multi-partition bottlenecks, which standard conductance may miss. This is in particular relevant for real-world networks that usually consists of multiple clusters.
We prove the following theorem for approximating the -way sparsest cut.
Theorem 2.
There is a randomized algorithm that, given a constant and undirected weighted graph with -way sparsest cut , w.h.p. gives an approximation of the -way sparsest cut in rounds. In particular, every vertex outputs such that .
To the best of our knowledge, we are the first to study the -way conductance in the distributed setting.
Lower bounds on approximating sparse cuts.
Generally, it is known that approximating the sparsest cut takes rounds. In particular, Das Sarma, Molla, and Pandurangan [19] show a lower bound for any multiplicative approximation in weighted graphs. They reduce the spanning connected subgraph problem to a sparsest cut instance with edge weights that are either or . This means that the lower bound shows that it is hard to decide whether holds or not. For bounded away from zero, there is no such lower bound. Our upper bound of rounds, Theorem 1, indeed takes rounds, since . However, for , the upper bound does not have a -term.
We prove a lower bound that holds even for unweighted graphs and in the model. is identical to , except that there is no bound on the message size. Even for a class of fixed diameter, any algorithm needs time for a -approximation.
Theorem 3.
Let be a parameter and consider the class of graphs with diameter . An algorithm that with probability satisfies that every vertex of a graph outputs a value , for sparsest cut , and at least one vertex has needs rounds in .
Note that the lower bound in Theorem 3 holds even in the case where only a single vertex needs to witness the cut.
The above proof is about discerning graphs with sparsity and . If every vertex needs to output a correct estimation, a diameter lower bound exists for larger values of .
Theorem 4.
Let be a constant and let . An algorithm that with probability satisfies that every vertex outputs a value such that , where is the sparsest cut value of the graph needs at least rounds in the model.
Our upper bound for approximating the sparsest cut w.h.p. is – only a factor away from optimal, since the graphs from the above observation have sparsest cut and diameter [8]. We note that with constant probability, we match the lower bound, see the full version for details.
3 Our Contributions on Spectrum Approximation
Our core technical contributions are efficient algorithms to compute the approximate (partial) spectrum of a graph. In essence, Theorems 1 and 2 then follow as direct consequences via Cheeger’s Inequality. Nevertheless, we chose to present these theorems first, as they are more accessible to a broader audience and can be compared with prior work.
Eigenvalues of the Normalized Laplacian.
The normalized Laplacian of a graph is the matrix defined as , where is the adjacency matrix and is the (weighted) degree matrix. Its spectrum captures many central properties of the graph, e.g., various types of its connectedness like -way conductance [15] and small set expansion [3], or how close it is to being bipartite [4, 16]. We provide the following theorem to compute an approximation of that spectrum.
Theorem 5 (Eigenvalue estimation).
Let be a parameter and be a constant. There are randomized algorithms that, given an undirected, weighted graph , with high probability approximate the eigenvalues of the graph’s normalized Laplacian as follows:
-
1.
: an additive -approximations in rounds,
-
2.
: an additive -approximation in rounds,
-
3.
: a multiplicative -approximation in rounds.
Eigenvalue is the eigenvalue approximating the sparsest cut via Cheeger’s Inequality. It could either be approximated via the second algorithm, or slightly slower via the first.
For the smaller eigenvalues, Theorem 5 states additive approximations avoiding the diameter dependence, e.g., when is an arbitrarily small constant Theorem 5 approximates in rounds even though the diameter may be up to linear in .
As can be arbitrary in Theorem 5, we can obtain multiplicative approximations by setting for any . However, this does not circumvent the diameter lower bound of Theorem 6 presented below. holds for any graph, so the diameter term is then always asymptotically smaller than the overall runtime of the algorithm.
To the best of our knowledge, the only prior distributed eigenvalue computation is via computing a spectral sparsifier, i.e., a sparse subgraph with edges that approximately preserve the graph’s spectrum. Next, the sparsifier can be gathered at one vertex of the graph that can locally compute the eigenvalues of and hence obtain approximations of all eigenvalues of . Computing is actually relatively fast and only requires rounds [13]. Collecting at a single vertex is the expensive part, requiring rounds. This is exponentially slower than Theorem 5, though this more general method immediately computes the full spectrum while we focus on single eigenvalues.
We note that all our upper bounds even hold in the more restricted Broadcast model in which a vertex needs to send the same message to all of its neighbors in each round. All our algorithms become a -factor faster if we only ask for constant probability as typical in property testing, see the discussion in the full version.
Lower Bound.
All of our multiplicative upper bounds take diameter time. This means that at the end of the algorithm, every vertex can easily output the same eigenvalue or sparsest cut estimate. One may ask whether one can obtain faster algorithms if the output is more local. Perhaps every vertex outputs the sparsest cut it sees in some neighborhood, and there is a guarantee that at least one vertex will see an approximately sparsest cut. In Theorem 3, we showed that this is not true for sparsest cut approximation. We also show that approximating at even one node requires rounds.
Theorem 6.
Any algorithm that with probability ensures that all vertices output a value , and at least one vertex additionally satisfies requires rounds.
An lower bound could also follow from our sparsest cut lower bounds via Cheeger’s Inequality, but such an approach incurs an inherent quadratic loss due to the approximation gap in Cheeger’s.
4 Techniques: The Power Method
As mentioned, Theorems 1 and 2 follow as direct consequences from our eigenvalue estimation results via Cheeger’s Inequality. Here, we focus on our core technical contribution: efficient algorithms to compute the approximate (partial) spectrum of a graph. The main ingredient of our algorithms is the well-known power method for approximating the largest eigenvalue of a matrix. Let be any matrix. The power method, also known as power iteration or Von Mises iteration, starts with a random vector and converges to the eigenvector corresponding to the largest eigenvalue of . It is a standard technique in numerical linear algebra, see, e.g., [12]. The procedure is as follows:
-
Approximate dominant eigenvalue: ;
-
Approximate dominant eigenvector: .
With constant probability over the randomness of the start vector , the power method outputs a vector whose Rayleigh coefficient is close to the largest eigenvalue of the matrix . In the full version, we provide both intuition and a formal proof.
In community detection, [5] used an algorithm that is in the same spirit as the power method but not entirely equal. In case of a sufficiently small balanced cut, and some additional technical assumptions, they can provide an approximate sparsest cut. Since the approximation factor depends on the technical assumptions, we will not state their exact guarantees here.
In the full version, we describe the power method in detail, provide a proof, and discuss the challenges of implementing it in a distributed setting.
References
- [1] Noga Alon. Eigenvalues and expanders. Comb., 6(2):83–96, 1986. doi:10.1007/BF02579166.
- [2] Noga Alon and V. D. Milman. lambda, isoperimetric inequalities for graphs, and superconcentrators. J. Comb. Theory B, 38(1):73–88, 1985. doi:10.1016/0095-8956(85)90092-9.
- [3] Sanjeev Arora, Boaz Barak, and David Steurer. Subexponential algorithms for unique games and related problems. J. ACM, 62(5):42:1–42:25, 2015. Announced at FOCS ’10. doi:10.1145/2775105.
- [4] Frank Bauer and Jürger Jost. Bipartite and neighborhood graphs and the spectrum of the normalized graph laplacian. Communications in Analysis and Geometry, 21, October 2009. doi:10.4310/CAG.2013.v21.n4.a2.
- [5] Luca Becchetti, Andrea E. F. Clementi, Emanuele Natale, Francesco Pasquale, and Luca Trevisan. Find your place: Simple distributed algorithms for community detection. SIAM J. Comput., 49(4):821–864, 2020. Announced at SODA’17. doi:10.1137/19M1243026.
- [6] Shuchi Chawla, Robert Krauthgamer, Ravi Kumar, Yuval Rabani, and D. Sivakumar. On the hardness of approximating multicut and sparsest-cut. Comput. Complex., 15(2):94–114, 2006. Announced at CCC’05. doi:10.1007/S00037-006-0210-9.
- [7] Daoyuan Chen, Simon Meierhans, Maximilian Probst Gutenberg, and Thatchaphol Saranurak. Parallel and distributed expander decomposition: Simple, fast, and near-optimal. In Yossi Azar and Debmalya Panigrahi, editors, Proceedings of the 2025 Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2025, New Orleans, LA, USA, January 12-15, 2025, pages 1705–1719. SIAM, 2025. doi:10.1137/1.9781611978322.53.
- [8] Flavio Chierichetti, George Giakkoupis, Silvio Lattanzi, and Alessandro Panconesi. Rumor spreading and conductance. J. ACM, 65(4):17:1–17:21, 2018. doi:10.1145/3173043.
- [9] Mohsen Ghaffari, Fabian Kuhn, and Hsin-Hao Su. Distributed MST and routing in almost mixing time. In Elad Michael Schiller and Alexander A. Schwarzmann, editors, Proceedings of the ACM Symposium on Principles of Distributed Computing, PODC 2017, Washington, DC, USA, July 25-27, 2017, pages 131–140. ACM, 2017. doi:10.1145/3087801.3087827.
- [10] Mohsen Ghaffari and Jason Li. New distributed algorithms in almost mixing time via transformations from parallel algorithms. In Ulrich Schmid and Josef Widder, editors, 32nd International Symposium on Distributed Computing, DISC 2018, New Orleans, LA, USA, October 15-19, 2018, volume 121 of LIPIcs, pages 31:1–31:16. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2018. doi:10.4230/LIPICS.DISC.2018.31.
- [11] George Giakkoupis. Tight bounds for rumor spreading in graphs of a given conductance. In Thomas Schwentick and Christoph Dürr, editors, 28th International Symposium on Theoretical Aspects of Computer Science, STACS 2011, March 10-12, 2011, Dortmund, Germany, volume 9 of LIPIcs, pages 57–68. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2011. doi:10.4230/LIPICS.STACS.2011.57.
- [12] Gene H Golub and Charles F Van Loan. Matrix computations. JHU press, 2013.
- [13] Ioannis Koutis and Shen Chen Xu. Simple parallel and distributed algorithms for spectral graph sparsification. ACM Trans. Parallel Comput., 3(2):14:1–14:14, 2016. doi:10.1145/2948062.
- [14] Fabian Kuhn and Anisur Rahaman Molla. Distributed sparse cut approximation. In Emmanuelle Anceaume, Christian Cachin, and Maria Gradinariu Potop-Butucaru, editors, 19th International Conference on Principles of Distributed Systems, OPODIS 2015, December 14-17, 2015, Rennes, France, volume 46 of LIPIcs, pages 10:1–10:14. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2015. doi:10.4230/LIPICS.OPODIS.2015.10.
- [15] James R. Lee, Shayan Oveis Gharan, and Luca Trevisan. Multi-way spectral partitioning and higher-order cheeger inequalities. In Howard J. Karloff and Toniann Pitassi, editors, Proceedings of the 44th Symposium on Theory of Computing Conference, STOC 2012, New York, NY, USA, May 19 - 22, 2012, pages 1117–1130. ACM, 2012. doi:10.1145/2213977.2214078.
- [16] Shiping Liu. Multi-way dual cheeger constants and spectral bounds of graphs. Advances in Mathematics, 268:306–338, 2015. arXiv:1401.3147.
- [17] David W. Matula and Farhad Shahrokhi. Sparsest cuts and bottlenecks in graphs. Discret. Appl. Math., 27(1-2):113–123, 1990. doi:10.1016/0166-218X(90)90133-W.
- [18] Thatchaphol Saranurak and Di Wang. Expander decomposition and pruning: Faster, stronger, and simpler. In SODA, pages 2616–2635. SIAM, 2019. doi:10.1137/1.9781611975482.162.
- [19] Atish Das Sarma, Anisur Rahaman Molla, and Gopal Pandurangan. Distributed computation of sparse cuts via random walks. In Sajal K. Das, Dilip Krishnaswamy, Santonu Karkar, Amos Korman, Mohan J. Kumar, Marius Portmann, and Srikanth Sastry, editors, Proceedings of the 2015 International Conference on Distributed Computing and Networking, ICDCN 2015, Goa, India, January 4-7, 2015, pages 6:1–6:10. ACM, 2015. doi:10.1145/2684464.2684474.
- [20] Atish Das Sarma, Danupon Nanongkai, Gopal Pandurangan, and Prasad Tetali. Distributed random walks. J. ACM, 60(1):2:1–2:31, 2013. doi:10.1145/2432622.2432624.
- [21] Alistair Sinclair and Mark Jerrum. Approximate counting, uniform generation and rapidly mixing markov chains. Inf. Comput., 82(1):93–133, 1989. doi:10.1016/0890-5401(89)90067-9.
