Sparsest Cut and Eigenvalue Multiplicities on Low Degree Abelian Cayley Graphs
Abstract
Whether or not the Sparsest Cut problem admits an efficient -approximation algorithm is a fundamental algorithmic question with connections to geometry and the Unique Games Conjecture.
Revisiting spectral algorithms for Sparsest Cut, we present a novel, simple algorithm that combines eigenspace enumeration with a new algorithm for the Cut Improvement problem. The runtime of our algorithm is parametrized by a quantity that we call the solution dimension : the smallest such that the subspace spanned by the first Laplacian eigenvectors contains all but fraction of a sparsest cut.
Our algorithm matches the guarantees of prior methods based on the threshold-rank paradigm, while also extending beyond them. To illustrate this, we study its performance on low degree Cayley graphs over Abelian groups – canonical examples of graphs with poor expansion properties.
We prove that low degree Abelian Cayley graphs have small solution dimension, yielding an algorithm that computes a -approximation to the uniform Sparsest Cut of a degree- Cayley graph over an Abelian group of size in time . Along the way to bounding the solution dimension of Abelian Cayley graphs, we analyze their sparse cuts and spectra, proving that the collection of -approximate sparsest cuts has an -net of size and that the multiplicity of is bounded by . The latter bound is tight and improves on a previous bound of by Lee and Makarychev.
Keywords and phrases:
Sparsest Cut, Spectral Graph Theory, Cayley Graphs, Approximation AlgorithmsCategory:
APPROXFunding:
Chris Jones: CJ is supported in part by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement Nos. 834861 and 101019547). CJ is also a member of the Bocconi Institute for Data Science and Analytics (BIDSA).Copyright and License:
2012 ACM Subject Classification:
Theory of computation Approximation algorithms analysis ; Mathematics of computing Spectra of graphsAcknowledgements:
We deeply thank Luca Trevisan for his motivation and wisdom on this problem, and for suggesting to look into Abelian Cayley graphs. CJ and JZ thank Lucas Pesenti and Robert Wang for discussions. We thank Madhu Sudan for pointing us to [6].Funding:
JR and SV are supported in part by a Simons Investigator Award to Salil Vadhan. Work began while JR and SV were visitors at Bocconi University.Editors:
Alina Ene and Eshan ChattopadhyaySeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
For an undirected graph , the sparsest cut measures how poor of an expander the graph is, characterizing how slowly random walks on mix. For simplicity in this introduction, we will state our definitions just for the case of regular graphs:
Definition 1 (Sparsest Cut).
For a -regular graph on vertices and a partition of its vertex set , the (normalized) density of the cut is defined as:
The sparsest cut problem is, given , to find that minimizes We write to denote the minimum value of over all cuts .
The normalization factor of ensures that the largest possible value of is , because for a randomly chosen cut , the expectation of is , and the expectation of is . We’ll often switch back and forth between and the closely related quantity of conductance, defined as
We have , so the problem of approximating conductance is equivalent to the problem of approximating sparsest cut, up to multiplying the approximation factor by 2. Via known reductions, up to a constant factor of approximation, sparsest cut is also equivalent to approximating several other graph parameters such as edge expansion and balanced separator [5].
Because of its centrality to algorithms, computational complexity, combinatorics, and geometry, efficient algorithms for sparsest cut have been the main focus of a long line of work. A polynomial-time -approximation algorithm was obtained via a linear programming relaxation by Leighton and Rao [34], and later improved to by Arora, Rao, and Vazirani (ARV) [5] via semidefinite programming. To this day, the ARV algorithm remains the state-of-the-art for general graphs.
A central challenge to resolving the approximability of sparsest cut is its intricate relationship with the Unique Games Conjecture (UGC) and the Small Set Expansion Hypothesis (SSEH), themselves outstanding open problems with a close connection [25, 41, 42]. Assuming the SSEH, sparsest cut does not have a polynomial-time -approximation algorithm [42] (the same holds for “non-uniform” sparsest cut assuming the UGC [13, 26, 4]). Thus we have some evidence that beating the approximation factor of the ARV algorithm may be hard. On the other side of the coin, searching for better approximation algorithms for sparsest cut is an ostensible approach to developing algorithms for related problems, including small-set expansion and unique games.
Given the above situation, researchers have attempted to pair interesting instances with interesting algorithms to determine when constant-factor approximation to sparsest cut is possible. Below we summarize the most relevant algorithmic approaches and what is known about them.
Fiedler’s Algorithm
The simplest algorithm for sparsest cut is Fiedler’s Algorithm, which is just to threshold the second eigenvector [17]. Fiedler’s Algorithm always finds a cut such that , where is the second-smallest eigenvalue of the normalized Laplacian of . Indeed, this performance guarantee for Fiedler’s Algorithm is the proof of the right-hand side of Cheeger’s Inequality, which says:
This analysis of Fiedler’s algorithm shows that it gives an worst-case approximation ratio, which can be quite large. Indeed, examples show, however, that the approximation factor achieved by Fiedler’s Algorithm can be as bad as in general [20].
Expander-like graphs
Starting around 15 years ago, a series of works gave approximation algorithms for sparsest cut and unique games on graphs that are similar to expanders. A constant-factor approximation to sparsest cut is trivial on actual expanders (every cut is a constant-factor approximation since ), but it took a nontrivial work of Arora et al. [4] to approximate unique games on expanders. This in turn led to efficient algorithms for sparsest cut on graphs that are “expander-like” in the sense of having few small eigenvalues, as captured by the following definition:
Definition 2 (-threshold-rank).
For a graph and the -threshold-rank is the number of eigenvalues of the normalized Laplacian with value at most .
Recall that is an expander if and only if is bounded away from 0. Equivalently, there is a constant (independent of ) such that . For every constant , the works [3, 10] give a constant-factor approximation algorithm for sparsest cut that runs in time polynomial in but exponential in . The algorithm of Barak, Raghavendra, and Steurer [10] is based on a rounding of a semidefinite programming relaxation, the ’th level of the Sum-of-Squares hierarchy. Recent work has extended the Sum-of-Squares technique based on looser expansion properties, including being a small-set expander [7], coming from a high-dimensional expander [8], or having a “succinct characterization” of non-expanding sets [9].
The algorithm of Arora, Barak, and Steurer [3] is based on enumerating the subspace spanned by the first eigenvectors to find a vector that is very close to a sparse cut. Approaches like this, which utilize multiple low eigenvectors of the Laplacian matrix, are natural generalizations of Fiedler’s Algorithm, and we collectively refer to them as higher-order spectral algorithms.
Cheeger-lower-bound graphs
With a more careful use of the Sum-of-Squares hierarchy, Guruswami and Sinop [22] showed that we can replace in the aforementioned results with .
Theorem 3 ([22]).
For all constants , sparsest cut admits a -approximation algorithm in time where is the -threshold-rank.
When is small, it is not clear that graphs with small -threshold-rank are “expander-like” anymore. Instead, we can think of them as graphs where the lower bound of Cheeger’s Inequality () is nearly tight in the sense that has few eigenvalues between and .
The crucial feature of this class of graphs is that they have relatively few distinct near-sparsest cuts. Indeed, the indicator vector of a set with conductance must be close to the subspace spanned by the eigenvalues of magnitude . Thus if the latter subspace has dimension , all of the near-sparsest cuts can be covered by a net of size exponential in .
A similar consequence of small -threshold-rank is that these graphs can be decomposed into a small number of pieces with conductance at most , such as depicted in Figure 1. See full version for a simple statement and proof of this decomposition.
Cut improvement
As noted by Guruswami and Sinop [22], Theorem 3 can also be proven via a higher-order spectral algorithm, based on eigenspace enumeration plus an algorithm for the cut improvement problem.111Guruswami and Sinop also study the non-uniform sparsest cut problem, which is not covered by the results on cut improvement. Informally, cut improvement is the problem of: given a cut , compute the sparsest cut that has large intersection with . This is a problem of independent interest, with practical applications [2, 31, 18].
Andersen and Lang [2] gave a flow-based approximation algorithm for cut improvement which for our purposes can be summarized as follows.
Theorem 4 ([2]).
Let be an -vertex graph, let , and let . There is a polynomial-time algorithm that, given and such that , returns with and .
With this cut improvement algorithm in hand, we can derive Theorem 3 (with a factor 2 loss in approximation factor) as follows: by enumerating the eigenspace up to eigenvalue , we can obtain a cut that is -close to a sparsest cut , and then we use Theorem 4 to find such that
where the factor 2 loss comes from moving between conductance and density.
Buser graphs
A class of graphs which is in a sense complementary to the class of Cheeger-lower-bound graphs consists of graphs that satisfy a Buser Inequality . Examples of graphs with bounded degree which satisfy this inequality are: graphs with non-negative discrete curvature [27, 39], Cayley graphs over Abelian groups [27, 40], or more generally Cayley graphs over nilpotent groups [12], or graphs satisfying strong bounds on their volume growth [11]. All these graphs satisfy where is the degree of the graph.
For solving these instances, it has been observed experimentally that spectral algorithms are less successful than flow-based algorithms based on the Leighton–Rao LP and its successors [34, 18]. For example, the computations of [31, Table 2] demonstrate that: for low-dimensional mesh graphs, Leighton–Rao succeeds while simple spectral thresholding algorithms are suboptimal, whereas the opposite is true for an “expander with a planted cut” similar to the graph in Figure 1. Based on this intuition, one might plausibly believe that this class of graphs is outside the range of techniques used for Theorem 3.
Note that the Buser Inequality implies that Fiedler’s Algorithm achieves -approximation to sparsest cut since the conductance of the Fiedler cut is . Thus, Fiedler’s Algorithm can still achieve constant approximation ratio for constant , although the approximation ratio decays to for the hypercube graph (when it is perturbed slightly so that the -indicator of the Majority function becomes the unique second eigenvector).
1.1 Results
In this work:
- 1.
-
2.
We define a new complexity measure, the solution dimension SD, which determines the runtime of our eigenspace enumeration algorithm for sparsest cut. Comparing to Theorem 3, the solution dimension is no larger than the -threshold-rank, but surprisingly, it can be significantly smaller. We demonstrate that this is the case for Cayley graphs over Abelian groups, which are a specific family of graphs satisfying the previously mentioned Buser Inequality. This gives us a PTAS for sparsest cut in Abelian Cayley graphs of degree and a subexponential time -approximation for .
-
3.
To prove our upper bound on the solution dimension, we obtain several new facts about Abelian Cayley graphs. We bound the number of sparse cuts in an Abelian Cayley graph, showing that all -approximate sparse cuts are contained in the span of the first eigenvectors. We also establish a new and tight bound of on the multiplicity of improving a previous bound of proven by Lee and Makaryachev [32] . In contrast to their proof, which is based on Kleiner’s proof of Gromov’s theorem [29], we directly show that the multiplicity of the eigenvalue in any graph can be robustly upper bounded by a notion of volume growth of the graph.
We elaborate on these contributions below.
1.1.1 Cut Improvement
Our first result is a new algorithm for the cut improvement problem.
Theorem 5.
Let be a regular -vertex graph, let and let , . There is a polynomial time algorithm that, given and such that returns with
Compared to Theorem 4 of Andersen and Lang [2], this theorem offers a -approximation to the density rather than to the conductance , which will save us a factor of 2 in moving between the two. The algorithm of Andersen and Lang is based on iterative max flow computations on an augmented graph. In comparison, our algorithm seems conceptually simpler in that it uses the Leighton–Rao linear program with one extra constraint, followed by the simplest ball rounding scheme. While other algorithms have been proposed for the cut improvement problem, such as a spectral algorithm by Mahoney et al [36], we are unaware of other algorithms besides Theorems 4 and 5 that achieve approximation to the nearby sparse cut.
1.1.2 Solution Dimension
Our second contribution is a new characterization of graphs for which a combination of eigenspace enumeration with an algorithm for cut improvement is an effective approach to sparsest cut. We name the parameter controlling the runtime of our spectral enumeration algorithm the solution dimension of the graph. For a subspace let be the projection onto let be the set of unit vectors near and, for , and let be the projection of orthogonal to the all-1s vector.
Definition 6 (Sparsest Cut Solution Dimension).
Let let be an -vertex graph. Let be the diagonal matrix whose th entry is . Let be the sorted eigenvalues of the normalized Laplacian of and let be the associated eigenvectors. The - sparest cut solution dimension of , denoted by is the smallest such that there exists with:
-
(i)
-
(ii)
.222Because of the centering, it does not matter whether the indicator of is represented as a 0/1 vector or a vector, and and are treated equivalently. For regular graphs, can be ignored.
When we simply write
It can be shown that ; indeed every cut can be approximated in the span of eigenvectors of eigenvalue at most . (See full version for a proof.) In this way, Theorem 7 strengthens Theorem 3. Surprisingly, the -threshold-rank does not always match the solution dimension, allowing for potentially large speedups over Theorem 3. The cycle graph is perhaps the most basic example with solution dimension whereas . Put another way, it suffices for eigenspace enumeration to find a fraction of a sparse cut, instead of the entire cut, which can dramatically reduce the runtime of the enumeration.
Furthermore, while being small implies that there are few “distinct” sparsest cuts (since all of them are -close to a low-dimensional subspace), small solution dimension only requires that some approximate sparsest cut is close to the low eigenspace. From this perspective, understanding whether graphs with small solution dimension yet many distinct sparsest cuts exist – and what their structure might be – appears to be a potentially valuable direction for developing new algorithms for sparsest cut.
Combining Definition 6 with Theorem 5 we deduce:
Theorem 7.
For all sparsest cut on regular graphs admits a -approximation algorithm in time
1.1.3 Abelian Cayley Graphs
Our next contribution is to show that using the solution dimension framework gives a dramatic speedup generically for low-degree Cayley graphs over Abelian groups.
Definition 8 (Cayley Graph).
Let be a group and let be a multiset (called the set of generators) from such that the multiplicity of and is the same for all . The Cayley graph of generated by , denoted , is the graph with vertex set and edge set .
Even when restricted to constant degree, Cayley graphs provide a simple way to construct graphs with interesting algebraic and geometric properties. For example, the first expander construction by Margulis is a 12-regular Cayley graph over the group [37].
Constant-degree Cayley graphs over Abelian groups, and more generally nilpotent groups, are well-known in computer science and geometric group theory as non-expanding graphs which have slow mixing properties. Degree- Abelian Cayley graphs satisfy [28, 19], the Cheeger upper bound is nearly tight due to the Buser inequality [27, 40], the volume of a radius- ball is at most , and they are flat or positively curved using any of several definitions of discrete curvature [15, 27].
Trevisan posed the question of whether sparsest cut admits an -approximation in polynomial time on Abelian Cayley graphs [40]. We prove that higher-order spectral algorithms can efficiently solve sparsest cut on low-degree Abelian Cayley graphs using the solution dimension framework.
Theorem 9.
Let be a Cayley graph over an Abelian group with generating set . There is an algorithm that finds a set satisfying in time
Recall that, because of the aforementioned Buser Inequality, Fiedler’s algorithm gives an approximation algorithm on degree Abelian Cayley graphs. In contrast, Theorem 9 gives a approximation ratio. The price is that runtime is doubly exponential in the degree . Indeed, we obtain a PTAS for degree and a subexponential-time approximation scheme for .
Every finite Abelian group is isomorphic to a product of cyclic groups . The minimum number of cyclic factors in such a decomposition is also the minimum size of a generating set for . Thus our algorithm is useful for groups that can be decomposed into cyclic factors (such as cyclic groups). Despite the fact that requires generators, Cayley graphs over are easy instances. The solution dimension of Cayley graphs over satisfy that for every .
To prove Theorem 9 from Theorem 7, we bound the solution dimension of Abelian Cayley graphs by analyzing both their sparse cuts and their low eigenvectors. The first key ingredient is a new bound on the multiplicity of eigenvalues near .
Theorem 10.
Let be a Cayley graph over an Abelian group with generating set . For every ,
By plugging in , we obtain a bound of on the multiplicity of the eigenvalue itself, which improves a previous bound of proven by Lee and Makaryachev [32]. Moreover, our bound is robust, in that we also bound the number of eigenvalues that are at most by . For equal to a multiple of , our bound meets the trivial upper bound . We observe that this upper bound is tight up to the constant factor in the exponent, since there exist Cayley graphs over coming from linear error-correcting codes which have and eigenvalue multiplicity . See the full version for the details.
Theorem 10 may be of independent interest in spectral graph theory, due in part to a recent work by Jiang et al. which resolved a longstanding open question in geometry about the maximum number of equiangular lines in , using a key insight that the maximum multiplicity of in a graph with maximum degree is [23, 38, 24]. These works also mention Cayley graphs as useful special cases for analysis.
The second key ingredient behind Theorem 9 is a bound on sparse cuts in low-degree Abelian Cayley graphs, established by the following theorem.
Theorem 11.
Let be a Cayley graph over an Abelian group with generating set . Then for
Combining Theorem 10 and Theorem 11 with the Cheeger inequality returns the bound on the solution dimension. This implies the final result in Theorem 9. When this theorem can lead to a large gap between and and consequently to the large speedup we observe over Theorem 3.
In fact, our proof shows a stronger result that all of the sparsest cuts are -contained in the span of the first eigenvectors. That is, all of the sparsest cuts are -close to a hyperplane cut in the dimensional spectral embedding of . This upper bound on the number of sparse cuts is likely of independent interest. In particular, a line of recent work [7, 9] shows how to solve unique games instances over graphs with “certifiable” upper bounds on the number of solutions. We consider it likely that our results can therefore be extended to solve unique games instances in polynomial time on constant-degree Abelian Cayley graphs.333Bafna and Minzer [9] informally define a “globally hypercontractive graph” to be one with a succinct and algorithmic characterization of its small non-expanding sets. Our result informally shows that low-degree Abelian Cayley graphs have this property, although the formal definitions do not exactly meet.
Regarding Cayley graphs of larger degree, appears to be a natural threshold for our analysis. Indeed, all Abelian Cayley graphs with generators have expansion by the bound , whereas a random Abelian Cayley graph with generators will be an expander with high probability [1]. Notably, Cayley graphs over with which may have small solution dimension, fall outside the range of structural results above, since they necessarily have in order to be connected. Nonetheless, Cayley graphs over in fact admit an -approximation algorithm to sparsest cut. This follows from [30], but we give a simpler algorithm in the full version.
In light of the results above, we conjecture that the class of Abelian Cayley graphs does not contain hard examples for sparsest cut:
Conjecture 12.
Let be a Cayley graph over an Abelian group of size . There is an algorithm that finds a set satisfying in time .
A subexponential time algorithm would already be an interesting result, considering that there is a subexponential time algorithm for unique games [3], but efforts to lift this algorithm back to sparsest cut have not yet succeeded.
2 Techniques
We present here the main ideas behind our results. Details and proofs are mostly deferred to the full version, except for the proofs of Theorem 10 and Theorem 11, which we present in Sections 4 and 5, respectively.
Cut improvement (Theorem 5) and eigenspace enumeration (Theorem 7)
The starting point for Theorem 5 is the natural LP relaxation for sparsest cut studied in [34]. This LP relaxation assigns lengths to edges such that the average distance between all pairs of vertices is fixed, while the average edge length is minimized. The pairwise distances are constrained to form an -point semi-metric . The rounding procedure then computes a randomized line embedding that is -Lipschitz and has small average distortion with respect to this metric:
Finally, the line embedding can be easily converted into a cut, yielding a cut with sparsity at most The improvement of [5] comes from the fact that their program returns a metric of negative type and, for such metrics, one can efficiently construct a -Lipschitz line embedding with average distortion at most Hence, natural paths to improve over these results may require better constructions of the metric and the line embedding, which appears challenging and may be computationally hard in general [42].
Our key insight is that, if we enforce solutions to be close to a given partition in the sense that:
| (1) |
then any feasible metric must have two “well-separated” sets of size at least and which are at distance at least Any such feasible metric admits an efficiently computable line embedding with distortion at most which can then be easily rounded into an integral solution. In this approach, all feasible solutions must have large overlap with and hence the overall quality of the output depends on the starting point .
Finally, to obtain Theorem 7, we simply initialize the program at a good , by finding in time a partition that differs from a sparsest cut in at most an -fraction of the vertices. For this choice of the constraint Equation 1 we can simultaneously ensure that the returned metric is easily roundable and that there is a nearby optimal integral solution which we -approximate.
Eigenvalue multiplicity (Theorem 10) and slow decay of collision probability
The common approach to bounding the eigenvalue multiplicity of graph Laplacians boils down to relating the local volume growth of induced subgraphs with the spectrum of the whole graph [23, 32, 38]. We remark that weaker bounds are also immediate consequences of higher order Cheeger inequalities [35, 33].
Limiting our discussion to in the context of Abelian Cayley graphs the most relevant work is by Lee and Makarychev [32]. Their notion of volume growth is the doubling constant: , where is the ball of radius about the identity element of . By vertex transitivity the choice of the center of the ball is inconsequential. The importance of the doubling constant stems from the observation that it can be used to bound the packing number of the graph Building on a series of existing results [14, 29, 21], Lee and Makarychev use this observation to establish
where the second step leveraged the inequality Because Abelian Cayley graphs are known to have polynomial growth [16], this allows them to conclude
We take a much more direct (and arguably significantly simpler) approach which directly relates spectral and combinatorial quantities. The starting point is the notion of -step collision probability The -step collision probability equals the probability that two independent walks of length have the same endpoint when starting from the same vertex, sampled from the stationary distribution of .444A technical detail is that our analysis uses lazy random walks. Thus serves as a probabilistic analog of the size of the ball (a similar notion was also used in [38]). This motivates us to consider the following “smooth” version of the doubling constant:
An important fact is that for -regular graphs the collision probability equals the average of the eigenvalues of the adjacency matrix of the -th power of our starting graph. That is, we have for
| (2) |
This is the -th norm of the eigenvalues, also called the -th Schatten norm of the adjacency matrix of The crucial consequence of the expressiveness of Equation 2 is that the collision probability ratio is then entirely captured by the spectrum of
| (3) |
And now the key insight is that for an appropriate choice of Equation 3 relates the collision probability ratio, the multiplicity of and the eigenvalues of
The observation that, by taking longer random walks, we can learn critical information about the spectrum and avoid constructing a large packing, is the fundamental improvement over [32]. Our inequality is valid for all regular graphs and hence immediately implies a bound on the eigenvalue multiplicities for all graphs that satisfy a bound on the collision probability ratio.
Our second key contribution is an upper bound on for all Abelian Cayley graphs. To prove this statement, our challenge is to relate walks (not balls) of length to those of length in an Abelian Cayley graph. First, we can see that in an Abelian Cayley graph, the endpoint of a walk is completely determined by how many times each generator is chosen, and not on their order. Therefore, we may replace the random walk by a random draw from a multinomial distribution, using samples of items each occurring with probability .
From here we can directly relate the multinomial density for to that of . The binomial density for samples is approximately a Gaussian with twice the variance of the density of a Gaussian for samples, leading to a direct comparison inequality on the probability density functions, for all . For generators, this idea can be extended (with additional arguments) to obtain a comparison equality with factor which then implies
| (4) |
The bound in Equation 4 is best possible up to the constant in the exponent: there exist Abelian Cayley graphs with . The construction of such graphs comes from a characterization of eigenvalue multiplicity in Cayley graphs over in terms of binary linear codes. Every Cayley graph corresponds to a binary linear code of dimension and block length with the multiplicity of corresponding to the number of code words of minimum weight. Thus, the problem of constructing Abelian Cayley graphs with large eigenvalue multiplicity is equivalent to constructing binary linear codes with many code words of minimum weight. Using algebraic geometry codes, Ashikhmin, Barg, Vlădut̨ [6] constructed a family of binary linear codes with linear rate and distance, and with an exponential number of minimum-weight codewords, which furnishes the desired construction.
Sparse cuts are approximately low-dimensional (Theorem 11)
To show that a partition in a graph approximately lives in the span of the first few eigenvectors, we observe that it suffices to relate its conductance in with its conductance in in the sense:
| (5) |
for some sufficiently large and small Recall now that the expansion can be expressed spectrally as the Rayleigh quotient of the vector . Let be the representation of the indicator vector of the set in the eigenbasis of the graph. Then the above argument implies
From this inequality, simple manipulations show that with the choice all but of the spectral mass of must be contained on eigenvalues up to .
To establish a version of Equation 5 for Abelian Cayley graphs, we leverage the analysis of Buser inequality [27, 40]. The proof by Oveis Gharan and Trevisan [40] studies the probability of a length- random walk crossing an arbitrary cut . This measures the expansion of in the graph and their combinatorial analysis proves that Equation 5 holds for As before, the crucial property of Abelian Cayley graphs being used is that it suffices to count how many times each generator is used by the walk. In the proof of this inequality, it is also important that generators and cancel out, which allows us to bound the expected number of steps in the direction of generator by only . Finally, because this argument does not rely on the specific cut we immediately obtain that for Abelian Cayley graphs, all sparsest cut approximately live in the span of the first eigenvectors.
3 Preliminaries
We establish the notation used throughout the paper along with some preliminary notions.
The norm is the norm. For a subspace we denote by the orthogonal projection onto For a set we denote by its indicator vector.
In this paper, we study undirected graphs which may have self-loops or multiedges, which we refer to as graphs. We always use to denote the number of vertices in a graph and we assume . We denote by the normalized adjacency matrix,
where is the degree of in (counting a self-loop as degree 1). When the graph is regular we use to denote its degree. In some of these definitions we may omit when the context is clear.
Let be the diagonal matrix with entries . The normalized Laplacian of is defined as The eigenvalues of a graph are the eigenvalues of its normalized Laplacian and are denoted to the associated eigenvectors . We use for the eigenvalues of the normalized adjacency matrix, in descending order. Note that for all , . When is regular, equals which denotes the transition matrix of the simple random walk on . When is not regular, and are similar (under conjugation by ) so they still have the same eigenvalues.
For a set we let . For we denote by the multi-graph obtained by taking an edge for each length- walk in . In other words, .
Fact 13.
For all graphs , is a symmetric positive semidefinite (PSD) matrix with all eigenvalues in the range .
Fact 14.
Let . If is an eigenvalue of , then is an eigenvalue of Furthermore, if then for any
An immediate corollary is the following relation between the spectrum of and its powers.
Fact 15.
Let be a graph and let For any it holds
We will make use of the following definition for the span of eigenvectors associated with small eigenvalues of the Laplacian.
Definition 16 (low eigenspace).
For a graph and we define to be the span of the eigenvectors of associated with eigenvalues Notice that
The conductance and normalized density of a cut are defined for general graphs as follows.
Definition 17 (Conductance and density).
Let be a graph and let . The conductance and the (normalized) density are:
where the volume of is . The conductance and density of are then and .
The denominator of is the probability that two independent random vertices chosen according to the stationary distribution cross the cut, and the numerator is the probability that a random edge in the graph crosses the cut. Although and are the same, we use both notations to highlight that is symmetric with respect to complementing whereas is not.
Fact 18.
Let be a graph and . Then,
These combinatorial quantities can be interpreted spectrally as simple Rayleigh quotients. For a vector , let denote the projection of orthogonal to the all-1s vector.
Fact 19.
Let be a graph. For all ,
Cheeger’s inequality provides a quantitative relation between eigenvalues and conductance.
Theorem 20 (Cheeger inequality).
Let be a graph. Then
We will make use of Stirling’s approximation:
Fact 21 (Stirling’s approximation [43]).
for all .
4 The low eigenspace of Abelian Cayley graphs
In this section, we study the low eigenspace of Abelian Cayley graphs and prove Theorem 10. We do so by analyzing the collision probability of a random walk in . For a probability distribution , the collision probability of is defined by
Definition 22 (-step lazy collision probability).
Let be a graph and be the stationary distribution for . The -step lazy collision probability is defined by .
Recall that the stationary distribution is proportional to the degree, and is the uniform distribution when the graph is regular. Note that for vertex-transitive graphs such as Abelian Cayley graphs, the collision probability satisfies for all vertices . The next Lemma gives a spectral interpretation of as the power sums of the eigenvalues of the normalized adjacency matrix (a.k.a the moments of the empirical spectral distribution).
Lemma 23.
Let be a regular graph. Then
Proof.
Let be a simple random walk initialized at drawn from the stationary distribution. Let be an independent simple random walk initialized at . Since the simple random walk is a reversible Markov chain, . On the other hand, the diagonal elements of the transition matrix equal the returning probabilities of a random walk. Therefore
as desired.
To bound the multiplicity of eigenvalues close to , we analyze the ratio where and are parameters. Concretely, Theorem 10 will follow from a combination of the next two statements.
The first shows that the collision probability ratio can be used to bound the dimension of , which is equal to , using an appropriate choice of and . Interestingly, the argument works for regular graphs and not only for Abelian Cayley graphs.
Lemma 24.
Let be a connected regular graph on vertices. Suppose and let . Then for , we have
Proof.
First notice that for any it holds since By applying Lemma 23, we have
We show the following lower bound,
| (6) |
First observe that if , then . Now by ignoring all that are not in , we have
where the last inequality uses the fact that for . Now, note that
This implies, . In particular, we can use this to obtain
The final inequality uses the fact that and . This proves (6). Observe, that for all non-negative numbers with we have This implies the following inequality
| (7) |
Choosing gives the desired inequality. The factor of in the denominator comes from the fact that and the assumption .
The second statement we prove, which is restricted to Abelian Cayley graphs, upper bounds the ratio with a function that depends on the degree of the graph but not on or .
Lemma 25.
Let be a degree Abelian Cayley graph. Then, for every integer
Proof.
To simplify the analysis of the quantity , we
-
1.
replace the lazy random walk with non-lazy random walk by introducing new copies of the identity element as generators, and
-
2.
assume each generator occurs with even multiplicity. This can be done by making a copy of every generator. Note this does not change the random walk matrix and hence the collision probabilities are preserved.
In the above two operations we introduce new generators ( copies of the identity and copy of each the original generators). To simplify notation, we assume satisfies the assumptions above and replace with in the final bound.
Let be a simple random walk in initialized at the identity element, denoted 0. Because is Abelian, the position of at any time can be compressed into the count of the number of times that each generator has been used as a step, which we write as the tuple . The returning walks of length are exactly those such that (in ) and (in ). We have:
We define to be an integer vector whose entries are approximately .
Claim 26.
There exists such that , , and for all .
Let be as in the Claim. Then, by ignoring terms in the denominator except for those with for all ,
The point of the inequality is that it now suffices to show the direct comparison inequality for all with (dropping the constraint that in ). Towards this, we have
We prove by “discrete gradient descent” that this quantity is maximized when . Let be with replaced by and replaced by . The ratio of the consecutive terms is,
This is at least 1 if and only if . If this holds, the change increases the value. This implies that at the maximizer (since if , there is at least once coordinate which is smaller than and one coordinate which is larger than in which we can move to increase the value).
Finally, we bound the value at the maximizer.
Which concludes the proof.
We are now ready to prove Theorem 10.
Proof of Theorem 10.
5 The sparse cuts of Abelian Cayley graphs
In this section, we prove that all sparse cuts of an Abelian Cayley graph are approximately contained in the low eigenspace with thus obtaining Theorem 11.
Theorem 27.
Let with Let and For all such that we have
The proof extends the combinatorial proof of the Buser inequality in graphs due to Oveis Gharan and Trevisan [40]. Let be a sparsest cut in . We analyze the expansion of in the graph for an appropriate choice of . Following the proof of the Buser inequality [40], this quantity can be bounded in terms of the expansion in .
Lemma 28 ([40]).
.
Proof of Theorem 27.
By Fact 19 the expansion has a spectral representation,
| (8) |
Let be the representation of in the eigenbasis. The eigenvalues of are equal to . By combining Equation 8 and Lemma 28 we obtain,
We interpret the left-hand side probabilistically. Let denote the “spectral sample” distribution on taking value with probability proportional to i.e. the weight of on the th eigenvector. The normalizing factor for is using . Then we have,
Fixing a threshold , we upper bound where is the fraction of mass outside of the low eigenspace. Therefore,
Selecting and we conclude i.e. at least fraction of the mass of is on the low eigenspace. This finishes the proof of Theorem 27.
References
- [1] Noga Alon and Yuval Roichman. Random cayley graphs and expanders. Random Structures & Algorithms, 5(2):271–284, 1994. doi:10.1002/RSA.3240050203.
- [2] Reid Andersen and Kevin J Lang. An algorithm for improving graph partitions. In SODA, volume 8, pages 651–660, 2008. URL: http://dl.acm.org/citation.cfm?id=1347082.1347154.
- [3] Sanjeev Arora, Boaz Barak, and David Steurer. Subexponential algorithms for unique games and related problems. Journal of the ACM (JACM), 62(5):1–25, 2015. doi:10.1145/2775105.
- [4] Sanjeev Arora, Subhash A Khot, Alexandra Kolla, David Steurer, Madhur Tulsiani, and Nisheeth K Vishnoi. Unique games on expanding constraint graphs are easy. In Proceedings of the fortieth annual ACM symposium on Theory of computing, pages 21–28, 2008.
- [5] Sanjeev Arora, Satish Rao, and Umesh Vazirani. Expander flows, geometric embeddings and graph partitioning. Journal of the ACM (JACM), 56(2):1–37, 2009. doi:10.1145/1502793.1502794.
- [6] Alexei Ashikhmin, Alexander Barg, and Serge Vladut. Linear codes with exponentially many light vectors. Journal of combinatorial theory. Series A, 96(2):396–399, 2001. doi:10.1006/JCTA.2001.3206.
- [7] Mitali Bafna, Boaz Barak, Pravesh K Kothari, Tselil Schramm, and David Steurer. Playing unique games on certified small-set expanders. In Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing, pages 1629–1642, 2021. doi:10.1145/3406325.3451099.
- [8] Mitali Bafna, Max Hopkins, Tali Kaufman, and Shachar Lovett. High dimensional expanders: Eigenstripping, pseudorandomness, and unique games. In Proceedings of the 2022 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 1069–1128. SIAM, 2022. doi:10.1137/1.9781611977073.47.
- [9] Mitali Bafna and Dor Minzer. Solving unique games over globally hypercontractive graphs. arXiv preprint arXiv:2304.07284, 2023. doi:10.48550/arXiv.2304.07284.
- [10] Boaz Barak, Prasad Raghavendra, and David Steurer. Rounding semidefinite programming hierarchies via global correlation. In 2011 ieee 52nd annual symposium on foundations of computer science, pages 472–481. IEEE, 2011. doi:10.1109/FOCS.2011.95.
- [11] Brian Benson, Peter Ralli, and Prasad Tetali. Volume growth, curvature, and buser-type inequalities in graphs. International Mathematics Research Notices, 2021(22):17091–17139, 2021.
- [12] Emmanuel Breuillard and Matthew CH Tointon. Nilprogressions and groups with moderate growth. Advances in Mathematics, 289:1008–1055, 2016.
- [13] Shuchi Chawla, Robert Krauthgamer, Ravi Kumar, Yuval Rabani, and D Sivakumar. On the hardness of approximating multicut and sparsest-cut. computational complexity, 15:94–114, 2006. doi:10.1007/S00037-006-0210-9.
- [14] Tobias H Colding and William P Minicozzi. Harmonic functions on manifolds. Annals of mathematics, 146(3):725–747, 1997.
- [15] David Cushing, Supanat Kamtue, Riikka Kangaslampi, Shiping Liu, and Norbert Peyerimhoff. Curvatures, graph products and ricci flatness. Journal of Graph Theory, 96(4):522–553, 2021. doi:10.1002/jgt.22630.
- [16] Persi Diaconis and Laurent Saloff-Coste. Moderate growth and random walk on finite groups. Geometric & Functional Analysis GAFA, 4:1–36, 1994.
- [17] Miroslav Fiedler. Algebraic connectivity of graphs. Czechoslovak mathematical journal, 23(2):298–305, 1973.
- [18] Kimon Fountoulakis, Meng Liu, David F Gleich, and Michael W Mahoney. Flow-based algorithms for improving clusters: A unifying framework, software, and performance. SIAM Review, 65(1):59–143, 2023. doi:10.1137/20M1333055.
- [19] Joel Friedman, Ram Murty, and Jean-Pierre Tillich. Spectral estimates for abelian cayley graphs. Journal of Combinatorial Theory, Series B, 96(1):111–121, 2006. doi:10.1016/J.JCTB.2005.06.012.
- [20] Stephen Guattery and Gary L Miller. On the quality of spectral separators. SIAM Journal on Matrix Analysis and Applications, 19(3):701–719, 1998. doi:10.1137/S0895479896312262.
- [21] Anupam Gupta, Robert Krauthgamer, and James R Lee. Bounded geometries, fractals, and low-distortion embeddings. In 44th Annual IEEE Symposium on Foundations of Computer Science, 2003. Proceedings., pages 534–543. IEEE, 2003. doi:10.1109/SFCS.2003.1238226.
- [22] Venkatesan Guruswami and Ali Kemal Sinop. Approximating non-uniform sparsest cut via generalized spectra. In Proceedings of the twenty-fourth annual ACM-SIAM symposium on Discrete algorithms, pages 295–305. SIAM, 2013. doi:10.1137/1.9781611973105.22.
- [23] Zilin Jiang, Jonathan Tidor, Yuan Yao, Shengtong Zhang, and Yufei Zhao. Equiangular lines with a fixed angle. Annals of Mathematics, 194(3):729–743, 2021.
- [24] Zilin Jiang, Jonathan Tidor, Yuan Yao, Shengtong Zhang, and Yufei Zhao. Spherical two-distance sets and eigenvalues of signed graphs. Combinatorica, 43(2):203–232, 2023. doi:10.1007/S00493-023-00002-1.
- [25] Subhash Khot. On the power of unique 2-prover 1-round games. In Proceedings of the thiry-fourth annual ACM symposium on Theory of computing, pages 767–775, 2002. doi:10.1145/509907.510017.
- [26] Subhash A Khot and Nisheeth K Vishnoi. The unique games conjecture, integrality gap for cut problems and embeddability of negative-type metrics into -1. Journal of the ACM (JACM), 62(1):1–39, 2015. doi:10.1145/2629614.
- [27] Bo’az Klartag, Gady Kozma, Peter Ralli, and Prasad Tetali. Discrete curvature and abelian groups. Canadian Journal of Mathematics, 68(3):655–674, 2016.
- [28] Maria Klawe. Non-existence of one-dimensional expanding graphs. In 22nd Annual Symposium on Foundations of Computer Science (sfcs 1981), pages 109–114. IEEE, 1981. doi:10.1109/SFCS.1981.23.
- [29] Bruce Kleiner. A new proof of gromov’s theorem on groups of polynomial growth. Journal of the American Mathematical Society, 23(3):815–829, 2010.
- [30] Tsz Chiu Kwok, Lap Chi Lau, Yin Tat Lee, Shayan Oveis Gharan, and Luca Trevisan. Improved cheeger’s inequality: analysis of spectral partitioning algorithms through higher order spectral gap. In Proceedings of the Forty-Fifth Annual ACM Symposium on Theory of Computing, STOC ’13, pages 11–20, New York, NY, USA, 2013. Association for Computing Machinery. doi:10.1145/2488608.2488611.
- [31] Kevin J Lang, Michael W Mahoney, and Lorenzo Orecchia. Empirical evaluation of graph partitioning using spectral embeddings and flow. In International Symposium on Experimental Algorithms, pages 197–208. Springer, 2009. doi:10.1007/978-3-642-02011-7_19.
- [32] James R Lee and Yury Makarychev. Eigenvalue multiplicity and volume growth. arXiv preprint arXiv:0806.1745, 2008.
- [33] James R Lee, Shayan Oveis Gharan, and Luca Trevisan. Multiway spectral partitioning and higher-order cheeger inequalities. Journal of the ACM (JACM), 61(6):1–30, 2014. doi:10.1145/2665063.
- [34] Tom Leighton and Satish Rao. Multicommodity max-flow min-cut theorems and their use in designing approximation algorithms. Journal of the ACM (JACM), 46(6):787–832, 1999. doi:10.1145/331524.331526.
- [35] Anand Louis, Prasad Raghavendra, Prasad Tetali, and Santosh Vempala. Many sparse cuts via higher eigenvalues. In Proceedings of the forty-fourth annual ACM symposium on Theory of computing, pages 1131–1140, 2012. doi:10.1145/2213977.2214079.
- [36] Michael W Mahoney, Lorenzo Orecchia, and Nisheeth K Vishnoi. A spectral algorithm for improving graph partitions. Technical report, Technical report. Preprint, 2009.
- [37] Grigorii Aleksandrovich Margulis. Explicit constructions of concentrators. Problemy Peredachi Informatsii, 9(4):71–80, 1973.
- [38] Theo McKenzie, Peter Michael Reichstein Rasmussen, and Nikhil Srivastava. Support of closed walks and second eigenvalue multiplicity of graphs. In Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing, pages 396–407, 2021. doi:10.1145/3406325.3451129.
- [39] Florentin Münch. Non-negative ollivier curvature on graphs, reverse poincaré inequality, buser inequality, liouville property, harnack inequality and eigenvalue estimates. Journal de Mathématiques Pures et Appliquées, 170:231–257, 2023.
- [40] Shayan Oveis Gharan and Luca Trevisan. ARV on abelian cayley graphs. In Theory blog, 2021.
- [41] Prasad Raghavendra and David Steurer. Graph expansion and the unique games conjecture. In Proceedings of the forty-second ACM symposium on Theory of computing, pages 755–764, 2010. doi:10.1145/1806689.1806792.
- [42] Prasad Raghavendra, David Steurer, and Madhur Tulsiani. Reductions between expansion problems. In 2012 IEEE 27th Conference on Computational Complexity, pages 64–73. IEEE, 2012. doi:10.1109/CCC.2012.43.
- [43] Wikipedia. Stirling’s approximation. https://en.wikipedia.org/wiki/Stirling%27s_approximation. Accessed: 2024-10-24.
