Rapid Mixing via Coupling Independence for Spin Systems with Unbounded Degree
Abstract
We develop a new framework to prove the mixing or relaxation time for the Glauber dynamics on spin systems with unbounded degree. It works for general spin systems including both -spin and multi-spin systems. As applications for this approach:
-
We prove the optimal relaxation time for the Glauber dynamics of random -list-coloring on an -vertices triangle-tree graph with maximum degree such that , where is the unique positive solution of the equation . This improves the relaxation time for Glauber dynamics obtained by the previous work of Jain, Pham, and Vuong (2022). Besides, our framework can also give a near-linear time sampling algorithm under the same condition.
-
We prove the optimal relaxation time and near-optimal mixing time for the Glauber dynamics on hardcore models with parameter in balanced bipartite graphs such that for the max degree in left part and the max degree of right part satisfies . This improves the previous result by Chen, Liu, and Yin (2023).
At the heart of our proof is the notion of coupling independence which allows us to consider multiple vertices as a huge single vertex with exponentially large domain and do a “coarse-grained” local-to-global argument on spin systems. The technique works for general (multi) spin systems and helps us obtain some new comparison results for Glauber dynamics.
Keywords and phrases:
coupling independence, Glauber dynamics, mixing times, relaxation times, spin systemsCategory:
RANDOMCopyright and License:
2012 ACM Subject Classification:
Theory of computation Random walks and Markov chainsFunding:
Weiming Feng acknowledges the support of Dr. Max Rössler, the Walter Haefner Foundation, and the ETH Zürich Foundation during his affiliation with ETH Zürich, where this work was done.Acknowledgements:
We would like to thank Heng Guo for bringing the paper [30] to our attention and for the helpful discussions at the beginning of this project. We thank Eric Vigoda for suggesting a better statement of technical results. We also thank Charlie Carlson, Yitong Yin, and Xinyuan Zhang for their helpful discussions.Editors:
Alina Ene and Eshan ChattopadhyaySeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
The spin system is a fundamental probabilistic graphical model. It is defined on a graph , where every vertex is a random variable and every edge models the local interactions. Each variable takes a value from a discrete domain . Each vertex has a vector called the external field and each edge has a symmetric matrix called the interaction matrix. The spin system defines a Gibbs distribution over such that for any configuration ,
The spin system covers many important distributions including the uniform distribution of graph colorings, the Ising model, the hardcore gas model in Physics, and a broad class of undirected graphical models in machine learning [42].
Sampling from the Gibbs distribution is a central algorithmic task for spin systems. The Glauber dynamics is a fundamental Markov chain Monte Carlo (MCMC) method for sampling from high-dimensional distributions. Given a distribution over , it starts from an arbitrary , where is the support of . In each step, it updates the current state as follows:
-
pick a variable uniformly at random;
-
resample the value of from the conditional distribution .
It is well-known that if the state space is connected through the moves of Glauber dynamics, then the distribution is the unique stationary distribution for the Glauber dynamics.
In this paper, we study the convergence rate of the Glauber dynamics. Let denote the random sequence generated by the Glauber dynamics. Let denote the transition matrix of the Glauber dynamics. Many notions capture the convergence rate. The most standard one is the mixing time, which is defined by
| (1) |
where denotes the standard total variation distance between and the distribution of . In words, if the Glauber dynamics starts from the worst initial state , the mixing time is the minimum number such that the total variation distance between and is below a sufficiently small constant. Another widely used notion is the relaxation time. A standard fact says that only has non-negative real eigenvalues [22]. The gap is called the spectral gap of Glauber dynamics. The relaxation time is defined by
It is well known that , where .
Recently, a series of works [4, 1, 3] studied Glauber dynamics using high dimensional expanders. An important notion called spectral independence was developed during this process. Anari, Liu, and Oveis Gharan [3] first introduced spectral independence for Boolean distributions. The follow-up works [16, 24] then generalized it to non-Boolean distributions. For example, for a Boolean distribution over , the influence matrix is defined by . A distribution is -spectrally independent if the maximum eigenvalue of is at most . If every conditional distribution of is -spectrally independent, then by the local-to-global argument [1], both relaxation and mixing time of Glauber dynamics are bounded by , where is the number of variables. Given this polynomial bound , many works tried to obtain an improved or even the optimal mixing/relaxation time for Glauber dynamics, especially when is a Gibbs distribution defined by spin systems. Chen, Liu and Vigoda [21] proved that for spin systems on bounded degree graphs, the spectral independence implies both optimal mixing time and optimal relaxation time.
The next question is how to deal with spin systems on unbounded degree graphs. Many works [29, 11, 2, 15, 12] focused on this question. Significant progress was made, especially for 2-spin systems (). [29] first studied coloring and weighted independent sets (hardcore model) in high-girth graphs and proved the near-optimal relaxation time. [11] introduced a stronger variant of spectral independence called complete spectral independence, and proved the optimal relaxation time for anti-ferromagnetic 2-spin systems in the uniqueness regime. To obtain the optimal mixing time, [2] made the first step and defined a new notion called entropic independence. After a line of work [2, 15, 12], the optimal mixing time was established for a broad class of 2-spin systems.
Most of the previous techniques [11, 2, 15, 12] for unbounded degree graphs are restricted to the -spin systems. We consider the following question in this paper.
How to prove the optimal mixing/relaxation time
for Glauber dynamics on (multi) spin systems with unbounded degree?
To the best of our knowledge, the only previous result beyond 2-spin systems is the relaxation time for graph coloring [29]. However, [29] relies on the coupling analysis for colorings in [28], which makes it difficult to be generalized to other spin systems.
In this work, we develop a new framework for proving mixing/relaxation time for the Glauber dynamics on general spin systems including both -spin and multi-spin systems. Our new framework is based on a stronger variant of the spectral independence known as the coupling independence, which is already used implicitly or explicitly in many previous works [40, 5, 14, 18, 19, 34]. A spin system on is -coupling independent if for any and , there is a coupling where and such that
Here, denotes the hamming distance between and and the distribution induced by conditional on taking the value .
Given a spin system on a graph with Gibbs distribution , we show that if and all the conditional distributions induced by satisfy the coupling independence and the maximum degree of is greater than a large constant, then the following comparison results hold for Glauber dynamics.
-
Relaxation time comparison. The relaxation time satisfies , where is a conditional distribution obtained from by fixing the values on a subset of variables. The set is chosen intentionally such that the induced subgraph on other vertices has smaller maximum degree. For many spin systems, the distribution is in an “easy regime” so that the mixing/relaxation time for is easy to analyze. We can bound the relaxation time for via this comparison result (see Theorem 9).
-
Mixing time comparison. If is a monotone spin system (Definition 14) and the Glauber dynamics starts from a specific initial configuration, then the mixing time satisfies , where hides a factor (See Theorem 15).
We obtain the relaxation/mixing time bounds via the above comparison results. In the relaxation time comparison result, the constant factor in is independent of the degree of the graph. In applications, the distribution is in an “easy regime”, hence, we can use some standard technique to show . The comparison result gives the optimal relaxation time. Similarly, in the applications of monotone systems, we can obtain the near-optimal mixing time for general graphs. Our comparison results only hold for graphs with large maximum degrees. It does not cause any issue in applications, because coupling independence implies spectral independence, and for graphs with bounded maximum degree, [21] already established the optimal relaxation/mixing time.
Our proof techniques can also give a near-linear time (in input size) sampling algorithm (see Theorem 13). Furthermore, we introduce a general technique to establish coupling independence for 2-spin systems (Theorem 16). Specifically, many spectral independence results for 2-spin systems are proved by analyzing the decay of correlation in the self-avoiding walk tree [20, 13]. We show that all of such proofs can be translated to a proof of coupling independence.
Organization of the paper
In Section 1.1, we first exhibit some concrete applications. In Section 2, we give our technical results. In Section 3, we give an overview of proof techniques.
All the detailed analysis are given in the full version [10].
1.1 Applications
Let be a graph with maximum degree and a set of colors. Given a set of color lists , a proper list-coloring assigns a color to each vertex such that adjacent vertices receive different colors. In a special case when for all , the list coloring becomes the standard graph -coloring. We use to denote the uniform distribution over all proper list-colorings in . For the list coloring, in each step, the Glauber dynamics picks a random vertex and update its color to a random available color. There is a long line of works studying the mixing and relaxation time of Glauber dynamics e.g. [33, 45, 9].
In the era of spectral independence, the proper list-coloring has been re-studied by a series of works [16, 24, 19]. Though the technique varies, all these works established some coupling independence results for the proper list-coloring. For list colorings on triangle-free graphs, let denote the unique positive solution to the equation . When , the -coupling independence can be established by techniques in [16, 24]. Our framework gives the optimal relaxation time of Glauber dynamics even if the maximum degree of is unbounded.
Theorem 1 (Coloring: Relaxation Time).
Let be a constant. For any triangle-free graph and color lists , if for all , where is the maximum degree of , then relaxation time of Glauber dynamics is , where is the number of vertices in .
Under the condition of Theorem 1, the relaxation time of the Glauber dynamics has been studied by many previous works. Combining the spectral independence technique [1, 3] with the correlation decay analysis [27, 26], two independent works [16, 24] proved the polynomial relaxation time of Glauber dynamics. For graphs with bounded maximum degree , Chen, Liu and Vigoda [21] established the relaxation time, where hides a constant factor like . For general graphs with possibly unbounded maximum degree, Jain, Pham and Vuong [29] proved the first almost linear relaxation time . Their proof combined the techniques in [21] with the coupling analysis in [28]. Compared to previous results, Theorem 1 gives the optimal linear relaxation time for general graphs.
We prove Theorem 1 by first verifying the coupling independence condition (Definition 7) and then applying our comparison result (Theorem 9). Theorem 1 requires because the current best coupling independence result requires this number of colors but our comparison result does not require such a strong condition. It is conjectured that -coupling independence should hold for proper list-coloring in general graphs when .
Conjecture 2 (Folklore).
Let be a constant. For any graph with maximum degree and color lists such that for all , the uniform distribution over all the proper list-colorings of is -coupling independent.
Our comparison framework can prove optimal relaxation time for Glauber dynamics on proper list-colorings of graphs (with potentially unbounded degree) as long as Conjecture 2 holds.
Proposition 3.
If Conjecture 2 holds with , then for any list coloring instance in Conjecture 2, the relaxation time of Glauber dynamics is .
The standard relation between relaxation time and mixing time implies that the Glauber dynamics mixes in time , which yields a sampling algorithm for the uniform distribution of graph colorings in time because each step of Glauber dynamics can be simulated in time . However, in terms of sampling algorithm, our technique would directly give an algorithm (not the Glauber dynamics) in time . Since the input graph contains edges, the running time is near-linear in the input size.
Theorem 4 (Coloring: Algorithm).
Let be a constant. There exists an algorithm such that given any , any triangle-free graph and color lists , if for all , where is the maximum degree of , it returns a random sample satisfying in time , where is a constant depending only on .
The next example is the hardcore model. Let be a graph. Let be the fugacity. The hardcore model defines a distribution over all independent sets in such that . Let denote the maximum degree of graph . There is a critical threshold (a.k.a. uniqueness threshold) for the tree uniqueness phase transition [36]
such that if the correlation between two vertices decays in their distance; if , the long-range correlation exists. A computational phase transition occurs at the same threshold. If , polynomial time sampling algorithm exists [46]; if , the sampling problem is hard unless NP = RP [44, 25]. The mixing and relaxation time of the Glauber dynamics for hardcore model were also extensively studied [41, 28, 23]. Recent works analyzed Glauber dynamics via spectral independence [3]. The optimal mixing time and the optimal relaxation time were established when for general graphs [21, 15, 12].
However, for the hardcore model on bipartite graphs, the picture is not very clear. Consider the hardcore model in a bipartite graph . Let denote the maximum degree in the left part. Assume . It is recently known that the uniqueness threshold for the hardcore model on the bipartite graph can be refined to where is the maximum degree of the bipartite graph [39, 13]. The Glauber dynamics is also proved to have polynomial mixing time when [13].
On the other side, when , the lower bound in [44] does not hold for bipartite graphs and the problem is #BIS-hard [7], where #BIS is the problem of counting the independent sets in bipartite graphs. A line of works (e.g. [31, 8, 38, 17, 32]) studied various sampling algorithms in the low-temperature (large ) regime.
Within the critical threshold , we consider “balanced” bipartite graphs. Let be the maximum degree in . We say a bipartite graph is -balanced if .
Theorem 5 (Bipartite Hardcore: Relaxation Time).
Let and be two constants. For any hardcore model on a -balanced bipartite graph with fugacity , if , then the relaxation time of Glauber dynamics is , where is the number of vertices in .
For the mixing time, again, the standard relation gives mixing time of the Glauber dynamics. However, since the bipartite graph hardcore is a monotone system, our technique also implies the mixing time of Glauber dynamics starting from the independent set containing all vertices in the left part: . Formally, for any ,
Theorem 6 (Bipartite Hardcore: Mixing Time).
Let and be two constants. For any hardcore model on a -balanced bipartite graph with fugacity , if , then the mixing time of Glauber dynamic starting from satisfies , where is a constant depending only on and .
The previous work [13] established the relaxation time for the Glauber dynamics, which, by the standard relation, implies the mixing time. The previous result holds for general bipartite graphs as long as . For balanced bipartite graphs, we obtained the optimal relaxation time and the near-optimal mixing time , which significantly improved the dependency to and compared to the previous result. For example, in the near critical case when , previous result gives relaxation time and mixing time but our result gives relaxation time and mixing time. However, our result works only on balanced bipartite graphs. The result in [13] is still state-of-the-art for general bipartite graphs.
2 Technical Results
2.1 Coupling Independence
In this section, we give our general results for spin systems. Let be a graph. Let be a set of spins. For each vertex , let vector be the external field at vertex . For each edge , let symmetric matrix be the interaction matrix at edge . A spin system defines a Gibbs distribution over such that,
We use to denote the support of the Gibbs distribution .
Let be a subset of vertices. Given any pinning , we define a conditional distribution by for any configuration ,
| (2) |
In words, is a Gibbs distribution obtained for by removing all edges and putting a constraint that every vertex in must take the value . In particular, if is feasible (e.g. belongs to the support of the marginal distribution ), then is exactly the conditional distribution induced by given the condition . For all spin systems considered in this paper, it holds that for all . The distribution in (2) is well-defined. Furthermore, for any subset , we use to denote the marginal distribution on projected from .
The following condition plays a key role in the proof of our main results. Let be a function that maps every vertex to a positive integer. We call the function the Hamming weight function. For any two (possibly partial) configurations , where , define their weighted Hamming distance with respect to by
| (3) |
Definition 7 (Coupling Independence).
Let be a constant. A distribution over is said to be -coupling independent (-CI) if there exists Hamming weight function such that the following holds. For any pinning , where and disagree only at one vertex , there exists a coupling , where and , such that
The notion of coupling independence was introduced explicitly in [14] to study the spectral independence property. For example, for Boolean distributions111The influence matrix and spectral independence are also defined for general distributions with . See [16] for the detailed definition. (), given any pinning , the influence matrix [3] is defined by
| (4) |
where and denotes the pinning together with taking the value . A distribution is -spectrally independent if the maximum eigenvalue of is at most for any pinning . It is not hard to show that -coupling independence implies -spectral independence. Hence, recent works [14, 19, 18] utilized coupling independence to establish the spectral independence for various spin systems.
2.2 Compare Markov Chains via Coupling Independence
In this work, we find more applications for coupling independence beyond establishing spectral independence. We build some comparison results of Markov chains via coupling independence. As a by-product result, we also show that the coupling independence gives fast sampling algorithms.
Let be a Gibbs distribution over on graph . For any , we use to denote the induced subgraph of on vertex set .
Definition 8 (Relaxation Time with Pinning).
Let be a Gibbs distribution on graph with maximum degree . Let . Let denote all subsets such that the maximum degree of is at most . Define
In the above definition, is a distribution on . In every step, the Glauber dynamics picks uniformly at random then resamples the value on . If , the value of after resampling is always . Indeed, is essentially the same as . But, considering would help us simplify some results and proofs. The following is our main comparison result.
Theorem 9 (Relaxation Time Comparison).
Let and be two constants. There exists such that for any Gibbs distribution on graph with the maximum degree , if satisfies -coupling independence, then the relaxation time of Glauber dynamics on satisfies
The above theorem is a comparison result between two kinds of relaxation times. Consider the case when parameters of are close to the critical threshold so that the relaxation time is hard to analyze. By choosing a sufficiently small , suppose for any and any , the conditional distribution falls into an easy regime. The relaxation time is easy to analyze. Theorem 9 boosts the relaxation time from an easy regime to the hard regime if satisfies the coupling independence and the maximum degree is greater than a constant .
When applying Theorem 9 to a specific spin system with Gibbs distribution , we first need to show that the satisfies the coupling independence property. Next, we choose a small constant to guarantee that is easy to analyze. Now, the constant parameter in Theorem 9 is fixed. If the maximum degree is bounded, then since the coupling independence implies the spectral independence, the previous work [21] already established the optimal relaxation time for . If the maximum degree , we can apply our boosting result to bound the relaxation time. We show how to prove Theorem 1 and Proposition 3 via Theorem 9.
Proof Sketch of Theorem 1
Given a triangle-free graph and color lists with for all , let denote the uniform distribution over all proper list-colorings. By going through the analysis in [24], we can prove that satisfies -coupling independence. Let be a parameter to be fixed later. For any , any pinning , the distribution is essentially the same as the distribution because the coloring outside is fixed by . By self-reducibility, is a list coloring on with color list . Let and denote the degree of in and respectively. The new instance satisfies
where denotes the maximum degree of . By the definition of , . We have . If we set the parameter , then
| (5) |
In this easy regime, one can use path coupling [6] to show . To apply Theorem 9, we pick a small and . If , then
On the other hand, if , then the maximum degree is bounded, we can use the result in [21] to obtain the relaxation time in the same order. This gives the proof sketch of Theorem 1. The only missing component is how to establish the coupling independence, which can be found in the full version ([10, Section 9]).
Proof of Proposition 3
To obtain (5), we only need to use the fact that . If we replace with , then we can set and (5) still holds. The same analysis proves Proposition 3.
Remark 10 (Hardcore Model in Uniqueness Regime).
Theorem 9 could also rediscover some previous results. For example, for the hardcore model on a graph with fugacity , [11] proved the optimal relaxation time. For a fixed , the hardcore model falls into an easy regime if we can reduce the maximum degree of the graph by a constant factor. The hardcore model in the uniqueness regime satisfies -coupling independence (which can be proved by Theorem 16 in this paper). Using a similar argument as that for list coloring, one can rediscover the optimal relaxation time using Theorem 9.
We remark that the relaxation time result for the bipartite graph hardcore model (Theorem 5) is not a direct consequence from Theorem 9. We need to tweak the proof of Theorem 9 to prove Theorem 5. The proof of Theorem 5 is in the full version ([10, Section 8]).See Section 3 for a proof overview.
Remark 11 (Compare Theorem 9 to the Technique in [11]).
Another comparison result about relaxation time was given in [11]. The previous result considers general Boolean distribution (not necessarily Gibbs distribution) over . Given a vector , denotes the distribution such that for any , . The result says if is spectrally independent for all , then one can compare to , where is the vector with constant value . When applying results to Gibbs distributions, here are some differences between Theorem 9 and the previous result in [11].
-
Theorem 9 works for general domain but previous result works only for Boolean domain;
-
The condition is incomparable. Theorem 9 requires coupling independence for and a degree lower bound for the underlying graph but the previous result requires spectral independence for a family of distributions;
-
The easy regime is incomparable. The easy regime in Theorem 9 is the conditional distributions on a small degree subgraph but the easy regime in the previous result is ;
For many spin systems, one can use Theorem 9 to establish the optimal relaxation for Glauber dynamics, where is the number of variables in the spin system. By the standard relation between mixing and relaxation time, the mixing time of Glauber dynamics can usually be bound by . Each transition of Glauber dynamics can be simulated in time . Hence, one can obtain a sampling algorithm in time . Alternatively, we can give a faster sampling algorithm in time if the easy regime has near-linear mixing time.
Definition 12 (Mixing Time with Pinning).
Let be a Gibbs distribution on graph with maximum degree . Let . Let denote all subsets such that the maximum degree of is at most . Define
In words, for any pinning on with , is an upper bound for the mixing time of Glauber dynamics for such that starting from the worst initial , the total variation distance between and is at most .
Theorem 13 (Fast Sampling Algorithm).
Let and be two constants. There exists an algorithm such that given any and any Gibbs distribution on graph with the maximum degree , if satisfies -coupling independence such that the weighted hamming distance satisfies , then it returns a random sample satisfying in time
where is the number of vertices in and we use and .
In the above theorem, suppose for some universal constant , then the running time in above theorem should be for some universal constant . We then hide the constants and by in Theorem 13.
Theorem 4 can be obtained from Theorem 13. Consider the list coloring on a triangle free graphs with . The uniform distribution satisfies -coupling-independence with standard Hamming weight for all . Take be a small constant with . By (5), a simple path coupling [6] shows that . Hence, if the maximum degree is greater than , we run the algorithm in Theorem 13 and the running time is . Otherwise, the maximum degree is bounded and the result in [21] gives the mixing time of the Glauber dynamics, then we can simulate Glauber dynamics to obtain a sampling algorithm. The proof of Theorem 4 is in the full version ([10, Section 9]).
The algorithm in Theorem 13 is not the Glauber dynamics. Roughly speaking, the algorithm uses some strategy to pick vertices and uses the Glauber update to resample the value of the picked vertices. However, for monotone spin systems, we can compare this algorithm to Glauber dynamics via censoring inequality [43] and then we can bound the mixing time of Glauber dynamics.
Let over be the Gibbs distribution. Define a partial order for as follows. For each , pick a total order on . For any two ,
| (6) |
For two distributions and over , we say is stochastic dominated by (i.e., ) if there is a coupling between such that . Let be the transition matrix of the Glauber dynamics on , which can be written as
| (7) |
where performs updates at the such that , for all .
Definition 14 ([37, Chapter 22]).
We say is a monotone spin system if for every , is ordering persistant, which means for any with , it holds that .
By the definition of the partial ordering in , there is a unique maximum configuration for the ordering. Denote this state as . Recall denotes the mixing time of Glauber dynamics starting from .
Theorem 15 (Mixing Time Comparison).
Let and be two constants. For any monotone spin system on graph with the maximum degree , if satisfies -coupling-independence such that the Hamming weight satisfies , then the mixing time of Glauber dynamics starting from the maximum configuration satisfies
where is the number of vertices in graph .
The above theorem is of independent interest. Suppose is a monotone system with coupling independence property. The parameters of are in the critical regime and the underlying graph has an unbounded maximum degree. If we can choose a proper constant such that , then the theorem gives a linear-optimal mixing time of Glauber dynamics for starting from the maximum configuration.
To obtain the near-linear mixing time for , some previous works [12, 15, 2] developed comparison techniques for the modified log-Sobolev (MLS) constants. Roughly speaking, if one can lower bound the MLS constant of the Glauber dynamics for , then one can obtain the optimal mixing time. Previous works compared to , where is a distribution in the easy regime, and such comparison requires to satisfy certain entropic independence [2] condition. In general, it is not easy to verify the entropic independence condition and analyze even if is in an easy regime. Theorem 15 only requires the coupling independence condition and directly compares the mixing time. However, Theorem 15 requires monotone systems, and the final mixing result is restricted.
We remark that although the hardcore model in bipartite graphs is a monotone system, Theorem 6 is not a direct consequence from Theorem 15. We need to tweak the proof of Theorem 15 to prove Theorem 6. The proof of Theorem 6 is in the full version ([10, Section 8]). See Section 3 for a proof overview.
2.3 Establishing Coupling Independence
The next question is how to establish the coupling independence condition for spin systems. Previously, spectral independence was known for many spin systems. The coupling independence was often a by-product result when proving spectral independence. Hence, it is known for some specific spin systems such as subgraph world [14], -matching [18] and coloring in high girth graphs [19].
In this paper, we give a tool to turn many existing spectral independence results into coupling independence results. A large family of spin systems is -spin systems. Let be a graph with maximum degree . Let be the edge interactions such that . Let be the external field. Let be the Gibbs distribution on with parameters such that for any , , where is the number of vertices with and is the number of edges with . The 2-spin system is said to be ferromagnetic if and anti-ferromagnetic if .
Anari, Liu, and Oveis Gharan [3] analyzed the spectral independence for the hardcore model. Chen, Liu, and Vigoda [20] extended the analysis to general 2-spin systems. Recall that the influence matrix is defined in (4). The maximum eigenvalue can be upper bounded by the total influence from one vertex
| (8) |
The RHS is called the total influence bound. For 2-spin systems, the analysis is performed on the Self-Avoiding-Walk (SAW) tree [46]. Roughly speaking, fix a vertex , the SAW tree enumerates all the SAWs in graph starting from . By defining a proper 2-spin system on , one can use the total influence from the root in to upper bound the total influence from in , and thus establish the spectral independence for Gibbs distribution . In [20], a weighted version of (8) is studied to deal with general 2-spin systems. We give the following result for coupling independence.
Theorem 16 (Informal version; see [10, Lemma 39] for the formal version).
For 2-spin systems, the (weighted) total influence bound in the Self-Avoid-Walk tree implies coupling independence.
As a consequence, all the spectral independence results for 2-spin systems in [20] can be turned into coupling independence results in black-box. For the hardcore model in bipartite graphs (Theorem 5 and Theorem 6), we can also use the above result to transform the total influence bound in [13] into coupling independence result.
Theorem 16 is proved by constructing a recursive coupling in the full version ([10, Section 7]). Fix a vertex in . We build a coupling between and and show the discrepancy between and are bounded by the total influence in the SAW tree . Suppose has neighbors . We split into copies such that only has one neighbor . Define the pinning such that for takes the value and for takes the value . Then and . We couple each adjacent and , then merge them into a coupling between two endpoints and . For each adjacent pair, the only difference between and is the pinning at . Hence, we first couple the only neighbor of then construct the coupling recursively if the coupling at fails. This recursive processing essentially enumerates all SAWs from . We can relate the coupling with the SAW tree to prove the theorem.
3 Proof Overview
We give a proof overview for the relaxation time comparison result in Theorem 9. Let be a graph with maximum degree . Let and be two constant integers such that . Their specific values will be fixed later. We first partition all the vertices in into parts such that for any vertex , each does not have more than neighbors of , where is the parameter in Theorem 9. In other words, let denote the set of neighbors of in graph . For any , . The existence of the partition is guaranteed by the Lovász local lemma. However, the local lemma requires the maximum degree to be sufficiently large. That is why we require a lower bound for in our technical results. We also remark that in our proof, the degree lower bound is used solely to ensure the existence of the partition. A similar partition appeared in the previous work [30].
The input Gibbs distribution over is a joint distribution of variables , where each variable takes its value from . Now, we can view as a joint distribution of variables such that each variable takes its value from a huge domain . We define the down-up walk on . Given , the Markov chain does as follows
-
Down-walk: Sample a set of indices uniformly at random and then remove the configuration on the set : ;
-
Up-walk: Resample from conditional on and then go back to a full configuration .
A full configuration is on the level . In the down-walk, we sample a random subset of indices with size . By dropping the configuration , we move from a full configuration at level to a partial configuration at level . In the up-walk, we resample and go back to the level . The process can also be viewed as a kind of block dynamics for configuration . In every step, we pick a random subset of variables and resample conditional on .
We use local-to-global technique [35, 1, 3] to analyze the spectral gap of the down-up walk for . The local-to-global technique suggests to analyze the relaxation time of down-up walk222In [1, 3], the local walk is essentially defined as the up-down walk. Every state is for . In the up-walk, it extends to a full configuration . In the down-walk step, it samples a random index and updates to . It is well-known that up-down walk and up-down walk has the same relaxation time.. In the down walk, we pick a random of size and drop . In the up-walk, we resample and go back to level . We use coupling independence to analyze this down-up walk via path coupling. For simplicity, suppose satisfies -coupling independence with standard Hamming distance ( for all ). We can view this down-up walk on as a block dynamics on , where it updates a block in every step. Given two and that disagree only at one vertex , say , we couple the transition of down-up walk. Let two down-up walks (starting from and , respectively) select the same random subset such that .
-
If , which happens with probability , then since the value of is removed in the down-walk, and thus and can be coupled perfectly after the transition.
-
If , which happens with probability , then since , the disagreement at may percolate to other blocks in the up-walk step. We use the coupling in the -coupling independence to couple the up-walk so that the expected Hamming distance between and after the transition is at most .
Hence, the expected Hamming distance between and after transition is at most . If , the path coupling gives the mixing time and relaxation time for this down-up walk. To apply the local-to-global technique, we also need to fix a configuration , where and , and consider the down-up walk for . The same path coupling works if . By choosing and such that and using the local-to-global technique, we can show that the down-up walk for has relaxation time.
We then compare the down-up for to the Glauber dynamics for . Recall that down-up walk is a kind of block dynamics for . In every step, the block dynamics updates a subset with . The update step is to resample conditional on . This step samples from the conditional Gibbs distribution on subgraph . By the construction of the partition, the maximum degree of is at most so that we have the relaxation time bound for Glauber dynamics on . Let denote the relaxation time of down-up walk. By some standard comparison argument between block dynamics and the Glauber dynamics, we can prove Theorem 9 by showing that
Next, we explain how to get the near-linear time sampling algorithm in Theorem 13 and the mixing time in Theorem 15. Note that for down-up walk, the path coupling actually gives the mixing time. For one update step, it selects a subset with . Let denote the missing index, i.e. . The update step resamples conditional on . We can simulate this transition step using down-up walk for the conditional distribution on . This down-up walk also has the mixing time. We do this recursively until we need to sample from a conditional distribution on with . Note that the maximum degree of the graph is at most . Now, we simulate the Glauber dynamics for steps to sample from the conditional distribution. The following informal algorithm generates an approximate random sample from within TV-distance error . The formal algorithm is given in the full version ([10, Section 5]).
Algorithm Sample
Input: a subset ;
Output: the algorithm randomly updates the partial configuration so that becomes an independent approximate sample of , where .
-
1.
if : update for steps, where every step uses the update rule of the Glauber dynamics on , where .
-
2.
else: run the update of down-up walk on for steps, where and every step does as follows:
-
(a)
pick an index uniformly at random and let ;
-
(b)
call Sample to update ; (recursion step)
-
(a)
We can call Sample() with an arbitrary feasible . When the algorithm terminates, is updated to an approximate random sample from the distribution . Note that the parameter is a constant. The total number of Glauber update steps is given by . Therefore, the overall running time of the algorithm is . Alternatively, we can think of the algorithm as follows: it first generates a random sequence of vertices . In the -th step, the algorithm randomly updates the value of , conditioned on the values of other variables. This behavior is similar to standard Glauber dynamics for , except that the sequence follows a non-trivial joint distribution. For monotone systems, we can compare this algorithm with Glauber dynamics using the censoring inequality.
The results for list-coloring are consequences of general technical results. However, we need to tweak the analysis to prove the results for hardcore model in bipartite graphs (Theorem 5 and Theorem 6). The reason is that for hardcore model on , we only know but we cannot control the degree in the right part . Our technique can only prove the coupling independence for , which is the marginal distribution on projected from . To prove the relaxation time and mixing time results, we first partition into disjoint part such that for any vertex , has no more than neighbors in each . Again, the existence of the partition is guaranteed by the local lemma. Let be a partial configuration on . We can define , where . By a similar proof, we show that the down-up walk for is rapid mixing. We consider a global Markov chain over defined as follows. Let be a full configuration.
-
Drop the right part to obtain ;
-
Update using the down-up walk for , where ;
-
Sample and let .
We first compare this chain to the down-up walk and then compare the Glauber dynamics for to this Markov chain. This gives the relaxation time of Glauber dynamics. For the mixing time, we can first obtain a near-linear time sampling algorithm for , since hardcore model in bipartite graph is a monotone system, we then compare the algorithm to the Glauber dynamics for via the censoring inequality.
References
- [1] Vedat Levi Alev and Lap Chi Lau. Improved analysis of higher order random walks and applications. In STOC, pages 1198–1211, 2020. doi:10.1145/3357713.3384317.
- [2] Nima Anari, Vishesh Jain, Frederic Koehler, Huy Tuan Pham, and Thuy-Duong Vuong. Entropic independence: optimal mixing of down-up random walks. In STOC, pages 1418–1430, 2022. doi:10.1145/3519935.3520048.
- [3] Nima Anari, Kuikui Liu, and Shayan Oveis Gharan. Spectral independence in high-dimensional expanders and applications to the hardcore model. In FOCS, pages 1319–1330, 2020. doi:10.1109/FOCS46700.2020.00125.
- [4] Nima Anari, Kuikui Liu, Shayan Oveis Gharan, and Cynthia Vinzant. Log-concave polynomials II: high-dimensional walks and an FPRAS for counting bases of a matroid. In STOC, pages 1–12, 2019. doi:10.1145/3313276.3316385.
- [5] Antonio Blanca, Pietro Caputo, Zongchen Chen, Daniel Parisi, Daniel Štefankovič, and Eric Vigoda. On mixing of markov chains: Coupling, spectral independence, and entropy factorization. arXiv preprint arXiv:2103.07459, 2021. arXiv:2103.07459.
- [6] Russ Bubley and Martin Dyer. Path coupling: A technique for proving rapid mixing in markov chains. In FOCS, pages 223–231, 1997. doi:10.1109/SFCS.1997.646111.
- [7] Jin-Yi Cai, Andreas Galanis, Leslie Ann Goldberg, Heng Guo, Mark Jerrum, Daniel Štefankovič, and Eric Vigoda. #BIS-hardness for 2-spin systems on bipartite bounded degree graphs in the tree non-uniqueness region. J. Comput. System Sci., 82(5):690–711, 2016. doi:10.1016/j.jcss.2015.11.009.
- [8] Sarah Cannon and Will Perkins. Counting independent sets in unbalanced bipartite graphs. In SODA, pages 1456–1466, 2020. doi:10.1137/1.9781611975994.88.
- [9] Sitan Chen, Michelle Delcourt, Ankur Moitra, Guillem Perarnau, and Luke Postle. Improved bounds for randomly sampling colorings via linear programming. In SODA, pages 2216–2234, 2019. doi:10.1137/1.9781611975482.134.
- [10] Xiaoyu Chen and Weiming Feng. Rapid mixing via coupling independence for spin systems with unbounded degree. arXiv:2407.04672, 2024. doi:10.48550/arXiv.2407.04672.
- [11] Xiaoyu Chen, Weiming Feng, Yitong Yin, and Xinyuan Zhang. Rapid mixing of glauber dynamics via spectral independence for all degrees. In FOCS, pages 137–148, 2021. doi:10.1109/FOCS52979.2021.00022.
- [12] Xiaoyu Chen, Weiming Feng, Yitong Yin, and Xinyuan Zhang. Optimal mixing for two-state anti-ferromagnetic spin systems. In FOCS, pages 588–599, 2022. doi:10.1109/FOCS54457.2022.00062.
- [13] Xiaoyu Chen, Jingcheng Liu, and Yitong Yin. Uniqueness and rapid mixing in the bipartite hardcore model. In FOCS, pages 1991–2005. IEEE, 2023. doi:10.1109/FOCS57990.2023.00121.
- [14] Xiaoyu Chen and Xinyuan Zhang. A near-linear time sampler for the Ising model with external field. In SODA, pages 4478–4503, 2023. doi:10.1137/1.9781611977554.CH170.
- [15] Yuansi Chen and Ronen Eldan. Localization schemes: A framework for proving mixing bounds for markov chains. In FOCS, pages 110–122, 2022.
- [16] Zongchen Chen, Andreas Galanis, Daniel Štefankovič, and Eric Vigoda. Rapid mixing for colorings via spectral independence. In SODA, pages 1548–1557, 2021.
- [17] Zongchen Chen, Andreas Galanis, Daniel Štefankovič, and Eric Vigoda. Sampling colorings and independent sets of random regular bipartite graphs in the non-uniqueness region. In SODA, pages 2198–2207, 2022.
- [18] Zongchen Chen and Yuzhou Gu. Fast sampling of -matchings and -edge covers. In SODA, pages 4972–4987, 2024. doi:10.1137/1.9781611977912.178.
- [19] Zongchen Chen, Kuikui Liu, Nitya Mani, and Ankur Moitra. Strong spatial mixing for colorings on trees and its algorithmic applications. In FOCS, pages 810–845, 2023. doi:10.1109/FOCS57990.2023.00053.
- [20] Zongchen Chen, Kuikui Liu, and Eric Vigoda. Rapid mixing of Glauber dynamics up to uniqueness via contraction. In FOCS, pages 1307–1318, 2020. doi:10.1109/FOCS46700.2020.00124.
- [21] Zongchen Chen, Kuikui Liu, and Eric Vigoda. Optimal mixing of Glauber dynamics: entropy factorization via high-dimensional expansion. In STOC, pages 1537–1550, 2021. doi:10.1145/3406325.3451035.
- [22] Martin Dyer, Catherine Greenhill, and Mario Ullrich. Structure and eigenvalues of heat-bath Markov chains. Linear Algebra Appl., 454:57–71, 2014.
- [23] Charilaos Efthymiou, Thomas P. Hayes, Daniel Štefankovič, Eric Vigoda, and Yitong Yin. Convergence of MCMC and loopy BP in the tree uniqueness region for the hard-core model. SIAM J. Comput., 48(2):581–643, 2019. doi:10.1137/17M1127144.
- [24] Weiming Feng, Heng Guo, Yitong Yin, and Chihao Zhang. Rapid mixing from spectral independence beyond the boolean domain. In SODA, pages 1558–1577, 2021. doi:10.1137/1.9781611976465.95.
- [25] Andreas Galanis, Qi Ge, Daniel Štefankovič, Eric Vigoda, and Linji Yang. Improved inapproximability results for counting independent sets in the hard-core model. Random Structures Algorithms, 45(1):78–110, 2014. (conference version in RANDOM’11). doi:10.1002/RSA.20479.
- [26] David Gamarnik, Dmitriy Katz, and Sidhant Misra. Strong spatial mixing of list coloring of graphs. Random Structures Algorithms, 2013.
- [27] Leslie Ann Goldberg, Russell A. Martin, and Mike Paterson. Strong spatial mixing with fewer colors for lattice graphs. SIAM J. Comput., 35(2):486–517, 2005. doi:10.1137/S0097539704445470.
- [28] Thomas P. Hayes and Eric Vigoda. Coupling with the stationary distribution and improved sampling for colorings and independent sets. Ann. Appl. Probab., 16(3):1297–1318, 2006.
- [29] Vishesh Jain, Huy Tuan Pham, and Thuy Duong Vuong. Spectral independence, coupling with the stationary distribution, and the spectral gap of the Glauber dynamics. Inf. Process. Lett., 177:106268, 2022.
- [30] Vishesh Jain, Ashwin Sah, and Mehtaab Sawhney. Perfectly sampling k (8/3 + o(1))-colorings in graphs. In STOC, pages 1589–1600, 2021.
- [31] Matthew Jenssen, Peter Keevash, and Will Perkins. Algorithms for #BIS-hard problems on expander graphs. SIAM J. Comput., 49(4):681–710, 2020. doi:10.1137/19M1286669.
- [32] Matthew Jenssen, Will Perkins, and Aditya Potukuchi. Approximately counting independent sets in bipartite graphs via graph containers. Random Structures Algorithms, 63(1):215–241, 2023. doi:10.1002/RSA.21145.
- [33] Mark Jerrum. A very simple algorithm for estimating the number of -colorings of a low-degree graph. Random Struct. Algorithms, 7(2):157–165, 1995. doi:10.1002/RSA.3240070205.
- [34] Mark Jerrum. Glauber dynamics for the hard-core model on bounded-degree H-free graphs. CoRR, abs/2404.07615, 2024. doi:10.48550/arXiv.2404.07615.
- [35] Tali Kaufman and Izhar Oppenheim. High order random walks: beyond spectral gap. Combinatorica, 40(2):245–281, 2020. (conference version in RANDOM’18). doi:10.1007/S00493-019-3847-0.
- [36] Frank P Kelly. Stochastic models of computer communication systems. Journal of the Royal Statistical Society: Series B (Methodological), 47(3):379–395, 1985.
- [37] David A. Levin, Yuval Peres, and Elizabeth L. Wilmer. Markov chains and mixing times. American Mathematical Society, Providence, RI, 2017.
- [38] Chao Liao, Jiabao Lin, Pinyan Lu, and Zhenyu Mao. An FPTAS for the hardcore model on random regular bipartite graphs. Theoret. Comput. Sci., 929:174–190, 2022. doi:10.1016/J.TCS.2022.07.001.
- [39] Jingcheng Liu and Pinyan Lu. FPTAS for #BIS with degree bounds on one side. In STOC, pages 549–556, 2015. doi:10.1145/2746539.2746598.
- [40] Kuikui Liu. From coupling to spectral independence and blackbox comparison with the down-up walk. arXiv preprint arXiv:2103.11609, 2021. arXiv:2103.11609.
- [41] Michael Luby and Eric Vigoda. Fast convergence of the glauber dynamics for sampling independent sets. Random Structures Algorithms, 15(3-4):229–241, 1999. doi:10.1002/(SICI)1098-2418(199910/12)15:3/4\%3C229::AID-RSA3\%3E3.0.CO;2-X.
- [42] Marc Mezard and Andrea Montanari. Information, physics, and computation. Oxford University Press, 2009.
- [43] Yuval Peres and Peter Winkler. Can extra updates delay mixing? Comm. Math. Phys., 323(3):1007–1016, 2013.
- [44] Allan Sly. Computational transition at the uniqueness threshold. In FOCS, pages 287–296, 2010. doi:10.1109/FOCS.2010.34.
- [45] Eric Vigoda. Improved bounds for sampling colorings. J. Math. Phys., 41(3):1555–1569, 2000.
- [46] Dror Weitz. Counting independent sets up to the tree threshold. In STOC, pages 140–149, 2006. doi:10.1145/1132516.1132538.
