Abstract 1 Introduction 2 Technical Results 3 Proof Overview References

Rapid Mixing via Coupling Independence for Spin Systems with Unbounded Degree

Xiaoyu Chen ORCID State Key Laboratory for Novel Software Technology, New Cornerstone Science Laboratory, Nanjing University, China Weiming Feng ORCID School of Computing and Data Science, The University of Hong Kong, China
Abstract

We develop a new framework to prove the mixing or relaxation time for the Glauber dynamics on spin systems with unbounded degree. It works for general spin systems including both 2-spin and multi-spin systems. As applications for this approach:

  • We prove the optimal O(n) relaxation time for the Glauber dynamics of random q-list-coloring on an n-vertices triangle-tree graph with maximum degree Δ such that q/Δ>α, where α1.763 is the unique positive solution of the equation α=exp(1/α). This improves the n1+o(1) relaxation time for Glauber dynamics obtained by the previous work of Jain, Pham, and Vuong (2022). Besides, our framework can also give a near-linear time sampling algorithm under the same condition.

  • We prove the optimal O(n) relaxation time and near-optimal O~(n) mixing time for the Glauber dynamics on hardcore models with parameter λ in balanced bipartite graphs such that λ<λc(ΔL) for the max degree ΔL in left part and the max degree ΔR of right part satisfies ΔR=O(ΔL). This improves the previous result by Chen, Liu, and Yin (2023).

At the heart of our proof is the notion of coupling independence which allows us to consider multiple vertices as a huge single vertex with exponentially large domain and do a “coarse-grained” local-to-global argument on spin systems. The technique works for general (multi) spin systems and helps us obtain some new comparison results for Glauber dynamics.

Keywords and phrases:
coupling independence, Glauber dynamics, mixing times, relaxation times, spin systems
Category:
RANDOM
Copyright and License:
[Uncaptioned image] © Xiaoyu Chen and Weiming Feng; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Random walks and Markov chains
Related Version:
Full Version: https://arxiv.org/abs/2407.04672 [10]
Funding:
Weiming Feng acknowledges the support of Dr. Max Rössler, the Walter Haefner Foundation, and the ETH Zürich Foundation during his affiliation with ETH Zürich, where this work was done.
Acknowledgements:
We would like to thank Heng Guo for bringing the paper [30] to our attention and for the helpful discussions at the beginning of this project. We thank Eric Vigoda for suggesting a better statement of technical results. We also thank Charlie Carlson, Yitong Yin, and Xinyuan Zhang for their helpful discussions.
Editors:
Alina Ene and Eshan Chattopadhyay

1 Introduction

The spin system is a fundamental probabilistic graphical model. It is defined on a graph G=(V,E), where every vertex is a random variable and every edge models the local interactions. Each variable takes a value from a discrete domain [q]={1,2,,q}. Each vertex has a vector b0q called the external field and each edge has a symmetric matrix A0q×q called the interaction matrix. The spin system defines a Gibbs distribution over [q]V such that for any configuration σ[q]V,

μ(σ)vVb(σv)e={u,v}EA(σu,σv).

The spin system covers many important distributions including the uniform distribution of graph colorings, the Ising model, the hardcore gas model in Physics, and a broad class of undirected graphical models in machine learning [42].

Sampling from the Gibbs distribution is a central algorithmic task for spin systems. The Glauber dynamics is a fundamental Markov chain Monte Carlo (MCMC) method for sampling from high-dimensional distributions. Given a distribution μ over [q]V, it starts from an arbitrary XΩ(μ), where Ω(μ)[q]V is the support of μ. In each step, it updates the current state X as follows:

  • pick a variable vV uniformly at random;

  • resample the value of Xv from the conditional distribution μv(XVv).

It is well-known that if the state space Ω(μ) is connected through the moves of Glauber dynamics, then the distribution μ is the unique stationary distribution for the Glauber dynamics.

In this paper, we study the convergence rate of the Glauber dynamics. Let (Xt)t0 denote the random sequence generated by the Glauber dynamics. Let P:Ω(μ)×Ω(μ)[0,1] denote the transition matrix of the Glauber dynamics. Many notions capture the convergence rate. The most standard one is the mixing time, which is defined by

TmixGD(μ,ε):=maxX0Ω(μ)min{t>0𝒟TV(Pt(X0,)μ)ε}, (1)

where 𝒟TV(Pt(X0,)μ) denotes the standard total variation distance between μ and the distribution of Xt. In words, if the Glauber dynamics starts from the worst initial state X0, the mixing time is the minimum number t such that the total variation distance between Pt(X0,) and μ is below a sufficiently small constant. Another widely used notion is the relaxation time. A standard fact says that P only has non-negative real eigenvalues 1=λ1λ2λ|Ω|0 [22]. The gap λ1λ2=1λ2 is called the spectral gap of Glauber dynamics. The relaxation time is defined by

TrelGD(μ):=11λ2.

It is well known that TmixGD(μ,ε)=O(TrelGD(μ)log1εμmin), where μmin=minσΩ(μ)μ(σ).

Recently, a series of works [4, 1, 3] studied Glauber dynamics using high dimensional expanders. An important notion called spectral independence was developed during this process. Anari, Liu, and Oveis Gharan [3] first introduced spectral independence for Boolean distributions. The follow-up works [16, 24] then generalized it to non-Boolean distributions. For example, for a Boolean distribution μ over {1,+1}[n], the influence matrix Ψ0n×n is defined by Ψ(u,v):=𝐏𝐫Xμ[Xv=+Xu=+]𝐏𝐫Xμ[Xv=+Xu=]. A distribution μ is C-spectrally independent if the maximum eigenvalue of Ψ is at most C. If every conditional distribution of μ is C-spectrally independent, then by the local-to-global argument [1], both relaxation and mixing time of Glauber dynamics are bounded by nO(C), where n is the number of variables. Given this polynomial bound nO(C), many works tried to obtain an improved or even the optimal mixing/relaxation time for Glauber dynamics, especially when μ is a Gibbs distribution defined by spin systems. Chen, Liu and Vigoda [21] proved that for spin systems on bounded degree graphs, the spectral independence implies both O(nlogn) optimal mixing time and O(n) optimal relaxation time.

The next question is how to deal with spin systems on unbounded degree graphs. Many works [29, 11, 2, 15, 12] focused on this question. Significant progress was made, especially for 2-spin systems (q=2). [29] first studied coloring and weighted independent sets (hardcore model) in high-girth graphs and proved the near-optimal n1+o(1) relaxation time. [11] introduced a stronger variant of spectral independence called complete spectral independence, and proved the optimal O(n) relaxation time for anti-ferromagnetic 2-spin systems in the uniqueness regime. To obtain the optimal mixing time, [2] made the first step and defined a new notion called entropic independence. After a line of work [2, 15, 12], the optimal O(nlogn) mixing time was established for a broad class of 2-spin systems.

Most of the previous techniques [11, 2, 15, 12] for unbounded degree graphs are restricted to the 2-spin systems. We consider the following question in this paper.

How to prove the optimal mixing/relaxation time
for Glauber dynamics on (multi) spin systems with unbounded degree?

To the best of our knowledge, the only previous result beyond 2-spin systems is the n1+o(1) relaxation time for graph coloring [29]. However, [29] relies on the coupling analysis for colorings in [28], which makes it difficult to be generalized to other spin systems.

In this work, we develop a new framework for proving mixing/relaxation time for the Glauber dynamics on general spin systems including both 2-spin and multi-spin systems. Our new framework is based on a stronger variant of the spectral independence known as the coupling independence, which is already used implicitly or explicitly in many previous works [40, 5, 14, 18, 19, 34]. A spin system μ on [q]V is C-coupling independent if for any vV and a,b[q], there is a coupling (X,Y) where Xμva and Yμvb such that

𝔼[dH(X,Y)]C.

Here, dH(X,Y)=|{vVXvYv}| denotes the hamming distance between X and Y and μva the distribution induced by μ conditional on v taking the value a.

Given a spin system on a graph G with Gibbs distribution μ, we show that if μ and all the conditional distributions induced by μ satisfy the coupling independence and the maximum degree of G is greater than a large constant, then the following comparison results hold for Glauber dynamics.

  • Relaxation time comparison. The relaxation time satisfies TrelGD(μ)=O(TrelGD(μ)), where μ is a conditional distribution obtained from μ by fixing the values on a subset ΛV of variables. The set Λ is chosen intentionally such that the induced subgraph G[VΛ] on other vertices has smaller maximum degree. For many spin systems, the distribution μ is in an “easy regime” so that the mixing/relaxation time for μ is easy to analyze. We can bound the relaxation time for μ via this comparison result (see Theorem 9).

  • Mixing time comparison. If μ is a monotone spin system (Definition 14) and the Glauber dynamics starts from a specific initial configuration, then the mixing time satisfies TmixGD(μ,ε)=O~(TmixGD(μ,14e)), where O~ hides a polylog(n/ε) factor (See Theorem 15).

We obtain the relaxation/mixing time bounds via the above comparison results. In the relaxation time comparison result, the constant factor in O() is independent of the degree of the graph. In applications, the distribution μ is in an “easy regime”, hence, we can use some standard technique to show TrelGD(μ)=O(n). The comparison result gives the optimal TrelGD(μ)=O(n) relaxation time. Similarly, in the applications of monotone systems, we can obtain the near-optimal O~(n) mixing time for general graphs. Our comparison results only hold for graphs with large maximum degrees. It does not cause any issue in applications, because coupling independence implies spectral independence, and for graphs with bounded maximum degree, [21] already established the optimal relaxation/mixing time.

Our proof techniques can also give a near-linear time (in input size) sampling algorithm (see Theorem 13). Furthermore, we introduce a general technique to establish coupling independence for 2-spin systems (Theorem 16). Specifically, many spectral independence results for 2-spin systems are proved by analyzing the decay of correlation in the self-avoiding walk tree [20, 13]. We show that all of such proofs can be translated to a proof of coupling independence.

Organization of the paper

In Section 1.1, we first exhibit some concrete applications. In Section 2, we give our technical results. In Section 3, we give an overview of proof techniques.

All the detailed analysis are given in the full version [10].

1.1 Applications

Let G=(V,E) be a graph with maximum degree Δ and [q]={1,2,,q} a set of colors. Given a set of color lists Lv[q],vV, a proper list-coloring X[q]V assigns a color XvLv to each vertex vV such that adjacent vertices receive different colors. In a special case when Lv=[q] for all VV, the list coloring becomes the standard graph q-coloring. We use μ to denote the uniform distribution over all proper list-colorings in G. For the list coloring, in each step, the Glauber dynamics picks a random vertex v and update its color to a random available color. There is a long line of works studying the mixing and relaxation time of Glauber dynamics e.g. [33, 45, 9].

In the era of spectral independence, the proper list-coloring has been re-studied by a series of works [16, 24, 19]. Though the technique varies, all these works established some coupling independence results for the proper list-coloring. For list colorings on triangle-free graphs, let α1.763 denote the unique positive solution to the equation α=exp(1/α). When |Lv|>(α+δ)Δ, the Oδ(1)-coupling independence can be established by techniques in [16, 24]. Our framework gives the optimal relaxation time of Glauber dynamics even if the maximum degree of G is unbounded.

Theorem 1 (Coloring: Relaxation Time).

Let δ>0 be a constant. For any triangle-free graph G=(V,E) and color lists (Lv)vV, if |Lv|(α+δ)Δ for all vV, where Δ3 is the maximum degree of G, then relaxation time of Glauber dynamics is Oδ(n), where n is the number of vertices in G.

Under the condition of Theorem 1, the relaxation time of the Glauber dynamics has been studied by many previous works. Combining the spectral independence technique [1, 3] with the correlation decay analysis [27, 26], two independent works [16, 24] proved the polynomial relaxation time nO(1/δ) of Glauber dynamics. For graphs with bounded maximum degree Δ=O(1), Chen, Liu and Vigoda [21] established the OΔ,δ(n) relaxation time, where OΔ,δ() hides a constant factor like ΔO(Δ2/δ). For general graphs with possibly unbounded maximum degree, Jain, Pham and Vuong [29] proved the first almost linear relaxation time Oδ(ne(loglogn)2)=Oδ(n1+o(1)). Their proof combined the techniques in [21] with the coupling analysis in [28]. Compared to previous results, Theorem 1 gives the optimal linear relaxation time for general graphs.

We prove Theorem 1 by first verifying the coupling independence condition (Definition 7) and then applying our comparison result (Theorem 9). Theorem 1 requires |Lv|(α+δ)Δ because the current best coupling independence result requires this number of colors but our comparison result does not require such a strong condition. It is conjectured that Oδ(1)-coupling independence should hold for proper list-coloring in general graphs when |Lv|(1+δ)Δ+O(1).

Conjecture 2 (Folklore).

Let δ>0 be a constant. For any graph G=(V,E) with maximum degree Δ and color lists (Lv)vV such that |Lv|(1+δ)Δ+O(1) for all vV, the uniform distribution μ over all the proper list-colorings of G is Oδ(1)-coupling independent.

Our comparison framework can prove optimal relaxation time for Glauber dynamics on proper list-colorings of graphs (with potentially unbounded degree) as long as Conjecture 2 holds.

Proposition 3.

If Conjecture 2 holds with δ>0, then for any list coloring instance in Conjecture 2, the relaxation time of Glauber dynamics is Oδ(n).

The standard relation between relaxation time and mixing time implies that the Glauber dynamics mixes in time Oδ(n2logq), which yields a sampling algorithm for the uniform distribution μ of graph colorings in time Oδ(Δn2logq) because each step of Glauber dynamics can be simulated in time O(Δ). However, in terms of sampling algorithm, our technique would directly give an algorithm (not the Glauber dynamics) in time O~δ(Δn). Since the input graph G contains Δn edges, the running time is near-linear in the input size.

Theorem 4 (Coloring: Algorithm).

Let δ>0 be a constant. There exists an algorithm such that given any ε>0, any triangle-free graph G=(V,E) and color lists (Lv)vV, if |Lv|(α+δ)Δ for all vV, where Δ3 is the maximum degree of G, it returns a random sample X satisfying 𝒟TV(Xμ)ε in time Δn(lognε)C(δ), where C(δ) is a constant depending only on δ.

The next example is the hardcore model. Let G=(V,E) be a graph. Let λ>0 be the fugacity. The hardcore model defines a distribution μ over all independent sets SV in G such that μ(S)λ|S|. Let Δ3 denote the maximum degree of graph G. There is a critical threshold (a.k.a. uniqueness threshold) for the tree uniqueness phase transition [36]

λc(Δ):=(Δ1)(Δ1)(Δ2)Δ.

such that if λλc(Δ) the correlation between two vertices decays in their distance; if λ>λc(Δ), the long-range correlation exists. A computational phase transition occurs at the same threshold. If λ<λc(Δ), polynomial time sampling algorithm exists [46]; if λ>λc(Δ), the sampling problem is hard unless NP = RP [44, 25]. The mixing and relaxation time of the Glauber dynamics for hardcore model were also extensively studied [41, 28, 23]. Recent works analyzed Glauber dynamics via spectral independence [3]. The optimal Oδ(nlogn) mixing time and the optimal Oδ(n) relaxation time were established when λ(1δ)λc(Δ) for general graphs [21, 15, 12].

However, for the hardcore model on bipartite graphs, the picture is not very clear. Consider the hardcore model in a bipartite graph G=(V=VLVR,E). Let ΔL denote the maximum degree in the left part. Assume 3ΔL. It is recently known that the uniqueness threshold for the hardcore model on the bipartite graph can be refined to λc(ΔL)λc(Δ) where ΔΔL is the maximum degree of the bipartite graph [39, 13]. The Glauber dynamics is also proved to have polynomial mixing time when λ<λc(ΔL) [13].

On the other side, when λ>λc(ΔL), the lower bound in [44] does not hold for bipartite graphs and the problem is #BIS-hard [7], where #BIS is the problem of counting the independent sets in bipartite graphs. A line of works (e.g. [31, 8, 38, 17, 32]) studied various sampling algorithms in the low-temperature (large λ) regime.

Within the critical threshold λ<λc(ΔL), we consider “balanced” bipartite graphs. Let ΔR be the maximum degree in VR. We say a bipartite graph is θ-balanced if ΔRθΔL.

Theorem 5 (Bipartite Hardcore: Relaxation Time).

Let δ(0,1) and θ>1 be two constants. For any hardcore model on a θ-balanced bipartite graph G with fugacity λ, if λ(1δ)λc(ΔL), then the relaxation time of Glauber dynamics is Oδ,θ(n), where n is the number of vertices in G.

For the mixing time, again, the standard relation gives Oδ,θ(n2) mixing time of the Glauber dynamics. However, since the bipartite graph hardcore is a monotone system, our technique also implies the O~δ,θ(n) mixing time of Glauber dynamics starting from the independent set containing all vertices in the left part: X0=VL. Formally, for any SΩ(μ),

TmixGD(μ,εS)=min{t>0𝒟TV(Xtμ)εX0=S}.
Theorem 6 (Bipartite Hardcore: Mixing Time).

Let δ(0,1) and θ>1 be two constants. For any hardcore model on a θ-balanced bipartite graph G with fugacity λ, if λ(1δ)λc(ΔL), then the mixing time of Glauber dynamic starting from VL satisfies TmixGD(μ,εVL)=n(lognε)C(δ,θ), where C(δ,θ) is a constant depending only on δ and θ.

The previous work [13] established the (ΔLlognλ)O(1/δ)n2 relaxation time for the Glauber dynamics, which, by the standard relation, implies the (ΔLlognλ)O(1/δ)n3log1+λλ mixing time. The previous result holds for general bipartite graphs as long as λ(1δ)λc(ΔL). For balanced bipartite graphs, we obtained the optimal relaxation time Oδ,θ(n) and the near-optimal mixing time O~δ,θ(n), which significantly improved the dependency to n and ΔL compared to the previous result. For example, in the near critical case when λ=(1δ)λc(ΔL)=Θ(1/ΔL), previous result gives ΔLO(1/δ)n2polylog(n) relaxation time and ΔLO(1/δ)n3polylog(n) mixing time but our result gives O(n) relaxation time and npolylog(n) mixing time. However, our result works only on balanced bipartite graphs. The result in [13] is still state-of-the-art for general bipartite graphs.

Finally, we point out that our technique could also recover many previous O(n) relaxation time results for anti-ferromagnetic 2-spin systems in [11]. See Remark 10 for one example.

2 Technical Results

2.1 Coupling Independence

In this section, we give our general results for spin systems. Let G=(V,E) be a graph. Let [q]={1,2,,q} be a set of q2 spins. For each vertex vV, let vector bv0q be the external field at vertex v. For each edge eE, let symmetric matrix Ae0q×q be the interaction matrix at edge e. A spin system defines a Gibbs distribution μ over [q]V such that,

σ[q]Vμ(σ)w(σ):=uVbu(σu)e={v,w}EAe(σv,σw).

We use Ω(μ)[q]V to denote the support of the Gibbs distribution μ.

Let ΛV be a subset of vertices. Given any pinning τ[q]VΛ, we define a conditional distribution μτ by for any configuration σ[q]V,

μτ(σ)wτ(σ):=𝟏[σVΛ=τ]uΛbu(σu)e={v,w}Ev,wΛAe(σv,σw)e={v,w}EvΛwΛAe(σv,τw). (2)

In words, μτ is a Gibbs distribution obtained for μ by removing all edges eVΛ and putting a constraint that every vertex in vVΛ must take the value τv. In particular, if τ is feasible (e.g. τ belongs to the support of the marginal distribution μVΛ), then μτ is exactly the conditional distribution induced by μ given the condition τ. For all spin systems considered in this paper, it holds that σwτ(σ)>0 for all τ. The distribution in (2) is well-defined. Furthermore, for any subset S, we use μSτ to denote the marginal distribution on S projected from μτ.

The following condition plays a key role in the proof of our main results. Let ρ:V>0 be a function that maps every vertex vV to a positive integer. We call the function ρ the Hamming weight function. For any two (possibly partial) configurations σ,τ[q]Λ, where ΛV, define their weighted Hamming distance with respect to ρ by

Hρ(σ,τ):=vΛ:σ(v)τ(v)ρ(v). (3)
Definition 7 (Coupling Independence).

Let C1 be a constant. A distribution μ over [q]V is said to be C-coupling independent (C-CI) if there exists Hamming weight function ρ:V>0 such that the following holds. For any pinning σ1,σ2[q]S, where SV and σ1,σ2 disagree only at one vertex v0S, there exists a coupling (X,Y), where Xμσ1 and Yμσ2, such that

𝔼[Hρ(X,Y)]ρ(v0)C.

The notion of coupling independence was introduced explicitly in [14] to study the spectral independence property. For example, for Boolean distributions111The influence matrix and spectral independence are also defined for general distributions with q2. See [16] for the detailed definition. (q=2), given any pinning τ{,+}VΛ, the |Λ|×|Λ| influence matrix [3] is defined by

Ψμτ(v,u):=μuτv+(+)μuτv(+), (4)

where u,vΛ and τv± denotes the pinning τ together with v taking the value ±. A distribution μ is C-spectrally independent if the maximum eigenvalue of Ψμτ is at most C for any pinning τ. It is not hard to show that C-coupling independence implies C-spectral independence. Hence, recent works [14, 19, 18] utilized coupling independence to establish the spectral independence for various spin systems.

2.2 Compare Markov Chains via Coupling Independence

In this work, we find more applications for coupling independence beyond establishing spectral independence. We build some comparison results of Markov chains via coupling independence. As a by-product result, we also show that the coupling independence gives fast sampling algorithms.

Let μ be a Gibbs distribution over [q]V on graph G=(V,E). For any ΛV, we use G[Λ] to denote the induced subgraph of G on vertex set Λ.

Definition 8 (Relaxation Time with Pinning).

Let μ be a Gibbs distribution on graph G=(V,E) with maximum degree Δ. Let η[0,1]. Let D(η) denote all subsets ΛV such that the maximum degree of G[Λ] is at most ηΔ. Define

Trel(η)(μ):=max{TrelGD(μτ)ΛD(η)τ[q]VΛ}.

In the above definition, μτ is a distribution on [q]V. In every step, the Glauber dynamics picks vV uniformly at random then resamples the value on v. If vΛ, the value of v after resampling is always τv. Indeed, μτ is essentially the same as μΛτ. But, considering μτ would help us simplify some results and proofs. The following is our main comparison result.

Theorem 9 (Relaxation Time Comparison).

Let M1 and 0<η12M be two constants. There exists Δ0=Ω(M2η2logMη) such that for any Gibbs distribution μ on graph G with the maximum degree ΔΔ0, if μ satisfies M-coupling independence, then the relaxation time of Glauber dynamics on μ satisfies

TrelGD(μ)2O(M/η)Trel(η)(μ).

The theorem is proved in the full version ([10, Section 4]). See Section 3 for a proof overview.

The above theorem is a comparison result between two kinds of relaxation times. Consider the case when parameters of μ are close to the critical threshold so that the relaxation time TrelGD(μ) is hard to analyze. By choosing a sufficiently small η, suppose for any ΛD(η) and any τ[q]VΛ, the conditional distribution μτ falls into an easy regime. The relaxation time Trel(η)(μ) is easy to analyze. Theorem 9 boosts the relaxation time from an easy regime to the hard regime if μ satisfies the coupling independence and the maximum degree Δ is greater than a constant Δ0.

When applying Theorem 9 to a specific spin system with Gibbs distribution μ, we first need to show that the μ satisfies the coupling independence property. Next, we choose a small constant η to guarantee that Trel(η)(μ) is easy to analyze. Now, the constant parameter Δ0 in Theorem 9 is fixed. If the maximum degree ΔΔ0=O(1) is bounded, then since the coupling independence implies the spectral independence, the previous work [21] already established the optimal relaxation time for μ. If the maximum degree ΔΔ0, we can apply our boosting result to bound the relaxation time. We show how to prove Theorem 1 and Proposition 3 via Theorem 9.

Proof Sketch of Theorem 1

Given a triangle-free graph G=(V,E) and color lists Lv[q] with |Lv|(α+δ)Δ for all vV, let μ denote the uniform distribution over all proper list-colorings. By going through the analysis in [24], we can prove that μ satisfies O(1/δ)-coupling independence. Let η be a parameter to be fixed later. For any ΛD(η), any pinning τ[q]VΛ, the distribution μτ is essentially the same as the distribution μΛτ because the coloring outside Λ is fixed by τ. By self-reducibility, μΛτ is a list coloring on G=G[Λ] with color list Lv=Lv{τuuΛ{u,v}E}. Let deg(v) and deg(v) denote the degree of v in G and G respectively. The new instance satisfies

vΛ,|Lv||Lv|(deg(v)deg(v))|Lv|Δ|Lv|deg(v)Δ,

where Δ denotes the maximum degree of G. By the definition of D(η), deg(v)ΔηΔ. We have |Lv|deg(v)>(α1)Δα1ηΔ. If we set the parameter η110, then

vΛ,|Lv|5Δ. (5)

In this easy regime, one can use path coupling [6] to show Trel(η)(μ)=O(n). To apply Theorem 9, we pick a small η=O(δ) and η<110. If ΔΔ0=Θ(1δ4log1δ), then

Trel(η)(μ)=2O(1/δ2)n=Oδ(n).

On the other hand, if ΔΔ0=Θ(1δ4log1δ), then the maximum degree is bounded, we can use the result in [21] to obtain the relaxation time Trel(η)(μ)=Oδ(n) in the same order. This gives the proof sketch of Theorem 1. The only missing component is how to establish the coupling independence, which can be found in the full version ([10, Section 9]).

Proof of Proposition 3

To obtain (5), we only need to use the fact that α>1. If we replace α with (1+δ), then we can set ηδ5 and (5) still holds. The same analysis proves Proposition 3.

 Remark 10 (Hardcore Model in Uniqueness Regime).

Theorem 9 could also rediscover some previous results. For example, for the hardcore model on a graph G=(V,E) with fugacity λ(1δ)Δ, [11] proved the optimal Oδ(n) relaxation time. For a fixed λ, the hardcore model falls into an easy regime if we can reduce the maximum degree of the graph by a constant factor. The hardcore model in the uniqueness regime satisfies O(1/δ)-coupling independence (which can be proved by Theorem 16 in this paper). Using a similar argument as that for list coloring, one can rediscover the optimal Oδ(n) relaxation time using Theorem 9.

We remark that the relaxation time result for the bipartite graph hardcore model (Theorem 5) is not a direct consequence from Theorem 9. We need to tweak the proof of Theorem 9 to prove Theorem 5. The proof of Theorem 5 is in the full version ([10, Section 8]).See Section 3 for a proof overview.

 Remark 11 (Compare Theorem 9 to the Technique in [11]).

Another comparison result about relaxation time was given in [11]. The previous result considers general Boolean distribution (not necessarily Gibbs distribution) μ over {,+}V. Given a vector λ=(λv)vV, (λμ) denotes the distribution such that for any σ{,+}V, (λμ)μ(σ)vV:σ(v)=+λv. The result says if (λμ) is spectrally independent for all λ(0,1]V, then one can compare TrelGD(μ) to TrelGD(λθμ), where λθ is the vector with constant value 0<θ<1. When applying results to Gibbs distributions, here are some differences between Theorem 9 and the previous result in [11].

  • Theorem 9 works for general domain [q] but previous result works only for Boolean domain;

  • The condition is incomparable. Theorem 9 requires coupling independence for μ and a degree lower bound for the underlying graph but the previous result requires spectral independence for a family of distributions;

  • The easy regime is incomparable. The easy regime in Theorem 9 is the conditional distributions on a small degree subgraph but the easy regime in the previous result is λθμ;

For many spin systems, one can use Theorem 9 to establish the optimal O(n) relaxation for Glauber dynamics, where n is the number of variables in the spin system. By the standard relation between mixing and relaxation time, the mixing time of Glauber dynamics can usually be bound by O(n2). Each transition of Glauber dynamics can be simulated in time O(Δ). Hence, one can obtain a sampling algorithm in time O(Δn2). Alternatively, we can give a faster sampling algorithm in time O~(Δn) if the easy regime has near-linear mixing time.

Definition 12 (Mixing Time with Pinning).

Let μ be a Gibbs distribution on graph G=(V,E) with maximum degree Δ. Let η[0,1]. Let D(η) denote all subsets ΛV such that the maximum degree of G[Λ] is at most ηΔ. Define

Tmix(η)(μ):=max{TmixGD(μτ,14e)ΛD(η)τ[q]VΛ}.

In words, for any pinning τ on VΛ with ΛD(η), Tmix(η)(μ) is an upper bound for the mixing time T of Glauber dynamics for μτ such that starting from the worst initial X0, the total variation distance between XT and μτ is at most 14e.

Theorem 13 (Fast Sampling Algorithm).

Let M1 and 0<η12M be two constants. There exists an algorithm such that given any ε(0,1) and any Gibbs distribution μ on graph G with the maximum degree ΔΔ0=(10Mη)2log10Mη, if μ satisfies M-coupling independence such that the weighted hamming distance ρ satisfies ρmaxρmin=poly(n), then it returns a random sample X satisfying 𝒟TV(Xμ)ε in time

ΔTmix(η)(μ)(lognε)O(M/η),

where n is the number of vertices in G and we use ρmax=maxvVρ(v) and ρmin=minvVρ(v).

The theorem is proved in the full version ([10, Section 5]).See Section 3 for a proof overview.

In the above theorem, suppose ρmaxρmin=O(nd) for some universal constant d, then the running time in above theorem should be TmixGD(μ,η)Δ(lognε)C(M/η+d) for some universal constant C. We then hide the constants C and d by O() in Theorem 13.

Theorem 4 can be obtained from Theorem 13. Consider the list coloring on a triangle free graphs G=(V,E) with |Lv|(α+δ)Δ. The uniform distribution μ satisfies O(1/δ)-coupling-independence with standard Hamming weight ρ(v)=1 for all vV. Take η=O(1/δ) be a small constant with η<110. By (5), a simple path coupling [6] shows that TmixGD(μ,η)=O(nlogn). Hence, if the maximum degree Δ is greater than Δ0, we run the algorithm in Theorem 13 and the running time is Δnpolylog(nε). Otherwise, the maximum degree is bounded and the result in [21] gives the Oδ(nlogn) mixing time of the Glauber dynamics, then we can simulate Glauber dynamics to obtain a sampling algorithm. The proof of Theorem 4 is in the full version ([10, Section 9]).

The algorithm in Theorem 13 is not the Glauber dynamics. Roughly speaking, the algorithm uses some strategy to pick vertices and uses the Glauber update to resample the value of the picked vertices. However, for monotone spin systems, we can compare this algorithm to Glauber dynamics via censoring inequality [43] and then we can bound the mixing time of Glauber dynamics.

Let μ over [q]V be the Gibbs distribution. Define a partial order for [q]V as follows. For each vV, pick a total order v on [q]. For any two X,Y[q]V,

XYvV,XvvYv. (6)

For two distributions μ and ν over [q]V, we say μ is stochastic dominated by ν (i.e., μν) if there is a coupling 𝒞 between μ,ν such that 𝐏𝐫(X,Y)𝒞[XY]=1. Let P be the transition matrix of the Glauber dynamics on μ, which can be written as

P=1nvVPv, (7)

where Pv performs updates at the vV such that Pv(X,Y)=μXVv(Y), for all X,Y[q]V.

Definition 14 ([37, Chapter 22]).

We say μ is a monotone spin system if for every vV, Pv is ordering persistant, which means for any X,Y[q]V with XY, it holds that Pv(X,)Pv(Y,).

By the definition of the partial ordering in [q]V, there is a unique maximum configuration for the ordering. Denote this state as X+. Recall TmixGD(μ,εX+) denotes the mixing time of Glauber dynamics starting from X+.

Theorem 15 (Mixing Time Comparison).

Let M1 and 0<η12M be two constants. For any monotone spin system μ on graph G with the maximum degree Δ=Ω(M2η2logMη), if μ satisfies M-coupling-independence such that the Hamming weight ρ satisfies ρmaxρmin=poly(n), then the mixing time of Glauber dynamics starting from the maximum configuration satisfies

TmixGD(μ,εX+)(lognε)O(M/η)Tmix(η)(μ),

where n is the number of vertices in graph G.

The theorem is proved in the full version ([10, Section 6]).See Section 3 for a proof overview.

The above theorem is of independent interest. Suppose μ is a monotone system with coupling independence property. The parameters of μ are in the critical regime and the underlying graph has an unbounded maximum degree. If we can choose a proper constant η such that Tmix(η)(μ)=O(nlogn), then the theorem gives a linear-optimal O~(n) mixing time of Glauber dynamics for μ starting from the maximum configuration.

To obtain the near-linear mixing time for μ, some previous works [12, 15, 2] developed comparison techniques for the modified log-Sobolev (MLS) constants. Roughly speaking, if one can lower bound the MLS constant mls(μ) of the Glauber dynamics for μ, then one can obtain the optimal O(nlogn) mixing time. Previous works compared mls(μ) to mls(μ), where μ is a distribution in the easy regime, and such comparison requires μ to satisfy certain entropic independence [2] condition. In general, it is not easy to verify the entropic independence condition and analyze mls(μ) even if μ is in an easy regime. Theorem 15 only requires the coupling independence condition and directly compares the mixing time. However, Theorem 15 requires monotone systems, and the final mixing result is restricted.

We remark that although the hardcore model in bipartite graphs is a monotone system, Theorem 6 is not a direct consequence from Theorem 15. We need to tweak the proof of Theorem 15 to prove Theorem 6. The proof of Theorem 6 is in the full version ([10, Section 8]). See Section 3 for a proof overview.

2.3 Establishing Coupling Independence

The next question is how to establish the coupling independence condition for spin systems. Previously, spectral independence was known for many spin systems. The coupling independence was often a by-product result when proving spectral independence. Hence, it is known for some specific spin systems such as subgraph world [14], b-matching [18] and coloring in high girth graphs [19].

In this paper, we give a tool to turn many existing spectral independence results into coupling independence results. A large family of spin systems is 2-spin systems. Let G=(V,E) be a graph with maximum degree Δ3. Let 0βγ be the edge interactions such that γ>0. Let λ>0 be the external field. Let μ be the Gibbs distribution on G with parameters β,γ,λ such that for any σ{,+}, μ(σ)λn+(σ)βm+(σ)γm(σ), where n+(σ) is the number of vertices v with σv=+ and m±(σ) is the number of edges {u,v} with σu=σv=±. The 2-spin system is said to be ferromagnetic if βγ>1 and anti-ferromagnetic if βγ<1.

Anari, Liu, and Oveis Gharan [3] analyzed the spectral independence for the hardcore model. Chen, Liu, and Vigoda [20] extended the analysis to general 2-spin systems. Recall that the influence matrix Ψμτ is defined in (4). The maximum eigenvalue Eigmax(Ψμτ) can be upper bounded by the total influence from one vertex

Eigmax(Ψμτ)maxvuV|Ψμ(v,u)|. (8)

The RHS is called the total influence bound. For 2-spin systems, the analysis is performed on the Self-Avoiding-Walk (SAW) tree [46]. Roughly speaking, fix a vertex v, the SAW tree Tv enumerates all the SAWs in graph G starting from v. By defining a proper 2-spin system on Tv, one can use the total influence from the root in Tv to upper bound the total influence from v in G, and thus establish the spectral independence for Gibbs distribution μ. In [20], a weighted version of (8) is studied to deal with general 2-spin systems. We give the following result for coupling independence.

Theorem 16 (Informal version; see [10, Lemma 39] for the formal version).

For 2-spin systems, the (weighted) total influence bound in the Self-Avoid-Walk tree implies coupling independence.

As a consequence, all the spectral independence results for 2-spin systems in [20] can be turned into coupling independence results in black-box. For the hardcore model in bipartite graphs (Theorem 5 and Theorem 6), we can also use the above result to transform the total influence bound in [13] into coupling independence result.

Theorem 16 is proved by constructing a recursive coupling in the full version ([10, Section 7]). Fix a vertex v in G. We build a coupling (X,Y) between μv+ and μv and show the discrepancy between X and Y are bounded by the total influence in the SAW tree Tv. Suppose v has d neighbors u1,u2,,ud. We split v into d copies v1,v2,,vd such that vi only has one neighbor ui. Define the pinning σi such that vj for ji takes the value + and vj for j<i takes the value . Then μv+=μσ0 and μv=μσd. We couple each adjacent μσi1 and μσi, then merge them into a coupling between two endpoints μσ0 and μσd. For each adjacent pair, the only difference between σi and σi1 is the pinning at vi. Hence, we first couple the only neighbor ui of vi then construct the coupling recursively if the coupling at ui fails. This recursive processing essentially enumerates all SAWs from v. We can relate the coupling with the SAW tree to prove the theorem.

For multi-spin systems such as list-coloring, we can mimic the recursive coupling for 2-spin systems. Since the previous spectral independence results for list-coloring were also obtained via the SAW tree [24, 16], a similar proof gives the coupling independence.

3 Proof Overview

We give a proof overview for the relaxation time comparison result in Theorem 9. Let G=(V,E) be a graph with maximum degree Δ. Let and k be two constant integers such that <k. Their specific values will be fixed later. We first partition all the vertices in V into k parts U1,U2,,Uk such that for any vertex vV, each Ui does not have more than ηΔ neighbors of v, where 0<η<1 is the parameter in Theorem 9. In other words, let Γv={u(u,v)E} denote the set of neighbors of v in graph G. For any i[k], |ΓvUi|ηΔ. The existence of the partition is guaranteed by the Lovász local lemma. However, the local lemma requires the maximum degree Δ to be sufficiently large. That is why we require a lower bound for Δ in our technical results. We also remark that in our proof, the degree lower bound is used solely to ensure the existence of the partition. A similar partition appeared in the previous work [30].

The input Gibbs distribution μ over [q]V is a joint distribution of n variables (Xv)vV, where each variable takes its value from [q]. Now, we can view μ as a joint distribution of k variables Y=(Yi)i[k] such that each variable Yi=XUi takes its value from a huge domain [q]Ui. We define the k(k) down-up walk on Y. Given Y=(Y1,Y2,,Yk), the Markov chain does as follows

  • Down-walk: Sample a set S([k]) of indices uniformly at random and then remove the configuration on the set S: YY[k]S;

  • Up-walk: Resample YS from μ conditional on Y[k]S and then go back to a full configuration Y[k]SY[k]SYS.

A full configuration Y=(Y1,Y2,,Yk) is on the level k. In the down-walk, we sample a random subset of indices S[k] with size . By dropping the configuration YS, we move from a full configuration at level k to a partial configuration at level k. In the up-walk, we resample YS and go back to the level k. The process can also be viewed as a kind of block dynamics for configuration X[q]V. In every step, we pick a random subset US=iSUiV of variables and resample X(US) conditional on X(VUS).

We use local-to-global technique [35, 1, 3] to analyze the spectral gap of the k(k) down-up walk for Y. The local-to-global technique suggests to analyze the relaxation time of k1 down-up walk222In [1, 3], the local walk is essentially defined as the 1k up-down walk. Every state is Yi for i[k]. In the up-walk, it extends Yi to a full configuration Y. In the down-walk step, it samples a random index j[k] and updates Y to Yj. It is well-known that 1k up-down walk and k1 up-down walk has the same relaxation time.. In the down walk, we pick a random S of size |S|=k1 and drop YS. In the up-walk, we resample YS and go back to level k. We use coupling independence to analyze this k1 down-up walk via path coupling. For simplicity, suppose μ satisfies C-coupling independence with standard Hamming distance (ρ(v)=1 for all vV). We can view this k1 down-up walk on Y as a block dynamics on X[q]V, where it updates a block US in every step. Given two X[q]V and X[q]V that disagree only at one vertex vV, say vU1, we couple the transition of k1 down-up walk. Let two k1 down-up walks (starting from X and X, respectively) select the same random subset S[k] such that |S|=k1.

  • If 1S, which happens with probability k1k, then since vU1 the value of v is removed in the down-walk, and thus X and X can be coupled perfectly after the transition.

  • If 1S, which happens with probability 1k, then since vU1, the disagreement at v may percolate to other blocks in the up-walk step. We use the coupling in the M-coupling independence to couple the up-walk so that the expected Hamming distance between X and X after the transition is at most M.

Hence, the expected Hamming distance between X and X after transition is at most Mk. If k>M, the path coupling gives the O(logn) mixing time and O(1) relaxation time for this down-up walk. To apply the local-to-global technique, we also need to fix a configuration YΛ, where Λ[k] and |Λ|=t, and consider the (kt)1 down-up walk for Y[k]Λ. The same path coupling works if kt>M. By choosing k and such that k>M and using the local-to-global technique, we can show that the k(k) down-up walk for Y has O(1) relaxation time.

We then compare the k(k) down-up for Y=(Y1,Y2,,Yk) to the Glauber dynamics for X[q]V. Recall that k(k) down-up walk is a kind of block dynamics for X. In every step, the block dynamics updates a subset US=iSUi with |S|=. The update step is to resample X(US) conditional on X(VUS). This step samples from the conditional Gibbs distribution μUSX(VUS) on subgraph G[US]. By the construction of the partition, the maximum degree of G[US] is at most ηΔ so that we have the relaxation time bound Trel(η)(μ) for Glauber dynamics on μUSXVUS. Let Treldown-up denote the relaxation time of k(k) down-up walk. By some standard comparison argument between block dynamics and the Glauber dynamics, we can prove Theorem 9 by showing that

TrelGD(μ)Treldown-up×Trel(η)(μ)=O(1)×Trel(η)(μ).

Next, we explain how to get the near-linear time sampling algorithm in Theorem 13 and the mixing time in Theorem 15. Note that for k1 down-up walk, the path coupling actually gives the O(logn) mixing time. For one update step, it selects a subset S[k] with |S|=k1. Let i denote the missing index, i.e. S{i}=[k]. The update step resamples YS conditional on Yi. We can simulate this transition step using (k1)1 down-up walk for the conditional distribution on YS. This down-up walk also has the O(logn) mixing time. We do this recursively until we need to sample from a conditional distribution on YS with |S|=. Note that the maximum degree of the graph G[US] is at most ηΔ. Now, we simulate the Glauber dynamics for O(Tmix(η)(μ)logn) steps to sample from the conditional distribution. The following informal algorithm generates an approximate random sample from μ within TV-distance error ε=O(1). The formal algorithm is given in the full version ([10, Section 5]).

Algorithm Sample(X,Λ)
Input: a subset Λ[k];

Output: the algorithm randomly updates the partial configuration XUΛ so that XUΛ becomes an independent approximate sample of μUΛσ, where σ=XU[k]Λ.

  1. 1.

    if |Λ|=: update XUΛ for O(Tmix(η)(μ)logn) steps, where every step uses the update rule of the Glauber dynamics on μUΛσ, where σ=XU[k]Λ.

  2. 2.

    else: run the update of |Λ|1 down-up walk on μUΛσ for O(logn) steps, where σ=XU[k]Λ and every step does as follows:

    1. (a)

      pick an index iΛ uniformly at random and let S=Λ{i};

    2. (b)

      call Sample(X,S) to update XUS; (recursion step)

We can call Sample(X,[k]) with an arbitrary feasible X[q]V. When the algorithm terminates, X is updated to an approximate random sample from the distribution μ. Note that the parameter k is a constant. The total number of Glauber update steps is given by N=(logn)O(1)Tmix(η)(μ). Therefore, the overall running time of the algorithm is O(ΔN). Alternatively, we can think of the algorithm as follows: it first generates a random sequence of vertices v1,v2,,vNV. In the t-th step, the algorithm randomly updates the value of vt, conditioned on the values of other variables. This behavior is similar to standard Glauber dynamics for μ, except that the sequence v1,v2,,vN follows a non-trivial joint distribution. For monotone systems, we can compare this algorithm with Glauber dynamics using the censoring inequality.

The results for list-coloring are consequences of general technical results. However, we need to tweak the analysis to prove the results for hardcore model in bipartite graphs (Theorem 5 and Theorem 6). The reason is that for hardcore model on G=(VLVR,E), we only know λ<λc(ΔL) but we cannot control the degree ΔR in the right part VR. Our technique can only prove the coupling independence for μL, which is the marginal distribution on VL projected from μ. To prove the relaxation time and mixing time results, we first partition VL into disjoint part U1,U2,,Uk such that for any vertex vVR, v has no more than ηΔ neighbors in each Ui. Again, the existence of the partition is guaranteed by the local lemma. Let XμL be a partial configuration on VL. We can define Y=(Y1,Y2,,Yk), where Yi=XUi. By a similar proof, we show that the k(k) down-up walk for Y is rapid mixing. We consider a global Markov chain over {,+}VLVR defined as follows. Let X¯{,+}VLVR be a full configuration.

  • Drop the right part to obtain XX¯(VL);

  • Update X using the k(k) down-up walk for Y=(Y1,,Yk), where Yi=X(Ui);

  • Sample X(VR)μVRX and let X¯XX(VR).

We first compare this chain to the k(k) down-up walk and then compare the Glauber dynamics for μ to this Markov chain. This gives the relaxation time of Glauber dynamics. For the mixing time, we can first obtain a near-linear time sampling algorithm for μL, since hardcore model in bipartite graph is a monotone system, we then compare the algorithm to the Glauber dynamics for μ via the censoring inequality.

References

  • [1] Vedat Levi Alev and Lap Chi Lau. Improved analysis of higher order random walks and applications. In STOC, pages 1198–1211, 2020. doi:10.1145/3357713.3384317.
  • [2] Nima Anari, Vishesh Jain, Frederic Koehler, Huy Tuan Pham, and Thuy-Duong Vuong. Entropic independence: optimal mixing of down-up random walks. In STOC, pages 1418–1430, 2022. doi:10.1145/3519935.3520048.
  • [3] Nima Anari, Kuikui Liu, and Shayan Oveis Gharan. Spectral independence in high-dimensional expanders and applications to the hardcore model. In FOCS, pages 1319–1330, 2020. doi:10.1109/FOCS46700.2020.00125.
  • [4] Nima Anari, Kuikui Liu, Shayan Oveis Gharan, and Cynthia Vinzant. Log-concave polynomials II: high-dimensional walks and an FPRAS for counting bases of a matroid. In STOC, pages 1–12, 2019. doi:10.1145/3313276.3316385.
  • [5] Antonio Blanca, Pietro Caputo, Zongchen Chen, Daniel Parisi, Daniel Štefankovič, and Eric Vigoda. On mixing of markov chains: Coupling, spectral independence, and entropy factorization. arXiv preprint arXiv:2103.07459, 2021. arXiv:2103.07459.
  • [6] Russ Bubley and Martin Dyer. Path coupling: A technique for proving rapid mixing in markov chains. In FOCS, pages 223–231, 1997. doi:10.1109/SFCS.1997.646111.
  • [7] Jin-Yi Cai, Andreas Galanis, Leslie Ann Goldberg, Heng Guo, Mark Jerrum, Daniel Štefankovič, and Eric Vigoda. #BIS-hardness for 2-spin systems on bipartite bounded degree graphs in the tree non-uniqueness region. J. Comput. System Sci., 82(5):690–711, 2016. doi:10.1016/j.jcss.2015.11.009.
  • [8] Sarah Cannon and Will Perkins. Counting independent sets in unbalanced bipartite graphs. In SODA, pages 1456–1466, 2020. doi:10.1137/1.9781611975994.88.
  • [9] Sitan Chen, Michelle Delcourt, Ankur Moitra, Guillem Perarnau, and Luke Postle. Improved bounds for randomly sampling colorings via linear programming. In SODA, pages 2216–2234, 2019. doi:10.1137/1.9781611975482.134.
  • [10] Xiaoyu Chen and Weiming Feng. Rapid mixing via coupling independence for spin systems with unbounded degree. arXiv:2407.04672, 2024. doi:10.48550/arXiv.2407.04672.
  • [11] Xiaoyu Chen, Weiming Feng, Yitong Yin, and Xinyuan Zhang. Rapid mixing of glauber dynamics via spectral independence for all degrees. In FOCS, pages 137–148, 2021. doi:10.1109/FOCS52979.2021.00022.
  • [12] Xiaoyu Chen, Weiming Feng, Yitong Yin, and Xinyuan Zhang. Optimal mixing for two-state anti-ferromagnetic spin systems. In FOCS, pages 588–599, 2022. doi:10.1109/FOCS54457.2022.00062.
  • [13] Xiaoyu Chen, Jingcheng Liu, and Yitong Yin. Uniqueness and rapid mixing in the bipartite hardcore model. In FOCS, pages 1991–2005. IEEE, 2023. doi:10.1109/FOCS57990.2023.00121.
  • [14] Xiaoyu Chen and Xinyuan Zhang. A near-linear time sampler for the Ising model with external field. In SODA, pages 4478–4503, 2023. doi:10.1137/1.9781611977554.CH170.
  • [15] Yuansi Chen and Ronen Eldan. Localization schemes: A framework for proving mixing bounds for markov chains. In FOCS, pages 110–122, 2022.
  • [16] Zongchen Chen, Andreas Galanis, Daniel Štefankovič, and Eric Vigoda. Rapid mixing for colorings via spectral independence. In SODA, pages 1548–1557, 2021.
  • [17] Zongchen Chen, Andreas Galanis, Daniel Štefankovič, and Eric Vigoda. Sampling colorings and independent sets of random regular bipartite graphs in the non-uniqueness region. In SODA, pages 2198–2207, 2022.
  • [18] Zongchen Chen and Yuzhou Gu. Fast sampling of b-matchings and b-edge covers. In SODA, pages 4972–4987, 2024. doi:10.1137/1.9781611977912.178.
  • [19] Zongchen Chen, Kuikui Liu, Nitya Mani, and Ankur Moitra. Strong spatial mixing for colorings on trees and its algorithmic applications. In FOCS, pages 810–845, 2023. doi:10.1109/FOCS57990.2023.00053.
  • [20] Zongchen Chen, Kuikui Liu, and Eric Vigoda. Rapid mixing of Glauber dynamics up to uniqueness via contraction. In FOCS, pages 1307–1318, 2020. doi:10.1109/FOCS46700.2020.00124.
  • [21] Zongchen Chen, Kuikui Liu, and Eric Vigoda. Optimal mixing of Glauber dynamics: entropy factorization via high-dimensional expansion. In STOC, pages 1537–1550, 2021. doi:10.1145/3406325.3451035.
  • [22] Martin Dyer, Catherine Greenhill, and Mario Ullrich. Structure and eigenvalues of heat-bath Markov chains. Linear Algebra Appl., 454:57–71, 2014.
  • [23] Charilaos Efthymiou, Thomas P. Hayes, Daniel Štefankovič, Eric Vigoda, and Yitong Yin. Convergence of MCMC and loopy BP in the tree uniqueness region for the hard-core model. SIAM J. Comput., 48(2):581–643, 2019. doi:10.1137/17M1127144.
  • [24] Weiming Feng, Heng Guo, Yitong Yin, and Chihao Zhang. Rapid mixing from spectral independence beyond the boolean domain. In SODA, pages 1558–1577, 2021. doi:10.1137/1.9781611976465.95.
  • [25] Andreas Galanis, Qi Ge, Daniel Štefankovič, Eric Vigoda, and Linji Yang. Improved inapproximability results for counting independent sets in the hard-core model. Random Structures Algorithms, 45(1):78–110, 2014. (conference version in RANDOM’11). doi:10.1002/RSA.20479.
  • [26] David Gamarnik, Dmitriy Katz, and Sidhant Misra. Strong spatial mixing of list coloring of graphs. Random Structures Algorithms, 2013.
  • [27] Leslie Ann Goldberg, Russell A. Martin, and Mike Paterson. Strong spatial mixing with fewer colors for lattice graphs. SIAM J. Comput., 35(2):486–517, 2005. doi:10.1137/S0097539704445470.
  • [28] Thomas P. Hayes and Eric Vigoda. Coupling with the stationary distribution and improved sampling for colorings and independent sets. Ann. Appl. Probab., 16(3):1297–1318, 2006.
  • [29] Vishesh Jain, Huy Tuan Pham, and Thuy Duong Vuong. Spectral independence, coupling with the stationary distribution, and the spectral gap of the Glauber dynamics. Inf. Process. Lett., 177:106268, 2022.
  • [30] Vishesh Jain, Ashwin Sah, and Mehtaab Sawhney. Perfectly sampling k (8/3 + o(1))Δ-colorings in graphs. In STOC, pages 1589–1600, 2021.
  • [31] Matthew Jenssen, Peter Keevash, and Will Perkins. Algorithms for #BIS-hard problems on expander graphs. SIAM J. Comput., 49(4):681–710, 2020. doi:10.1137/19M1286669.
  • [32] Matthew Jenssen, Will Perkins, and Aditya Potukuchi. Approximately counting independent sets in bipartite graphs via graph containers. Random Structures Algorithms, 63(1):215–241, 2023. doi:10.1002/RSA.21145.
  • [33] Mark Jerrum. A very simple algorithm for estimating the number of k-colorings of a low-degree graph. Random Struct. Algorithms, 7(2):157–165, 1995. doi:10.1002/RSA.3240070205.
  • [34] Mark Jerrum. Glauber dynamics for the hard-core model on bounded-degree H-free graphs. CoRR, abs/2404.07615, 2024. doi:10.48550/arXiv.2404.07615.
  • [35] Tali Kaufman and Izhar Oppenheim. High order random walks: beyond spectral gap. Combinatorica, 40(2):245–281, 2020. (conference version in RANDOM’18). doi:10.1007/S00493-019-3847-0.
  • [36] Frank P Kelly. Stochastic models of computer communication systems. Journal of the Royal Statistical Society: Series B (Methodological), 47(3):379–395, 1985.
  • [37] David A. Levin, Yuval Peres, and Elizabeth L. Wilmer. Markov chains and mixing times. American Mathematical Society, Providence, RI, 2017.
  • [38] Chao Liao, Jiabao Lin, Pinyan Lu, and Zhenyu Mao. An FPTAS for the hardcore model on random regular bipartite graphs. Theoret. Comput. Sci., 929:174–190, 2022. doi:10.1016/J.TCS.2022.07.001.
  • [39] Jingcheng Liu and Pinyan Lu. FPTAS for #BIS with degree bounds on one side. In STOC, pages 549–556, 2015. doi:10.1145/2746539.2746598.
  • [40] Kuikui Liu. From coupling to spectral independence and blackbox comparison with the down-up walk. arXiv preprint arXiv:2103.11609, 2021. arXiv:2103.11609.
  • [41] Michael Luby and Eric Vigoda. Fast convergence of the glauber dynamics for sampling independent sets. Random Structures Algorithms, 15(3-4):229–241, 1999. doi:10.1002/(SICI)1098-2418(199910/12)15:3/4\%3C229::AID-RSA3\%3E3.0.CO;2-X.
  • [42] Marc Mezard and Andrea Montanari. Information, physics, and computation. Oxford University Press, 2009.
  • [43] Yuval Peres and Peter Winkler. Can extra updates delay mixing? Comm. Math. Phys., 323(3):1007–1016, 2013.
  • [44] Allan Sly. Computational transition at the uniqueness threshold. In FOCS, pages 287–296, 2010. doi:10.1109/FOCS.2010.34.
  • [45] Eric Vigoda. Improved bounds for sampling colorings. J. Math. Phys., 41(3):1555–1569, 2000.
  • [46] Dror Weitz. Counting independent sets up to the tree threshold. In STOC, pages 140–149, 2006. doi:10.1145/1132516.1132538.