Abstract 1 Introduction 2 Preliminaries 3 Mixing within the disordered and ordered phases for the RC model 4 Proof for WSM in the disordered regime 5 Proof for WSM within the ordered regime References

Planting and MCMC Sampling from the Potts Model

Andreas Galanis ORCID Department of Computer Science, University of Oxford, UK Leslie Ann Goldberg ORCID Department of Computer Science, University of Oxford, UK Paulina Smolarova ORCID Department of Computer Science, University of Oxford, UK
Abstract

We consider the problem of sampling from the ferromagnetic q-state Potts model on the random d-regular graph with parameter β>0. A key difficulty that arises in sampling from the model is the existence of a “metastability” window β(βu,βu), where roughly the distribution has two competing modes, the so-called disordered and ordered phases. This causes classical Markov-chain algorithms to be slow mixing from worst-case initialisations. Nevertheless, Helmuth, Jenssen and Perkins (SODA ’19) designed a sampling algorithm that works for all β, when d5 and q=dΩ(d), using polymers and cluster expansion methods; more recently, their analysis technique has been adapted to show that a Markov chain (random-cluster dynamics) mixes fast when initialised appropriately, in the same regime of q,d,β.

Despite these positive algorithmic results, a well-known bottleneck behind cluster-expansion arguments is that they inherently only work for large q, whereas it is widely conjectured that sampling on the random d-regular graph is possible for all q,d3. The only result so far that applies to general q,d3 is by Blanca and Gheissari who showed that the random-cluster dynamics mixes fast in the “uniqueness” regime β<βu where roughly only the disordered mode exists. For ββu however, a second subdominant mode emerges creating bottlenecks and giving rise to correlations which have been hard to handle, especially for small values of q and d.

Our main contribution is to perform a delicate analysis of the Potts distribution and the random-cluster dynamics that goes beyond the threshold βu. We use planting as the main tool, a technique used in the analysis of random CSPs to capture how the space of solutions is correlated with the structure of the random instance. While planting arguments provide only weak sampling guarantees generically, here we instead combine planting with the analysis of random-cluster dynamics to obtain significantly stronger guarantees. We are thus able to show that the random-cluster dynamics initialised from all-out mixes fast for all integers q,d3 beyond the uniqueness threshold βu, all the way to the optimal threshold βc(βu,βu) where the dominant mode switches from disordered to ordered. A more involved analysis also applies to the ordered regime β>βc where we obtain an algorithm for all d3 and q(5d)5, improving significantly upon the previous range of q,d by Carlson, Davies, Fraiman, Kolla, Potukuchi, and Yap (FOCS’22).

Keywords and phrases:
approximate sampling, Glauber dynamics, Potts model, random cluster model
Copyright and License:
[Uncaptioned image] © Andreas Galanis, Leslie Ann Goldberg, and Paulina Smolarova; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Mathematics of computing Gibbs sampling
; Mathematics of computing Random graphs ; Mathematics of computing Markov-chain Monte Carlo methods
Related Version:
Full version with all proofs: https://arxiv.org/abs/2410.14409 [18]
Editors:
Meena Mahajan, Florin Manea, Annabelle McIver, and Nguyễn Kim Thắng

1 Introduction

The Potts model is a weighted model assigning probabilities to (non-proper) q-colourings. The model originated in statistical physics but has since been a central object of study in various contexts; here, we focus on the computational problem of sampling from the model.

For an integer q3 and a graph G=(V,E), the q-state Potts model on G with parameter β is a probability distribution μ=μG,q,β on [q]V where [q]={1,,q}. Each configuration σ[q]V has weight μ(σ)=eβm(σ)/ZG where m(σ) is the number of monochromatic edges under σ; the normalising factor ZG is the partition function. Throughout, we consider the “ferromagnetic” case β>0, where configurations with many monochromatic edges are favoured in the distribution.

From a computational complexity perspective, sampling from the ferromagnetic Potts model has various twists relative to other similar models and the complexity of the sampling problem is widely open. On general graphs, the problem is #BIS-hard for any fixed β>0 [24]; for graphs of max degree d (where d is a fixed integer), the problem becomes #BIS-hard when β>βc [20], where βc=lnq2(q1)12/d1 is known as the ordered/disordered threshold (we will discuss this in more detail shortly). It is also conjectured that the problem admits a polynomial time algorithm when β<βc but this is open in general; in fact, many of the standard tools that are used to analyse Markov chains provably fail well below the βc threshold. The quintessential example here is the random d-regular graph, which underpins all the relevant phenomena behind this picture. For dn even, we let 𝔾=𝔾n,d be a graph chosen uniformly at random from the set of all d-regular graphs with n vertices. We use “whp” as a shorthand for “with probability 1o(1) as n grows large”.

For a typical random graph 𝔾, it is known that a sample from the Potts distribution μ𝔾 is “disordered” for β<βc, and “ordered” for β>βc; roughly, this means that the colours in the former case appear equally often whereas in the second case one of the colours strictly dominates over the rest (see Lemma 3 for the formal statement). The intricate feature of the Potts model on the random d-regular graph is that these competing modes are both present as “local maxima” throughout a window (βu,βu) containing βc even though only one of them has the vast majority of the mass (except at β=βc itself, where both modes appear with constant probability). This already poses problems to standard MCMC algorithms such as Glauber and Swendsen-Wang dynamics since the simultaneous presence of the modes causes exponentially slow mixing from worst-case initialisations, see [26, 14]. At a more conceptual level, any sampling algorithm has to take into account the presence of the other mode which obliterates from the very start standard analysis tools (e.g., correlation decay/spectral independence).

Helmuth, Jenssen and Perkins [26] introduced a cluster-expansion technique that allowed them to control more precisely how much a typical sample differs from the corresponding mode, and obtained an algorithm based on the interpolation method for all β>0 when d5 and qdΩ(d); their algorithm applies more generally to expander graphs (under some mild conditions). See also the works [16, 11, 12] for various refinements of their method. Following a series of developments [21, 22], this cluster expansion expansion argument has been converted into an MCMC algorithm (for large d and q) [19] using the random-cluster dynamics with appropriate initialisation that avoids the bottlenecks in the distribution (see Section 2.2 for details). See also [10, 8, 23] for closely related results on the random d-regular graph (and lattices) that apply for large β.

Despite these positive algorithmic developments, the cluster expansion technique that underpins the analysis of these algorithms is inherently “perturbative”, relying roughly on controlling how much a typical configuration differs from the max-energy configurations and arguing that the difference is relatively small. In the case of the Potts model, the resulting estimates are accurate only when q is quite large relative to d. By contrast, it is conjectured that sampling on the random regular graph should be possible for all q,d3 (and all β>0), and hence more precise tools are needed to settle the picture for small q,d. The only result so far that works for general q,d3 is by Blanca and Gheissari [9] who showed that random-cluster dynamics mixes in O(nlogn) time when β<βu, for arbitrary initialisation. Note that for β>βu the results of [14] give worst-case initialisations that slow down the mixing time of the Glauber dynamics to eΩ(n); more generally, for β>βu, the presence of the ordered mode imposes correlation phenomena that are hard to handle, especially for small values of q,d.

1.1 Results

Our main contribution is to do a delicate analysis of the Potts distribution and the random-cluster dynamics that goes beyond the threshold βu, using planting. Planting is a technique used in the analysis of random CSPs to capture how the space of solutions is correlated with the structure of the random instance, see [1, 15, 4] for some applications. In terms of sampling however, there is no recipe for converting planting methods to efficient sampling algorithms, and typically some extra sampling step is needed on top, see for example [2, 17] for such approaches; even so, the resulting sample usually has some accuracy limitation, i.e., the TV-distance from the target distribution cannot be made arbitrarily small (typically it is required to be at least an inverse polynomial in n). Here, we instead use planting as a means to obtain sharp estimates that are used in the analysis of random-cluster dynamics; we thus show fast convergence of the latter, yielding automatically strong sampling guarantees. A different sampling idea that uses planting in a somewhat related manner has recently been considered in [27], in the context of sampling from spherical spin glasses.

Using this planting framework, we give an algorithm to sample from the Potts distribution on the random d-regular graph 𝔾 that covers the optimal range of “high temperatures” β<βc for all integers q,d3; as we will explain in more detail later the algorithm is running random-cluster dynamics for O(nlogn) steps, initialised appropriately (from the “empty” configuration).

Theorem 1.

For integers d,q3 and real β(0,βc), the following holds whp over 𝔾=𝔾n,d.

There is an MCMC algorithm that, on input 𝔾 and εeΩ(n), outputs in O(nlognlog(1ε)) steps a sample σ^[q]n whose distribution μ^ satisfies μ^μ𝔾,q,βTVε.

Our analysis framework also applies to the ordered regime β>βc where we improve significantly upon various restrictions from previous works [11, 16, 26, 19]. For example, [26, 22] applied when qdΩ(d) (and d5), whereas the recent work by Carlson et al [11] (see also [16]) applied when dd0 and qdc for some large constants d0,c. Using planting, we obtain a sampling result for all d3 when q is roughly larger than d5. We show the following.

Theorem 2.

For integers d3,q(5d)5 and real β>βc, the following holds whp over 𝔾=𝔾n,d.

There is an MCMC algorithm that, on input 𝔾 and εeΩ(n), outputs in O(nlognlog(1ε)) steps a sample σ^[q]n whose distribution μ^ satisfies μ^μ𝔾,q,βTVε.

1.2 Proof outline

Our algorithms are based on a well-known edge representation of the Potts model, the random-cluster (RC) model, where configurations are edge subsets weighted by the number of edges present and the number of induced components. We run the random-cluster dynamics (the analogue of Glauber dynamics in this edge setting), starting from the empty set of edges in the disordered regime β<βc and the full set of edges in the ordered regime β>βc. The reason for working in the RC representation is that, in contrast to the Potts model, the RC model enjoys certain monotonicity properties that facilitate tracking the evolution of Glauber dynamics. In the next Preliminaries section we define the RC model and the relevant random-cluster chain. We also give a more detailed overview of the phases of the Potts model, and how planting gives a handle for the disordered and ordered settings. Most of these parts are largely imported from [14, 20]. In Section 3, we convert the planting for Potts into suitable planting for the RC model, giving the analogue of the ordered and disordered phases in that setting. Then, we state the main contribution of this work which is establishing weak spatial mixing within the disordered and ordered phases (a notion that originated in [21]). Sections 4 and 5 give the technical core of our arguments.

Roughly, in the disordered regime (Section 4), the key ingredient is to show that the dynamics converges to a set of configurations that do not form large components. For β<βu, Blanca and Gheissari [9] accomplished this by a very careful random-graph revealing process coupled with the evolution of random-cluster dynamics. Using planting, we are able to capture precisely the correlation between the random graph and a sample from stationarity for β<βc, reducing the study of the component structure to showing that the planted random graph is subcritical. Using the monotonicity of the RC model, this implies that the RC dynamics initialised from the empty set (“all-out”) enjoys the same component structure as the stationary distribution (for exponentially many steps). Having this in place (Lemma 14), the rest of the proof is more streamlined after some small reworking of certain correlation decay estimates up to the ordered/disordered threshold βc (Lemma 15).

In the ordered regime (Section 5), the planted model is instead supercritical and a typical configuration has a “giant” component, whose presence complicates the arguments significantly. The goal now is roughly to show that bichromatic edges do not typically form large components; more precisely, the key ingredient is to show that every vertex v is surrounded by close-by vertices (each at some distance <12logd1n) that belong to the giant component, a so-called “wired” boundary. Planting here allows us to do a head-on analysis of the probability that, in a typical sample from stationarity, there exists a path of length that does not include any vertex in the giant. We get a tight bound for this using careful first moment arguments and percolation bounds; then, for q(5d)5, a union bound over the paths starting at v gives the existence of the desired wired boundary (in a typical configuration). Relative to the earlier works [26, 16, 11, 19], bichromatic components roughly correspond to so-called polymers; our techniques show that a smaller lower bound on the size of the giant component suffices in order to ensure that the polymers are not large.

1.3 Further Discussion

The planting arguments that we use in this paper follow by careful first and second moment considerations in [20], which have been carried out for the ferromagnetic Potts model, see also [5] for similar-flavored “silent” planting results. The moment analysis has previously been used in [13] for sampling independent sets and proper colourings in random bipartite graphs, though there the approach ultimately relies on cluster-expansion type of arguments [29, 12]. Our more direct planting approach can be carried out for sampling independent sets on random bipartite graphs (that shares similar monotonicity and planting properties) and obtain sharper bounds for sampling using Glauber dynamics. Note however that, as pointed out in [13, Lemma 10], it is unlikely that current polymer-based techniques will achieve better asymptotic estimates in terms of the degree d. Our technique should also apply to the G(n,d/n) setting (for arbitrary q,d>1) provided that the analogous “silent” planting is in place; this seems more easily feasible in the disordered regime but more difficult for the ordered regime where the Poisson-tree neighbourhoods are expected to cause bigger variance.

It is also relevant to note that the results we stated earlier for the cluster-expansion technique and fast mixing of the random-cluster dynamics in the literature apply to non-integer q, in the random-cluster representation. It is conceivable that a similar planting to the one we consider here for the Potts model can be carried out for non-integer q in the random-cluster representation, though the related first and second moment computations will likely require careful arguments for the distribution of the number of components in random graph percolation; see [6] for some related developments.

2 Preliminaries

2.1 The configuration model

To obtain a handle on the random d-regular 𝔾=𝔾n,d, we work throughout in the configuration model. Formally, for integers n,d1 with dn even, let [n]×[d] be a set of “half-edges”; we denote by G𝒢n,d a (multi)graph on vertex set [n] whose edges are obtained by taking a uniformly random perfect matching of the half-edges; we add an edge between u,v[n] in G whenever the half-edges (u,i) and (v,j) are matched together for some i,j[d]. For brevity, instead of G𝒢n,d we sometimes use 𝑮 to denote the random (multi)graph chosen. Any property that holds whp over 𝑮 also holds whp over 𝔾 (see, e.g., [28]), so henceforth we focus on the configuration model 𝑮.

2.2 The random-cluster dynamics

Given a graph G=(V,E) and real parameters q>0 and p[0,1], the random-cluster model is a probability distribution π=πG,q,p on the subsets of E. Let ΩG be the set of all configurations. Each configuration FE has probability π(F)=qc(F)p|F|(1p)|EF|/ZRC,G where c(F) is the number of connected components in the graph (V,F), and ZRC,G is a normalisation factor. For integer q and p=1eβ, it is well known that sampling from this probability distribution is equivalent to sampling from the q-state Potts model with parameter β, see Section 3 for details.

Let p^:=p(1p)q+p. The random-cluster (RC) dynamics initialised from a configuration X0 is a Markov chain (Xt)t0 whose transition Xt to Xt+1 is as follows:

  1. 1.

    First choose an edge eE uniformly at random.

  2. 2.

    If e is a cut edge of the graph (V,Xt{e}), then with probability p^, set Xt+1:=Xt{e}. With all remaining probability set Xt+1:=Xt{e}

  3. 3.

    Otherwise, with probability p, set Xt+1:=Xt{e}. With all remaining probability set Xt+1:=Xt{e}.

It is a standard fact that the distribution of Xt converges to πG, regardless of the initialisation. To work around the slow convergence in the metastability window when β(βu,βu), we will later consider the RC dynamics starting from the two extreme configurations, the all-out (X0=) and the all-in (X0=E).

2.3 Phases of the Potts model

Let ν=(ν1,,νq) be a q-dimensional probability vector. For an n-vertex graph G, let

Sν(θ)={σ[q]V(G):i[q]||σ1(i)|nνi|θn},

where θ=θ(q,d,β)>0 is a small constant which we will suppress from notation. Hence, Sν(θ) consists of these configurations where the colour frequencies are given by the vector ν. Let also

Zν(G)=σSνwG(σ) and Ψ=Ψd,q,β(ν)=limθ0limn1nln𝔼[Zν(𝑮)],

where wG(σ)=eβm(σ) is the weight of σ. In short, the function Ψ(ν) captures the (logarithm of the) expected contribution to the partition function Z(𝑮) of the random graph 𝑮 given by the configurations in Sν. As we shall see shortly, the local/global maximisers of Ψ play a key role in determining the modes of the Gibbs distribution.

There are three relevant thresholds that come into play

βu(q,d)<βc(q,d)<βu(q,d).

For high temperatures (β<βu), there is a unique global maximiser of Ψ, given by the uniform vector

νd=(1q,,1q)

which corresponds to the “disordered” configurations. As the temperature decreases and β crosses the threshold βu, another local maximum of Ψ emerges at an “ordered” vector νo where one of the q colours dominates over the others (which are roughly balanced). More precisely, νo is given by111The value of βu is defined as the infinimum value of β>0 such that there exists a(1/q,1) which satisfies (2.1); then it is simple to check that such an a exists for all ββu. For certain values of β (namely when β(βu,βu)), there may be more than one values of a(1/q,1) that satisfy the equation; when this is the case, the value of a that is relevant for the ordered phase is the largest such value.

νo=(a,1aq1,,1aq1), and a(1q,1) satisfies a=tdtd+q1 where eβ1=(t1)(td1+q1)td1t. (2.1)

Interestingly, the local maximum at νo does not become global until ββc where βc=lnq2(q1)12/d1 is known as the ordered/disordered threshold; notably, the only point where both νo and νd are global maxima is at β=βc. Note also that νd continues being a local maximum till β<βu where βu=ln(1+qd1); so, in the interval (βu,βu) both νd and νo are local maxima of Ψ which causes the metastability phenomena mentioned in the introduction, see [14] for details.

For convenience, let Sd and So denote the sets of configurations Sν when ν=νd and ν=νo, respectively. For a graph G, define Zd(G) and Zo(G) analogously, and let μG,d=μG(Sd) and μG,o=μG(So) denoted the conditional Gibbs distributions. The following lemma from [20] gives some sharp concentration estimates for Zd(𝑮) and Zo(𝑮) (derived using moments and the small subgraph conditioning method) and are the key behind our planting approach.

Lemma 3 ([20, Theorem 5 & Lemma 16]).

Let d,q3 be integers, and β0 be real.222Here, we assume that the constant θ>0, used in the definition of Zd and Zo, is a sufficiently small constant depending only on d,q,β. Let f(n)>0 be any function with f(n)=o(1). Whp over 𝐆, we have:

 Zd(𝑮)f(n)𝔼[Zd(𝑮)] if ββcand Zo(𝑮)f(n)𝔼[Zo(𝑮)] if ββc.

Moreover, μ𝐆(Sd)=1eΩ(n) when β<βc, and μ𝐆(S~o)=1eΩ(n) when β>βc, where S~o is the union of So with its q1 permutations. For β=βc, it holds that μ𝐆(SdS~o)=1eΩ(n) and μ𝐆(Sd),μ𝐆(So)f(n).

2.4 Planting disordered and ordered configurations

We now introduce the planting distribution which will be the key in obtaining our results. Recall that 𝑮 denotes a graph chosen according to the configuration model 𝒢n,d; in the expressions below, expectations are taken with respect to the choice of 𝑮. For σ[q]n, define a random graph 𝑮^(σ) by letting

[𝑮^(σ)=G] =[𝑮=G]eβmG(σ)𝔼[eβm𝑮(σ)]. (2.2)

Furthermore, for θ>0, recalling the disordered and ordered partition functions Zd,Zo from the previous section, we define random configurations 𝝈^d=𝝈^d(θ)[q]n and 𝝈^o=𝝈^o(θ)[q]n with distributions

[𝝈^d=σ] =𝟏{σSd}𝔼[eβm𝑮(σ)]𝔼[Zd(𝑮)], [𝝈^o=σ] =𝟏{σSo}𝔼[eβm𝑮(σ)]𝔼[Zo(𝑮)]. (2.3)

We refer to (𝑮^(𝝈^d),𝝈^d) and (𝑮^(𝝈^o),𝝈^o) as the disordered and ordered planted distributions respectively.

The following lemma allows us to relate the planted distributions with the distributions of the (conditional) disordered 𝝈𝑮,dμ𝑮,d and ordered 𝝈𝑮,oμ𝑮,o configurations of the random graph 𝑮. Recall that 𝒢n,d denotes the set of all n-vertex regular (multi)graphs with degree d.

Lemma 4.

Let d,q3 be integers and β>0 be real. Let f(n),g(n)>0 be any functions with f(n)=o(1). Let 𝒢n,d×[q]n be an arbitrary event.

  1. (1)

    ββc: if [(𝑮^(𝝈^d),𝝈^d)]g(n), then, whp over 𝑮, [(𝑮,𝝈𝑮,d)𝑮]g(n)f(n).

  2. (2)

    ββc: if [(𝑮^(𝝈^o),𝝈^o)]g(n), then, whp over 𝑮, [(𝑮,𝝈𝑮,o)𝑮]g(n)f(n).

The proof is given in the full version [18]. To utilise Lemma 4, the key observation is that, for the planted models, once we condition on the vertex and edge statistics, the models become uniformly distributed. Formally, for a graph G𝒢n,d and a configuration σ[q]n, let νσ be vertex colour statistics under σ, i.e.,

for a colour i[q], νiσ=|σ1(i)|/n.

Similarly, let ρG,σ be the [q]×[q] matrix with the following half-edge colour statistics under σ, i.e.,

for colours i,j[q]ρijG,σ=12|E(G)|u,vV(G)𝟏{{u,v}E(G),σ(u)=i,σ(v)=j}. (2.4)

Recall that 2|E(G)|=dn. So, for a colour class i[q], out of the dnνiσ half-edges incident to it, there are dnρijG,σ “going” to a colour class j[q] (note, for i=j, there are dn2ρiiG,σ edges in G that lie inside the colour class i). We have that ρG,σ is symmetric with i,jρijG,σ=1. Observe further that, conditioned on ν𝝈^d=ν and ρ𝑮^(𝝈^d),𝝈^d=ρ the pair (𝑮^(𝝈^d),𝝈^d) is uniformly distributed over the pairs (G,σ)𝒢n,d×[q]n with νσ=ν and ρG,σ=ρ. Similarly for the pair (𝑮^(𝝈^o),𝝈^o) conditioned on ν𝝈^o and ρ𝑮^(𝝈^o),𝝈^o.

To put this observation into use, we will need more quantitative information on the vertex and edge colour statistics in the planted models. By definition of 𝝈^d,𝝈^o, we already know that ν𝝈^dνd1θ and ν𝝈^oνo1θ. Analogously, ρ𝑮^(𝝈^d),𝝈^d and ρ𝑮^(𝝈^o),𝝈^o are concentrated around the vectors ρd and ρo where

for i,j[q],ρd,ij=eβ𝟏{i=j}i,j[q]eβ𝟏{i=j},ρo,ij=eβ𝟏{i=j}(νo,iνo,j)(d1)/di,j[q]eβ𝟏{i=j}(νo,iνo,j)(d1)/d. (2.5)

We have the following concentration statement.

Lemma 5 ([14, Lemmas 3.4 & 3.5]).

Let d,q3 be integers and β>0 be real. There exists c>0 such that for any t=ω(n1/2) it holds that

  1. (i)

    For β<βu: [ν𝝈^dνd1+ρ𝑮^(𝝈^d),𝝈^dρd1t]ect2n.

  2. (ii)

    For β>βu: [ν𝝈^oνo1+ρ𝑮^(𝝈^o),𝝈^oρo1t]ect2n.

3 Mixing within the disordered and ordered phases for the RC model

3.1 Phases for the random cluster model via Potts

For a graph G𝒢n,d and σ[q]n, let 𝑭(G,σ) denote the random subset of edges obtained by keeping each monochromatic edge of G under σ with probability p. It is a standard fact (see, e.g., [25]) that, for integer q3, we can obtain a sample 𝑭πG by first sampling 𝝈μG and then keeping each monochromatic edge of G under σ with probability p=1eβ (the so-called percolation step).

We use this for the random graph 𝑮 to get a handle on the random cluster model on 𝑮, i.e., 𝑭(𝑮,𝝈) has distribution π𝑮. From Lemma 5, we can figure out in particular how many monochromatic edges we should expect from each phase, so we can translate the phases for the Potts model to the random cluster model. More precisely, set

𝔪d(β)=pd2i[q]ρd,ii and 𝔪o(β)=pd2i[q]ρo,ii.

With a bit of algebra, we can derive the following expressions for 𝔪d and 𝔪o from (2.5): for β<βu, we have 𝔪d(β)=d2eβ1eβ+q1, while, for β>βu, we have 𝔪o(β)=d2(eβ1)(x2+(1x)2q1)1+(eβ1)(x2+(1x)2q1) with x=td1td1+q1 and t>1 defined from eβ1=(t1)(td1+q1)td1t, as in (2.1).

We are now ready to define the ordered/disordered phases for the random-cluster model. Due to monotonicity properties (and in contrast to the phases of the Potts model), we can just use the values of 𝔪d and 𝔪o at the critical temperature β=βc; this will be convenient later on.

Definition 6 (The ordered/disordered phases).

Let q,d3 be integers. For a graph G𝒢n,d and ρ:=𝔪o(βc)𝔪d(βc)>0, define

ΩG,d={FE(G):|F|n𝔪d(βc)+nρ/4} and ΩG,o={FE(G):|F|n𝔪o(βc)nρ/4}.

Let also πG,d=πG(ΩG,d) and πG,o=πG(ΩG,o).

Using the translation between the Potts and random-cluster models, we have the following.

Corollary 7.

Let d,q3 be integers and β>0 be real. Then, whp over G𝒢n,d, we have πG(ΩG,d)=1eΩ(n) when β<βc and πG(ΩG,o)=1eΩ(n) when β>βc.

For β=βc, πG(ΩG,dΩG,o)=1eΩ(n) and πG(ΩG,d),πG(ΩG,o)1/lnlnn.

Proof of Corollary 7.

This follows from Lemma 3, and the correspondence between 𝝈μG and 𝑭πG stated at the start of the section, using f(n)=1/lnlnn for the lower bounds at β=βc.

To analyse the ordered/disordered phases of the random cluster model using the planted distributions, we will need the following intermediate distributions. Namely, for a graph G, let π^G,d be the distribution on FE(G) obtained by first sampling 𝝈G,dμG,d and then keeping each monochromatic edge with probability p=1eβ. Define similarly π^G,o. The proof of the following is given in the full version [18].

Lemma 8.

Let d,q3 be integers and β>0 be real. Then, whp over G𝒢n,d, for any event 𝒜2E(G), the following holds.

  1. (i)

    if ββc, then πG,d(𝒜)π^G,d(𝒜)+eΩ(n).

  2. (ii)

    if ββc, then πG,o(𝒜)π^G,o(𝒜)+eΩ(n).

3.2 Mixing and Weak Spatial Mixing (WSM) within a phase

Treading a path initiated in [21], to prove Theorems 1 and Theorem 2 it suffices to show the following results for the disordered and ordered regimes, respectively.

Theorem 9.

Let d,q3 be integers and β(0,βc]. Then, whp over G𝒢n,d, for every εeΩ(n) the RC dynamics (Xt)t0 initialised from all-out satisfies XTπG,dTVε for T=O(nlognlog1ε).

Theorem 10.

Let d3,q(5d)5 be integers and ββc be a real. Then, whp over G𝒢n,d, for every εeΩ(n) the RC dynamics (Xt)t0 initialised from all-in satisfies XTπG,oTVε for T=O(nlognlog1ε).

Proof of Theorems 1 and 2.

We first do the proof of Theorem 1. By Corollary 7, it holds that πGπG,dTV=eΩ(n) and by Theorem 9 we can obtain a sample from πG,d in T steps using random-cluster dynamics. Then, using the translation between the Potts and RC models [25], this gives a sample 𝝈^ whose distribution is within TV-distance ε from μG, as wanted. The proof of Theorem 2 is analogous using Corollary 7 and Theorem 10.

It was shown in [21] that, for the ferromagnetic Ising model, a sufficient condition for fast convergence to equilibrium within the phase is the property of weak spatial mixing (WSM) within a phase, paired with some mixing-time estimates in local neighbourhoods. The same condition applies for the random-cluster model and, more generally, for monotone models. In the case of the random regular graph, the local neighbourhoods resemble the d-regular tree and the required mixing-time results have been established in [7]. So, the main piece missing in order to obtain Theorems 9 and 10 is showing WSM within the corresponding phase. Roughly, the notion captures that, when we condition on the phase, the influence of a boundary configuration at distance is small.

More formally, for a graph G=(V,E), an integer and a vertex v, let B(v) be the graph induced by vertices at distance from v, whose vertex and edge sets are going to be denoted by V(B(v)) and E(B(v)), respectively. A boundary condition τ on B(v) is a subset of the edges in E\E(B(v)); then πG,v,τ denotes the RC distribution induced on B(v) conditioned on τ, i.e., for SE(B(v)) it holds that πG,v,τ(S)=πG(Sτ)SE(B(v))πG(Sτ). There are two extreme boundary conditions on B(v) which will be most relevant: (i) the all-in or wired condition + where all edges in E\E(B(v)) are included in the boundary condition, and (ii) the all-out or free boundary condition where none of the edges in E\E(B(v)) are included. We use πG,v,+ and πG,v, to denote the corresponding conditional RC distributions.

For an edge eE, let 1e denote the subsets of E that include e. Following [19], we say that we have WSM within the disordered phase (respectively, ordered) at distance if for each vertex v and each edge e incident to v it holds that |πG,v,(1e)πG,d(1e)|1100m (respectively, |πG,v,+(1e)πG,o(1e)|1100m) with m=|E|.

Our main technical results for establishing Theorems 9 and 10 are the following.

Theorem 11.

Let d,q3 be integers and ββc be a positive real. Then, there exists δ>0 such that, whp over G𝒢n,d, G=(V,E) has WSM within the disordered phase at a distance =(12δ)logd1n. That is, with m=|E|, for every vV and every edge e incident to v, it holds that |πG,v,(1e)πG,d(1e)|1100m.

Theorem 12.

Let d3,q(5d)5 be integers and ββc be a real. Then, there exists δ>0 such that, whp over G𝒢n,d, G has WSM within the ordered phase at a distance =(12δ)logd1n. That is, with m=|E|, for every vV and every edge e incident to v, it holds that |πG,v,+(1e)πG,o(1e)|1100m.

Using these, the proofs of Theorems 9 and 10 can be done using by now standard arguments; we give the details for completeness in the full version [18].

We therefore focus on the WSM proofs. The proof for the disordered regime ββc is given in Section 4, and the proof for the ordered regime ββc is given in Section 5.1.

4 Proof for WSM in the disordered regime

To show WSM in the disordered regime, we follow the strategy outlined in Section 1.2. We begin with some relevant definitions. Let G=(V,E) be a graph. Recall, that for a vertex v and >0, B(v) denotes the radius- ball around v in G. Also, denote by S(v) the set of vertices at distance exactly from v. For a set of vertices S, we let E(S) the set of edges with both endpoints in S.

Definition 13.

Let G=(V,E) be a graph and v be a vertex. For K,>0, we say that a subset FE is K-shattered at distance from v if all but K vertices of S(v) belong to distinct components of the graph (V,F\E(B(v)).

Lemma 14.

Let d,q3 be integers and ββc be a positive real. For any constant ϵ>0, there is K>0 so that the following holds whp over 𝐆. For every vertex vV, define the event

𝒜v={FE(𝑮)F is K-shattered at distance =(122ϵ)logd1n from v}.

Then, for the RC disordered phase of 𝐆, we have π𝐆,d(v𝒜v)11/n3.

Proof.

For a graph G𝒢n,d and σ[q]n, let 𝑭(G,σ) denote the random subset of edges obtained by keeping each monochromatic edge of G under σ with probability p. Fix ϵ>0 and let K=10/ϵ. We will consider the disordered planted graph (𝑮^(𝝈^d),𝝈^d) and show that for an arbitrary vertex v it holds that

[𝑭(𝑮^(𝝈^d),𝝈^d)𝒜v]2/n5. (4.1)

Assuming this for now, we obtain by Lemma 4 and a union bound over the vertices that [𝑭(𝑮,𝝈𝑮,d)vV𝒜v𝑮]lnn/n4 as well. Note that 𝑭(𝑮,𝝈𝑮,d) has distribution π^𝑮,d, so Lemma 8 yields that π𝑮,d(v𝒜v)π^𝑮,d(v𝒜v)eΩ(n)11/n3.

It remains to prove (4.1). By Lemma 5, we have ν𝝈^dνd1+ρ𝑮^(𝝈^d),𝝈^dρd1=O(n1/3) w.p. 1eΩ(n1/3). Consider arbitrary ν and ρ with ννd1+ρρd1=O(n1/3). Conditioned on ν𝝈^d=ν and ρ𝑮^(𝝈^d),𝝈^d=ρ, we have that (𝑮^(𝝈^d),𝝈^d) is uniformly random among all pairs (G,σ) with νσ=ν and ρG,σ=ρ.

To proceed, fix σ with νσ=ν and condition on 𝝈^d=σ. Condition further that ρ𝑮^(𝝈^d),𝝈^d=ρ and on the induced graph on B(v). For convenience, let 𝑭~ denote the (random) set of edges outside B(v) after the percolation step, i.e., the set 𝑭(𝑮^(𝝈^d),𝝈^d)\E(B(v)).

Let T:=n1/2ϵ. For t=1,,T, we explore the (union of the) components of vertices uS(v) in the graph (V,𝑭~) by revealing at each step an edge incident to a vertex belonging to any of these (chosen arbitrarily). We denote by At the set of “active vertices” in these components that still have unmatched half-edges, so that |At|=0 implies that all the components have been fully explored. Let At=ttAt denote the set of all vertices that have been active up to time t. The process is described precisely in Figure 1.

Exploration of (V,F~) starting from S(v)

Let t=0, T=n1/2ϵ, and A0=S(v).

While |At|0 or tT:

  • If |At|=0, pick any unmatched half-edge (w,r) and run Match(w,r);

    increase t by 1 and set At=

  • Otherwise, pick wAt. For r=1,,d:

    If the half-edge (w,r) is not matched:

    increase t by 1;

    match (w,r) to another half-edge, let z=Match(w,r).

    if σ(z)=σ(w), w.p. p=1eβ set AtAt1{z}; else set At=At1.

    Remove w and any other vertices from At which don’t have unmatched half-edges.

Match(w,r)
For i,j[q], let Mij,t be the number of currently unmatched half-edges from colour i to j.

  • Let i=σ(w). Choose a colour j[q] from the distribution {Mij,tjMij,t}j[q].

  • From the set of unmatched half-edges from colour j to i, choose one u.a.r., say (z,r).

  • match (z,r) with (w,r), and return the vertex z.

Figure 1: The exploration process of the percolated planted graph used in the proof of Lemma 14. This is a combination of exposing first an edge in the planted configuration model, captured by the routine Match(,), followed by a percolation step with probability p=1eβ whenever the edge is monochromatic.

Note that at any time t, for i,j[q], there is a number Mij,t of prescribed half-edges with one endpoint of colour i that needs to be matched to a vertex of colour j (note, this goes down when we reveal such an edge connecting colours i and j). The revealing of an edge at time t adjacent to an active vertex w with colour σ(w)=i has therefore two stages: we first reveal the other endpoint z in the graph 𝑮^(σ), chosen proportionally to the counts Mij,t; if σ(w)=σ(z), we further toss a coin with probability p=1eβ to determine whether the edge belongs to 𝑭~, and add z to the set of active vertices accordingly. The key point here is that we only need to reveal the components involving S(v) in the graph (V,𝑭~), rather than the components in 𝑮^(𝝈^d) (which are much bigger). Indeed, as we will see shortly the former is dominated by a subcritical branching process and therefore at most O(logn) vertices are revealed for each of the vertices in S(v).

To make this precise, for a time t=1,,T, let 𝟏t denote the indicator of the event that the t-th half edge was matched to a vertex in the set At:=ttAt consisting of vertices that have been active at some point. Let also 𝟏t be the indicator that the edge revealed was monochromatic and that it survived the percolation step. Note that the number of the remaining unmatched half-edges Mij,t from colour i to j satisfies 0dnρijMij,t2T2n1/2ϵ, so Mij,t=dnρij+o(n) for all the relevant t. Similarly, |At||S(v)|+T2n1/2ϵ. So, with t denoting the first t edges revealed, we have the crude bounds
(𝟏tt1)d|At|minijMij,t1n1/2+ϵ/2and(𝟏tt1)pmaxi[q]Mii,tjMij,teβ1eβ+q1+o(1).

It follows that the sum t=1T𝟏t is dominated above by a sum of T independent coin tosses with probability 1n1/2+ϵ/2, whereas the sum t=1T𝟏t is dominated above by a sum of T independent coin tosses with probability eβ1eβ+q1+o(1)<12κd1 for some small constant κ=κ(q,d,β)>0 (using that eβ1<q/(d2) for β<βu). Therefore, recalling that K=10/ϵ and T=n1/2ϵ, we have

(t=1T𝟏t>K)(TK)1nK(1/2+ϵ/2)1/n5and(t=1T𝟏tT(1κ)d1)=eΩ(n1/3),

where the second probability bound follows by standard Chernoff bounds (using that T=Ω(n1/3)).

Note that the event |AT|>0 implies that |At|>0 for all t=0,1,,T which in turn implies that at least one vertex gets removed from the set of active vertices every (at most) d1 time steps; indeed, when exploring the neighbours of a vertex wAt at a particular time t, there are at most d1 half-edges incident to w remaining unmatched since one of the d half-edges of w has been previously matched (to activate w). We therefore have the inequality 0|AT||S(v)|+t=1T𝟏tT(d1)d1. Since |S(v)|d(d1)dn1/22ϵκTd1, it follows that

(|AT|>0)(|S(v)|T(d1)d1+t=1T𝟏t0)(t=1T𝟏tT(1κ)d1)=eΩ(n1/3).

Combining these, we obtain that with probability 12/n5 both of |AT|=0 and t=1T𝟏t<K hold. The first implies that by time T all the components of S(v) in the graph (V,𝑭~) have been explored, and the second implies that all but (at most) K of these vertices belong to distinct components. This finishes the proof of (4.1) and hence completes the proof of the lemma.

We next use an argument of Blanca and Gheissari [9] which shows a correlation decay bound for K-shattered configurations in a treelike graph in terms of the existence of two root-to-leaf paths. The argument in [9] is technically proved for β<βu using some uniqueness estimates, but it can be extended it to the regime ββc by observing that in a treelike ball, a K-shattered configuration induces a distribution that is comparable to percolation on the tree with parameter p^ (this coincides with the cut-edge update probability in the definition of the random cluster dynamics). A key feature of the correlation decay bound in [9] is the existence of two paths which gives the p^2 term in the lemma below, and which ultimately allows us to make the probability less than 1100m that is required for WSM, while keeping the distance less than 12logd1n that is the threshold for having treelike neighbourhoods. We give the proof of the following lemma in the full version [18].

Lemma 15.

Let d,q3 be integers, and ββc, δ(0,1/2) positive reals.

There exists a constant C=C(d,q,β,δ)>0 such that whp over G𝒢n,d the following holds. For all vV(G) and an edge e incident to it, and :=(12δ)logd1n,

|πG,v,(1e)πG,d(1e)|Cp^2+1n3 where p^:=eβ1q+eβ1.

We can now give the proof of Theorem 11.

Theorem 11. [Restated, see original statement.]

Let d,q3 be integers and ββc be a positive real. Then, there exists δ>0 such that, whp over G𝒢n,d, G=(V,E) has WSM within the disordered phase at a distance =(12δ)logd1n. That is, with m=|E|, for every vV and every edge e incident to v, it holds that |πG,v,(1e)πG,d(1e)|1100m.

Proof.

Consider ββc. Since βc<βu=ln(1+qd2), we have that p^(β)<p^(βu)=1d1. Hence there is an ϵ(q,d,β)>0 such that p^(β)=1(d1)1+ϵ. Let δ=ϵ4(1+ϵ) and =(12δ)logd1n.

Lemma 15 gives that, whp over G𝒢n,d, for any edge e and vertex v incident to it, it holds that |πG,v,(1e)πG,d(1e)|Cp^2l+1n3 . By the choice of δ,ϵ and , we get that p^2ln(1+ϵ/2)/(p^2), so for large enough n we have Cp^2n(1+ϵ/2)+n31100m, concluding the proof of WSM within the disordered phase.

5 Proof for WSM within the ordered regime

To show WSM within the ordered phase for the random-cluster model on G𝒢n,d, we follow the strategy outlined in Section 1.2. The core of the argument will be to show that there are no paths of length 1/2logd1n that avoid the largest component in a typical configuration from πG,o. We will do this by considering percolation on the planted random graph model (𝑮^(𝝈^o),𝝈^o) for μG,o. It will thus be relevant to consider random graph percolation on (almost) d-regular random graphs where we have removed a path of length .

For a graph G, let 𝒞1(G) denote the largest component of G (breaking ties arbitrarily), and more generally, 𝒞i(G) denote the i-th largest component of G. It is a standard fact [3, 30, 14] that the size of the largest component in a percolated random d-regular graph is controlled by the extinction probability φ of a Galton-Watson process with distribution Bin(d1,p) on the tree. For p(1d1,1), define φ=φ(p),χ=χ(p) and φ^=φ^(p) to be the unique solutions on the interval (0,1) to the following

φ=(1φ+pφ)d1,χ:=1(pφ+1p)d,φ^:=(1φ+pφ)d2. (5.1)

In particular, the size of the largest component is roughly equal to χn+o(n); see [3, 30] for more details. We also remark that the quantity φ^ is the extinction probability of a slightly modified GW-tree where every vertex has degree d apart from the root which has degree d2; this will be relevant later for our path considerations.

We show a similar result for the size of the giant component for slightly non-regular degree sequences 𝒅, where we allow a small set S of vertices to have degree less than d; in our WSM setting, we will further be interested in the probability that a particular subset of the vertices S does not belong to the giant. A variant of percolation we will use is the exact edge model for random graphs with a given degree sequence 𝒅=(d1,,dn) and edge count m12di. Let 𝑮~n,𝐝,m be the random graph model where each i[n] is associated to di half-edges; then we choose a subset of 2m half-edges and a matching of these uniformly at random; the final (multi)graph is obtained by projecting the edges on the vertex set [n].

Lemma 16.

Let d3 be an integer and p(0,1). Let δ,ε>0 be arbitrarily small constants.

Suppose that S[n] satisfies |S|=O(n1/6) and 𝐝=(d1,,dn) is a degree sequence such that di=d for iS and di[0,d] for iS. Then, there exists a constant M(d,p)>0, such that for 𝐆~n,𝐝,m with m=12dpn+o(n), the following hold:

  1. 1.

    If p<1d1, [𝒞1(𝑮~n,𝒅,m)Mn1/2]eΩ(n).

  2. 2.

    If p>1d1, [𝒞1(𝑮~n,𝒅,m)(χδ)n]eΩ(n) and [𝒞2(𝑮~n,𝒅,m)Mn2/3]eΩ(n). Moreover, with φ=φ(p) as in (5.1), for any subset SS,

    [S𝒞1(𝑮~n,𝒅,m)=](1+ε)|S|iS(pφ+1p)di.

Intuitively, the bound in the second item follows by revealing the R-neighbourhoods of vertices in S for a suitably large constant R (these are whp trees and disjoint from each other); then, for a vertex iS, the factor (pφ+1p)di captures the probability that the percolation dies out within the neighourhood of the vertex i. The full proof of Lemma 16 is given in the full version [18].

Next, we consider the density of the edges inside the color classes. The following lemma says that in the ordered regime one colour class is supercritical, while the others subcritical. Recall that νo and ρo are the vertex and edge statistics for the ordered phase respectively, defined in (2.1) and (2.5).

Lemma 17 ([14, Lemma 5.6]).

Let q,d3 be integers. For β>βu and p=1eβ, the ordered vectors νo and ρo satisfy pρo,iiνo,i>1d1 for i=1, and pρo,iiνo,i<1d1 for i{2,,q}.

Let G=(V,E) be a graph. For a subset of edges FE and a vertex v, we let Fv be the edges in F incident to v. We will also consider the largest component 𝒞 of the graph (V,F), i.e. the component with the largest number of vertices (breaking ties arbitrarily). We say that 𝒞 avoids a path P if there is no vertex v𝒞 that belongs to P. The next lemma is the key technical step in our argument and will be used to show that, in the RC ordered phase, the largest component (which has size Ω(n) due to the supercriticality of colour 1, cf. Lemma 17) is “very close” to every vertex v.

Lemma 18.

Let d,q3 be integers, ββc be real and p=1eβ. For any constant ε>0, the following holds for any integer satisfying lnlnnn1/6, whp over G𝒢n,d.

For a vertex vV(G), consider the event

𝒜v={FE(G)|there exists a path of length  from v in G whichavoids the largest component of (V\{v},F\Fv)}.

Then, for the RC ordered phase of G, πG,o(vV(G)𝒜v)nd(d1)1(A+ε) where A=maxi[q]{1(1φ^1)ρo,i1νo,i} with ρo,νo as in (2.1) and φ^1=φ^(p1) as in (5.1) for p1=pρo,11νo,1.

To parse the bound on πG,o(vV(G)𝒜v) note that the factor nd(d1)1 is just an upper bound on the total number of paths of length . The factor A controls the probability that a particular path P avoids the largest component. The term φ^1 in A is the probability that the Galton-Watson process dies out from a vertex v of the path that belongs to the supercritical color class 1 (note that v has “remaining” degree d2 other than its neighbours on the path, and that the relevant percolation parameter is given by p1). The term ρo,i1νo,i accounts for the frequency that we encounter vertices in the supercritical color class 1 along the path P, which follows from the definition of the planting model. The exact quantitative dependence of A to φ^1 and ρo,i1νo,i follows by a more technical calculation involving eigenvalues. Later, in Lemma 19, we will show that A can be made less than 1/d5 by taking q sufficiently large.

Proof of Lemma 18 (Sketch).

From Lemmas 4 and 8, it suffices to show for the ordered planted graph (𝑮^(𝝈^o),𝝈^o) that (for all n sufficiently large)

[𝑭(𝑮^(𝝈^o),𝝈^o)vV(G)𝒜v]nd(d1)1(A+ε2). (5.2)

From this, we obtain by Lemma 4 that [𝑭(𝑮,𝝈𝑮,o)vV𝒜v𝑮]nln(n)d(d1)1(A+ε2); since 𝑭(𝑮,𝝈𝑮,o) has distribution π^𝑮,o, Lemma 8 yields that, whp over 𝑮, it holds that π𝑮,o(v𝒜v)π^𝑮,o(v𝒜v)+eΩ(n)nd(d1)1(A+ε).

By Lemma 5, we have ν𝝈^oνo1+ρ𝑮^(𝝈^o),𝝈^oρo1=O(n1/3) w.p. 1eΩ(n1/3). Condition on ν𝝈^o=ν and ρ𝑮^(𝝈^o),𝝈^o=ρ where ν and ρ satisfy ννo1+ρρo1=O(n1/3), and note that the pair (𝑮^(𝝈^o),𝝈^o) is uniformly random from the pairs having these statistics.

Let P be an arbitrary sequence of vertices (v0,v1,,v), together with a q-colouring τ=(τ0,,τ). Let also ξ be a set of paired half-edges that turn v0,,v into a path in that order. Let 𝑬 be the set of random edges in the graph before percolation. Denote by

P,τ,ξ={ξ𝑬,𝝈^o(v0)=τ0,,𝝈^o(v)=τ}

the event that v0,v1,,v form a path in 𝑮^ (in this order) using the half-edges prescribed by ξ and that they belong to the colour classes that τ prescribes. In the full version [18], we give an exact expression for (P,τ,ξ) in terms of ν and ρ which yields that

(P,τ,ξ)(dn)νo,τ0k=01ρo,τkτk+1νo,τk, (5.3)

where f(n)g(n) denotes f(n)g(n)1 as n.

Fix now an arbitrary vertex v and let P be an arbitrary sequence (v0,v1,,v) with v0=v, together with a q-colouring τ=(τ0,,τ) and pairing ξ (as above). Condition on the event P,τ,ξ. Let 𝑭 be the edge set obtained by keeping each monochromatic edge with probability p, and 𝑭v be the set of these edges that are incident to v. Then, for the graph 𝑮^v(𝑭):=(𝑽\{v},𝑭\𝑭v), we show in the full version [18] that color 1 is supercritical and the others are subcritical (using Lemma 17); by invoking then Lemma 16 to the supercritical color class 1, we further obtain that

[the largest component of 𝑮^v(𝑭)avoids the τ-coloured path P|P,τ,ξ](1+ε3)φ^11(τ), (5.4)

where 1(τ) is the number of vertices with colour 1 under τ and φ^1:=(p1φ1+1p1)d2 where p1=pρo,11νo,1 and φ1=φ(p1) (as in (5.1)). Combining (5.3) and (5.4) via a union bound (note that there are n sequences P with +1 vertices starting from v and d2(d(d1))1 ways to select half-edges ξ to turn each of these into a path) yields that

[the largest component of 𝑮^v(𝑭) avoidssome τ-coloured path from v in G](1+ε3)d(d1)1νo,τ0k=01ρo,τkτk+1νo,τkφ^1𝟏{τk=1}. (5.5)

At this stage, it remains to take a union bound over all possible colourings τ on +1 vertices, i.e., sum the right-hand-side of (5.5) over all τ. The resulting expression can be written more succinctly in the form (1+ε3)d(d1)l1νoTM1 where M is the q×q matrix whose (i,j)-entry is given by ρo,ijν^o,i and ν^o is the vector whose i-th entry is equal to νo,1/φ^1 if i=1 and νo,i if i{2,,q}. Note that M=D1/2M^D1/2 where D is the q×q diagonal matrix with ν^o on the diagonal and M^ is the symmetric matrix ρo,ij(ν^o,iν^o,j)1/2. In particular, we have that M^ and M have the same eigenvalues, which are all real. Let ν^o denote the vector whose entries are the square roots of the entries of ν^o. Then, we can write

ν^oTM1=ν^oTM^ν^o1φ^1νoTM^νo.

Now, we will show that entry-wise it holds that M^νoAνo where A:=maxi[q]{1(1φ^1)ρo,i1νo,i} is as in the lemma statement. Note first that (M^νo)i=jρo,ij(ν^o,iν^o,j)1/2(νo,j)1/2. Since ν^o,j=νo,j for j1 and jρo,ij=νo,i for i[q], we have that

(M^νo)1=φ^1ρo,11(νo,1)1/2+j1φ^1ρo,1j(νo,1)1/2=(φ^1φ^1)ρo,11(νo,1)1/2+φ^1(νo,1)1/2φ^1A(νo,1)1/2.

For i1, we have

(M^νo)i=φ^1ρo,i1(νo,i)1/2+j1ρo,ij(νo,i)1/2=(νo,i)1/2(1φ^1)ρo,i1(νo,i)1/2A(νo,i)1/2.

This finishes the proof of M^νoAνo, and therefore we can bound the left-hand-side of (5.5) with d(d1)1(A+ε2). Noting that the event in (5.5) is just [𝑭(𝑮^(𝝈^o),𝝈^o)vV(G)𝒜v], (5.2) follows by a union bound over v.

We next bound the quantity A appearing in Lemma 18 by a suitably small constant so that we can take a union bound over paths; it is for this step that we require q(5d)5. The proof is in the full version [18]

Lemma 19.

Let d3, q(5d)5 and ββc. Then, for all i[q], it holds that (1φ^1)ρo,i1νo,i>11(d1)5.

5.1 WSM within the ordered phase

Next we prove WSM within the ordered phase. We first define a wired boundary and show how it implies the WSM within the ordered phase, and then prove that it occurs with a good probability.

Definition 20.

Let G=(V,E) be a graph, vV, 1 and let FE be a RC configuration. Consider the -neighbourhood B(v) of v in G. We say that SV(B(v)){v} is a wired boundary around v in B(v), if

  • S is a cut set separating v from the outside of the ball. That is, 𝒞v, the component of v in GS, is disjoint from GB(v); and

  • in the graph (VV(𝒞v),{eF:eV(𝒞v)=}), all vertices of S are in the same component.

Lemma 21.

Let d,q3 be integers, ββc a real. Then whp over G𝒢n,d the following holds. For any vV, any edge e incident to v, and any 12logd1n,

|πG,v,+(1e)πG,o(1e)|πG,o(no wired boundary around v in B(v))+eΩ(n).

To prove the lemma, we compare with πG. When β>βc, we obtain, from Corollary 7, that πG,oπGTV=eΩ(n). Then we can bound |πG,v,+(1e)πG(1e)| by the probability (under πG) that there is no wired boundary around v in B(v): informally, this is because probability of 1e conditional on the wired boundary is the same under both πG and πG,v,+, and due to the monotonicity of the RC distribution, the probability of the wired boundary under πG,v,+ is at least that under πG. For more details as well as the case when β=βc see the full proof of the Lemma is in the full version [18].

By Lemma 18 all simple length-l paths from v intersect the largest component of Gv,F, so we can construct a wired boundary S by taking, for each length- path from v in G, the first vertex on the path belonging to the largest component of Gv,F. We thus show the following lemma in the full version [18].

Lemma 22.

Let G=(V,E) be a graph, let vV be an arbitrary vertex, and let 1 an integer. For any FE, let Fv be the set of edges in F that are incident to v and let Gv,F:=(V{v},FFv). Suppose that all simple length- paths from v in G intersect the largest component of Gv,F. Then F has a wired boundary around v in B(v).

From these two lemmas we can now obtain WSM within the ordered phase. See 12

Proof.

By Lemma 19, we have A:={maxi(1(1φ^1))ρo,i1νo,i}<(d1)5. Let ε>0 be a small constant (depending on d) such that A+ε(d1)(5+ε). Let δ:=ε4(4+ε). By Lemma 21, |πG,v,+(1e)πG,o(1e)| is at most

eΩ(n)+1πG,o( a wired boundary around v in B(v)).

By Lemma 22, this is at most

eΩ(n)+1πG,o( all simple length- paths from v in G intersect the largest component of Gv,F ).

By Lemma 18 this is at most eΩ(n)+nd(d1)1(A+ε/2). So whp over G𝒢n,d it holds for every vertex v and edge e incident to it that

|πG,o(1e)πG,v,+(1e)|eΩ(n)+ndd1(d1)(4+ε)eΩ(n)+1n1+ε/4

which is at most 1100m for all n large enough.

References

  • [1] Dimitris Achlioptas and Amin Coja-Oghlan. Algorithmic barriers from phase transitions. In 2008 49th Annual IEEE Symposium on Foundations of Computer Science, pages 793–802, 2008. doi:10.1109/FOCS.2008.11.
  • [2] Ahmed El Alaoui, Andrea Montanari, and Mark Sellke. Sampling from the Sherrington-Kirkpatrick Gibbs measure via algorithmic stochastic localization. In 63rd IEEE Annual Symposium on Foundations of Computer Science, FOCS 2022, pages 323–334, 2022. doi:10.1109/FOCS54457.2022.00038.
  • [3] Noga Alon, Itai Benjamini, and Alan Stacey. Percolation on finite graphs and isoperimetric inequalities. The Annals of Probability, 32(3):1727–1745, 2004.
  • [4] Victor Bapst, Amin Coja-Oghlan, and Charilaos Efthymiou. Planting colourings silently. Comb. Probab. Comput., 26(3):338–366, 2017. doi:10.1017/S0963548316000390.
  • [5] Victor Bapst, Amin Coja-Oghlan, and Charilaos Efthymiou. Planting colourings silently. Combinatorics, Probability and Computing, 26(3):338–366, 2017. doi:10.1017/S0963548316000390.
  • [6] Ferenc Bencs, Márton Borbényi, and Péter Csikvári. Random cluster model on regular graphs. Communications in Mathematical Physics, pages 1–46, 2022.
  • [7] Antonio Blanca, Zongchen Chen, Daniel Štefankovič, and Eric Vigoda. The Swendsen-Wang dynamics on trees. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2021), volume 207, pages 43:1–43:15, 2021. doi:10.4230/LIPIcs.APPROX/RANDOM.2021.43.
  • [8] Antonio Blanca, Andreas Galanis, Leslie Ann Goldberg, Daniel Štefankovič, Eric Vigoda, and Kuan Yang. Sampling in uniqueness from the Potts and random-cluster models on random regular graphs. SIAM Journal on Discrete Mathematics, 34(1):742–793, 2020. doi:10.1137/18M1219722.
  • [9] Antonio Blanca and Reza Gheissari. Random-cluster dynamics on random regular graphs in tree uniqueness. Communications in Mathematical Physics, 386(2):1243–1287, 2021.
  • [10] Antonio Blanca and Reza Gheissari. On the tractability of sampling from the Potts model at low temperatures via Swendsen-Wang dynamics. CoRR, abs/2304.03182, 2023. doi:10.48550/arXiv.2304.03182.
  • [11] Charlie Carlson, Ewan Davies, Nicolas Fraiman, Alexandra Kolla, Aditya Potukuchi, and Corrine Yap. Algorithms for the ferromagnetic Potts model on expanders. In 63rd IEEE Annual Symposium on Foundations of Computer Science, FOCS 2022, pages 344–355, 2022. doi:10.1109/FOCS54457.2022.00040.
  • [12] Zongchen Chen, Andreas Galanis, Leslie Ann Goldberg, Will Perkins, James Stewart, and Eric Vigoda. Fast algorithms at low remperatures via Markov chains. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, APPROX/RANDOM 2019, volume 145, pages 41:1–41:14, 2019. doi:10.4230/LIPIcs.APPROX-RANDOM.2019.41.
  • [13] Zongchen Chen, Andreas Galanis, Daniel Štefankovič, and Eric Vigoda. Sampling colorings and independent sets of random regular bipartite graphs in the non-uniqueness region. In Proceedings of the 2022 ACM-SIAM Symposium on Discrete Algorithms, SODA 2022, pages 2198–2207. SIAM, 2022. doi:10.1137/1.9781611977073.87.
  • [14] Amin Coja-Oghlan, Andreas Galanis, Leslie Ann Goldberg, Jean Bernoulli Ravelomanana, Daniel Štefankovič, and Eric Vigoda. Metastability of the Potts ferromagnet on random regular graphs. Communications in Mathematical Physics, pages 1–41, 2023.
  • [15] Amin Coja-Oghlan, Florent Krzakala, Will Perkins, and Lenka Zdeborová. Information-theoretic thresholds from the cavity method. In Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2017, pages 146–157, 2017. doi:10.1145/3055399.3055420.
  • [16] Matthew Coulson, Ewan Davies, Alexandra Kolla, Viresh Patel, and Guus Regts. Statistical physics approaches to Unique Games. In Proceedings of the 35th Computational Complexity Conference, CCC ’20, 2020.
  • [17] Charilaos Efthymiou. On Sampling Symmetric Gibbs Distributions on Sparse Random Graphs and Hypergraphs. In 49th International Colloquium on Automata, Languages, and Programming (ICALP 2022), volume 229, pages 57:1–57:16, 2022. doi:10.4230/LIPIcs.ICALP.2022.57.
  • [18] Andreas Galanis, Leslie Ann Goldberg, and Paulina Smolarova. Planting and mcmc sampling from the potts model, 2024. doi:10.48550/arXiv.2410.14409.
  • [19] Andreas Galanis, Leslie Ann Goldberg, and Paulina Smolarova. Sampling from the random cluster model on random regular graphs at all temperatures via glauber dynamics. Combinatorics, Probability and Computing, 34(3):359–391, 2025. doi:10.1017/S0963548324000403.
  • [20] Andreas Galanis, Daniel Štefankovič, Eric Vigoda, and Linji Yang. Ferromagnetic Potts model: Refined #BIS-hardness and related results. SIAM Journal on Computing, 45(6):2004–2065, 2016. doi:10.1137/140997580.
  • [21] Reza Gheissari and Alistair Sinclair. Low-temperature Ising dynamics with random initializations. In 54th Annual ACM SIGACT Symposium on Theory of Computing, (STOC ’22), pages 1445–1458, 2022. doi:10.1145/3519935.3519964.
  • [22] Reza Gheissari and Alistair Sinclair. Spatial mixing and the random-cluster dynamics on lattices. In Proceedings of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms, (SODA ’23) , pages 4606–4621, 2023. doi:10.1137/1.9781611977554.CH174.
  • [23] Reza Gheissari, Allan Sly, and Youngtak Sohn. Rapid phase ordering for Ising and potts dynamics on random regular graphs. arXiv preprint arXiv:2505.15783, 2025.
  • [24] Leslie Ann Goldberg and Mark Jerrum. Approximating the partition function of the ferromagnetic Potts model. J. ACM, 59(5):25:1–25:31, 2012. doi:10.1145/2371656.2371660.
  • [25] Geoffrey Grimmett. The random-cluster model, volume 333. Springer, 2006.
  • [26] Tyler Helmuth, Matthew Jenssen, and Will Perkins. Finite-size scaling, phase coexistence, and algorithms for the random cluster model on random graphs. Annales de l’Institut Henri Poincare (B) Probabilites et statistiques, 59(2):817–848, 2023.
  • [27] Brice Huang, Sidhanth Mohanty, Amit Rajaraman, and David X Wu. Weak poincaré inequalities, simulated annealing, and sampling from spherical spin glasses. In Proceedings of the 57th Annual ACM Symposium on Theory of Computing (STOC’25), pages 915–923, 2025. doi:10.1145/3717823.3718236.
  • [28] Svante Janson, Andrzej Rucinski, and Tomasz Łuczak. Random Graphs. John Wiley and Sons, 2011.
  • [29] Matthew Jenssen, Peter Keevash, and Will Perkins. Algorithms for #bis-hard problems on expander graphs. In Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2019, pages 2235–2247, 2019. doi:10.1137/1.9781611975482.135.
  • [30] Michael Krivelevich, Eyal Lubetzky, and Benny Sudakov. Asymptotics in percolation on high-girth expanders. Random Structures and Algorithms, 56(4):927–947, 2020. doi:10.1002/RSA.20903.