Planting and MCMC Sampling from the Potts Model
Abstract
We consider the problem of sampling from the ferromagnetic -state Potts model on the random -regular graph with parameter . A key difficulty that arises in sampling from the model is the existence of a “metastability” window , where roughly the distribution has two competing modes, the so-called disordered and ordered phases. This causes classical Markov-chain algorithms to be slow mixing from worst-case initialisations. Nevertheless, Helmuth, Jenssen and Perkins (SODA ’19) designed a sampling algorithm that works for all , when and , using polymers and cluster expansion methods; more recently, their analysis technique has been adapted to show that a Markov chain (random-cluster dynamics) mixes fast when initialised appropriately, in the same regime of .
Despite these positive algorithmic results, a well-known bottleneck behind cluster-expansion arguments is that they inherently only work for large , whereas it is widely conjectured that sampling on the random -regular graph is possible for all . The only result so far that applies to general is by Blanca and Gheissari who showed that the random-cluster dynamics mixes fast in the “uniqueness” regime where roughly only the disordered mode exists. For however, a second subdominant mode emerges creating bottlenecks and giving rise to correlations which have been hard to handle, especially for small values of and .
Our main contribution is to perform a delicate analysis of the Potts distribution and the random-cluster dynamics that goes beyond the threshold . We use planting as the main tool, a technique used in the analysis of random CSPs to capture how the space of solutions is correlated with the structure of the random instance. While planting arguments provide only weak sampling guarantees generically, here we instead combine planting with the analysis of random-cluster dynamics to obtain significantly stronger guarantees. We are thus able to show that the random-cluster dynamics initialised from all-out mixes fast for all integers beyond the uniqueness threshold , all the way to the optimal threshold where the dominant mode switches from disordered to ordered. A more involved analysis also applies to the ordered regime where we obtain an algorithm for all and , improving significantly upon the previous range of by Carlson, Davies, Fraiman, Kolla, Potukuchi, and Yap (FOCS’22).
Keywords and phrases:
approximate sampling, Glauber dynamics, Potts model, random cluster modelCopyright and License:
2012 ACM Subject Classification:
Mathematics of computing Gibbs sampling ; Mathematics of computing Random graphs ; Mathematics of computing Markov-chain Monte Carlo methodsEditors:
Meena Mahajan, Florin Manea, Annabelle McIver, and Nguyễn Kim ThắngSeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
The Potts model is a weighted model assigning probabilities to (non-proper) -colourings. The model originated in statistical physics but has since been a central object of study in various contexts; here, we focus on the computational problem of sampling from the model.
For an integer and a graph , the -state Potts model on with parameter is a probability distribution on where . Each configuration has weight where is the number of monochromatic edges under ; the normalising factor is the partition function. Throughout, we consider the “ferromagnetic” case , where configurations with many monochromatic edges are favoured in the distribution.
From a computational complexity perspective, sampling from the ferromagnetic Potts model has various twists relative to other similar models and the complexity of the sampling problem is widely open. On general graphs, the problem is #BIS-hard for any fixed [24]; for graphs of max degree (where is a fixed integer), the problem becomes #BIS-hard when [20], where is known as the ordered/disordered threshold (we will discuss this in more detail shortly). It is also conjectured that the problem admits a polynomial time algorithm when but this is open in general; in fact, many of the standard tools that are used to analyse Markov chains provably fail well below the threshold. The quintessential example here is the random -regular graph, which underpins all the relevant phenomena behind this picture. For even, we let be a graph chosen uniformly at random from the set of all -regular graphs with vertices. We use “whp” as a shorthand for “with probability as grows large”.
For a typical random graph , it is known that a sample from the Potts distribution is “disordered” for , and “ordered” for ; roughly, this means that the colours in the former case appear equally often whereas in the second case one of the colours strictly dominates over the rest (see Lemma 3 for the formal statement). The intricate feature of the Potts model on the random -regular graph is that these competing modes are both present as “local maxima” throughout a window containing even though only one of them has the vast majority of the mass (except at itself, where both modes appear with constant probability). This already poses problems to standard MCMC algorithms such as Glauber and Swendsen-Wang dynamics since the simultaneous presence of the modes causes exponentially slow mixing from worst-case initialisations, see [26, 14]. At a more conceptual level, any sampling algorithm has to take into account the presence of the other mode which obliterates from the very start standard analysis tools (e.g., correlation decay/spectral independence).
Helmuth, Jenssen and Perkins [26] introduced a cluster-expansion technique that allowed them to control more precisely how much a typical sample differs from the corresponding mode, and obtained an algorithm based on the interpolation method for all when and ; their algorithm applies more generally to expander graphs (under some mild conditions). See also the works [16, 11, 12] for various refinements of their method. Following a series of developments [21, 22], this cluster expansion expansion argument has been converted into an MCMC algorithm (for large and ) [19] using the random-cluster dynamics with appropriate initialisation that avoids the bottlenecks in the distribution (see Section 2.2 for details). See also [10, 8, 23] for closely related results on the random -regular graph (and lattices) that apply for large .
Despite these positive algorithmic developments, the cluster expansion technique that underpins the analysis of these algorithms is inherently “perturbative”, relying roughly on controlling how much a typical configuration differs from the max-energy configurations and arguing that the difference is relatively small. In the case of the Potts model, the resulting estimates are accurate only when is quite large relative to . By contrast, it is conjectured that sampling on the random regular graph should be possible for all (and all ), and hence more precise tools are needed to settle the picture for small . The only result so far that works for general is by Blanca and Gheissari [9] who showed that random-cluster dynamics mixes in time when , for arbitrary initialisation. Note that for the results of [14] give worst-case initialisations that slow down the mixing time of the Glauber dynamics to ; more generally, for , the presence of the ordered mode imposes correlation phenomena that are hard to handle, especially for small values of .
1.1 Results
Our main contribution is to do a delicate analysis of the Potts distribution and the random-cluster dynamics that goes beyond the threshold , using planting. Planting is a technique used in the analysis of random CSPs to capture how the space of solutions is correlated with the structure of the random instance, see [1, 15, 4] for some applications. In terms of sampling however, there is no recipe for converting planting methods to efficient sampling algorithms, and typically some extra sampling step is needed on top, see for example [2, 17] for such approaches; even so, the resulting sample usually has some accuracy limitation, i.e., the TV-distance from the target distribution cannot be made arbitrarily small (typically it is required to be at least an inverse polynomial in ). Here, we instead use planting as a means to obtain sharp estimates that are used in the analysis of random-cluster dynamics; we thus show fast convergence of the latter, yielding automatically strong sampling guarantees. A different sampling idea that uses planting in a somewhat related manner has recently been considered in [27], in the context of sampling from spherical spin glasses.
Using this planting framework, we give an algorithm to sample from the Potts distribution on the random -regular graph that covers the optimal range of “high temperatures” for all integers ; as we will explain in more detail later the algorithm is running random-cluster dynamics for steps, initialised appropriately (from the “empty” configuration).
Theorem 1.
For integers and real , the following holds whp over .
There is an MCMC algorithm that, on input and , outputs in steps a sample whose distribution satisfies .
Our analysis framework also applies to the ordered regime where we improve significantly upon various restrictions from previous works [11, 16, 26, 19]. For example, [26, 22] applied when (and ), whereas the recent work by Carlson et al [11] (see also [16]) applied when and for some large constants . Using planting, we obtain a sampling result for all when is roughly larger than . We show the following.
Theorem 2.
For integers and real , the following holds whp over .
There is an MCMC algorithm that, on input and , outputs in steps a sample whose distribution satisfies .
1.2 Proof outline
Our algorithms are based on a well-known edge representation of the Potts model, the random-cluster (RC) model, where configurations are edge subsets weighted by the number of edges present and the number of induced components. We run the random-cluster dynamics (the analogue of Glauber dynamics in this edge setting), starting from the empty set of edges in the disordered regime and the full set of edges in the ordered regime . The reason for working in the RC representation is that, in contrast to the Potts model, the RC model enjoys certain monotonicity properties that facilitate tracking the evolution of Glauber dynamics. In the next Preliminaries section we define the RC model and the relevant random-cluster chain. We also give a more detailed overview of the phases of the Potts model, and how planting gives a handle for the disordered and ordered settings. Most of these parts are largely imported from [14, 20]. In Section 3, we convert the planting for Potts into suitable planting for the RC model, giving the analogue of the ordered and disordered phases in that setting. Then, we state the main contribution of this work which is establishing weak spatial mixing within the disordered and ordered phases (a notion that originated in [21]). Sections 4 and 5 give the technical core of our arguments.
Roughly, in the disordered regime (Section 4), the key ingredient is to show that the dynamics converges to a set of configurations that do not form large components. For , Blanca and Gheissari [9] accomplished this by a very careful random-graph revealing process coupled with the evolution of random-cluster dynamics. Using planting, we are able to capture precisely the correlation between the random graph and a sample from stationarity for , reducing the study of the component structure to showing that the planted random graph is subcritical. Using the monotonicity of the RC model, this implies that the RC dynamics initialised from the empty set (“all-out”) enjoys the same component structure as the stationary distribution (for exponentially many steps). Having this in place (Lemma 14), the rest of the proof is more streamlined after some small reworking of certain correlation decay estimates up to the ordered/disordered threshold (Lemma 15).
In the ordered regime (Section 5), the planted model is instead supercritical and a typical configuration has a “giant” component, whose presence complicates the arguments significantly. The goal now is roughly to show that bichromatic edges do not typically form large components; more precisely, the key ingredient is to show that every vertex is surrounded by close-by vertices (each at some distance ) that belong to the giant component, a so-called “wired” boundary. Planting here allows us to do a head-on analysis of the probability that, in a typical sample from stationarity, there exists a path of length that does not include any vertex in the giant. We get a tight bound for this using careful first moment arguments and percolation bounds; then, for , a union bound over the paths starting at gives the existence of the desired wired boundary (in a typical configuration). Relative to the earlier works [26, 16, 11, 19], bichromatic components roughly correspond to so-called polymers; our techniques show that a smaller lower bound on the size of the giant component suffices in order to ensure that the polymers are not large.
1.3 Further Discussion
The planting arguments that we use in this paper follow by careful first and second moment considerations in [20], which have been carried out for the ferromagnetic Potts model, see also [5] for similar-flavored “silent” planting results. The moment analysis has previously been used in [13] for sampling independent sets and proper colourings in random bipartite graphs, though there the approach ultimately relies on cluster-expansion type of arguments [29, 12]. Our more direct planting approach can be carried out for sampling independent sets on random bipartite graphs (that shares similar monotonicity and planting properties) and obtain sharper bounds for sampling using Glauber dynamics. Note however that, as pointed out in [13, Lemma 10], it is unlikely that current polymer-based techniques will achieve better asymptotic estimates in terms of the degree . Our technique should also apply to the setting (for arbitrary ) provided that the analogous “silent” planting is in place; this seems more easily feasible in the disordered regime but more difficult for the ordered regime where the Poisson-tree neighbourhoods are expected to cause bigger variance.
It is also relevant to note that the results we stated earlier for the cluster-expansion technique and fast mixing of the random-cluster dynamics in the literature apply to non-integer , in the random-cluster representation. It is conceivable that a similar planting to the one we consider here for the Potts model can be carried out for non-integer in the random-cluster representation, though the related first and second moment computations will likely require careful arguments for the distribution of the number of components in random graph percolation; see [6] for some related developments.
2 Preliminaries
2.1 The configuration model
To obtain a handle on the random -regular , we work throughout in the configuration model. Formally, for integers with even, let be a set of “half-edges”; we denote by a (multi)graph on vertex set whose edges are obtained by taking a uniformly random perfect matching of the half-edges; we add an edge between in whenever the half-edges and are matched together for some . For brevity, instead of we sometimes use to denote the random (multi)graph chosen. Any property that holds whp over also holds whp over (see, e.g., [28]), so henceforth we focus on the configuration model .
2.2 The random-cluster dynamics
Given a graph and real parameters and , the random-cluster model is a probability distribution on the subsets of . Let be the set of all configurations. Each configuration has probability where is the number of connected components in the graph , and is a normalisation factor. For integer and , it is well known that sampling from this probability distribution is equivalent to sampling from the -state Potts model with parameter , see Section 3 for details.
Let . The random-cluster (RC) dynamics initialised from a configuration is a Markov chain whose transition to is as follows:
-
1.
First choose an edge uniformly at random.
-
2.
If is a cut edge of the graph , then with probability , set . With all remaining probability set
-
3.
Otherwise, with probability , set . With all remaining probability set .
It is a standard fact that the distribution of converges to , regardless of the initialisation. To work around the slow convergence in the metastability window when , we will later consider the RC dynamics starting from the two extreme configurations, the all-out () and the all-in .
2.3 Phases of the Potts model
Let be a -dimensional probability vector. For an -vertex graph , let
where is a small constant which we will suppress from notation. Hence, consists of these configurations where the colour frequencies are given by the vector . Let also
| and , |
where is the weight of . In short, the function captures the (logarithm of the) expected contribution to the partition function of the random graph given by the configurations in . As we shall see shortly, the local/global maximisers of play a key role in determining the modes of the Gibbs distribution.
There are three relevant thresholds that come into play
For high temperatures (), there is a unique global maximiser of , given by the uniform vector
which corresponds to the “disordered” configurations. As the temperature decreases and crosses the threshold , another local maximum of emerges at an “ordered” vector where one of the colours dominates over the others (which are roughly balanced). More precisely, is given by111The value of is defined as the infinimum value of such that there exists which satisfies (2.1); then it is simple to check that such an exists for all . For certain values of (namely when ), there may be more than one values of that satisfy the equation; when this is the case, the value of that is relevant for the ordered phase is the largest such value.
| (2.1) |
Interestingly, the local maximum at does not become global until where is known as the ordered/disordered threshold; notably, the only point where both and are global maxima is at . Note also that continues being a local maximum till where ; so, in the interval both and are local maxima of which causes the metastability phenomena mentioned in the introduction, see [14] for details.
For convenience, let and denote the sets of configurations when and , respectively. For a graph , define and analogously, and let and denoted the conditional Gibbs distributions. The following lemma from [20] gives some sharp concentration estimates for and (derived using moments and the small subgraph conditioning method) and are the key behind our planting approach.
Lemma 3 ([20, Theorem 5 & Lemma 16]).
Let be integers, and be real.222Here, we assume that the constant , used in the definition of and , is a sufficiently small constant depending only on . Let be any function with . Whp over , we have:
Moreover, when , and when , where is the union of with its permutations. For , it holds that and .
2.4 Planting disordered and ordered configurations
We now introduce the planting distribution which will be the key in obtaining our results. Recall that denotes a graph chosen according to the configuration model ; in the expressions below, expectations are taken with respect to the choice of . For , define a random graph by letting
| (2.2) |
Furthermore, for , recalling the disordered and ordered partition functions from the previous section, we define random configurations and with distributions
| (2.3) |
We refer to and as the disordered and ordered planted distributions respectively.
The following lemma allows us to relate the planted distributions with the distributions of the (conditional) disordered and ordered configurations of the random graph . Recall that denotes the set of all -vertex regular (multi)graphs with degree .
Lemma 4.
Let be integers and be real. Let be any functions with . Let be an arbitrary event.
-
(1)
: if , then, whp over , .
-
(2)
: if , then, whp over , .
The proof is given in the full version [18]. To utilise Lemma 4, the key observation is that, for the planted models, once we condition on the vertex and edge statistics, the models become uniformly distributed. Formally, for a graph and a configuration , let be vertex colour statistics under , i.e.,
| for a colour , . |
Similarly, let be the matrix with the following half-edge colour statistics under , i.e.,
| (2.4) |
Recall that . So, for a colour class , out of the half-edges incident to it, there are “going” to a colour class (note, for , there are edges in that lie inside the colour class ). We have that is symmetric with . Observe further that, conditioned on and the pair is uniformly distributed over the pairs with and . Similarly for the pair conditioned on and .
To put this observation into use, we will need more quantitative information on the vertex and edge colour statistics in the planted models. By definition of , we already know that and . Analogously, and are concentrated around the vectors and where
| (2.5) |
We have the following concentration statement.
Lemma 5 ([14, Lemmas 3.4 & 3.5]).
Let be integers and be real. There exists such that for any it holds that
-
(i)
For : .
-
(ii)
For : .
3 Mixing within the disordered and ordered phases for the RC model
3.1 Phases for the random cluster model via Potts
For a graph and , let denote the random subset of edges obtained by keeping each monochromatic edge of under with probability . It is a standard fact (see, e.g., [25]) that, for integer , we can obtain a sample by first sampling and then keeping each monochromatic edge of under with probability (the so-called percolation step).
We use this for the random graph to get a handle on the random cluster model on , i.e., has distribution . From Lemma 5, we can figure out in particular how many monochromatic edges we should expect from each phase, so we can translate the phases for the Potts model to the random cluster model. More precisely, set
With a bit of algebra, we can derive the following expressions for and from (2.5): for , we have , while, for , we have with and defined from , as in (2.1).
We are now ready to define the ordered/disordered phases for the random-cluster model. Due to monotonicity properties (and in contrast to the phases of the Potts model), we can just use the values of and at the critical temperature ; this will be convenient later on.
Definition 6 (The ordered/disordered phases).
Let be integers. For a graph and , define
Let also and .
Using the translation between the Potts and random-cluster models, we have the following.
Corollary 7.
Let be integers and be real. Then, whp over , we have when and when .
For , and .
Proof of Corollary 7.
This follows from Lemma 3, and the correspondence between and stated at the start of the section, using for the lower bounds at .
To analyse the ordered/disordered phases of the random cluster model using the planted distributions, we will need the following intermediate distributions. Namely, for a graph , let be the distribution on obtained by first sampling and then keeping each monochromatic edge with probability . Define similarly . The proof of the following is given in the full version [18].
Lemma 8.
Let be integers and be real. Then, whp over , for any event , the following holds.
-
(i)
if , then .
-
(ii)
if , then .
3.2 Mixing and Weak Spatial Mixing (WSM) within a phase
Treading a path initiated in [21], to prove Theorems 1 and Theorem 2 it suffices to show the following results for the disordered and ordered regimes, respectively.
Theorem 9.
Let be integers and . Then, whp over , for every the RC dynamics initialised from all-out satisfies for .
Theorem 10.
Let be integers and be a real. Then, whp over , for every the RC dynamics initialised from all-in satisfies for .
Proof of Theorems 1 and 2.
We first do the proof of Theorem 1. By Corollary 7, it holds that and by Theorem 9 we can obtain a sample from in steps using random-cluster dynamics. Then, using the translation between the Potts and RC models [25], this gives a sample whose distribution is within TV-distance from , as wanted. The proof of Theorem 2 is analogous using Corollary 7 and Theorem 10.
It was shown in [21] that, for the ferromagnetic Ising model, a sufficient condition for fast convergence to equilibrium within the phase is the property of weak spatial mixing (WSM) within a phase, paired with some mixing-time estimates in local neighbourhoods. The same condition applies for the random-cluster model and, more generally, for monotone models. In the case of the random regular graph, the local neighbourhoods resemble the -regular tree and the required mixing-time results have been established in [7]. So, the main piece missing in order to obtain Theorems 9 and 10 is showing WSM within the corresponding phase. Roughly, the notion captures that, when we condition on the phase, the influence of a boundary configuration at distance is small.
More formally, for a graph , an integer and a vertex , let be the graph induced by vertices at distance from , whose vertex and edge sets are going to be denoted by and , respectively. A boundary condition on is a subset of the edges in ; then denotes the RC distribution induced on conditioned on , i.e., for it holds that . There are two extreme boundary conditions on which will be most relevant: (i) the all-in or wired condition where all edges in are included in the boundary condition, and (ii) the all-out or free boundary condition where none of the edges in are included. We use and to denote the corresponding conditional RC distributions.
For an edge , let denote the subsets of that include . Following [19], we say that we have WSM within the disordered phase (respectively, ordered) at distance if for each vertex and each edge incident to it holds that (respectively, ) with .
Theorem 11.
Let be integers and be a positive real. Then, there exists such that, whp over , has WSM within the disordered phase at a distance . That is, with , for every and every edge incident to , it holds that .
Theorem 12.
Let be integers and be a real. Then, there exists such that, whp over , has WSM within the ordered phase at a distance . That is, with , for every and every edge incident to , it holds that .
4 Proof for WSM in the disordered regime
To show WSM in the disordered regime, we follow the strategy outlined in Section 1.2. We begin with some relevant definitions. Let be a graph. Recall, that for a vertex and , denotes the radius- ball around in . Also, denote by the set of vertices at distance exactly from . For a set of vertices , we let the set of edges with both endpoints in .
Definition 13.
Let be a graph and be a vertex. For , we say that a subset is -shattered at distance from if all but vertices of belong to distinct components of the graph .
Lemma 14.
Let be integers and be a positive real. For any constant , there is so that the following holds whp over . For every vertex , define the event
Then, for the RC disordered phase of , we have .
Proof.
For a graph and , let denote the random subset of edges obtained by keeping each monochromatic edge of under with probability . Fix and let . We will consider the disordered planted graph and show that for an arbitrary vertex it holds that
| (4.1) |
Assuming this for now, we obtain by Lemma 4 and a union bound over the vertices that as well. Note that has distribution , so Lemma 8 yields that .
It remains to prove (4.1). By Lemma 5, we have w.p. . Consider arbitrary and with . Conditioned on and , we have that is uniformly random among all pairs with and .
To proceed, fix with and condition on . Condition further that and on the induced graph on . For convenience, let denote the (random) set of edges outside after the percolation step, i.e., the set .
Let . For , we explore the (union of the) components of vertices in the graph by revealing at each step an edge incident to a vertex belonging to any of these (chosen arbitrarily). We denote by the set of “active vertices” in these components that still have unmatched half-edges, so that implies that all the components have been fully explored. Let denote the set of all vertices that have been active up to time . The process is described precisely in Figure 1.
Exploration of starting from
Let , , and .
While or :
-
If , pick any unmatched half-edge and run Match;
increase by 1 and set
-
Otherwise, pick . For :
-
If the half-edge is not matched:
-
increase by 1;
-
match to another half-edge, let .
-
if , w.p. set ; else set .
-
Remove and any other vertices from which don’t have unmatched half-edges.
-
Match
For , let be the number of currently unmatched half-edges from colour to .
-
Let . Choose a colour from the distribution .
-
From the set of unmatched half-edges from colour to , choose one u.a.r., say .
-
match with , and return the vertex .
Note that at any time , for , there is a number of prescribed half-edges with one endpoint of colour that needs to be matched to a vertex of colour (note, this goes down when we reveal such an edge connecting colours and ). The revealing of an edge at time adjacent to an active vertex with colour has therefore two stages: we first reveal the other endpoint in the graph , chosen proportionally to the counts ; if , we further toss a coin with probability to determine whether the edge belongs to , and add to the set of active vertices accordingly. The key point here is that we only need to reveal the components involving in the graph , rather than the components in (which are much bigger). Indeed, as we will see shortly the former is dominated by a subcritical branching process and therefore at most vertices are revealed for each of the vertices in .
To make this precise, for a time , let denote the indicator of the event that the -th half edge was matched to a vertex in the set consisting of vertices that have been active at some point. Let also be the indicator that the edge revealed was monochromatic and that it survived the percolation step. Note that the number of the remaining unmatched half-edges from colour to satisfies , so for all the relevant .
Similarly, . So, with denoting the first edges revealed, we have the crude bounds
It follows that the sum is dominated above by a sum of independent coin tosses with probability , whereas the sum is dominated above by a sum of independent coin tosses with probability for some small constant (using that for ). Therefore, recalling that and , we have
where the second probability bound follows by standard Chernoff bounds (using that ).
Note that the event implies that for all which in turn implies that at least one vertex gets removed from the set of active vertices every (at most) time steps; indeed, when exploring the neighbours of a vertex at a particular time , there are at most half-edges incident to remaining unmatched since one of the half-edges of has been previously matched (to activate ). We therefore have the inequality . Since , it follows that
Combining these, we obtain that with probability both of and hold. The first implies that by time all the components of in the graph have been explored, and the second implies that all but (at most) of these vertices belong to distinct components. This finishes the proof of (4.1) and hence completes the proof of the lemma.
We next use an argument of Blanca and Gheissari [9] which shows a correlation decay bound for -shattered configurations in a treelike graph in terms of the existence of two root-to-leaf paths. The argument in [9] is technically proved for using some uniqueness estimates, but it can be extended it to the regime by observing that in a treelike ball, a -shattered configuration induces a distribution that is comparable to percolation on the tree with parameter (this coincides with the cut-edge update probability in the definition of the random cluster dynamics). A key feature of the correlation decay bound in [9] is the existence of two paths which gives the term in the lemma below, and which ultimately allows us to make the probability less than that is required for WSM, while keeping the distance less than that is the threshold for having treelike neighbourhoods. We give the proof of the following lemma in the full version [18].
Lemma 15.
Let be integers, and , positive reals.
There exists a constant such that whp over the following holds. For all and an edge incident to it, and ,
We can now give the proof of Theorem 11.
Theorem 11. [Restated, see original statement.]
Let be integers and be a positive real. Then, there exists such that, whp over , has WSM within the disordered phase at a distance . That is, with , for every and every edge incident to , it holds that .
Proof.
Consider . Since , we have that . Hence there is an such that . Let and .
Lemma 15 gives that, whp over , for any edge and vertex incident to it, it holds that . By the choice of and , we get that , so for large enough we have , concluding the proof of WSM within the disordered phase.
5 Proof for WSM within the ordered regime
To show WSM within the ordered phase for the random-cluster model on , we follow the strategy outlined in Section 1.2. The core of the argument will be to show that there are no paths of length that avoid the largest component in a typical configuration from . We will do this by considering percolation on the planted random graph model for . It will thus be relevant to consider random graph percolation on (almost) -regular random graphs where we have removed a path of length .
For a graph , let denote the largest component of (breaking ties arbitrarily), and more generally, denote the -th largest component of . It is a standard fact [3, 30, 14] that the size of the largest component in a percolated random -regular graph is controlled by the extinction probability of a Galton-Watson process with distribution on the tree. For , define and to be the unique solutions on the interval to the following
| (5.1) |
In particular, the size of the largest component is roughly equal to ; see [3, 30] for more details. We also remark that the quantity is the extinction probability of a slightly modified GW-tree where every vertex has degree apart from the root which has degree ; this will be relevant later for our path considerations.
We show a similar result for the size of the giant component for slightly non-regular degree sequences , where we allow a small set of vertices to have degree less than ; in our WSM setting, we will further be interested in the probability that a particular subset of the vertices does not belong to the giant. A variant of percolation we will use is the exact edge model for random graphs with a given degree sequence and edge count . Let be the random graph model where each is associated to half-edges; then we choose a subset of half-edges and a matching of these uniformly at random; the final (multi)graph is obtained by projecting the edges on the vertex set .
Lemma 16.
Let be an integer and . Let be arbitrarily small constants.
Suppose that satisfies and is a degree sequence such that for and for . Then, there exists a constant , such that for with , the following hold:
-
1.
If , .
-
2.
If , and . Moreover, with as in (5.1), for any subset ,
Intuitively, the bound in the second item follows by revealing the -neighbourhoods of vertices in for a suitably large constant (these are whp trees and disjoint from each other); then, for a vertex , the factor captures the probability that the percolation dies out within the neighourhood of the vertex . The full proof of Lemma 16 is given in the full version [18].
Next, we consider the density of the edges inside the color classes. The following lemma says that in the ordered regime one colour class is supercritical, while the others subcritical. Recall that and are the vertex and edge statistics for the ordered phase respectively, defined in (2.1) and (2.5).
Lemma 17 ([14, Lemma 5.6]).
Let be integers. For and , the ordered vectors and satisfy for , and for .
Let be a graph. For a subset of edges and a vertex , we let be the edges in incident to . We will also consider the largest component of the graph , i.e. the component with the largest number of vertices (breaking ties arbitrarily). We say that avoids a path if there is no vertex that belongs to . The next lemma is the key technical step in our argument and will be used to show that, in the RC ordered phase, the largest component (which has size due to the supercriticality of colour 1, cf. Lemma 17) is “very close” to every vertex .
Lemma 18.
Let be integers, be real and . For any constant , the following holds for any integer satisfying , whp over .
To parse the bound on note that the factor is just an upper bound on the total number of paths of length . The factor controls the probability that a particular path avoids the largest component. The term in is the probability that the Galton-Watson process dies out from a vertex of the path that belongs to the supercritical color class 1 (note that has “remaining” degree other than its neighbours on the path, and that the relevant percolation parameter is given by ). The term accounts for the frequency that we encounter vertices in the supercritical color class 1 along the path , which follows from the definition of the planting model. The exact quantitative dependence of to and follows by a more technical calculation involving eigenvalues. Later, in Lemma 19, we will show that can be made less than by taking sufficiently large.
Proof of Lemma 18 (Sketch).
From Lemmas 4 and 8, it suffices to show for the ordered planted graph that (for all sufficiently large)
| (5.2) |
From this, we obtain by Lemma 4 that ; since has distribution , Lemma 8 yields that, whp over , it holds that .
By Lemma 5, we have w.p. . Condition on and where and satisfy , and note that the pair is uniformly random from the pairs having these statistics.
Let be an arbitrary sequence of vertices , together with a -colouring . Let also be a set of paired half-edges that turn into a path in that order. Let be the set of random edges in the graph before percolation. Denote by
the event that form a path in (in this order) using the half-edges prescribed by and that they belong to the colour classes that prescribes. In the full version [18], we give an exact expression for in terms of and which yields that
| (5.3) |
where denotes as .
Fix now an arbitrary vertex and let be an arbitrary sequence with , together with a -colouring and pairing (as above). Condition on the event . Let be the edge set obtained by keeping each monochromatic edge with probability , and be the set of these edges that are incident to . Then, for the graph , we show in the full version [18] that color 1 is supercritical and the others are subcritical (using Lemma 17); by invoking then Lemma 16 to the supercritical color class 1, we further obtain that
| (5.4) |
where is the number of vertices with colour 1 under and where and (as in (5.1)). Combining (5.3) and (5.4) via a union bound (note that there are sequences with vertices starting from and ways to select half-edges to turn each of these into a path) yields that
| (5.5) |
At this stage, it remains to take a union bound over all possible colourings on vertices, i.e., sum the right-hand-side of (5.5) over all . The resulting expression can be written more succinctly in the form where is the matrix whose -entry is given by and is the vector whose -th entry is equal to if and if . Note that where is the diagonal matrix with on the diagonal and is the symmetric matrix . In particular, we have that and have the same eigenvalues, which are all real. Let denote the vector whose entries are the square roots of the entries of . Then, we can write
Now, we will show that entry-wise it holds that where is as in the lemma statement. Note first that . Since for and for , we have that
For , we have
This finishes the proof of , and therefore we can bound the left-hand-side of (5.5) with . Noting that the event in (5.5) is just , (5.2) follows by a union bound over .
We next bound the quantity appearing in Lemma 18 by a suitably small constant so that we can take a union bound over paths; it is for this step that we require . The proof is in the full version [18]
Lemma 19.
Let , and . Then, for all , it holds that .
5.1 WSM within the ordered phase
Next we prove WSM within the ordered phase. We first define a wired boundary and show how it implies the WSM within the ordered phase, and then prove that it occurs with a good probability.
Definition 20.
Let be a graph, , and let be a RC configuration. Consider the -neighbourhood of in . We say that is a wired boundary around in , if
-
is a cut set separating from the outside of the ball. That is, , the component of in , is disjoint from ; and
-
in the graph , all vertices of are in the same component.
Lemma 21.
Let be integers, a real. Then whp over the following holds. For any , any edge incident to , and any ,
To prove the lemma, we compare with . When , we obtain, from Corollary 7, that . Then we can bound by the probability (under ) that there is no wired boundary around in : informally, this is because probability of conditional on the wired boundary is the same under both and , and due to the monotonicity of the RC distribution, the probability of the wired boundary under is at least that under . For more details as well as the case when see the full proof of the Lemma is in the full version [18].
By Lemma 18 all simple length- paths from intersect the largest component of , so we can construct a wired boundary by taking, for each length- path from in , the first vertex on the path belonging to the largest component of . We thus show the following lemma in the full version [18].
Lemma 22.
Let be a graph, let be an arbitrary vertex, and let an integer. For any , let be the set of edges in that are incident to and let . Suppose that all simple length- paths from in intersect the largest component of . Then has a wired boundary around in .
From these two lemmas we can now obtain WSM within the ordered phase. See 12
Proof.
By Lemma 19, we have . Let be a small constant (depending on ) such that . Let . By Lemma 21, is at most
By Lemma 22, this is at most
By Lemma 18 this is at most . So whp over it holds for every vertex and edge incident to it that
which is at most for all large enough.
References
- [1] Dimitris Achlioptas and Amin Coja-Oghlan. Algorithmic barriers from phase transitions. In 2008 49th Annual IEEE Symposium on Foundations of Computer Science, pages 793–802, 2008. doi:10.1109/FOCS.2008.11.
- [2] Ahmed El Alaoui, Andrea Montanari, and Mark Sellke. Sampling from the Sherrington-Kirkpatrick Gibbs measure via algorithmic stochastic localization. In 63rd IEEE Annual Symposium on Foundations of Computer Science, FOCS 2022, pages 323–334, 2022. doi:10.1109/FOCS54457.2022.00038.
- [3] Noga Alon, Itai Benjamini, and Alan Stacey. Percolation on finite graphs and isoperimetric inequalities. The Annals of Probability, 32(3):1727–1745, 2004.
- [4] Victor Bapst, Amin Coja-Oghlan, and Charilaos Efthymiou. Planting colourings silently. Comb. Probab. Comput., 26(3):338–366, 2017. doi:10.1017/S0963548316000390.
- [5] Victor Bapst, Amin Coja-Oghlan, and Charilaos Efthymiou. Planting colourings silently. Combinatorics, Probability and Computing, 26(3):338–366, 2017. doi:10.1017/S0963548316000390.
- [6] Ferenc Bencs, Márton Borbényi, and Péter Csikvári. Random cluster model on regular graphs. Communications in Mathematical Physics, pages 1–46, 2022.
- [7] Antonio Blanca, Zongchen Chen, Daniel Štefankovič, and Eric Vigoda. The Swendsen-Wang dynamics on trees. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2021), volume 207, pages 43:1–43:15, 2021. doi:10.4230/LIPIcs.APPROX/RANDOM.2021.43.
- [8] Antonio Blanca, Andreas Galanis, Leslie Ann Goldberg, Daniel Štefankovič, Eric Vigoda, and Kuan Yang. Sampling in uniqueness from the Potts and random-cluster models on random regular graphs. SIAM Journal on Discrete Mathematics, 34(1):742–793, 2020. doi:10.1137/18M1219722.
- [9] Antonio Blanca and Reza Gheissari. Random-cluster dynamics on random regular graphs in tree uniqueness. Communications in Mathematical Physics, 386(2):1243–1287, 2021.
- [10] Antonio Blanca and Reza Gheissari. On the tractability of sampling from the Potts model at low temperatures via Swendsen-Wang dynamics. CoRR, abs/2304.03182, 2023. doi:10.48550/arXiv.2304.03182.
- [11] Charlie Carlson, Ewan Davies, Nicolas Fraiman, Alexandra Kolla, Aditya Potukuchi, and Corrine Yap. Algorithms for the ferromagnetic Potts model on expanders. In 63rd IEEE Annual Symposium on Foundations of Computer Science, FOCS 2022, pages 344–355, 2022. doi:10.1109/FOCS54457.2022.00040.
- [12] Zongchen Chen, Andreas Galanis, Leslie Ann Goldberg, Will Perkins, James Stewart, and Eric Vigoda. Fast algorithms at low remperatures via Markov chains. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, APPROX/RANDOM 2019, volume 145, pages 41:1–41:14, 2019. doi:10.4230/LIPIcs.APPROX-RANDOM.2019.41.
- [13] Zongchen Chen, Andreas Galanis, Daniel Štefankovič, and Eric Vigoda. Sampling colorings and independent sets of random regular bipartite graphs in the non-uniqueness region. In Proceedings of the 2022 ACM-SIAM Symposium on Discrete Algorithms, SODA 2022, pages 2198–2207. SIAM, 2022. doi:10.1137/1.9781611977073.87.
- [14] Amin Coja-Oghlan, Andreas Galanis, Leslie Ann Goldberg, Jean Bernoulli Ravelomanana, Daniel Štefankovič, and Eric Vigoda. Metastability of the Potts ferromagnet on random regular graphs. Communications in Mathematical Physics, pages 1–41, 2023.
- [15] Amin Coja-Oghlan, Florent Krzakala, Will Perkins, and Lenka Zdeborová. Information-theoretic thresholds from the cavity method. In Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2017, pages 146–157, 2017. doi:10.1145/3055399.3055420.
- [16] Matthew Coulson, Ewan Davies, Alexandra Kolla, Viresh Patel, and Guus Regts. Statistical physics approaches to Unique Games. In Proceedings of the 35th Computational Complexity Conference, CCC ’20, 2020.
- [17] Charilaos Efthymiou. On Sampling Symmetric Gibbs Distributions on Sparse Random Graphs and Hypergraphs. In 49th International Colloquium on Automata, Languages, and Programming (ICALP 2022), volume 229, pages 57:1–57:16, 2022. doi:10.4230/LIPIcs.ICALP.2022.57.
- [18] Andreas Galanis, Leslie Ann Goldberg, and Paulina Smolarova. Planting and mcmc sampling from the potts model, 2024. doi:10.48550/arXiv.2410.14409.
- [19] Andreas Galanis, Leslie Ann Goldberg, and Paulina Smolarova. Sampling from the random cluster model on random regular graphs at all temperatures via glauber dynamics. Combinatorics, Probability and Computing, 34(3):359–391, 2025. doi:10.1017/S0963548324000403.
- [20] Andreas Galanis, Daniel Štefankovič, Eric Vigoda, and Linji Yang. Ferromagnetic Potts model: Refined #BIS-hardness and related results. SIAM Journal on Computing, 45(6):2004–2065, 2016. doi:10.1137/140997580.
- [21] Reza Gheissari and Alistair Sinclair. Low-temperature Ising dynamics with random initializations. In 54th Annual ACM SIGACT Symposium on Theory of Computing, (STOC ’22), pages 1445–1458, 2022. doi:10.1145/3519935.3519964.
- [22] Reza Gheissari and Alistair Sinclair. Spatial mixing and the random-cluster dynamics on lattices. In Proceedings of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms, (SODA ’23) , pages 4606–4621, 2023. doi:10.1137/1.9781611977554.CH174.
- [23] Reza Gheissari, Allan Sly, and Youngtak Sohn. Rapid phase ordering for Ising and potts dynamics on random regular graphs. arXiv preprint arXiv:2505.15783, 2025.
- [24] Leslie Ann Goldberg and Mark Jerrum. Approximating the partition function of the ferromagnetic Potts model. J. ACM, 59(5):25:1–25:31, 2012. doi:10.1145/2371656.2371660.
- [25] Geoffrey Grimmett. The random-cluster model, volume 333. Springer, 2006.
- [26] Tyler Helmuth, Matthew Jenssen, and Will Perkins. Finite-size scaling, phase coexistence, and algorithms for the random cluster model on random graphs. Annales de l’Institut Henri Poincare (B) Probabilites et statistiques, 59(2):817–848, 2023.
- [27] Brice Huang, Sidhanth Mohanty, Amit Rajaraman, and David X Wu. Weak poincaré inequalities, simulated annealing, and sampling from spherical spin glasses. In Proceedings of the 57th Annual ACM Symposium on Theory of Computing (STOC’25), pages 915–923, 2025. doi:10.1145/3717823.3718236.
- [28] Svante Janson, Andrzej Rucinski, and Tomasz Łuczak. Random Graphs. John Wiley and Sons, 2011.
- [29] Matthew Jenssen, Peter Keevash, and Will Perkins. Algorithms for #bis-hard problems on expander graphs. In Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2019, pages 2235–2247, 2019. doi:10.1137/1.9781611975482.135.
- [30] Michael Krivelevich, Eyal Lubetzky, and Benny Sudakov. Asymptotics in percolation on high-girth expanders. Random Structures and Algorithms, 56(4):927–947, 2020. doi:10.1002/RSA.20903.
