Abstract 1 Introduction 2 Preliminaries 3 Recursion and marginals 4 FPTAS for counting proper edge colorings on general graphs 𝒒𝟑𝚫 5 Strong spatial mixing for edge colorings on trees when 𝒒>(𝟑+𝒐(𝟏))𝚫 6 Tight bound for weak spatial mixing References

Decay of Correlation for Edge Colorings When 𝒒>𝟑𝚫

Zejia Chen ORCID Shanghai Jiao Tong University, China Yulin Wang ORCID Shanghai Jiao Tong University, China Chihao Zhang ORCID Shanghai Jiao Tong University, China Zihan Zhang ORCID Graduate Institute for Advanced Studies, SOKENDAI, Tokyo, Japan
Abstract

We examine various perspectives on the decay of correlation for the uniform distribution over proper q-edge colorings of graphs with maximum degree Δ.

First, we establish the coupling independence property when q3Δ for general graphs. Together with the recent work of Chen, Feng, Guo, Zhang and Zou (2024), this result implies a fully polynomial-time approximation scheme (FPTAS) for counting the number of proper q-edge colorings.

Next, we prove the strong spatial mixing property on trees, provided that q>(3+o(1))Δ. The strong spatial mixing property is derived from the spectral independence property of a version of the weighted edge coloring distribution, which is established using the matrix trickle-down method developed in Abdolazimi, Liu and Oveis Gharan (FOCS, 2021) and Wang, Zhang and Zhang (STOC, 2024).

Finally, we show that the weak spatial mixing property holds on trees with maximum degree Δ if and only if q2Δ1.

Keywords and phrases:
Strong Spatial Mixing, Edge Coloring, Approximate Counting
Category:
Track A: Algorithms, Complexity and Games
Funding:
Chihao Zhang: The author is supported by National Natural Science Foundation of China No. 62372289.
Copyright and License:
[Uncaptioned image] © Zejia Chen, Yulin Wang, Chihao Zhang, and Zihan Zhang; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Generating random combinatorial structures
Related Version:
Full Version: https://arxiv.org/abs/2502.06586
Editors:
Keren Censor-Hillel, Fabrizio Grandoni, Joël Ouaknine, and Gabriele Puppis

1 Introduction

Sampling from the uniform distribution of proper edge colorings received lots of attention recently, with the advent of new tools in analyzing high-dimensional distributions [10, 1, 16, 3]. A proper edge coloring of an undirected graph with maximum degree Δ is an assignment of each edge with one of q colors so each adjacent edges receive different color. Clearly a proper edge coloring can be viewed as a proper vertex coloring on its line graph.The sampling problem is to draw a proper edge coloring uniformly. Most of previous work focuses on the mixing time of Glauber dynamics. The work of [16] established the spectral independence property, a notion to measure the correlation in high-dimensional distributions [2], whenever q>(2+o(1))Δ for general graphs and the work of [3] establish approximate tensorization of variance on trees when qΔ+1. Note that Glauber dynamics is known to be reducible when q<2Δ on general graphs [13] and therefore the q>(2+o(1))Δ condition for Glauber dynamics is asymptotically tight. However, it is still open whether q2Δ is the threshold for efficient sampling proper edge colorings and there are some recent attempts to design other sampling algorithm under better conditions [9]. A (almost) uniformly sampling algorithm can be turned into fully polynomial-time randomized scheme (FPRAS) for counting the number of proper colorings using standard reduction [14].

In this work we examine some other aspects for the correlation of the uniform distribution on edge colorings. We first established the coupling independence property on general graphs when q3Δ. Coupling independence [6] is a notion stronger than spectral independence, and thanks to recent work of [5], building on the machinery of Moitra [15], it (together with some other properties) implies a fully polynomial-time approximate scheme (FPTAS) for counting proper edge colorings. This is the first determinstic approximate counting algorithm in this range of parameters. Moreover, our proof of the coupling independence property differs from those directly derived from contractive couplings for establishing rapid mixing results.

Theorem 1 (Informal).

If q3Δ, then there exists an FPTAS for counting the number of proper q-edge colorings on any graph G with maximum degree Δ.

Before our work, there is no similar results tailored for counting edge colorings. The best FPTAS for counting edge coloring is the same as the one for general vertex coloring, which requires q>3.618Δ [4, 5] for sufficiently large Δ.

We then study the strong spatial mixing (SSM) property for edge colorings on trees, an important notion to measure the correlation between sites in Gibbs distributions whose definition is in Section 2.3.

Theorem 2 (Informal).

Let T be a tree with maximum degree Δ. If q(3+o(1))Δ, then the uniform distribution over q-colorings on T exhibits strong spatial mixing with exponential decay rate.

Similar strong spatial mixing bounds on trees have been thoroughly studied for vertex colorings (e.g. [11, 8]). It is conjectured that SSM holds whenever qΔ+1 and in [8], a qΔ+3 condition was established, almost resolving the conjecture. However, much less is known for edge colorings. The q>(3+o(1))Δ bound established in this work is by no means tight. We also discuss the limit of our approach and possible further improvement. On the other hand, we show that one cannot expect the strong spatial mixing property to hold when q<2Δ, as we prove that 2Δ1 is the threshold for weak spatial mixing. Therefore, we conjecture that SSM holds for edge colorings on graphs with maximum degree Δ whenever q2Δ+γ for certain constant γ. We also show that the best bound one can expect using the analysis in this paper cannot be better than q2.618Δ.

Theorem 3 (Informal).

If q2Δ1, then the uniform distribution over q-colorings on any tree with maximum degree Δ exhibits weak spatial mixing with exponential decay rate. Otherwise, there exists a tree with maximum degree Δ such that the uniform distribution over q-coloring on it does not satisfy the weak spatial mixing property.

In the following, we give an overview with our technique, with an emphasis on the novelty.

1.1 Technical contribution

A new coupling strategy

The coupling independence property is established via a new local coupling for edge colorings. Our coupling can somehow be viewed as a multi-spin version of Chen and Gu’s coupling [7] for Holant problems with boolean domain. Their coupling, using the problem of b-matching as an example, begins with two instances differing at one pendant edge, or equivalently, two instances with a single constraint discrepancy. During the coupling process, the number of discrepancies can never increase but has a nonzero probability of decreasing to zero. Therefore, the coupling process terminates in expected constant number of steps.

In the problem of edge coloring, we can design a local coupling starting from a single discrepancy so that the number of discrepancies can either increase to two, decrease to zero, or remain unchanged. We then use marginal probability bounds to control the probability of discrepancy increasing, while ensuring that the number of discrepancies decreases in expectation.

Dimension reduction

We establish the strong spatial mixing property on trees by analyzing marginal recursions, which is similar to [8]. However, unlike previous work for spin systems where the marginal on a single site is considered, we study the recursion for marginals on a “broom”, namely all edges incident to the root. For each partial coloring on the broom, we can represent its marginal as a function of marginal probabilities of partial colorings in subtrees. However, the Jacobian matrix of this system can be as large as qΔ×ΔqΔ, and is technically very hard to analyze. Our key observation is that the Jacobian matrix is of low rank, and therefore we can apply a trace trick to bound its 2-norm by the norm of a much smaller matrix. We call this step dimension reduction.

From spectral independence to strong spatial mixing

It is still challenging to directly bound the 2-norm of the small matrix. We then observe that it can be written as the product of certain covariance matrices of marginal distributions on brooms. Therefore, ideally we can apply the known bounds for these covariance matrices, or equivalently the spectral independence bound for these marginals. However, these marginal probabilities are from the distribution of certain “weighted edge colorings” and one cannot directly apply previous spectral independence result for edge colorings. As a result, we apply the machinery of matrix trickle-down developed in [1] in the way of [16] to establish the desired spectral independence result.

Top-down analysis of recursion

There is another subtle technical issue in the above approach. When analyzing the contraction of marginal recursion, one needs to analyze the gradient / Jacobian at certain “midpoint” between two boundary conditions due to the application of fundamental theorem of calculus or mean-value theorem in the analysis. These midpoints, however, are not necessarily probabilities because the recursion may involve a potential function111Theses quantities are referred to as “subdistributions” in [8]. In previous work, only certain marginal bounds are used to prove the contraction, and these bounds are also satisfied by the midpoints. However, in our case, we require these midpoints to satisfy refined properties, such as spectral independence, which does not hold in general. Therefore, we cannot apply the recursion in the traditional bottom-to-top manner, where one fixes the boundary value at level L, analyze the contraction at level L1, then fixes boundary value at level L1 and analyze the contraction at level L2, and so on. Instead, we only fix boundary value at the leaves and analyze the composition of the recursion at each level as a whole. Therefore, we need to take “midpoints” only at the leaves, which defines our “weighted edge coloring” instance. Its spectral independence property can be established by the matrix trickle-down method, as discussed earlier.

1.2 Organization of the paper

After introducing the necessary preliminaries in Section 2, we give the marginal recursions and prove useful marginal bounds in Section 3. Then we present our proof of coupling independence, which implies the FPTAS, in Section 4. Strong spatial mixing on trees is proved in Section 5 and the results of weak spatial mixing are proved in Section 6. Verification for technical lemmas are deferred to the full version of this paper.

2 Preliminaries

We use the following notations. For any a,b, let ab:=min{a,b}, ab:=max{a,b}. For any two non-negative integers ab, let ab¯ be the falling factorial, i.e. ab¯=i=ab+1ai. Let 𝖨𝖽 denote the identity matrix. For a function f:Ω defined on a finite domain Ω, we use [f(x)]xΩ to denote the corresponding (column) vector in Ω. For any set S and an element xS, we write Sx for S{x}.

For two probability measures μ,ν on the same probability space Ω, we define μνTV=12ωΩ|μ(ω)ν(ω)|=supAΩ|μ(A)ν(A)| for the total variation distance between μ,ν. If μ,ν are two probability distributions on finite state spaces Ω1,Ω2 respectively, then we say ω is a coupling of μ,ν when it is a joint distribution on Ω1×Ω2 with μ,ν as its marginals, i.e. μ(x)=yΩ2ω(x,y) and ν(y)=xΩ1ω(x,y) for every xΩ1,yΩ2.

Given a graph G=(V,E), for any vertex vV, let E(v)={eE|ve} and deg(v)=|E(v)| be the degree of v; for any edge eE, let deg(e) be the degree of e, which is the number of edges adjacent to e. Moreover, we write deg(G) for the maximum (vertex) degree in G. For two edges e,eE, we write distG(e,e) for the length of the shortest path between them in G (not containing e,e) and distG(e,e)= if e and e is disconnected. Similarly, for vertices v,vV, distG(v,v) is the shortest path between them in G and distG(v,v)= if v and v is disconnected.

2.1 List edge coloring

Fix a color set [q]={1,2,,q} where q. Let G=(V,E) be an undirected graph and ={(e)[q]:eE} be a collection of color lists associated with each edge in E. The pair (G,) is an instance of list edge coloring.

If (e)=[q] for any eE, we say (G,) is a q-edge coloring instance. If |(e)|deg(e)+β for any eE, we say (G,) is a β-extra edge coloring instance. We say σ:E[q] is a proper edge coloring if σ(e)(e) for any eE and σ(e1)σ(e2) for any e1e2. Let Ω denote the set of all proper edge colorings and μ be the uniform distribution on Ω.

Let ΛE and τ[q]Λ. We say τ is a proper partial edge coloring on Λ if it is a proper coloring on (G[Λ],|Λ) where G[Λ] is the subgraph of G induced by Λ and |Λ={(e):EΛ}. Let Ωτ be the set of all proper edge colorings on E that is compatible with τ, i.e. Ωτ={σΩ|τσ} . We also define μτ on Ω which is supported on Ωτ as μτ()=σμ[σ=|τσ]. For a subset SEΛ and a partial coloring ω on S, define ΩSτ as the set of all proper partial edge colorings on S that is compatible with τ and μSτ(ω)=σμ[ωσ|τσ]. Especially, when Λ={i} and τ(i)=c, we write the conditional distribution and the conditional marginal distribution by μic and μSic. Besides, we define the color lists after pinning τ by τ such that for any eEΛ, τ(e)={c(e)|μeτ(c)>0} and the degree after pinning by degτ(e)=|{ef|fEΛ}|.

For a given list edge coloring instance (G,), let ZG,(M) denote the number of proper colorings with the condition M satisfied, (or event M happens) and G,[M] denote the probability that the condition M is satisfied when a proper coloring is drawn uniformly at random. For an edge set FE, we usually use c(F) to denote the partial coloring on F. With a little abuse of notation, c(F) is sometimes referred to as the set of colors used on F. For a color a, we write aF,aF as shorthands for ac(F),ac(F) respectively.

2.2 The Wasserstein distance

In this work, we restrict our discussions and terminologies to finite probability spaces without invoking general measure theory.

Definition 4 (Wasserstein distance).

Let μ,ν be two distributions defined on the same finite set Ω equipped with a metric d(,). We define Γ(μ,ν) as the set of couplings of μ and ν. Then the Wasserstein (1-)distance is defined by

𝒲1(μ,ν):=infτΓ(μ,ν)𝐄(x,y)τ[d(x,y)].

In this paper, our metric d is always the Hamming distance. For two configurations σ,τ on [q]V, their Hamming distance is defined as d(σ,τ)=|{vV|σ(v)τ(v)}|. We define the notion of coupling independence for Gibbs distribution here.

Definition 5 (Coupling independence).

We say a Gibbs distribution μ over [q]V satisfies C-coupling independence if for any two partial configurations σ,τ[q]Λ on ΛV such that d(σ,τ)=1,

𝒲1(μσ,μτ)C

where μσ and μτ denote the Gibbs distribution conditional on σ and τ, respectively.

We will use the following inequality w.r.t Wasserstein distance in later proof.

Proposition 6.

Let μ,ν be arbitrary distributions on a common finite metric space (Ω,d). If there exists non-negative constants λi,1ik and distributions {μi}1ik, {νi}1ik on Ω such that

μν=i=1kλi(μiνi),

where we regard both sides as functions on Ω. Then

𝒲1(μ,ν)i=1kλi𝒲1(μi,νi).

The proof of Proposition 6 can be found in the full version.

2.3 Correlation Decay

Correlation decay refers to the phenomenon that the correlation between the color assignments of edges diminishes as their distance in the graph increases. Specifically, there are two primary notions of correlation decay: strong spatial mixing and weak spatial mixing. These two notions differ in how they measure the “distance” over which the correlation should decay.

Definition 7 (Strong spatial mixing).

The Gibbs distribution μ of the list edge coloring instance (G=(V,E),) satisfies strong spatial mixing (SSM) with exponential decay rate 1δ and constant C=C(q,Δ) if for any eE, every subset ΛE{e} and every pair of feasible pinning τ1,τ2 on Λ which differ on τ1,τ2={eΛ|τ1(e)τ2(e)}, we have that

μeσμeτTVC(1δ)K

where K=mineτ1,τ2distG(e,e).

Definition 8 (Weak spatial mixing).

The Gibbs distribution μ of the list edge coloring instance (G=(V,E),) satisfies weak spatial mixing (WSM) with exponential decay rate 1δ and constant C=C(q,Δ) if for any eE, every subset ΛE{e} and every pair of feasible pinning σ,τ on Λ, we have that

μeσμeτTVC(1δ)K

where K=mineΛdistG(e,e).

 Remark 9.

When defining Strong Spatial Mixing (SSM) and Weak Spatial Mixing (WSM) for individual problem instances, it is technically possible to choose a constant C large enough to satisfy the decay inequality since the instance is finite. For SSM/WSM to meaningfully imply algorithmic tractability, these constants must hold universally for an entire class of instances, independent of the size of graph. Therefore, when we informally state that we are establishing weak / strong spatial mixing property in this context, we refer to the existence of uniform constants C and δ for a certain class of edge coloring instances even as the graph grows arbitrarily large.

3 Recursion and marginals

In this section, we analyze the marginal bounds on the set of edges adjacent to a given vertex in a β-extra list-edge-coloring instance under various conditions.

3.1 Marginal bounds for general edge coloring

We begin by examining the marginal bounds on general graphs.

Lemma 10.

Given a list-edge-coloring instance (G,) where G=(V,E), and a vertex vV such that eEv:|(e)|deg(e)β2. Then for any vV, FE(v) and color a:

G,[ac(F)]|F|β1+|F|andG,[ac(F)]G,[ac(E(v))]|F|β1.

The proof of Lemma 10 is deferred to the full version of this paper. Especially, when |F|=1, we have the following corollary.

Corollary 11.

Let (G=(V,E),) be a β-extra list-edge-coloring instance. Then for any eE,vV,a(e),

G,[c(e)=a]G,[ac(Ev)]1β1.

3.2 Tree recursion for edge coloring

We now turn our attention to a more specialized structure: trees. Throughout this section, we fix a β-extra edge-coloring instance (T=(V,E),) with root r. For any vertex vV, let Tv be the sub-tree of T rooted at v and ETv be the edge set of Tv. Let Cv be the set of proper partial colorings on ETv(v) and 𝒟(Cv) be the set of all distributions on Cv.

Assume r has d children v1,v2,,vd. For any i[d], let ei=(r,vi) and Ti=Tvi. For brevity, since the color lists are fixed, we omit the color lists in the subscript in T,[] and ZT,(). For any i[d], we also write Ti[] for Ti,|Ti[] and ZTi() for ZTi,|Ti().

We introduce a tree recursion on the marginal distributions of partial colorings on “brooms”, where a “broom” is referred to as the edge set ETv(v) for a vertex vV. This recursion demonstrates how the marginal distributions on brooms propagate through the tree structure as in Figure 1.

Figure 1: Brooms on a tree.
Lemma 12.

Given distributions 𝐩i=(Ti[c(ETi(vi))=τ])τCvi on Cvi for each vi, we can compute the marginal distribution 𝐩r=(T[c(E(r))=π])πCr on Cr:

𝐩r(π)=fπ({𝐩i}i[d])=iτCvi:π(ei)τ𝐩i(τ)ρCriτCvi:ρ(ei)τ𝐩i(τ).

We provide the verification of Lemma 12 in the full version. We will regard f=(fπ)πCr:0Cv1×0Cv2××0Cvd𝒟(Cr) as a function taking inputs 𝐩=(𝐩1,𝐩2,,𝐩d) where 𝐩i0Cvi for i[d]. Note that in some cases, 𝐩 might not encode a distribution.

3.3 Marginal bounds propagated by recursion

Now we can give marginal bounds of probabilities that propagated by the recursion. For brevity, we define some notations of marginals for any non-zero vector 𝐩v0Cv with vV and c[q]: Let 𝐩v(c)=τCv:cτ𝐩v(τ), 𝐩v(c¯)=τCv:cτ𝐩v(τ), and 𝐩v(c1¯,c2¯)=τCv:c1,c2τ𝐩v(τ).

Especially, for 𝐩r𝒟(Cr), we define 𝐩r(i,c)=πCr:π(ei)=c𝐩r(π), and 𝐩r(i,c1,j,c2)=πCr:π(ei)=c1,π(ej)=c2𝐩r(π).

By Lemma 12, we have

𝐩r(π)=fπ({𝐩i}i[d])=i𝐩vi(π(ei)¯)ρCri𝐩vi(ρ(ei)¯). (1)

We have the following lemmas similar to Lemma 10 on trees whose proofs are also in the full version. Note that we do not require 𝐩i’s to be distributions in Lemma 13 and Lemma 14.

Lemma 13.

Given non-zero vector 𝐩i0Cvi for each i[d] and 𝐩r=f({𝐩i}i[d]), we have that for any color a[q],

𝐩r(a)dβ1+dand𝐩r(a)𝐩r(a¯)dβ1.
Lemma 14.

Given non-zero vector 𝐩i0Cvi for each i[d] and 𝐩r=f({𝐩i}i[d]), we have that for any color a[q] and j[d],

𝐩r(j,a)𝐩j(a¯)(β1)τCvj𝐩j(τ).

4 FPTAS for counting proper edge colorings on general graphs 𝒒𝟑𝚫

In this section, we prove the following main algorithmic result, which is a formal version of Theorem 1.

Theorem 15.

Assume Δ4. There exists a deterministic algorithm that outputs Z^ satisfying (1δ)ZG,Z^(1+δ)ZG, for any (Δ+2)-extra edge coloring instance (G,) with maximum degree Δ and given error bound 0<δ<1 in time (nδ)C(Δ), where n is the number of edges in G and C(Δ)=𝒪(ΔΔlogΔlogΔ) is a universal constant only depends on Δ.

Our key contribution is the following coupling independence result.

Theorem 16.

Let (G=(V,E),) be a ((1+ε)Δ+1)-extra list-edge-coloring instance. Then μE is (1+2ε)-coupling independent. That is, for any iE, a,b(i),

𝒲1(μEia,μEib)1+2ε.

We first set up our terminologies to argue about the Wasserstein distance. We define some upper bounds for the 𝒲1 distance between λΔ-extra list-colorings on Δ-degree graphs with s edges with respect to one different pinning. We will construct recursion on these upper bounds. An edge is pendant if one of its endpoints has degree exactly 1.

Definition 17 (Universal upper bounds for coupling independence).

Define

κs,Δ,λ:=sup(G=(V,E),)iE:i is pendanta,b(i),ab𝒲1(μEiia,μEiib)

where (G,) is taken over

  1. 1.

    all graph G=(V,E) such that deg(G)Δ, |E|s;

  2. 2.

    all color lists such that eE:|(e)|deg(e)+λΔ+1.

 Remark 18.

It is clear from the definition that κs+1,Δ,λκs,Δ,λ and κ1,Δ,λ=0.

Note that we only pin color on the edge i in Definition 17. If we need other pinnings, we can simply consider the pinnings as deleting the pinned edges and remove the pinned color from the lists of their adjacent edges.

The main lemma of this section is a recursion for κs,Δ,λ and leads to Theorem 16 immediately.

Lemma 19.

Let λ=1+ε for some ε>0 and s2. Then

κs,Δ,λ22+ε(κs1,Δ,λ+12).
Proof of Theorem 16.

Lemma 19 shows that supsκs,Δ,λ12+ε(1+2/ε)=1/ε. And

𝒲1(μEia,μEib)1+𝒲1(μEiia,μEiib).

To reduce to the pendant edge case, we may break i into two pendant edges i1,i2, connected to the two endpoints of i respectively, the color lists of i1,i2 are the same as i. We denote the new coloring instance by (G,). Then we have

μEi;(G,)ia=μEi1i2;(G,)i2ai1a,μEi;(G,)ib=μEi1i2;(G,)i2bi1b.

So by triangle inequality, we have

𝒲1(μE;(G,)ia,μE;(G,)ib)
1+𝒲1(μEi;(G,)ia,μEi;(G,)ib)
=1+𝒲1(μEi1i2;(G,)i2ai1a,μEi1i2;(G,)i2bi1b)
1+𝒲1(μEi1i2;(G,)i2ai1a,μEi1i2;(G,)i2bi1a)+𝒲1(μEi1i2;(G,)i2bi1a,μEi1i2;(G,)i2bi1b)
1+2ϵ.

The proof of Lemma 19 is based on a greedy one-step coupling of two marginal distributions with different pinning on a single edge.

Let (G=(V,E),) be a list-edge-coloring instance that fits into the constraints of κs,Δ,λ in Definition 17, iE be a pendant edge, and a,b(i) be colors. We do a one-step coupling to reduce the W1 distance between graphs with s edges to that between graphs with s1 edges using Proposition 6.

Denoting the two endpoints of i by u,v, we may assume u is the pendant vertex, that is deg(u)=1 without loss of generality.

We denote the non-empty set E(v)i by N, and use integers from 1 to denote the edges in N so that N={1,,d}. For every edge jN, define

γj:=μEiia(jb)μEiia(bN),δj:=μEiib(ja)μEiib(aN).

The following lemma describes the greedy coupling we use.

Lemma 20.
μEiiaμEiib= jNγjδj1+k(γkδk)(μEiiajbμEiibja)
+jN(γjδj)01+k(γkδk)(μEiiajbμEiib)
+jN(δjγj)01+k(γkδk)(μEiiaμEiibja)
+11+k(γkδk)(μEiiabNμEiibaN).

The first, second and third lines in the RHS correspond to the couplings with 1, 2 and 0 discrepancies respectively. With the marginal bounds in Section 3, the chance of 2 discrepancies is controlled by the chance of 0 discrepancy when the color lists are Δ-extra, and we have a constant rate of discrepancy decay and proves Lemma 20.

We leave the proofs of Lemmas 19 and 20 to the full version of this paper.

4.1 Proof of Theorem 15

Theorem 16 shows that any ((1+ε)Δ+1)-extra list-edge-coloring instance is (1+2ε)-coupling independent. This is because adding additional pinnings can be viewed as generating a new ((1+ε)Δ+1)-extra list-edge-coloring instance by deleting the pinned edges and removing the corresponding colors from the lists of their adjacent edges.

The work of [5] designs an FPTAS for counting the partition function of any Gibbs distribution of permissive spin systems that is marginally bounded and coupling independent. A spin system is specified by a 4-tuple S=(G=(V,E),q,AE,AV) where the state space is [q]V and the weight of a configuration is characterized by the matrices AE0q×q and AV0q. The Gibbs distribution is defined by:

μ(σ)w(σ):=u,vEAE(σ(u),σ(v))vVAV(σ(v)).

The normalizing factor of μ is called the partition function Z:=σ[q]Vw(σ).

We say S is permissive if for any partial configuration τ[q]Λ with ΛV, the conditional partition function Zτ=σ:τσw(σ)>0. For τ[q]Λ with ΛV, let μvτ be the marginal distribution on vVΛ conditional on the partial configuration τ. We say μ is b-marginally bounded if for any partial configuration τ[q]Λ with ΛV, any vertex vVΛ and c[q] such that μvτ>0, we have μvτb.

The main result of [5] that we will use is as follows.

Theorem 21 ([5]).

Let q2,b>0,C>0,Δ3 be constants. There exists a deterministic algorithm such that given a permissive spin system 𝒮=(G,q,AE,AV) and error bound 0<ε<1, if the Gibbs distribution of 𝒮 is b-marginally bounded and satisfies C-coupling independence, and the maximum degree of G is at most Δ, then it returns Z^ satisfying (1ε)ZZ^(1+ε)Z in time (nε)f(q,b,C,Δ), where f(q,b,C,Δ)=Δ𝒪(C(logb1+logC+loglogΔ))logq is a constant.

In our setting, the spin system is list-edge-coloring instance. Be careful with the parameters since the spins are on edges of the graph. In order to prove Theorem 15, we need the marginal lower bound from [12].

Lemma 22 (Corollary of Lemma 3 in [12]).

Fix a β-extra edge coloring instances (G,) of maximum degree Δ with Gibbs distribution μ. For any partial coloring τ on ΛE, any eEΛ, and cτ(e), it holds that

μeτ(c)(11|τ(e)|degτ(e))degτ(e)|τ(e)|(11β)2Δ2β+2Δ2.

Now we give the proof of our main theorem in this section.

Proof of Theorem 15.

It is easy to verify that any β-extra edge coloring instance with β1 is permissive. From Theorem 16 and Lemma 22, we know that the Gibbs distribution μ of a (Δ+2)-extra edge coloring instance (G,) is (1+2Δ)-coupling independent and b-marginally bounded where b=(11Δ+2)2Δ23Δ is Ω(1Δ). Then by Theorem 21, there exists a deterministic algorithm that outputs Z^ satisfying (1δ)ZG,Z^(1+δ)ZG, in time (nδ)C(Δ) where n is the number of edges in G and C(Δ)=𝒪(ΔΔlogΔlogΔ).

5 Strong spatial mixing for edge colorings on trees when 𝒒>(𝟑+𝒐(𝟏))𝚫

In this section, we prove our theorem for strong spatial mixing.

Theorem 23.

Given a β-extra edge coloring instance (G,) where G is a tree of maximum degree Δ, the uniform distribution on such instance exhibits strong spatial mixing with exponential decay rate 1δ and constant C=max{32qΔ+22Δ2(1δ)3,(1δ)4} if β>max{Δ+50,(1+ηΔ)Δ+1}, where

δ=(1+(β1Δ)/Δ)2(1+ηΔ)22(1+(β1Δ)/Δ)2,ηΔ=missingO(log2ΔΔ).

Specifically, if β=(1+log3ΔΔ)Δ+50, then δlog3ΔΔ.

We already introduced the recursion for marginal probabilities of edge colorings on trees and derived certain marginal bounds in Section 3. We will then analyze its Jacobian matrix in Section 5.1. Using the bounds on the norm of the Jacobian matrix, we prove Theorem 23 in Section 5.2. Finally, we discuss the limit of our approach and possible further improvement in Section 5.3. A key ingredient in our bounds for the norm of Jacobian matrix is a bound for certain covariance matrices. Due to space limitations, proofs of technical lemmas and bounding the covariance matrices is addressed in the full version.

5.1 Upper bound the 2-norm of Jacobian

5.1.1 The Jacobian

Recall the recursion f for marginals introduced in Section 3.2. We regard f=(fπ)πCr:0Cv1×0Cv2××0Cvd𝒟(Cr) as a function taking inputs 𝐩=(𝐩1,𝐩2,,𝐩d) where 𝐩iCvi for i[d]. The Jacobian of f is a matrix (𝒥f)(𝐩)Cr×i[d]Cvi. Since Cvi’s are disjoint, for every τi[d]Cvi, we will denote it by (i,τ) if τCvi for clarity. Therefore,

(𝒥f)π,(i,τ)(𝐩)=fπ𝐩i(τ).

For each i[d], define the matrix 𝒥iCr×Cvi with entries

(𝒥if)π,τ(𝐩)=(𝒥f)π,(i,τ)(𝐩)

for every πCr and τCvi.


We can write 𝒥if in a compact way. It derives from direct computation.

Proposition 24.

Let 𝐩r=f(𝐩). Then

(𝒥if)(𝐩)=c(ei)𝐚i,c𝐛i,c,

where 𝐚i,c=f(𝐩)[𝟙[π(ei)=c]πCr:π(ei)=c𝐩r(π)]πCr and
𝐛i,c=[𝟙[cτ]τCvi:cτ𝐩i(τ)]τCvi. 222We write 𝐮𝐯 for their Hadamard product (entry-wise product).

A well-known trick in the analysis of decay of correlation is to apply a potential function on the marginal recursion to amortize the contraction rate. Given an increasing potential function ϕ:[0,1], we define fϕ such that for any πCr and 𝐦Cv1×Cv2××Cvd,

fπϕ(𝐦)=ϕ(fπ((ϕ1(𝐦1),ϕ1(𝐦2),,ϕ1(𝐦d)))).

As a result, the Jacobian of fϕ can be obtained by the chain rule and the inverse function theorem as follows.

Proposition 25.

Given a smooth increasing function ϕ:[0,1] with derivative Φ=ϕ, let 𝐩=ϕ1(𝐦). Then we have

(𝒥ifϕ)(𝐦)=c(Φ(f(𝐩))𝐚i,c)(𝐛i,cΦ1(𝐩i)).

Taking Φ(x)=1x, we have

(𝒥ifϕ)(𝐦)=c𝐚i,cϕ(𝐛i,cϕ),

where 𝐚i,cϕ=f(𝐩)[𝟙[π(ei)=c]πCr:π(ei)=c𝐩r(π)]πCr and
𝐛i,cϕ=[𝟙[cτ]𝐩i(τ)τCvi:cτ𝐩i(τ)]τCvi.

5.1.2 Bounding (𝓙𝒇ϕ)(𝐩)𝟐

In this section, we aim to derive an upper bound for the 2-norm of the Jacobian of the tree recursion. For brevity, we follow some notations which is defined in Section 3.3 of marginal probabilities w.r.t 𝐩i and 𝐩r.

Also we introduce some notations of matrices used in later proof.

Definition 26.

For a distribution 𝐩 over proper colorings on a broom ETv(v)={e1,,em}, we define Xv={(i,c):i[m],c(ei)} and its (local) covariance matrix 𝖢𝗈𝗏(𝐩)Xv×Xv with entries:

𝖢𝗈𝗏(𝐩)((i,c1),(j,c2))=τCv:τ(ei)=c1τ(ej)=c2𝐩(τ)(τCv:τ(ei)=c1𝐩(τ))(τCv:τ(ej)=c2𝐩(τ)).
Definition 27.

For a distribution over colorings 𝐩 on a broom ETv(v)={e1,,em}, we define Xv={(i,c):i[m],c(ei)} and its diagonal matrix of mean vector Π(𝐩)Xv×Xv as follows,

Π(𝐩)=diag{τCv:τ(ei)=c𝐩(τ)}(i,c)Xv.

Now we define the notion of spectral independence on a broom.

Definition 28.

For any distribution over colorings 𝐩 on a broom E(v)={e1,,em}, we say 𝐩 is C-spectrally independent if it holds that

𝖢𝗈𝗏(𝐩)CΠ(𝐩).

The following is the main result in this section.

Condition 1 (marginal bound).

For any i[d], 𝐩i is a distribution on Cvi such that for any color a

𝐩i(a)|ETi(vi)|β1+|ETi(vi)|,

and for 𝐩r=(fπ(𝐩1,𝐩2,,𝐩d))πCv,

𝐩r(i,a)𝐩i(a¯)1β1,

where β(1+o(1))Δ .

Theorem 29.

For any i[d], 𝐩i=ϕ1(𝐦i) and 𝐩r=f((𝐩i)i[d]) satisfy Condition 1 and (1+η)-spectrally independent. Then β1+(1+η)Δ12δ implies that

𝒥fϕ(𝐦)21δΔ

where 𝐦=ϕ(𝐩) and ϕ(x)=2x.

We will prove the theorem in Section 5.1.4 after introducing our key reduction in Section 5.1.3.

5.1.3 Dimension reduction

By the definition of 2-norm, we have that (𝒥fϕ)(𝐩)2=λmax((𝒥fϕ)(𝐩)(𝒥fϕ)(𝐩)). Let 𝐀:=(𝒥fϕ)(𝐩)(𝒥fϕ)(𝐩). We have that

𝐀=i=1d(𝒥ifϕ)(𝐩)(𝒥ifϕ)(𝐩)=i=1dc1,c2(ei)𝐛i,c1ϕ,𝐛i,c2ϕ𝐚i,c1ϕ(𝐚i,c2ϕ).

The last equation simply follows from Proposition 25. The above calculation suggests that although the dimension of 𝐀 is exponential in d, its rank is polynomial in d. In the following, we will find a much smaller matrix which can be used to upper bound 𝐀. The idea is to use the trace method, namely to study Tr(𝐀k). We have the following lemma.

Lemma 30.

For any positive semi-definite matrix 𝐌n×n, we have that

λmax(𝐌)=limk(Tr(𝐌k))1k.

To simplify notations, we let

V(i,z1,c1,z2,c2):=𝐛i,c1ϕ,𝐛i,c2ϕ(𝟙[z1=c1]𝐩r(i,c1))(𝟙[z2=c2]𝐩r(i,c2)).

Then we can write 𝐀 explicitly.

𝐀(π,τ)=f(𝐩)(π)f(𝐩)(τ)i=1dc1,c2(ei)V(i,π(ei),c1,τ(ei),c2). (2)

Let gkπ(i,c) denote

τ1,,τk1Crτ0=πj=1k1f(𝐩)(τj)
×i1,,ik1[d]ik=ic1,1(ei1)c1,k(eik)c2,1(ei1)c2,k1(eik1)c2,k=cj=1k1V(ij,τj1(ij),c1,j,τj(ij),c2,j)
×𝐛i,c1,kϕ,𝐛i,cϕ(𝟙[τk1(i)=c1,k]𝐩r(i,c1,k)).

We omit π in gkπ for brevity. Then we have that for any πCr

𝐀k(π,π)=f(𝐩)(π)i=1dc(ei)gk(i,c)(𝟙[π(i)=c]𝐩r(i,c)). (3)

Fix π, then we will show that {gk}k1 can be computed recursively, which gives a simple representation of 𝐀k(π,π). Let X={(i,c)|i[d],c(ei)} be the set of all feasible edge-color pairs.

Lemma 31.

If 𝐁(𝐩)X×X satisfies that

𝐁(𝐩)((i,c2),(j,c4)) =c3(ej)𝐩j(c3¯,c4¯)𝐩j(c3¯)𝐩j(c4¯)×(𝐩r(j,c3,i,c2)𝐩r(j,c3)𝐩r(i,c2)).

Then we have that gk=απ𝐁k1 where

απ(i,c2) =c1(ei)𝐩i(c1¯,c2¯)𝐩i(c1¯)𝐩i(c2¯)×(𝟙[π(i)=c1]𝐩r(i,c1)).

It directly indicates that we can upper bound the maximum degree of 𝐀 by the 2-norm of 𝐁. In the following, we omit (𝐩) in 𝐁(𝐩) for brevity if there is no ambiguity. Lemma 31 directly indicates that we can use the 2-norm of 𝐁 to upper bound that of 𝐀.

Lemma 32.

Let 𝐁X×X and απ denote the matrix and the vector defined in  Lemma 31. Then we have that for any k1,

πCr𝐀k(π,π)=πCrf(𝐩)(π)απ𝐁k1βπ (4)

where βπ(j,c4)=𝟙[π(j)=c4]𝐩r(j,c4), implying that λmax(𝐀)𝐁2.

The details in dimension reduction from 𝐀 to 𝐁 including the proofs of Lemma 30, Lemma 31 and Lemma 32 are in the full version.

5.1.4 Bound the transition matrix

Let DTX×X be the diagonal matrix where DT((i,c),(i,c))=𝐩i(c¯). In this section, we give an upper bound for 𝐁2 as 𝐁 can be represented as the product of covariance matrices of 𝐩r, 𝐩i and some auxiliary diagonal matrices.

Proposition 33.

Let CiXvi×|(ei)| denote the matrix satisfying that Ci((j,c1),c2)=𝟙[c1=c2] for any i[d]. Let 𝐑X×X denote

diag{Ci𝖢𝗈𝗏(𝐩i)Ci}i[d].

Then we have that

𝐁=𝖢𝗈𝗏(𝐩r)DT1𝐑DT1.

It can be verified via direct computation. Then we can establish Theorem 29 through the above proposition.The detailed proof of Proposition 33 and Theorem 29 are in the full version.

5.2 Strong spatial mixing via contraction

As Theorem 29 gives the upper bounds on the 2-norm of the Jacobian matrix, we now proceed to demonstrate how these bounds can be used to prove strong spatial mixing via contraction. Specifically, we will quantify the decay of correlations using the derived bounds. We need an upper bound for the covariance matrix defined in Definition 26, i.e. the spectral independence property, which can be found in the full version of this paper. Let BG(u,d) denote {uV|distG(u,u)=d}.

Lemma 34.

Given a tree T rooted with r of depth +1 and weight functions 𝐩v:ETv(v)0 for any vBT(r,). If 2 and βΔ+50, then 𝐩r:=f0((𝐩v)vB(r,)) is (1+ηΔ)-spectrally independent, i.e.

𝖢𝗈𝗏(𝐩r)(1+ηΔ)Π(𝐩r),

where ηΔ is O(log2ΔΔ).

Now we are able to prove Theorem 23. Suppose there are two pinnings τ1, τ2 that differ on edges below BT(r,). The key idea for proving Theorem 23 is to leverage the bound of Jacobian matrix from Theorem 29 on brooms ETu(u) for any vertices uBT(r,k) with k<2, while for the remaining two levels of brooms, specifically those corresponding to vertices in BT(r,2) and BT(r,1), we apply trivial bounds of Jacobian. Note that we “abandon” the lowest two levels of brooms due to the constraint for spectral independence from Lemma 34. The complete proof of Theorem 23 is included in the full version.

5.3 Worst-case scenario

Though Theorem 29 establishes an upper bound on the 2-norm of the Jacobian matrix under certain conditions, in this section, we will introduce the “worst” pinning of q-edge coloring, which is the bottleneck of our analysis. In fact, with potential function ϕ(x)=2x and applying 2-norm for the correlation decay step, the best bound we expected to prove is q>(3+52+o(1))Δ2.618Δ. However what we can only prove strong spatial mixing for instances of (1+o(1))Δ-extra edge colorings, as we currently lack a better upper bound for 𝐑. Note that the pinning that maximizes eq.(10) in the full version is the same as the pinning in Theorem 36 and it can indeed achieve the upper bound Δ1(q2Δ+2)2 under this worst pinning.

Before showing the worst-case scenario, we introduce the following technical lemma to calculate the eigenvalues of some simple matrices.

Lemma 35.

Given constant k1,k20, the eigenvalues of k1𝟏𝟏+k2𝖨𝖽n are either k2 or nk1+k2 where 𝖨𝖽n is the identity matrix in n×n.

Now we show the worst pinning and corresponding norm value of 𝐁 and (𝒥fϕ)(𝐩).

Theorem 36.

Under some specific pinning, the norm of 𝐁 satisfies lower bound that

(𝒥fϕ)(𝐩)22=𝐁2=Δ1(qΔ)(q2Δ+2).

Then 𝐁2<1Δ implies that q>(3+52+o(1))Δ.

For the worst case scenario, consider the following instance of edge coloring (G,) generated from a q-coloring instance by pinning:

  1. 1.

    deg(r)=1, (e1)={Δ,,q}, that is, in original instance r has Δ children and we pin e2,,eΔ with 1,2,,Δ1.

  2. 2.

    deg(v1)=Δ and for any uN(v1) and ur, ({u,v1})={Δ,,q}, that is, we assume u has Δ1 children and we pin the children of u with color 1,2,,Δ1.

Theorem 36 indicates the limitations of our current analysis by providing a lower bound on the norm of the Jacobian matrix. This indicates the best bound one can expect to use 2-norm and the potential function ϕ(x)=2x is q2.618Δ.

The formal process for computing 𝐁 are in the full version. Theorem 36 indicates the limitations of our current analysis by providing a lower bound on the norm of the Jacobian matrix. This indicates the best bound one can expect to use 2-norm and the potential function ϕ(x)=2x is q2.618Δ.

6 Tight bound for weak spatial mixing

In this section, we prove the tight bound for weak spatial mixing of q-edge coloring instance on trees. Note that the strong spatial mixing of q-edge coloring instance is a special case of spatial mixing of list instance (for list coloring instance, weak spatial mixing is equivalent to strong spatial mixing). Therefore the theorem stated in this section is a weak version of spatial mixing conclusions and thus we can give a tight bound for trees.

The main theorem for weak spatial mixing on trees is as follows.

Theorem 37.

Given a tree T=(V,E) with n vertices, m edges and maximum degree Δ. Suppose that instance (T,) is a q-edge coloring instance (i.e. for any eE, (e)=[q]). Then we have that

  1. 1.

    If q2Δ1, the edge coloring instance satisfies weak spatial mixing with rate 11ε2Δ(1+ε), where ε=max{Δ1Δ,e1e}.

  2. 2.

    If q2Δ2, there exists an instance that does not satisfy weak spatial mixing.

Consider the following example which simply shows that hardness of weak spatial mixing:

Example 38.

Consider a (Δ1)-regular tree with depth d (the depth of the root r is 0) and d is an even number. Let Λ be the edges between vertices of depth d and d1. Let τ1 be the pinning over Λ which only uses 1,2,,Δ1 and τ2 be the pinning over Λ which only uses Δ,,2Δ2. It is easy to verify that for any {u,v}EΛ and dep(u)=dep(v)+1,

σΩτ1,,σ({u,v}){{1,,Δ1},2|dep(u){Δ,,2Δ2},2dep(u)
σΩτ2,σ({u,v}){{Δ,,2Δ2},2|dep(u){1,,Δ1},2dep(u)

where dep(u) is the depth of u. Therefore, for any eEΛ, we have that

μeσμeτTV=1

which demonstrates that the weak spatial mixing does not hold.

The proof scheme is also using the idea of correlation decay and we use the uniform distribution as bridge to prove the weak spatial mixing property. And we use another recursion which is different from that in strong spatial mixing. Instead of considering a broom of edges, we specify the marginal probability of a pendant edge and then generalize to every edge. Since the lists of feasible colors are clear, we use T[] to denote T,[] for simplicity.

Lemma 39 (One-step contraction).

Suppose (T=(V,E),) is a q-edge coloring instance, where T is a tree with a pendant edge e={r,r} on its root r (that is, deg(r)=1) and τ is the pinning over a set of edges Λ whose edges are incident to leaf vertices. If for any ei={vi,r}E, the subtree Ti with pendant edge ei satisfies that

c[q],|Ti{ei}[c(ei)=c|c(Λ)=τ]1q|δ

where δ<1q is a universal constant, then we have that

c[q],|T[c(e)=c|c(Λ)=τ]1q|2Δ2q(1δ|q2Δ+2|)δ.

The formal process for computing 𝐁 are in the full version of this paper. Theorem 36 indicates the limitations of our current analysis by providing a lower bound on the norm of the Jacobian matrix. This indicates the best bound one can expect to use 2-norm and the potential function ϕ(x)=2x is q2.618Δ.

References

  • [1] Dorna Abdolazimi, Kuikui Liu, and Shayan Oveis Gharan. A matrix trickle-down theorem on simplicial complexes and applications to sampling colorings. In 62nd IEEE Annual Symposium on Foundations of Computer Science, FOCS 2021, Denver, CO, USA, February 7-10, 2022, pages 161–172. IEEE, IEEE, 2021. doi:10.1109/FOCS52979.2021.00024.
  • [2] Nima Anari, Kuikui Liu, and Shayan Oveis Gharan. Spectral independence in high-dimensional expanders and applications to the hardcore model. SIAM Journal on Computing, 53(6):FOCS20–1, 2021.
  • [3] Charlie Carlson, Xiaoyu Chen, Weiming Feng, and Eric Vigoda. Optimal mixing for randomly sampling edge colorings on trees down to the max degree. In Proceedings of the 2025 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 5418–5433. SIAM, 2025. doi:10.1137/1.9781611978322.184.
  • [4] Charlie Carlson and Eric Vigoda. Flip dynamics for sampling colorings: Improving (11/6—ε) using a simple metric. In Proceedings of the 2025 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 2194–2212. SIAM, 2025. doi:10.1137/1.9781611978322.71.
  • [5] Xiaoyu Chen, Weiming Feng, Heng Guo, Xinyuan Zhang, and Zongrui Zou. Deterministic counting from coupling independence. arXiv preprint arXiv:2410.23225, 2024. doi:10.48550/arXiv.2410.23225.
  • [6] Xiaoyu Chen and Xinyuan Zhang. A near-linear time sampler for the Ising model with external field. In Proceedings of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 4478–4503. SIAM, 2023. doi:10.1137/1.9781611977554.CH170.
  • [7] Zongchen Chen and Yuzhou Gu. Fast sampling of b-matchings and b-edge covers. In Proceedings of the ACM-SIAM Symposium on Discrete Algorithms (SODA 2024), pages 4972–4987, 2024. doi:10.1137/1.9781611977912.178.
  • [8] Zongchen Chen, Kuikui Liu, Nitya Mani, and Ankur Moitra. Strong spatial mixing for colorings on trees and its algorithmic applications. In 2023 IEEE 64th Annual Symposium on Foundations of Computer Science (FOCS), pages 810–845. IEEE, 2023. doi:10.1109/FOCS57990.2023.00053.
  • [9] Lucas De Meyer, František Kardoš, Aurélie Lagoutte, and Guillem Perarnau. An algorithmic Vizing’s theorem: toward efficient edge-coloring sampling with an optimal number of colors. arXiv preprint arXiv:2501.11541, 2025.
  • [10] Michelle Delcourt, Marc Heinrich, and Guillem Perarnau. The Glauber dynamics for edge-colorings of trees. Random Structures & Algorithms, 57(4):1050–1076, 2020. doi:10.1002/RSA.20960.
  • [11] Charilaos Efthymiou, Andreas Galanis, Thomas P Hayes, Daniel Štefankovič, and Eric Vigoda. Improved strong spatial mixing for colorings on trees. Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, 2019.
  • [12] David Gamarnik, Dmitriy Katz, and Sidhant Misra. Strong spatial mixing of list coloring of graphs. Random Structures & Algorithms, 46(4):599–613, 2015. doi:10.1002/RSA.20518.
  • [13] Marc Heinrich, Alice Joffard, Jonathan Noel, and Aline Parreau. Unpublished manuscript, 2019. URL: https://hoanganhduc.github.io/events/CoRe2019/CoRe_2019_Open_Problems.pdf.
  • [14] Mark R Jerrum, Leslie G Valiant, and Vijay V Vazirani. Random generation of combinatorial structures from a uniform distribution. Theoretical computer science, 43:169–188, 1986. doi:10.1016/0304-3975(86)90174-X.
  • [15] Ankur Moitra. Approximate counting, the Lovász local lemma, and inference in graphical models. Journal of the ACM, 66(2):10:1–10:25, 2019. doi:10.1145/3268930.
  • [16] Yulin Wang, Chihao Zhang, and Zihan Zhang. Sampling proper colorings on line graphs using (1+ o (1)) Δ colors. In Proceedings of the 56th Annual ACM Symposium on Theory of Computing, pages 1688–1699, 2024. doi:10.1145/3618260.3649724.