Abstract 1 Introduction 2 Preliminary 3 Exposition of the Algorithm: Special Case 4 Conclusion References

Perfect Simulation of Las Vegas Algorithms via Local Computation

Xinyu Fu ORCID State Key Laboratory for Novel Software Technology, New Cornerstone Science Laboratory, Nanjing University, China Yonggang Jiang ORCID MPI-INF, Saarbrücken, Germany
Saarland University, Saarbrücken, Germany
Yitong Yin ORCID State Key Laboratory for Novel Software Technology, New Cornerstone Science Laboratory, Nanjing University, China
Abstract

The notion of Las Vegas algorithms was introduced by Babai (1979) and can be defined in two ways:

  • In Babai’s original definition, a randomized algorithm is called Las Vegas if it has a finitely bounded running time and certifiable random failure.

  • Another definition widely accepted today is that Las Vegas algorithms refer to zero-error randomized algorithms with random running times.

The equivalence between the two definitions is straightforward. Specifically, for randomized algorithms with certifiable failures, repeatedly running the algorithm until no failure is encountered allows for faithful simulation of the correct output when it executes successfully.

We show that a similar perfect simulation can also be achieved in distributed local computation. Specifically, in the 𝖫𝖮𝖢𝖠𝖫 model, with a polylogarithmic overhead in time complexity, any Las Vegas algorithm with finitely bounded running time and locally certifiable failures can be converted to a zero error Las Vegas algorithm. This transformed algorithm faithfully reproduces the correct output of the original algorithm in successful executions. This is achieved by a reduction to a distributed sampling problem under the Lovász Local Lemma (LLL), where the objective is to sample from the joint distribution of random variables avoiding all bad events. We then design the first efficient algorithm to solve this sampling problem in the 𝖫𝖮𝖢𝖠𝖫 model.

Keywords and phrases:
Las Vegas algorithms, perfect simulation, Lovász Local Lemma, sampling
Copyright and License:
[Uncaptioned image] © Xinyu Fu, Yonggang Jiang, and Yitong Yin; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Distributed algorithms
Related Version:
Full Version: https://arxiv.org/abs/2311.11679
Editor:
Shubhangi Saraf

1 Introduction

The Las Vegas algorithm, introduced by Babai [2], stands as a cornerstone in the theory of computing, defining the ZPP class for decision problems efficiently solvable by Las Vegas algorithms. Beyond decision problems, Las Vegas algorithms are pivotal in tackling optimization [24, 6], searching [28, 31], or sampling [34, 16] problems.

Las Vegas algorithms can be defined in two distinct yet related ways. In Babai’s original work [2], Las Vegas algorithms are defined as randomized algorithms whose failures are certifiable:

  • A Las Vegas algorithm produces the correct output or reports failure within a finite bounded time.

Another definition of the Las Vegas algorithm, widely accepted today and utilized in various contexts, e.g. in [27, 32, 29], defines Las Vegas algorithms as zero-error randomized algorithms:

  • A Las Vegas algorithm may exhibit a random running time but always produces the correct output.

The equivalence between these two definitions is evident. Through truncation, we can transform a zero-error Las Vegas algorithm with random running time into a Las Vegas algorithm with bounded running time and certifiable failure. Conversely, restarting the algorithm upon failure allows us to convert a Las Vegas algorithm with certifiable failure into a zero-error Las Vegas algorithm. This strategy, retrying with independent random choice until success, defines a rejection sampling procedure that faithfully simulates the random output of the Las Vegas algorithms, provided that they return successfully without failure.

This strategy for perfectly simulating the Las Vegas algorithm relies on a global coordination machinery: Every part of the algorithm must be notified if a failure occurs anywhere. However, our focus is on exploring how this could be accomplished through local computations, without the need for global coordination.

The LOCAL model.

Local computations are formally characterized by the 𝖫𝖮𝖢𝖠𝖫 model [25, 33]. An instance consists of a network G=(V,E), which is an undirected graph, and a vector 𝒙=(xv)vV of local inputs. Each node vV receives xv and n=|V| as input and can access private random bits. Communications are synchronized and proceed in rounds. In each round, each node may perform arbitrary local computation based on all information collected so far and send messages of arbitrary sizes to all its neighbors. This gives a 𝖫𝖮𝖢𝖠𝖫 algorithm. The time complexity is measured by the number of rounds that the algorithm spends until all nodes are terminated. A 𝖫𝖮𝖢𝖠𝖫 algorithm is said to be a t(n)-round 𝖫𝖮𝖢𝖠𝖫 algorithm on a class of instances, if it always terminates within t(n) rounds on every instance from that class, where n represents the number of nodes of the instance.

The (Babai’s) Las Vegas algorithm can be defined in the 𝖫𝖮𝖢𝖠𝖫 mode with locally certifiable failures. A t(n)-round 𝖫𝖮𝖢𝖠𝖫 algorithm is called Las Vegas if each node vV returns a pair (Yv,Fv), where Yv stands for the local output at v, and Fv{0,1} indicates whether the algorithm failed locally at v. The algorithm successfully returns if none of the nodes fails. Furthermore, it is guaranteed that vV𝔼[Fv]<1. This notion of Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithm was formulated in [14].

In this paper, we try to answer the following fundamental question:

Can we faithfully simulate the correct output avoiding all local failures via local computation?

Specifically, we wonder whether a fixed-round Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithm with locally certifiable failures, can be converted into a zero-error Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithm that produces the correct output (Yv)vV conditioned on vVFv=0, where the distribution of the correct output is faithfully preserved. This can be viewed as the analog of rejection sampling, carried out without any form of global coordination.

In this paper, for the first time, we answer this question affirmatively. We prove the following result for the perfect simulation of Las Vegas algorithms via local computation.

Theorem 1 (main theorem, informal).

Any t(n)-round Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithm can be converted to a zero-error Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithm, which terminates within t(n)polylog(n) rounds with probability 1nO(1), and returns the output of the t(n)-round Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithm conditioned on no failure.

In the above theorem, the output of the zero-error Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithm is identically distributed as the output of the t(n)-round Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithm conditioned on that none of the nodes fails, i.e. the zero-error algorithm perfectly simulates a successful running of the algorithm that may locally fail.

To see how nontrivial this is, consider a weakened task: to generate an assignment of random bits so that under this random choice the algorithm terminates without failure. One may think of this as “solving” for the random bits under which the algorithm successfully returns, which is weaker than our goal, where the generated random bits are further required to follow the correct distribution. However, for local computation, just solving the feasible random bits without changing their distribution is already highly nontrivial.

In a seminal work [14], Ghaffari, Harris, and Kuhn gave a systematic approach to solve the good random bits under which a Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithm successfully returns. Their derandomization based approach preserves the support of the distribution, hence was more suitable for the distributed graph problems for constructing feasible solutions on graphs. Specifically, polylog(n)-round reductions were established between the two types of Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithms for such problems.

In contrast, the perfect simulation guaranteed in Theorem 1 preserves the distribution, therefore, the result can apply to problems beyond constructing feasible solutions, for example, the sampling problems.

Consider the Gibbs distributions defined on the network G=(V,E). Each node v corresponds to a variable with finite domain Σ. Let be a class of constraints, where each f is a nonnegative-valued function f:Σ𝗏𝖻𝗅(f)0 defined on the variables in 𝗏𝖻𝗅(f)V. This defines a Gibbs distribution μ over all assignments σΣV by:

μ(σ)ff(σ𝗏𝖻𝗅(f)).

Such Gibbs distribution μ is said to be local, if: (1) for any f, the diameter of 𝗏𝖻𝗅(f) in graph G is bounded by a constant; and (2) for any partial assignment σΣΛ specified on ΛV, if σ is locally feasible, i.e. if f(σ𝗏𝖻𝗅(f))>0 for all f with 𝗏𝖻𝗅(f)Λ, then σ is (globally) feasible, which means that σ can be extended to a feasible full assignment τΣV such that τΛ=σ and μ(τ)>0.111In [13], this property of local feasibility implying global feasibility in Gibbs distribution was called “locally admissible”.

A Gibbs distribution μ is said to have strong spatial mixing with exponential decay if the discrepancy (measured in the total variation distance) between the marginal distributions μvσ,μvτ at any vV given the respective feasible boundary conditions σ,τΣΛ on ΛV that differ over an arbitrary ΔΛ, satisfies:

dTV(μvσ,μvτ)|V|O(1)exp(Ω(distG(v,Δ))).

The strong spatial mixing is a key property for sampling algorithms. Its implication for efficient sampling from general Gibbs distributions is a major open problem. In [13, Corollary 5.3], a polylog(n)-round Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithm with bounded local failures was given, for perfect sampling from local Gibbs distributions that have strong spatial mixing with exponential decay. By Theorem 1, this immediately implies the following result for the perfect simulation of Gibbs distributions via local computation.

Corollary 2.

For any class of local Gibbs distributions that have strong spatial mixing with exponential decay, there is a 𝖫𝖮𝖢𝖠𝖫 algorithm for perfect sampling from the Gibbs distribution, which terminates within polylog(n) rounds with probability 1nΩ(1).

 Remark 3.

The computational tractability of sampling from general Gibbs distributions with strong spatial mixing remains unresolved, even in the sequential and centralized setting. Corollary 2 certainly does not resolve the computational complexity of sampling. This is because the 𝖫𝖮𝖢𝖠𝖫 model does not impose any bounds on the computational costs, and generic reductions such as the one stated in Theorem 1 must apply to all 𝖫𝖮𝖢𝖠𝖫 algorithms. Such a lack of limitation on computational power was indeed common in general (black-box) reductions in the 𝖫𝖮𝖢𝖠𝖫 model and may be necessary. For instance, the derandomization of general 𝖫𝖮𝖢𝖠𝖫 algorithms in [14] also relied on a local exhaustive search of random bits. Here, our objective is even more ambitious: to preserve the distribution of the random bits of a successful run.

On a positive note, if it is known that the correct output has strong spatial mixing, as in the case of Corollary 2, then the perfect simulation in Theorem 1 can be simplified (using Algorithm 3 instead of Algorithm 4), avoiding exhaustive enumerations. Consequently, the computational cost of the algorithm in Corollary 2 is dominated by approximately sampling from the marginal distributions μvσ and estimating the marginal probabilities within an inverse-polynomial accuracy. Indeed, the non-trivial computation costs in both the sampling algorithm in [13] and our perfect simulation algorithm when there is a strong spatial mixing, are dominated by these tasks which are computationally equivalent to sampling.

In summary, the focus of Corollary 2 is the locality of Gibbs sampling. Our proof of it may also imply: locality of computation imposes no additional significant barrier to the task of Gibbs sampling with strong spatial mixing, provided the sampling problem is computationally tractable.

1.1 A sampling Lovász Local Lemma in the LOCAL model

The perfect simulation of Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithms stated in Theorem 1 is achieved by resolving a more general problem, namely, generating a random sample avoiding all bad events. This problem is formulated as a natural sampling problem in the variable-framework of the Lovász Local Lemma.

An instance for the variable-framework Lovász Local Lemma (LLL) is given by I=({Xi}iU,{Av}vV), where {Xi}iU is a set of mutually independent random variables, such that each Xi follows a distribution νi over a finite domain Σi; and {Av}vV is a set of bad events, such that for each vV, the occurrence of Av is determined by the evaluation of X𝗏𝖻𝗅(v)=(Xi)i𝗏𝖻𝗅(v), where 𝗏𝖻𝗅(v)U denotes the subset of variables on which Av is defined. The LLL instance I defines a dependency graph D=DI=(V,E), such that each vertex vV represents a bad event Av and each {u,v}E iff 𝗏𝖻𝗅(v)𝗏𝖻𝗅(u).

An LLL instance I is said to be γ-satisfiable, if the probability avoiding all bad events is bounded as:

Pr(vVAv¯)γ. (1)

The Lovász Local Lemma [8] states a sufficient condition on the dependency graph for γ to be positive.

A satisfiable LLL instance I gives rise to a natural probability distribution over satisfying assignments. Let μ=μI denote the distribution of the random vector 𝑿=(Xi)iU conditioning on that none of the bad events {Av}vV occurs. Formally, denote by Ω=ΩI{σiUΣiσ avoids Av for all vV} the space of all satisfying assignments, and ν=νIiUνi the product measure. Here, the symbol represents the Cartesian product over sets. Then

σΩ,μ(σ)Pr(𝑿=σ𝑿Ω)=ν(σ)ν(Ω). (2)

This distribution μ=μI of random satisfying assignment was referred to as the LLL distribution in [17, 18]. It is a Gibbs distribution defined by hard constraints.

The following defines a computational problem to generate a sample according to the distribution μ.

Sampling Satisfying Solution of Lovász Local Lemma

    Input: a γ-satisfiable LLL instance I with dependency graph DI=(V,E);
    Output: a random satisfying assignment 𝑿=(Xi)iU distributed as μI.

When the problem is solved in the 𝖫𝖮𝖢𝖠𝖫 model, the input is presented to the algorithm as follows:

  1. 1.

    The network G of the 𝖫𝖮𝖢𝖠𝖫 model is just the dependency graph DI.

  2. 2.

    Each node vV receives as input the values of n=|V| and γ, along with the definition of the bad event Av, and the distributions νi of the random variables {Xii𝗏𝖻𝗅(v)} on which Av is defined so that it can locally draw independent evaluations of the random variables {Xii𝗏𝖻𝗅(v)} or check the occurrence of Av on such evaluation.

Our main technical result is an efficient 𝖫𝖮𝖢𝖠𝖫 algorithm for sampling satisfying solution according to the LLL distribution, for any LLL instance that is not prohibitively hard to satisfy (i.e., γ is not too small). We call this result a “𝖫𝖮𝖢𝖠𝖫 sampling lemma” since it uses the local lemma framework to give a 𝖫𝖮𝖢𝖠𝖫 sampling algorithm.

Theorem 4 (𝖫𝖮𝖢𝖠𝖫 sampling lemma).

There is a randomized 𝖫𝖮𝖢𝖠𝖫 algorithm such that for any LLL instance I with n bad events, if I is γ-satisfiable, then the algorithm returns a random satisfying assignment drawn from μI, within O~(log6nlog41γ) rounds in expectation, and within O~(log6nlog41γlog61ϵ) rounds with probability at least 1ϵ for any 0<ϵ<1.

 Remark 5.

The sampling variant of LLL has recently drawn considerable attention [35, 20, 21, 22, 11, 30], which aims to generate a satisfying solution distributed as μI. This is harder than just finding a satisfying solution. It requires a stronger LLL condition (with polynomials of dependency degree) to ensure that the sampling is tractable in polynomial time [3, 16, 18].

Alternatively, in our setting, we do not impose any LLL-like conditions, but instead only assume that the LLL instance is γ-satisfiable for some suitably large γ (e.g. γ=1/poly(n) or γ=Ω(1)). This is because the LLL instance here arises from a Las Vegas algorithm which succeeds with probability γ. In centralized model, such assumption may trivialize the problem because a sample can be drawn using O(1/γ) trials of rejection sampling. However, the sampling problem remains highly nontrivial for local computation.

1.2 Perfect simulation via LOCAL sampling lemma

Recall that for 𝖫𝖮𝖢𝖠𝖫 algorithms, an instance =(G,𝒙) consists of a network G=(V,E) and a vector 𝒙=(xv)vV specifying the local input xv to each node vV. Let be a class of instances. A t(n)-round Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithm with success probability γ(n) on instance class , is a randomized 𝖫𝖮𝖢𝖠𝖫 algorithm such that on every instance =(G,𝒙), where G is a network with n=|V| nodes, at every node vV the algorithm terminates within t(n) rounds and outputs a pair (Yv,Fv) where Fv{0,1} indicates whether the algorithm failed locally at v, and the probability that the algorithm succeeds is

Pr(vV:𝑭v=0)γ(n).

Denote by (𝒀,𝑭)=(Yv)vV,(Fv)vV the output of the Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithm on instance =(G,𝒙) with network G=(V,E). The following theorem is a formal restatement of Theorem 1, which gives a zero-error Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithm that perfectly simulates the good output (𝒀𝑭=𝟎) when there is no failure everywhere in the network.

Theorem 6 (formal restatement of Theorem 1).

Let t: and γ:[0,1] be two functions. Let 𝒜 be a t(n)-round Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithm with success probability γ(n) on instance class . There is a 𝖫𝖮𝖢𝖠𝖫 algorithm such that on every instance of n nodes, for any 0<ϵ<1,

  • terminates in t(n)O~(log6nlog41γ(n)log61ϵ) rounds with probability at least 1ϵ;

  • upon termination, returns an output 𝒀 that is identically distributed as (𝒀𝒜𝑭𝒜=𝟎), which stands for the output of 𝒜 on the same instance conditioned on that none of the nodes fails.

Proof.

Let =(G,𝒙) be the instance of the 𝖫𝖮𝖢𝖠𝖫 algorithm, where G=(V,E) is a network with n=|V| nodes and the vector 𝒙=(xv)vV specifies the local inputs. For each vV, let Xv denote the local random bits at node v used by algorithm 𝒜.

Since 𝒜 is a t(n)-round Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithm, at any vV, the algorithm 𝒜 deterministically maps the inputs 𝒙B=(xv)vB and the random bits 𝑿B=(Xv)vB within the t(n)-ball B=Bt(n)(v), to the local output (Yv𝒜,Fv𝒜), where Fv𝒜{0,1} indicates the failure at v. This defines a bad event Av for every vV, on the random variables Xu for uBt(n)(v), i.e. 𝗏𝖻𝗅(v)=Bt(n)(v), by

Av:Fv𝒜(𝑿𝗏𝖻𝗅(v))=1.

Together, this defines an LLL instance I=({Xv}vV,{Av}vV), which is γ(n)-satisfiable because the probability that 𝒜 has no failure everywhere is at least γ(n).

We simulate the sampling algorithm in Theorem 4 (which we call the LLL sampler) on this LLL instance I. Rather than executing it on the dependency graph DI as in Theorem 4, here we simulate the LLL sampler on the network G=(V,E), where each node vV holds an independent random variable Xv and a bad event Av. Note that any 1-round communication in the dependency graph DI can be simulated by O(t(n))-round communications in this network G=(V,E). Also note that at each vV, the values of t(n) and γ(n) can be computed locally by knowing n=|V| and enumerating all network instances with n nodes. The LLL sampler can thus be simulated with O(t(n))-multiplicative overhead. In the end it outputs an 𝑿=(Xv)vVμI, which is identically distributed as (𝑿𝑭𝒜=𝟎), i.e. the random bits used in the algorithm 𝒜 conditioned on no failure. The final output 𝒀=(Yv)vV is computed by simulating 𝒜 within t(n) locality deterministically using 𝑿=(Xv)vV as random bits.

The LLL sampler in Theorem 4 is a Las Vegas algorithm with random terminations. Each node vV can continue updating Yv using the current random bits 𝑿B it has collected within its t(n)-local neighborhood B=Bt(n)(v). Once the LLL sampler for generating 𝑿 terminates at all nodes, the updating of 𝒀 will stabilize within additional t(n) rounds. And this final 𝒀 is identically distributed as (𝒀𝒜𝑭𝒜=𝟎). This gives us the zero-error Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithm as claimed in Theorem 6.

1.3 Related work

The perfect simulation of Las Vegas algorithms is a fundamental problem. In the celebrated work of Luby, Sinclaire, and Zuckerman [27], an optimal strategy was given for speeding up Las Vegas algorithms. Their approach was based on stochastic resetting, which requires global coordination and works for the Las Vegas algorithms with deterministic outputs, or the interruptible random outputs.

The distribution of satisfying solutions of the Lovász local lemma (LLL) has drawn much attention, e.g. in [16, 18]. Its perfect simulation was studied in [16, 20, 19, 9], where several key approaches for perfect sampling were applied, including partial rejection sampling (PRS) [16], “lazy depth-first” method of Anand and Jerrum (a.k.a. the AJ algorithm) [1], coupling from the past (CFTP) [34], and coupling towards the past (CTTP) [9].

The connection between LLL and distributed computing is well-established. The foundational work of Chung, Pettie, and Su [5] pioneered the application of distributed LLL for solving Locally Checkable Labeling (LCL) problems in the 𝖫𝖮𝖢𝖠𝖫 model. Then, Chang and Pettie [4] proposed that any Las Vegas randomized 𝖫𝖮𝖢𝖠𝖫 algorithm can be represented as an LLL instance. Recently, Davies-Peck [7] present a novel analysis that leads to improved constructive LLL algorithms with variable running times for the LOCAL model, further enriching this domain.

In the 𝖫𝖮𝖢𝖠𝖫 model, Ghaffari, Harris and Kuhn [14] showed that for distributed graph problems, where the goal is to construct feasible graph configurations, the fixed-round Las Vegas algorithms and the zero-error Las Vegas algorithms are equivalent to polylogarithmic rounds. Their approach was based on a derandomization by conditional expectations, and hence was especially suitable for the tasks where the support of the output distribution, instead of the output distribution itself, is concerned, such as the searching problems for constructing feasible solutions.

The 𝖫𝖮𝖢𝖠𝖫 algorithms and Gibbs distributions are intrinsically related. For example, the distributions of the random bits on which a fixed-round Las Vegas 𝖫𝖮𝖢𝖠𝖫 algorithm successfully returns are Gibbs distributions, where the certifiers of local failures are the local constraints defining the Gibbs distribution. In [13], Feng and Yin gave a 𝖫𝖮𝖢𝖠𝖫 sampler with local failures for the Gibbs distributions with strong spatial mixing by parallelizing the JVV sampler [23] using the network decomposition [26].

1.4 Technique overview

We provide an overview of the main sampling algorithm stated in Theorem 4. The design and analysis of this algorithm will be detailed in subsequent sections.

Recall that the input instance for the algorithm in Theorem 4 is an LLL instance I=({Xi}iU,{Av}vV) with n=|V| bad events. We assume that I is γ-satisfiable.

Initially, a random assignment 𝒀 is generated according to the product distribution 𝝂, which can be achieved by each node independently drawing its own variable Yvνv. If none of the bad events in {Av}vV occurs on this 𝒀, then 𝒀 is a correct sample that follows the LLL distribution μI. However, in general, some bad events in {Av}vV may occur. In this case, the crux of the sampling algorithm lies in how to locally resample the independently generated 𝒀 to make it follow the correct joint distribution μI.

Locally resample via the Bayes filter.

We first propose a resampling procedure under the idealized assumption that there is only one node vV with its bad event Av occurring on 𝒀. This algorithm serves as the principal subroutine in our main sampling algorithm. To locally resample the random assignment 𝒀 around node v, a natural attempt would be to resample Y𝗏𝖻𝗅(v) locally according to the marginal distribution induced by μI on 𝗏𝖻𝗅(v) conditioned on the outer boundary YU𝗏𝖻𝗅(v). However, this naïve resampling would introduce a bias to the sample, because YU𝗏𝖻𝗅(v) does not follow the marginal distribution induced by μI on U𝗏𝖻𝗅(v), but rather follows the conditional marginal distribution on U𝗏𝖻𝗅(v) given Y𝗏𝖻𝗅(v).

To rectify this, the idea of Bayes filter was introduced in [10]. In this approach, a (locally checkable) filter is constructed according to Bayes’ law to cancel the bias introduced by the naïve resampling. This new resampling procedure guarantees the production of a correct 𝒀 that follows the desired distribution μI upon termination. Moreover, the procedure terminates in finite steps if the distribution μI exhibits a strong enough decay of correlation property. The only problem is that such decay of correlation may not necessarily hold for general LLL instances.

De-correlate distant variables by LLL augmentation.

The decay of correlation is a key property that is widely assumed by sampling algorithms. However, in our main applications, namely simulating Las Vegas algorithms via local computation, such decay of correlation may not always hold. Instead, we assume that the LLL instance is sufficiently satisfiable (which corresponds to a Las Vegas algorithm that succeeds with a sufficiently large probability).

With this new assumption, we show that it is possible to establish suitably strong correlation decay between far-apart regions of variables by introducing a new bad event on the boundary between these two regions. Intuitively, we shows that strong correlation between distant regions can only persist due to some specific low-probability bad blocking assignments on the variables between them. We then de-correlate the regions by augmenting the LLL instance with a new locally constructed bad event that explicitly prohibits these bad blocking assignments. Crucially, because these bad blocking assignments must have small marginal probabilities, the new bad event itself is rare, preserving the overall satisfiability of the instance. A precise example illustrating this mechanism is provided in the full version of the paper for further insight. Furthermore, we show how to utilize this LLL augmentation in a carefully designed resampling procedure to guarantee both the correctness and efficiency of the resampling.

Resample in general case using the SLOCAL paradigm.

Finally, to handle the general case where multiple bad events vV may occur on 𝒀, we first cluster the bad events into small enough balls far apart using network decomposition. We then apply the resampling procedure to fix each ball one by one. This final step of sequentially resampling on balls is presented in the sequential local (𝖲𝖫𝖮𝖢𝖠𝖫) paradigm. We introduce the notion of 𝖲𝖫𝖮𝖢𝖠𝖫-𝖫𝖵 algorithm as an extension of the 𝖲𝖫𝖮𝖢𝖠𝖫 model, where 𝖲𝖫𝖮𝖢𝖠𝖫-𝖫𝖵 stands for the term sequential 𝖫𝖮𝖢𝖠𝖫 Las Vegas. Finally, we show that this 𝖲𝖫𝖮𝖢𝖠𝖫-𝖫𝖵 algorithm can be simulated in the 𝖫𝖮𝖢𝖠𝖫 model with bounded time complexity.

1.5 Organization of the paper

We begin with preliminaries in Section 2. Then, in Section 3, we illustrate the core ideas behind our main sampling algorithm (stated in Theorem 4) by analyzing a special case where only a single bad event occurs initially. The algorithm for the general case involving multiple bad events initially, along with the complete proofs, is deferred to the full version of this paper. We conclude with a discussion of our work in Section 4.

2 Preliminary

2.1 Graph and LLL notations

Let G=(V,E) be an undirected graph. The following notations are used throughout.

  • Neighborhoods: NG(v){uV{u,v}E} and inclusive neighborhood NG+(v)N(v){v}.

  • Distances: distG(u,v) represents the shortest path distance between u and v in G, and distG(S,T)minuS,vTdistG(u,v). The diameter of ΛV in G is diamG(Λ)maxuΛ,vΛdistG(u,v).

  • Balls: BrG(v){uVdistG(u,v)r} and BrG(Λ){uVdistG(u,Λ)r} for ΛV.

  • Shells/Spheres: S[,r]G(v)BrG(v)B1G(v) and S[,r]G(Λ)BrG(Λ)B1G(Λ) for ΛV.

Let I=({Xi}iU,{Av}vV) be an LLL instance with dependency graph DI. The following defines a notion of rings of variables enclosing a nonempty subset ΛV of bad events, where 𝗏𝖻𝗅(Λ)vΛ𝗏𝖻𝗅(v).

  • Rings: RrI(Λ)𝗏𝖻𝗅(BrDI(Λ))𝗏𝖻𝗅(Br1DI(Λ)) and in particular, R0I(Λ)𝗏𝖻𝗅(Λ). Furthermore, we define

    R[i,j]I(Λ)irjRrI(Λ). (3)

We also define the rings of bad events intersected or contained by a ring of variables:

V[i,j]I(Λ){vV𝗏𝖻𝗅(v)R[i,j]I(Λ)},V(i,j)I(Λ){vV𝗏𝖻𝗅(v)R[i,j]I(Λ)}. (4)

In all the above notations, we omit the graph G or the LLL instance I if they are clear in the context.

Inspired by the induced subgraph, we define the sub-instance of I=({Xi}iU,{Av}vV) induced by a subset ΛV of bad events:

I(Λ)({Xi}i𝗏𝖻𝗅(Λ),{Av}vΛ),

where I(Λ) is the LLL sub-instance induced by the bad events {Av}vΛ.

2.2 Marginal distribution

Let I=({Xi}iU,{Av}vV) be a LLL instance, where each random variable Xi follows the distribution νi over domain Σi. For ΛU, define ΣΛiΛΣi and νΛiΛνi, and write ν=νU and Σ=ΣU.

For nonempty ΛU and τΣΛ, define

Ωτ=ΩIτ{σΣσΛ=τ and σ avoids bad events Av for all vV s.t. 𝗏𝖻𝗅(v)Λ}. (5)

For disjoint S,TU, σΣS and τΣT, denote by στ the direct concatenation of σ and τ, that is, στΣST satisfying (στ)i=σ(i) for iS and (στ)i=τ(i) for iT.

The following defines a notion of marginal distribution in the LLL instance.

Definition 7 (marginal distribution).

For ΛU, a τΣΛ is said to be a feasible boundary condition if Ωτ, where Ωτ is defined in (5). Given ΛU and feasible boundary condition τΣΛ, for any nonempty SUΛ, the marginal distribution on S induced by μ=μI conditioned on τ, denoted by μSτ=μI,Sτ, is defined as:

σΣS,μSτ(σ)Pr𝑿ν(XS=σ𝑿Ωτ).

2.3 Decay of correlation in LLL

We consider the following notion of correlation decay in the LLL instance.

Definition 8 (ϵ-correlated sets).

A pair of disjoint S,TU with STU, is said to be ϵ-correlated, if one of S,T is empty, or for any σ1,σ2ΣS and τ1,τ2ΣT,

1/(1+ϵ)ν(Ωσ1τ2)ν(Ωσ2τ1)ν(Ωσ1τ1)ν(Ωσ2τ2)(1+ϵ)ν(Ωσ1τ2)ν(Ωσ2τ1).

To see that this indeed defines a decay of correlation, note that it is equivalent to the following property: For 𝑿 drawn according to the product distribution ν that avoids all bad events Av satisfying 𝗏𝖻𝗅(v)ST,

Pr(XS=σ1XT=τ1)Pr(XS=σ2XT=τ2)
(1+ϵ)Pr(XS=σ1XT=τ2)Pr(XS=σ2XT=τ1).

Recall that we want to bound the correlation between XS and XT in a random 𝑿 distributed as μ=μI. Here, Definition 8 is slightly different by ignoring the bad events Av defined on the variables within ST.

2.4 Las Vegas SLOCAL algorithm

Our main sampling algorithm is described in a sequential local (𝖲𝖫𝖮𝖢𝖠𝖫) paradigm. The 𝖲𝖫𝖮𝖢𝖠𝖫 model introduced by Ghaffari, Kuhn, and Maus [15] captures local computations where the inherent parallelism of distributed systems is intentionally suppressed, allowing nodes to be processed sequentially rather than concurrently. We extend this notion to the algorithms with random locality of computation.

SLOCAL-LV algorithms.

Here, 𝖲𝖫𝖮𝖢𝖠𝖫-𝖫𝖵 stands for the term sequential 𝖫𝖮𝖢𝖠𝖫 Las Vegas. An N-scan 𝖲𝖫𝖮𝖢𝖠𝖫-𝖫𝖵 algorithm runs on a network G=(V,E) with a subset AV of active nodes. Each node vV maintains a local memory state Mv, initially storing v’s local input, random bits, its ID, and neighbors’ IDs. An arbitrary total order is assumed over V, such that the relative order between any u,vV can be deduced from the contents of Mu and Mv.

The algorithm operates in N1 scans. Within each scan, the active nodes in A are processed one after another in the ordering. Upon each node vA being processed, for =0,1,2,, the node v tries to grow an -ball B(v) and update the memory states Mu for all uB(v) based on the information observed so far by v, until some stopping condition has been met by the information within the current ball B(v). Finally, each vV outputs a value based on its memory state Mv.

Compared to the standard (Monte Carlo) 𝖲𝖫𝖮𝖢𝖠𝖫 algorithm, whose locality is upper bounded by a fixed value, in 𝖲𝖫𝖮𝖢𝖠𝖫-𝖫𝖵 algorithm, the local neighborhoods are randomly constructed in a sequential and local fashion. The next theorem gives a simulation of 𝖲𝖫𝖮𝖢𝖠𝖫-𝖫𝖵 algorithms in the 𝖫𝖮𝖢𝖠𝖫 model.

Proposition 9 (simulation of 𝖲𝖫𝖮𝖢𝖠𝖫-𝖫𝖵 in 𝖫𝖮𝖢𝖠𝖫).

Let 𝒜 be an N-scan 𝖲𝖫𝖮𝖢𝖠𝖫-𝖫𝖵 algorithm that assumes an arbitrary ordering of nodes. Then there is a randomized 𝖫𝖮𝖢𝖠𝖫 algorithm , such that starting from the same initial memory states 𝐌=(Mv)vV, the algorithm terminates and returns the same output as 𝒜 within O(|A|maxvA,j[N]v,j) rounds, where A is the set of active nodes and v,j is the radius of the ball accessed by the active node v in the jth scan of algorithm 𝒜, both fully determined by 𝐌.

Compared to the simulation of 𝖲𝖫𝖮𝖢𝖠𝖫 Monte Carlo algorithm in the 𝖫𝖮𝖢𝖠𝖫 model proved in [15], which relies on the network decomposition to parallelize the 𝖲𝖫𝖮𝖢𝖠𝖫 procedure, Proposition 9 provides a rather straightforward simulation that does not parallelize the local computations. An advantage of such an easy simulation is that it does not require a worst-case complexity upper bound for all scan orders of nodes. This translation from 𝖲𝖫𝖮𝖢𝖠𝖫-𝖫𝖵 to 𝖫𝖮𝖢𝖠𝖫 algorithm makes it more convenient to describe 𝖫𝖮𝖢𝖠𝖫 algorithms where there are multiple randomly growing local neighborhoods interfering with each other.

A formal proof of Proposition 9 is given in the full version of the paper.

3 Exposition of the Algorithm: Special Case

In this section, we expose the key ideas of the main sampling algorithm stated in Theorem 4 within a special case. The detailed and formal construction of the algorithm will be presented in the next section.

As stated in Theorem 4, the algorithm deals with an LLL instance I=({Xi}iU,{Av}vV) with n=|V| bad events. Assume that I is γ-satisfiable, and let DI be its dependency graph.

Our sampling algorithm follows a natural two-step framework:

  • Initially, the algorithm generates a random assignment 𝒀 according to the product distribution 𝝂. This can be easily achieved by having each node independently generate its own variables.

  • If none of the bad events in {Av}vV occurs, then 𝒀 follows the correct distribution μI. Otherwise, if some bad events Av have occurred on 𝒀, the algorithm locally applies some corrections to 𝒀 to ensure that the corrected assignment 𝒀 follows the correct distribution μI.

Obviously, the nontrivial part of the algorithm is the above second step, in which the algorithm locally modifies a random assignment 𝒀𝝂 to a new assignment 𝒀μI when some bad events Av occur on 𝒀.

To simplify our exposition, we start by making the following idealized assumption.

Assumption 10.

Only one node vV has its corresponding bad event Av occurs on 𝐘.

Next, we will expose the main idea of our sampling algorithm by explaining how to resolve the sampling problem under this idealized assumption. The rest of this section is organized as follows:

  1. 1.

    We begin by approaching the sampling problem with modest objectives. Our first goal is to sample with a bounded expected complexity under Assumption 10, while also assuming a correlation decay property. With these assumptions, we present a sampling algorithm. (Section 3.1)

  2. 2.

    The correlation decay assumed above may not always hold. To address this, we introduce an augmentation of the LLL instance, inducing desirable correlation decay by properly augmenting the LLL instance. This enables us to remove the correlation decay assumption from the earlier mentioned algorithm, ensuring correct sampling from μI despite augmenting the LLL instance I. (Section 3.2)

  3. 3.

    Finally, we introduce a recursive sampling framework, which upgrades the aforementioned sampling algorithms with bounded expected complexity to algorithms with exponentially convergent running time. This proves Theorem 4 under Assumption 10. (Section 3.3)

In the full version of the paper, we eliminate the need for Assumption 10 through the introduction of a new section that addresses the general case involving multiple bad events.

3.1 Warm-up: Sampling with idealized correlation decay

In this part, we assume the correlation decay property as formally defined in Definition 8. This idealized correlation decay property is assumed for exposition purpose, and reliance on it will be eliminated later.

Assumption 11.

Any pair of disjoint S,TU are 12n3-correlated.

Let 𝒀 be generated according to the product distribution 𝝂. Under Assumption 10, there is only one node vV having Av occur on 𝒀. Let S𝗏𝖻𝗅(v) and TU𝗏𝖻𝗅(B1(v)). Notice that we have

YTμTYS, (6)

because 𝒀 is distributed as 𝝂 and all bad events except Av do not occur on 𝒀.

An attempt to correct resampling.

Our goal is to locally fix the random assignment 𝒀 around v to make it distributed as μI. A natural attempt is to resample evaluation of YU\T according to the correct marginal distribution μU\TYT. This would produce a new assignment 𝒀 which is distributed as:

σΣ,Pr[𝒀=σ] =Pr[𝒀T=σT]Pr[𝒀U\T=σU\T𝒀T=σT]
=μTYS(σT)μU\TσT(σU\T);

whereas, our goal is that each σΣ is sampled with probability μI(σ)=μT(σT)μU\TσT(σU\T).

To remedy this, we apply the Bayes filters introduced in [10]. A Bayes filter =(𝒀,S,T) is a trial whose success or failure is determined by 𝒀,S,T. The probability that succeeds satisfies

Pr[ succeeds𝒀=σ]μT(σT)μTYS(σT), (7)

where are taken over all possible σΣ with μTYS(σT)>0.

Now given a Bayes filter satisfying (7), we conduct an experiment of , and resample YUTμUTYT if succeeds. This will produce a 𝒀μI due to the Bayes law derived as follows:

Pr[𝒀=σ succeeds] =Pr[𝒀=σ]Pr[ succeeds𝒀=σ]Pr[ succeeds] (8)
(μTTS(σT)μU\TσT(σU\T))(μT(σT)μTYS(σT))μI(σ).

Otherwise, if fails, we trivially resample the entire 𝒀μI, which still ensures the correctness of sampling but uses global information. Overall, the above procedure correctly produces a 𝒀μI.

It remains to construct a good Bayes filter using local generation and succeeding with high probability.

Construction of the Bayes filter 𝓕.

Condition (7) of the Bayes filter can be expressed as

Pr[ succeeds𝒀=σ] μT(σT)μTYS(σT) (9)
(the Bayes law) =ν(ΩσT)ν(ΩYS)ν(Ω)ν(ΩYSσT)
ν(ΩσT)ν(ΩYSσT)
f(σT).

Note that f(σT)ν(ΩσT)ν(ΩYSσT) can be computed locally from B2(v). It is thus natural to define as

Pr[ succeeds]=f(YT)maxf,

where maxf denotes the maximum value of f(σT) taken over all all possible σTΣT with μTYS(σT)>0.

It is obvious to see that for the Bayes filter constructed as above, an experiment of can be conducted and observed locally from B2(v). Furthermore, due to the correlation decay assumed in Assumption 11, succeeds with high probability. Let ΣS be the set of possible assignments on the set S of variables:

ΣS{ρΣS ρ avoids bad events Av for all vV s.t. 𝗏𝖻𝗅(v)S}.

Let τ be the assignment on T that achieve the maximum f(τ). It holds that

Pr[ succeeds𝒀=σ] =f(σT)f(τ)=ν(ΩσT)ν(ΩYSτ)ν(Ωτ)ν(ΩYSσT) (10)
=ρΣSν(ΩρσT)ν(ΩYSτ)ρΣSν(ΩYSσT)ν(Ωρτ)112n3,

where the last inequality is derived from that S and T are 12n3-correlated, guaranteed by Assumption 11.

This simple sampling algorithm under Assumption 10 and Assumption 11 is described in Algorithm 1. By (10) and the locality of the Bayes filter, the algorithm terminates within O(1) rounds in expectation.

Algorithm 1 Sampling-with-decay(𝒀; I,v).

3.2 Sampling without correlation decay via LLL augmentation

The correlation decay asked by Assumption 11 may not always hold for general γ-satisfiable LLL instances. A key idea is then to properly “augment” the LLL instance by including new bad events to enforce desirable decay of correlation. This is highlighted by the following LLL augmentation lemma.

Recall the notions of balls B(), shells S[,](), and rings R[,]() formally defined in Section 2.1.

Lemma 12 (LLL augmentation).

Let I=({Xi}iU,{Av}vV) be a LLL instance. Let ϵ,γ(0,1), δ(0,γ2) and 0(ϵ,γ,δ) where 0(ϵ,γ,δ)=O~(log1ϵlog1γlog1δ). For any nonempty ΛV, if the sub-instance I(S[1,](Λ)) is γ-satisfiable, then a new bad event Aλ with λV can be constructed such that:

  1. 1.

    Aλ is defined on 𝗏𝖻𝗅(λ)=R[1,](Λ), and is constructed using local information on B+1(Λ);

  2. 2.

    Aλ has probability at most δ on independent random variables {Xi}iU, i.e. ν(Aλ)δ;

  3. 3.

    the variable sets S=𝗏𝖻𝗅(Λ) and T=U𝗏𝖻𝗅(B(Λ)) are ϵ-correlated after including Aλ into I.

Definition 13 (the augmenting event).

Let I=({Xi}iU,{Av}vV) be a LLL instance. Let ϵ,γ(0,1), δ(0,γ2), and =0(ϵ,γ,δ). Fix any nonempty ΛV that I(S[1,](Λ)) is γ-satisfiable. We use A𝛌(Λ,ϵ,γ,δ)I to denote the new bad event Aλ constructed in Lemma 12.

 Remark 14.

Lemma 12 constitutes a critical component of our sampling approach. Beyond the algorithmic implications, it provides new insights into correlation between variables constrained by local constraints.

This lemma aims to establish a decay of correlation between two distant regions S=𝗏𝖻𝗅(Λ)=R0(Λ) and T=U𝗏𝖻𝗅(B(Λ))=R[,](Λ). However, it only assumes that all local constraints sandwiched between S and T are collectively easy to satisfy. Merely knowing this fact does not prevent S and T from being strongly correlated. Remarkably, Lemma 12 shows that the only obstacle to achieving correlation decay between S and T in this case lies in rarely occurring “bad” assignments between them. Consequently, S and T can be effectively de-correlated by introducing a new locally defined bad event over the region between S and T to prohibit those bad assignments that cause significant correlation between them.

This argument will be formalized in the full version of the paper, where a formal restatement of Lemma 12 will be introduced, and the lemma will be rigorously proved.

Sampling using augmented LLL.

Now we show how to utilize this LLL augmentation stated in Lemma 12 to get rid of Assumption 11 in Algorithm 1, while still assuming Assumption 10.

As before, let 𝒀 be generated according to the product distribution 𝝂; and let S𝗏𝖻𝗅(v) and TU𝗏𝖻𝗅(B1(v)), where vV is the only node (Assumption 10) with the bad event Av occurring on 𝒀.

Our goal is to utilize Lemma 12 to induce the necessary decay of correlation required for sampling. However, augmenting LLL would inevitably alter the distribution μI. The saving grace is the following observation, suggesting that sampling may not be affected by certain local changes to the distribution.

Observation 15.

The sampling in Algorithm 1 is correct and efficient as long as:

  • (correctness) YT follows the marginal distribution μI,TYS;

  • (efficiency) S and T are 12n3-correlated.

Using this observation, we can apply the LLL augmentation described in Lemma 12 to ensure correlation decay between S and T, while preserving the marginal distribution μTYS. Consequently, the resulting sampling process is both correct and efficient without relying on Assumption 11.

Note the new bad event Aλ=A𝝀(Λ,ϵ,γ,δ)I in Definition 13, and define its complement Aλ¯:

AλA𝝀(Λ,ϵ,γ,δ)I, where λV, and Aλ¯Aλ¯, where λ¯V. (11)

Correspondingly, define the following two augmented LLL instances:

I^=({Xi}iU,{Av}vV{λ}) and I^=({Xi}iU,{Av}vV{λ¯}). (12)

Let 0<ζ0<1 be a sufficiently small constant. We define the choice of parameter

(ϵ0,γ0,δ0)(12n3,γ8,ζ0γ24n3). (13)

In the following, let Aλ,Aλ¯,I^,I^ be defined as above on Λ={v} with parameter (ϵ0,γ0,δ0). Let S=𝗏𝖻𝗅(Λ) and T=U\𝗏𝖻𝗅(B(Λ)), where =0(ϵ0,γ0,δ0)=O~(log2nlog21γ).

By the same argument for (6), we still have YTμI,TYS. Notice that I^ is identical I except for the new bad event Aλ. Then, conditioned on Aλ¯, we have YTμI^,TYS in the augmented instance I^. By Lemma 12 on our choice of parameter, S and T are 12n3-correlated in I^ . Thus, a calling to Sampling-with-decay(𝒀;I^,v) (Algorithm 1) will produce a random assignment 𝒀μI^ within O(1) rounds in expectation.

An idealized sampling algorithm.

Then an idealized sampling procedure may proceed as follows. Our goal is to modify the input 𝒀 to a new 𝒀μI. This is achieved differently depending on whether the new bad event Aλ occurs on 𝒀. If Aλ occurs on 𝒀 (which happens with a small probability), we generate an entire new 𝒀μI using global information. Otherwise, if Aλ does not occur on 𝒀, we can use the following cleverer approach to produce a 𝒀μI:

  • With probability P, generate a 𝒀μI^, where P is defined as

    PPr𝑿μI[𝑿 avoids Aλ]. (14)

    Since 𝒀 avoids Aλ, this can be achieved by calling Sampling-with-decay(𝒀;I^,v) to produce 𝒀μI^.

  • Otherwise, generate entire 𝒀μI^ using global information.

Overall, this correctly generates a 𝒀μI=PμI^+(1P)μI^ This procedure for sampling is idealized because it uses a value P as defined in (14) that is not easy to compute locally.

Bootstraping the unknown threshold 𝑷.

The probability P in (14) can be lower bounded. By Lemma 12, Aλ occurs with probability at most δ0. With the parameter specified in (13), we have

P=ν(ΩI^)ν(ΩI)=ν(ΩI)ν(ΩI^)ν(ΩI)1ν(Aλ)ν(ΩI)1δ0γ0112n3.

Therefore, when Aλ does not occur on 𝒀, the above idealized sampling procedure can be realized as: first call the subroutine Sampling-with-decay(𝒀;I^,v) defined in Algorithm 1 to produce 𝒀μI^, and then with probability 12n3, compute the value of P (which uses global information) and generate the entire 𝒀μI^ with probability 2(1P)n3. The resulting algorithm is described in Algorithm 2.

Algorithm 2 Sampling-without-decay(𝒀; I,v).

Algorithm 2 may use global information, particularly in Line 5 with probability 12n3, and in Line 7 when Aλ occurs on 𝒀, which happens with probability at most δ01n3 due to Lemma 12.222Technically, 10 may bias the probability of Aλ on 𝒀. However, Algorithm 2 is solely used for exposition purposes, and such bias caused by 10 will no longer be an issue beyond in Algorithm 2. As we discussed, the call to Sampling-with-decay(𝒀;I^,v) at Line 3 returns within O(1) rounds in expectation. At last, the construction of the augmented instance I^ takes =O~(log2nlog21γ) rounds. Overall, Algorithm 2 returns a 𝒀μI within O~(log2nlog21γ) rounds in expectation without relying on Assumption 11.

3.3 Recursive sampling with exponential convergence

Both Algorithm 1 and Algorithm 2 occasionally rely on global information with a probability of O(1/n3). While the expected time complexity remains well bounded, achieving exponential convergence, as outlined in Theorem 4, would be more desirable in randomized running time.

We introduce a recursive sampling framework to achieve such exponential convergence in the time complexity. And more importantly, this framework is also crucial for finally getting rid of Assumption 10.

First, we adapt Algorithm 1 to the recursive sampling framework. This adaptation is direct and simple. Then, we achieve the recursive sampling without Assumption 11. Although part of this has already been embodied in Algorithm 2, there are still some substantial difficulties that need to be overcome.

Recursive sampling assuming correlation decay.

At first, we still assume Assumption 11. The recursive sampling algorithm in this scenario is described in Algorithm 3. There are two differences between this algorithm and Algorithm 1: First, instead of taking just a node vV as input in Algorithm 1, now Algorithm 3 takes a subset ΛV as input. Second and more importantly, the original step of sampling using global information in Algorithm 1, is now replaced by a recursive call in Algorithm 3.

Algorithm 3 RecursiveSampling-with-decay(𝒀; I,Λ).

The correctness of recursive sampling relies on the following conditional Gibbs property, which is adapted from the same property introduced in [12, 10] for Gibbs distributions.

Definition 16 (conditional Gibbs).

Let I=({Xi}iU,{Av}vV) be a LLL instance. The random pair (𝐘,𝓡), where 𝐘Σ is an assignment and 𝓡V is a subset of events, is said to satisfy conditional Gibbs property on instance I, if for any RV and σΣ𝗏𝖻𝗅(Λ) with Pr[𝓡=RY𝗏𝖻𝗅(R)=σ]>0, conditioned on 𝓡=RY𝗏𝖻𝗅(R)=σ, it holds that YU𝗏𝖻𝗅(R)μI,U𝗏𝖻𝗅(R)σ.

Intuitively, the random pair (𝒀,𝓡) represents a “partially correct” sample 𝒀 with its problematic part covered by 𝓡. The conditional Gibbs property guarantees that except for this problematic part, the sample is always distributed correctly. This gives a key invariant condition for the correctness of recursive sampling.

The following is easy to verify for 𝒀 satisfying Assumption 10 with Λ={v}.

Condition 17.

(𝒀,Λ) satisfies conditional Gibbs property on instance I.

The following can be verified routinely by a structural induction: As long as Condition 17 is satisfied by the input, Algorithm 3 terminates with probability 1 and returns a 𝒀 that is distributed as μI. This guarantees the correctness of Algorithm 3 as a sampling algorithm.

To further bound the complexity of Algorithm 3, we need to assume Assumption 11. With such assumption, by the same analysis as in (10), the Bayes filter in Line 2 succeeds with probability at least 112n3. Hence, for any 0<ϵ<1, Algorithm 3 terminates in O(log1ϵ/logn) rounds with probability 1ϵ.

Recursive sampling with LLL augmentation.

Our goal here is to adopt Algorithm 2 for sampling without the additional assumption about correlation decay, into the recursive sampling framework. The resulting algorithm will solve the problem in Theorem 4 under Assumption 10.

As in Algorithm 2, we need LLL augmentation. The following notion of augmented conditional Gibbs property is a variant of the conditional Gibbs property in Definition 16, tailored with LLL augmentation.

Definition 18 (augmented conditional Gibbs).

Let ϵ,γ(0,1), δ(0,γ2), and =0(ϵ,γ,δ), where 0(ϵ,γ,δ)=O~(log1ϵlog1γlog1δ) is defined as in Lemma 12. Let I=({Xi}iU,{Av}vV) be a LLL instance. The random pair (𝐘,𝓡), where 𝐘Σ and 𝓡V, is said to satisfy augmented conditional Gibbs property on instance I with parameter (ϵ,γ,δ), if for any RV with Pr[𝓡=R]>0:

  1. 1.

    the sub-instance I(S[1,](R)) is γ-satisfiable;

  2. 2.

    for S𝗏𝖻𝗅(R), TU𝗏𝖻𝗅(B(R)), for any σΣ𝗏𝖻𝗅(Λ) with Pr[𝓡=RY𝗏𝖻𝗅(R)=σ]>0, conditioned on that 𝓡=RY𝗏𝖻𝗅(R)=σ, YT follows the distribution μI^,Tσ, i.e.,

    τΣT,Pr(YT=τ𝓡=YS=σ)=μI^,Tσ(τ),

    where I^ stands for the augmented LLL instance I^({Xi}iU,{Av}vV{A𝝀(ϵ,γ,δ,R)I}) with A𝝀(ϵ,γ,δ,R)I as in Definition 13.

Recall the choice of parameter (ϵ0,γ0,δ0) in (13) and let 0(ϵ0,γ0,δ0). Let r be the minimal integer that 𝒀 avoids the bad event A𝝀(ϵ0,γ0,δ0,Br(v))I. The following can be routinely verified on the instance I, the random assignment 𝒀, Λ=Br(v), along with the parameter (ϵ,γ,δ,α)=(ϵ0,γ0,δ0,γ0).

Condition 19.

The following hold:

  • 0<ϵ12, 0<αγ<1 and 0<δ<ζ0α;

  • the LLL instance I is α-satisfiable and the sub-instance I(VΛ) is γ-satisfiable;

  • (𝒀,Λ) satisfies the augmented conditional Gibbs property on instance I with parameter (ϵ,γ,δ).

Our goal is then to modify the random assignment 𝒀 around the region Λ to make it follow the correct distribution μI, as long as Condition 19 is satisfied. Adopting the definitions of Aλ,Aλ¯,I^,I^ and P as in Equations 11, 12, and 14 with Λ=Br(v) and (ϵ,γ,δ)=(ϵ0,γ0,δ0). By our choice of r, the bad event Aλ may never occur on 𝒀. Then, Algorithm 2 can be simiplied to the case where 𝒀 always aoids Aλ.

Recursive bootstrapping of marginal probabilities.

A challenge, as we mentioned in Section 3.2, is that the true value of the marginal probability P as defined in (14) is hard to compute locally. This is now resolved by a more sophisticated bootstrapping of the threshold P, by recursively estimating it within a progressively more accurate interval [L,R]. The estimation is formally stated by the following lemma, which is proved in the full version of the paper. It shows that the threshold P can be estimated exponentially more accurately as the radius of local information grows.

Lemma 20.

Let I=({Xi}iU,{Av}vV) be a LLL instance. Let 0<ϵ<12, k+, 0<α1α2<1 and =0(ϵk,α2,α1ϵk), where the function 0 is as defined in Lemma 12. For any nonempty ΛV and an arbitrary event Aλ defined on the variables in 𝗏𝖻𝗅(Λ), assuming that I is α1-satisfiable and I(VΛ) is α2-satisfiable, there is a P^(0,1) determined only by Λ, Aλ, and I(B+1(Λ)), such that

PPr𝑿μI[𝑿 avoids Aλ][P^2ϵk,P^+2ϵk].
Algorithm 4 RecursiveSampling(𝒀; I,Λ,ϵ,γ,δ,α).

The high-level strategy for sampling can be outlined as follows. Given the current estimate interval [L,R] of the threshold P[L,R], as provided by Lemma 20, the algorithm operates as follows:

  • With probability L, the algorithm is determined to enter the branch for sampling 𝒀μI^. This can be resolved efficiently as in Algorithm 3, because of the correlation decay in the augmented instance I^.

  • With probability 1R, the algorithm is determined to enter the branch for sampling 𝒀μI^. This can be resolved recursively with an appropriately enlarged neighborhood containing 𝗏𝖻𝗅(λ), ensuring that Condition 19 is still satisfied invariantly.

  • Otherwise, the algorithm enters the so-called “zone of indecision”, where it enlarges the local radius to gather more information in order to obtain a more accurate estimate [L,R] of the threshold P.

Overall, the above procedure generates a random assignment 𝒀μI. Throughout the recursive calls, we ensure to maintain the invariant Condition 19. The procedure RecursiveSampling(𝒀; I,Λ,ϵ,γ,δ,α) is detailed in Algorithm 4.

Our use of “zones of indecision” for recursive bootstrapping of marginal probabilities is inspired from the Anand-Jerrum algorithm introduced in [1] for solving perfect simulation of Gibbs distributions.

The correctness of Algorithm 4 follows from a structural induction. The complexity of Algorithm 4 is challenging to analyze due to the random recursion, nevertheless, we apply a potential method to bound it.

Lemma 21.

Assume the input of RecursiveSampling(𝐘;I,Λ,ϵ,γ,δ,α) satisfy Condition 19.

  1. 1.

    After RecursiveSampling(𝒀;I,Λ,ϵ,γ,δ,α) returns, 𝒀 follows the distribution μI.

  2. 2.

    For any 0<η<1, with probability 1η, RecursiveSampling(𝒀;I,Λ,ϵ,γ,δ,α) accesses I(Br(Λ)) and updates Y𝗏𝖻𝗅(Br(Λ)), where

    r=0(ϵ,γ,δ)+O~(log1γlog41η+log1γlog21ηlog1α).
 Remark 22.

It is important to note that Assumption 10 is not assumed in the statement of Lemma 21. Instead, the lemma holds as long as that Condition 19 is satisfied. The scenario described in Assumption 10 can in fact be incorporated into Condition 19 as a special case where Λ=Br(v).

4 Conclusion

In this paper, we show that for local computation, the successful output of any fixed-round Las Vegas computation, where failures are reported locally, can be perfectly simulated with polylogarithmic overheads.

As by-products, this gives perfect simulations, via efficient (polylogarithmic-round) local computation, for several fundamental classes of high-dimensional joint distributions, including:

  • random satisfying solutions of Lovász local lemma with non-negligible satisfiability;

  • uniform locally checkable labelings (LCLs) with non-negligible feasibility;

  • Gibbs distributions satisfying the strong spatial mixing with exponential decay.

We develop a novel approach for augmenting Lovász local lemma (LLL) instances by introducing locally-defined new bad events, to create the desirable decay of correlation. We also give a recursive local sampling procedure, which utilizes the correlation decay to accelerate the sampling process, and meanwhile still keeps the sampling result correct, without being biased by the change to the LLL instance.

At first glance, it almost looks like we are creating mixing conditions out of nothing. In particular, the approach seems to bypass the local-lemma-type conditions for sampling (e.g. the one assumed in [20]). But indeed, our augmentation of LLL instances relies on that the LLL instances are fairly satisfiable (or at least the separator between the regions that we want to de-correlate should be enough satisfiable). Sampling in such instances might already be tractable in conventional computation models, e.g. in polynomial-time Turing machine, but the problem remains highly nontrivial for local computation.

This new approach for perfect simulation works especially well in the models where the locality is the sole concern. A fundamental question is how this could be extended to the models where the computation and/or communication costs are also concerned, e.g. 𝖢𝖮𝖭𝖦𝖤𝖲𝖳 model or 𝖯𝖱𝖠𝖬 model.

References

  • [1] Konrad Anand and Mark Jerrum. Perfect sampling in infinite spin systems via strong spatial mixing. SIAM Journal on Computing, 51(4):1280–1295, 2022. doi:10.1137/21M1437433.
  • [2] László Babai. Monte-carlo algorithms in graph isomorphism testing. Université tde Montréal Technical Report, DMS, (79-10), 1979.
  • [3] Ivona Bezáková, Andreas Galanis, Leslie Ann Goldberg, Heng Guo, and Daniel Stefankovic. Approximation via correlation decay when strong spatial mixing fails. SIAM Journal on Computing, 48(2):279–349, 2019. doi:10.1137/16M1083906.
  • [4] Yi-Jun Chang and Seth Pettie. A time hierarchy theorem for the local model. SIAM J. Comput., 48(1):33–69, January 2019. doi:10.1137/17M1157957.
  • [5] Kai-Min Chung, Seth Pettie, and Hsin-Hao Su. Distributed algorithms for the Lovász local lemma and graph coloring. In Proceedings of the 33th ACM Symposium on Principles of Distributed Computing (PODC), pages 134–143, 2014. doi:10.1145/2611462.2611465.
  • [6] Kenneth L Clarkson. Las vegas algorithms for linear and integer programming when the dimension is small. Journal of the ACM (JACM), 42(2):488–499, 1995. doi:10.1145/201019.201036.
  • [7] Peter Davies-Peck. On the locality of the lovász local lemma. In Proceedings of the 57th Annual ACM Symposium on Theory of Computing, STOC ’25, pages 1271–1282, New York, NY, USA, 2025. Association for Computing Machinery. doi:10.1145/3717823.3718103.
  • [8] Paul Erdos and László Lovász. Problems and results on 3-chromatic hypergraphs and some related questions. Infinite and finite sets, 10(2):609–627, 1975.
  • [9] Weiming Feng, Heng Guo, Chunyang Wang, Jiaheng Wang, and Yitong Yin. Towards derandomising markov chain monte carlo. In FOCS, 2023.
  • [10] Weiming Feng, Heng Guo, and Yitong Yin. Perfect sampling from spatial mixing. Random Structures & Algorithms, 61(4):678–709, 2022. doi:10.1002/RSA.21079.
  • [11] Weiming Feng, Heng Guo, Yitong Yin, and Chihao Zhang. Fast sampling and counting k-sat solutions in the local lemma regime. Journal of the ACM (JACM), 68(6):1–42, 2021. doi:10.1145/3469832.
  • [12] Weiming Feng, Nisheeth K Vishnoi, and Yitong Yin. Dynamic sampling from graphical models. SIAM Journal on Computing, 50(2):350–381, 2021. doi:10.1137/20M1315099.
  • [13] Weiming Feng and Yitong Yin. On local distributed sampling and counting. In PODC, 2018.
  • [14] Mohsen Ghaffari, David G Harris, and Fabian Kuhn. On derandomizing local distributed algorithms. In FOCS, 2018.
  • [15] Mohsen Ghaffari, Fabian Kuhn, and Yannic Maus. On the complexity of local distributed graph problems. In STOC, 2017.
  • [16] Heng Guo, Mark Jerrum, and Jingcheng Liu. Uniform sampling through the Lovász local lemma. Journal of the ACM (JACM), 66(3):18:1–18:31, 2019. doi:10.1145/3310131.
  • [17] Bernhard Haeupler, Barna Saha, and Aravind Srinivasan. New constructive aspects of the Lovász local lemma. Journal of the ACM (JACM), 58(6):28, 2011.
  • [18] David G Harris. New bounds for the moser-tardos distribution. Random Structures & Algorithms, 57(1):97–131, 2020. doi:10.1002/RSA.20914.
  • [19] Kun He, Xiaoming Sun, and Kewen Wu. Perfect sampling for (atomic) lov\’asz local lemma. arXiv preprint arXiv:2107.03932, 2021.
  • [20] Kun He, Chunyang Wang, and Yitong Yin. Sampling lovász local lemma for general constraint satisfaction solutions in near-linear time. In FOCS, 2022.
  • [21] Vishesh Jain, Huy Tuan Pham, and Thuy Duong Vuong. On the sampling Lovász local lemma for atomic constraint satisfaction problems. arXiv, abs/2102.08342, 2021. arXiv:2107.03932.
  • [22] Vishesh Jain, Huy Tuan Pham, and Thuy Duong Vuong. Towards the sampling lovász local lemma. In FOCS, 2021.
  • [23] Mark R. Jerrum, Leslie G. Valiant, and Vijay V. Vazirani. Random generation of combinatorial structures from a uniform distribution. Theoret. Comput. Sci., 43:169–188, 1986. doi:10.1016/0304-3975(86)90174-X.
  • [24] Gil Kalai. A subexponential randomized simplex algorithm. In STOC, 1992.
  • [25] Nathan Linial. Locality in distributed graph algorithms. SIAM Journal on Computing (SICOMP), 21(1):193–201, 1992. doi:10.1137/0221015.
  • [26] Nathan Linial and Michael Saks. Low diameter graph decompositions. Combinatorica, 13(4):441–454, 1993. doi:10.1007/BF01303516.
  • [27] Michael Luby, Alistair Sinclair, and David Zuckerman. Optimal speedup of las vegas algorithms. Information Processing Letters, 47(4):173–180, 1993. doi:10.1016/0020-0190(93)90029-9.
  • [28] D Mitchell, B Selman, and H Leveque. A new method for solving hard satisfiability problems. In AAAI, 1992.
  • [29] Michael Mitzenmacher and Eli Upfal. Probability and Computing: Randomized Algorithms and Probabilistic Analysis. Cambridge University Press, 2005. doi:10.1017/CBO9780511813603.
  • [30] Ankur Moitra. Approximate counting, the lovász local lemma, and inference in graphical models. Journal of the ACM (JACM), 66(2):1–25, 2019. doi:10.1145/3268930.
  • [31] Robin A Moser and Gábor Tardos. A constructive proof of the general Lovász local lemma. Journal of the ACM (JACM), 57(2):11, 2010.
  • [32] Rajeev Motwani and Prabhakar Raghavan. Randomized algorithms. Cambridge university press, 1995. doi:10.1017/CBO9780511814075.
  • [33] David Peleg. Distributed computing: a locality-sensitive approach. SIAM, 2000.
  • [34] James G. Propp and David B. Wilson. Exact sampling with coupled Markov chains and applications to statistical mechanics. Random Structures Algorithms, 9(1-2):223–252, 1996. doi:10.1002/(SICI)1098-2418(199608/09)9:1/2\%3C223::AID-RSA14\%3E3.0.CO;2-O.
  • [35] Chunyang Wang and Yitong Yin. A sampling lovász local lemma for large domain sizes. In FOCS, 2024.