Abstract 1 Introduction 2 Model and definitions 3 Gap between 𝑶(𝟏) and 𝚯(𝐥𝐨𝐠𝐥𝐨𝐠𝒏) in paths 4 A property with optimal size 𝚯(𝐥𝐨𝐠𝐥𝐨𝐠𝒏) in unlabeled paths 5 Gap between 𝑶(𝟏) and 𝚯(𝐥𝐨𝐠𝒏) in cycles 6 A property with optimal size 𝚯(𝐥𝐨𝐠𝒏) in cycles References Appendix A No gap above 𝐥𝐨𝐠𝒏 in general graphs with identifiers Appendix B Missing proofs for the gap between 𝑶(𝟏) and 𝚯(𝐥𝐨𝐠𝒏) in cycles Appendix C Proof of a property with optimal size 𝚯(𝐥𝐨𝐠𝒏) in cycles

Complexity Landscape for Local Certification

Nicolas Bousquet ORCID CNRS, INSA Lyon, UCBL, LIRIS, UMR5205, F-69622 Villeurbanne, France Laurent Feuilloley ORCID CNRS, INSA Lyon, UCBL, LIRIS, UMR5205, F-69622 Villeurbanne, France Sébastien Zeitoun ORCID CNRS, INSA Lyon, UCBL, LIRIS, UMR5205, F-69622 Villeurbanne, France
Abstract

An impressive recent line of work has charted the complexity landscape of distributed graph algorithms. For many settings, it has been determined which time complexities exist, and which do not (in the sense that no local problem could have an optimal algorithm with that complexity). In this paper, we initiate the study of the landscape for space complexity of distributed graph algorithms. More precisely, we focus on the local certification setting, where a prover assigns certificates to nodes to certify a property, and where the space complexity is measured by the size of the certificates.

Already for anonymous paths and cycles, we unveil a surprising landscape:

  • There is a gap between complexity O(1) and Θ(loglogn) in paths. This is the first gap established in local certification.

  • There exists a property that has complexity Θ(loglogn) in paths, a regime that was not known to exist for a natural property.

  • There is a gap between complexity O(1) and Θ(logn) in cycles, hence a gap that is exponentially larger than for paths.

We then generalize our result for paths to the class of trees. Namely, we show that there is a gap between complexity O(1) and Θ(loglogd) in trees, where d is the diameter. We finally describe some settings where there are no gaps at all.

To prove our results we develop a new toolkit, based on various results of automata theory and arithmetic, which is of independent interest.

Keywords and phrases:
Local certification, proof-labeling schemes, locally checkable proofs, space complexity, distributed graph algorithms, complexity gap
Copyright and License:
[Uncaptioned image] © Nicolas Bousquet, Laurent Feuilloley, and Sébastien Zeitoun; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Distributed algorithms
Related Version:
Full Version: https://arxiv.org/abs/2505.20915 [14]
Acknowledgements:
The authors would like to thank Thomas Colcombet for answering our questions about Chrobak normal form, and the reviewers for useful comments.
Funding:
This work is supported by the ANR grant ENEDISC (ANR-24-CE48-7768).
Editor:
Dariusz R. Kowalski

1 Introduction

1.1 Approach

Time complexity landscapes for distributed graph algorithms

For distributed graph algorithms, the most classic measure of complexity is time, measured by the number of rounds before output. One of the most fruitful research programs on this topic has been the one of charting the complexity landscape [42]. That is, instead of considering specific problems and asking for their time complexity, a long series of papers have answered the following question: given a complexity function, is there a problem with that complexity? In this paper, we aim at starting the analogue research program for the space complexity of distributed graph algorithms.

Let us mention some elements about the time complexity landscape, in order to draw analogies with our setting later. For the sake of simplicity, we only discuss deterministic complexity of locally checkable labelings (LCLs) in bounded degree graphs in the LOCAL model. Before 2016, there were a few of classic complexities: O(1) for very simple tasks, Θ(logn) for tasks that could be reduced to coloring, and O(n) for global problems (throughout the paper, n refers to the number of nodes in the graph). Then, several seminal papers established that some problems have complexity Θ(logn) [15, 21], and that there is no problem whose complexity strictly lies between Θ(logn) and Θ(logn) [21]. This gap sharply contrasts with classic computational theory where the time hierarchy theorem prevents the existence of such gaps [38]. It was also proved that in some intervals infinitely many complexities exist [6, 7, 19, 22], but not any complexity function had a corresponding problem. A key strategy in this research area is to understand the landscape for specific classes of networks such as paths [3], grids [16], trees [4, 5, 6], minor-closed graphs [20], etc. In this paper, we will follow a similar approach: characterizing which space complexities are possible depending on the structure of the network.

New direction: A landscape approach to space

The space used by distributed algorithms is much less studied than the time, especially if we restrict to distributed graph algorithms. Some models where space is a well-studied measure are population protocols [2], massively parallel computing (MPC) [39] and models similar to Stone Age [26], but they are not relevant to our approach, either because they differ too much from distributed graph computing (having one-to-one communication or a centralized component) or because they fix the space complexity to constant. As far as we know, the only major line of research considering space complexity for distributed graph algorithms is self-stabilization, and more specifically the state model, where the nodes update their states by reading the states of their neighbors [1, 25]. In that model, the algorithm has to cope with transient faults, and consequently it is common to design algorithms (at least implicitly) with two components : one that builds a solution and one that checks the solution, and can reset the system if needed. Local certification was introduced to study specifically the space needed for the checking phase, by abstracting the computation of the solution into an oracle, and this is our focus today.

Informally, a local certification of a property is an assignment of labels to the nodes of the graph, such that the nodes can collectively check the correctness of a given property by inspecting the labels in their neighborhoods. This notion is known under different names, depending on the specific model considered: proof-labeling scheme [40], locally checkable proofs [36], non-deterministic local decision [33], etc. We will formally define our model in Section 2, and refer to [27] for an introduction to the notion. The (space) complexity of a certification is the maximum size of the label given to a node, and it is known to be basically equal to the space complexity of self-stabilizing algorithms.111There are some fine prints to this statement, that we will discuss later, in Section 1.4. Finally, since local certification is about decision problems, it is now standard to focus on checking graph properties (instead of solutions to graph problems), and to refer to the set of correct graphs as a language.

In local certification, the situation is similar to the pre-2016 situation for the time complexity as described above. That is, there are a few well-established complexity regimes – O(1), Θ(logn), Θ(n) and Θ(n2) – but no solid explanation as of why these are so common. But unlike for distributed time complexity (for LCLs on trees), we can prove that for a wide range of complexities, there exists a problem with that complexity.

Theorem 1.

For general graphs with identifiers, for any non-decreasing function f(n) in Ω(logn) and O(n2), there exists a property that can be certified with O(f(n)) bits, but not in o(f(n)) bits.

The proof (deferred to Appendix A) follows a standard construction of [36]. The graphs satisfying the property are the ones made of two copies of a graph H, linked by a path. The graph H is chosen so that it can be encoded on f(n) bits, and a certification consists in giving this encoding to all nodes, so that every node knows H. It is known that this encoding can be checked locally when identifiers are given, hence we get the O(f(n)) upper bound. A counting argument allows to prove a matching lower bound [36].

At first sight, Theorem 1 gives a trivial answer to our question about the complexity landscape of local certification. But it relies crucially on several assumptions. First, in the upper bound it is necessary to have unique identifiers to avoid being fooled by symmetries at the checking phase. Also, because of the identifiers and because the value of n needs to be certified, Ω(logn) bits are needed. Finally, to make the counting argument of the lower bound work, one cannot restrict too much the graphs in which to apply this theorem. This rises the three following questions that we tackle in this paper.

Question 2.

What happens for complexities in o(logn)?

In other words, is the Ω(logn) a limitation of the proof technique, or could there be a gap between constant and Θ(logn), as the current state of our knowledge suggests?

Question 3.

What about anonymous networks?

This is especially relevant in conjunction with the previous question, since a unique identifier cannot be encoded in a certificate of o(logn) bits. Note that even beyond that regime, the impact of the identifiers on local decision is a well-studied topic [31, 30, 32, 29].

Question 4.

What about structured graphs, like paths, cycles and trees?

In the spirit of the work made on the distributed time complexity landscape, we would like to understand local certification on restricted graph classes, in which counting arguments do not apply, or only partially.

1.2 Main results

We start with a simple setting: anonymous paths without input labels. In this setting, a path is characterized by its length, and a language is defined by a set of authorized lengths. Naturally, we assume that the nodes do not have the knowledge of n (otherwise the problem is trivial). Given O(logn) bits, we can recognize any language. This is because the prover can certify the exact length of the path, by giving to every node its distance to a chosen endpoint as a certificate. The nodes can then check locally the correctness of this counter, and the last node can check whether the length belongs to the authorized sizes. On the other hand, with a constant number of bits, we can encode properties like “the path has length kmodm”, via counters modulo m (where m is a constant). To build a language whose complexity would be strictly between these regimes, it is tempting to consider properties of the form: “the path has length k modulo f(n)” for arbitrary function f, and for which a counter would use logf(n) bits. Unfortunately, the nodes are unable to check that the f(n) is correct, since they do not have access to n. This obstacle does not seem easy to bypass and, it is reasonable to conjecture that there is a gap between O(1) and Θ(logn). We do establish the existence of a gap, but it only goes up to Θ(loglogn).

Theorem 5.

Let c>1 and N. Let 𝒫 be a property on paths that can be certified with certificates of size s(n):=loglognc for all nN. Then, 𝒫 can also be certified with constant-size certificates.

This is the first gap established in local certification. At first, this loglogn looks like an artifact of the proof but, surprisingly, it is not! We next prove that there exists a language for which the optimal certificate size is Θ(loglogn). (Given the previous theorem, it is sufficient to prove that this language can be certified with O(loglogn) bits but not with O(1) bits.)

Theorem 6.

There exist properties on paths that can be certified with certificates of size O(loglogn), but not with certificates of size O(1).

It is the first time that this regime appears in the area of local certification (here under the promise that the graph is a path). The language we use is the set of paths whose length is not a product of consecutive primes; we will come back to it.

Now, moving on to cycles, there is a second surprise. We expect to see the same landscape in paths and cycles, but the intermediate regime actually disappears in cycles, and there is a gap between O(1) and Θ(logn).

Theorem 7.

Let c>12 and N. Let 𝒫 be a property on cycles that can be certified with certificates of size s(n):=lognc for every integer nN. Then, 𝒫 can also be certified with constant-size certificates.

Finally, we study the case of trees, where the picture is more complex, as it depends on the exact setting considered. In some settings, we can prove that there is no gap (any reasonable function corresponds to the optimal certificate size for a language), and in some other cases, the gap from paths persists. These settings depend on three parameters: (1) whether we consider certificate size as a function of the number of nodes n or of the diameter d, (2) whether the nodes can only see what is at distance 1 or at larger distance (in other words the verification radius is 1 or larger), and (3) whether the maximum degree is bounded or not. We establish a classification of these different settings. In particular, we prove that the gap for paths is generalized to arbitrary trees, when parameterized by the diameter, and the verification radius is 1.

Theorem 8.

Let c>2 and D. Let 𝒫 be a property in trees (of unbounded degree) that can be certified using s(d):=loglogdc bits for all dD (where d is the diameter). Then, 𝒫 can also be certified with constant-size certificates.

We then show that two assumptions in Theorem 8 are optimal. Namely, we prove in Theorem 9 that we can not hope for a gap in n instead of d in trees (even in caterpillars, that is, paths with leafs attached), and we establish in Theorem 10 that it is essential that the vertices are able to see only at distance only 1, because there is no more gap if the verification radius r is at least 2 (again, even in caterpillars).

Theorem 9.

Let f: be a non-decreasing function such that limn+f(n)=+ and for all integers 1st we have f(t)f(s)logtlogs. Then, there exists a property on caterpillars (of unbounded degree) that can be certified with certificates of size O(f(n)) and not with certificates of size o(f(n)).

Theorem 10.

Let r>1. Let f: be a non-decreasing function such that limn+f(n)=+ and for all integers 1st we have f(t)f(s)logtlogs. Then, if the vertices can see at distance r, there exists a property on caterpillars (of unbounded degree) that can be certified with certificates of size O(f(d)) and not with certificates of size o(f(d)), where d is the diameter.

Note that the condition on f(t)f(s) only ask for a function which is sublogarithmic and whose growth is sublogarithmic, to avoid arbitrarily long plateaus followed by big jumps. This condition is satisfied by all the usual functions (logn, loglogn, logn, etc.).

All the results mentioned so far are in the anonymous setting. We also explore the impact of identifiers and of the knowledge of n on this landscape, but we delay this discussion for now.

1.3 Main techniques

Automata and arithmetic perspectives

To give an overview of our techniques, let us start by describing two perspectives on constant size certification in anonymous paths. As said earlier, a basic building block is to certify with a counter that the length is equal to imodk for some constant i and k. One can see this behavior as a run in an automaton with k states inducing a cycle whose initial state being 0 mod k and the final state being i mod k. More generally, any constant size certification can be turned into a finite state automaton. There are various ways to do it, but basically, it consists in having states describing (pairs of) certificates, and transitions that connect states that would be accepted by the local verifier. The existence of an accepting run in the automaton is equivalent to the existence of a certificate assignment making the local verifier accept (see Section 3.1 for a more detailed explanation). (Also see [28], and also [23] for a similar perspective in the context of local problems.) Another point of view is the one of arithmetic. Intuitively, a set of lengths of anonymous paths that are accepted by a constant size certification corresponds to a combination of congruence relations (in other words, such a set is equal to the union of arithmetic progressions up to some point). This point of view allows, for example, to derive that the set of lengths of anonymous paths having a given constant size certification is eventually periodic.

Now, moving on to non-constant certifications, we need to introduce a non-uniform automata model. Indeed, since larger paths imply that we can use larger certificates, it also means larger automata. For each certificate size i, we will have an automaton 𝒜i, and we will require that an incorrect instance is rejected by all these automata, while a correct instance is accepted by at least one (small enough) 𝒜i. If a property has non-constant complexity, then it means that we will need arbitrarily large automata. If there exists a certification of size s(n), then for any instance of length n, the automaton 𝒜s(n) will have to accept. Now from the arithmetic perspective, there is an equivalence between having non-constant complexity and the fact that the set of correct lengths is not eventually periodic.

Establishing gaps in paths and trees

To establish a gap between constant size and Θ(loglogn) size in paths, the reasoning is the following.222Actually this proof gives worse constants than the ones of Theorem 5, but it is the one that generalizes to other settings. Consider a language S that does not have a constant size certification. For an arbitrary i, we focus on the paths of S that are recognized by 𝒜i but not by 𝒜1,,𝒜i1 (assume this is non-empty, which is the case for infinitely many i). This set is itself recognized by an automata: the intersection of 𝒜i with the complement of the union of 𝒜1,,𝒜i1. Studying the state complexity of this new automaton (which we do not discuss here) leads us to the fact that it must accept a path of length at most 22i. Finally, this upper bound on the minimum length for which some certificate size is needed translates into a lower bound constraint for the optimal complexity, and the double exponential translates into the double logarithm of the theorem.

This proof can be generalized without much modification to labeled paths (that is, paths with inputs) and to larger verification radius (see the full version [14]). This is pretty straightforward, given the previous proof: we use classic automata instead of unary automata, and a slightly different transformation from certification to automata. On the contrary, the proof that gives the right constants for Theorem 5 utilizes Chrobak normal form for unary automata (see full version), and does not generalize easily to the labeled case.

The proof of the analogous gap in trees (Theorem 8), is based on the same insights, but is much more involved. Basically, we adapt our automata to walk in the tree, reading the pending subtrees as labels. Two challenges are that a given tree can be read in many different ways, and that the alphabet is infinite. We argue that the complexity will be captured by the walks that follow a maximum path in the tree, and that the intersection, complement and union that are used in the proof are not harmed by the infinite alphabet.

The 𝐥𝐨𝐠𝐥𝐨𝐠𝒏 language

We now turn our attention to Theorem 6, which establishes the existence of a language with optimal certification size Θ(loglogn) in anonymous unlabeled paths. Let us start by giving some intuition about how we came to this result. Intuitively, having optimal certificates of size Θ(loglogn) means that certificates of size k allow to differentiate between correct and incorrect instances of size order 22k. When we implement simple counters modulo some constant using k bits, the modulo is of order 2k, hence we can distinguish between correct and incorrect instances of size 2k but not more, in the sense that q and q+2k will be classified in the same category. Hence we need to do an “exponential jump” in terms of distinguishing capability. We now make two observations. First, given a set of pairs (ri,mi)i, the fact that the path length is not equal to rimodmi for some i can be certified with certificates of size O(maxilog(mi)). Indeed, since it is sufficient to prove that one of the modulo equation is not satisfied, we can simply give explicitly mi in all the certificates, in addition to the counter modulo mi. Second, using the Chinese remainder theorem, for any two numbers a and b, there must exist an integer m exponentially smaller than max(a,b) such that abmodm. This makes the existence of a Θ(loglogn) language believable, but making it into a real construction is another kettle of fish. We end up considering the language of the path whose length is not the product of consecutive primes. The scheme consists in certifying that either there is a gap or a repetition in the sequence of primes, via adequate counters, or there is an inconsistency between the largest prime used and what the length of the path is equal to, modulo a well-chosen (small) integer.

The case of cycles

Theorem 7 states that in cycles there is a gap between constant and logarithmic certification size. This is in sharp contrast with the case of paths, which is surprising given the similarity of the two structures. At a very intuitive level, the difference stems from the fact that the only congruence that we can check in cycles are of the form 0modm, and not arbitrary imodm. Indeed, in a path, it is easy to have one endpoint checking counter value zero and the other checking counter value i, while in cycles there are no endpoints. The natural way to simulate the endpoints is to designate a specific vertex v in the cycle to check that the counter starts at 0 from one side and reaches i from the other side, but this is not robust. Indeed there could be more than one designated vertices and still all nodes could accept if each segment is imodm. This breaks the congruence, except if i=0. This restriction in the congruences prevents us from using the arithmetic tools of the proof of Theorem 6.

Let us now sketch the proof of the gap between certificates of size O(1) and Θ(logn) in cycles. We again use an automata-like point of view, but we need to adapt it to work without starting and ending nodes. We consider a sequence of directed certificate graphs333We consider “graphs” and not “automata” here since we have no initial or accepting states, a sequence of certificate only gives us a walk in the graph of pairs of certificates. (Gi)i. Here, a cycle C with certificates of size i (c1,,cn) is accepted by Gi if and only if (c1,c2),(c2,c3),,(cn,c1) is a closed walk in Gi. For the proof, we consider the first length nk that is accepted by the k-th graph Gk (corresponding to certificates of size k), but not by smaller ones. Now, if the certificate size is in o(logn), this walk is much longer than the size of Gk, and therefore it has a lot of cycles. We define a notion of elementary cycle of the graph, and decompose this walk as a linear combination of elementary cycles. If d is the greatest common divisor of these elementary cycles, then nk=d×q, for some q. We consider a set of numbers of the form p×d, such that (1) p is prime, (2) p×d<nk and (3) p×d is still much larger than the size of Gk. By a generalization of Bézout’s identity, (3) implies that all these numbers are accepted by Gk hence they belong to the language. But since they are smaller than nk, by minimality, they must be recognized by smaller graphs too. We argue by pigeon-hole principle that there must exist one graph Gk, k<k that has two walks of lengths p1d and p2d that intersect. Then by concatenating portions of the two walks, we can prove that Gk actually accepts all the cycles of size ap1d+bp2d. Finally, using again a generalization of Bézout’s identity we can prove that the cycle of length nk is also recognized by Gk, which contradicts the minimality of k.

“No gap” results

We also establish that for several settings there is no gap in the complexity, that is, for any well-behaved function f, there exists a property that has certificates of size O(f(n)) but not o(f(n)).

Let us sketch the technique of the upper bound for the setting of Theorem 10: restricting to caterpillars, using certificates whose size is a function of d, with verification radius 2. Given a function f, we define a sequence of integers (bk), such that the k-th term is roughly f1(logk). The correct instances are of the following form: a path of length bk, for some k, such that the i-th node has i leaves, and except for the first one, which has bk leaves instead of 1. A compact way to certify these instances is to give k to every node. The nodes in the middle check the growth of the number of attached leaves (thanks to their radius 2 verification) and the endpoints check that they have bk leaves. The diameter is bk, and the number of bits used is logk which is basically f(d).

1.4 Discussions and open problems

Before we move on to the technical parts, let us discuss open problems and future directions.

Full understanding of paths.

For paths, we do not know what happens between Θ(loglogn) and Θ(logn). By sparsifying the set of primes considered in the loglogn language (Theorem 6), we can get languages for which the natural upper bound can be positioned in between these two regimes, but Theorem 5 does not provide a matching lower bound anymore, hence we cannot prove that there is no gap.

Open problem 11.

Is there a gap between Θ(loglogn) and Θ(logn) in paths?

To solve this question, it would good to get a better understanding of the Θ(loglogn) regime, or even a characterization. (For now, we just have one example of a property in this regime.)

General graphs.

For general graphs, we prove in the full version that if the radius is larger than 1, then there is no gap. For radius 1, it is unclear whether we should expect the same gap as for trees or not. Our automata-related tools seem too weak to tackle this case (the generalization from trees to graphs in automata theory is notoriously intricate).

Open problem 12.

Is there a gap between O(1) and Θ(loglogd) for general graphs? For bounded degree graphs?

The role of identifiers and of the knowledge of 𝒏.

Our main results are for the anonymous setting, where the nodes do not have the knowledge of n. In the full version, we explore several settings with identifiers or (approximate) knowledge of n. We can for example prove that a very sharp estimate of n allows to break the loglogn barrier of Theorem 5, while arbitrarily large identifiers do not help. It is still very unclear how these different assumptions affect the complexity landscape.

Extensions to self-stabilizing algorithms: back space complexity in algorithms.

We described in the introduction our wish to chart the space landscape of distributed graph algorithms, and as a first step we focused on local certification. A natural next step is to transfer our results to self-stabilizing algorithms. As mentioned earlier, the two are tightly connected, since the space used for silent self-stabilization is basically captured by the space needed to certify the solution correctness [10]. Actually, [10] implements a transformation from a local certification to a self-stabilizing algorithm, that does require an additive O(logn) for the memory of the algorithm in comparison to the local certification size. This is usually harmless, but in the setting of this paper, which focuses on sublogarithmic regime, the result is unusable. The restricted topology might allow to shave the additional logarithmic term.

Open problem 13.

Are the optimal certificate size and the optimal memory of a self-stabilizing algorithm asymptotically equal in restricted topologies (paths, cycles, trees), even in the sublogarithmic regime?

When it comes to transferring the sub-log-logarithmic gap for trees to (silent) self-stabilizing algorithms, it would actually be enough to understand whether labelings accepted by tree automata (which are equivalent to monadic second-order on trees) can be built in constant space in a self-stabilizing manner. Incidentally, understanding when one can make this transfer while keeping polynomial time is also a very intriguing question (see [9] for a discussion).

Different landscapes.

Theorem 1 establishes that for any super-logarithmic complexity f there exists a property that requires exactly that complexity. But the construction is very unnatural, since the nodes need to know the function f, and the instances are extremely specific. Hence the following question.

Open problem 14.

Are there super-logarithmic gaps in the complexity of natural properties, for a reasonable definition of “natural”? What about monadic second-order (MSO) properties?

It is known that there are properties for which the optimal certification size is Θ(n), for example having diameter 3 [18, 12] and also Θ(n2) [36]. As far as we know the only works about what happens in between are [11] and [41], that have established that for forbidden subgraphs, there are many polynomial complexities, when using verification radius 2.

We mention MSO properties in the open problem because they have received a considerable amount of attention in recent years in local certification [8, 24, 28, 35, 34], and capture many classic properties and problems through logic.

Finally, another research direction is to chart landscapes for other parameters. For example, [13] explored the certification complexity as a function of the maximum degree.

1.5 Organization of the conference version

For this conference version, we focus on the results on paths and cycles, which allows to give most of the insights within the page limit. Some of the proofs of the lemmas and claims are deferred to the appendix. The proof of our more involved results as well as discussions of the knowledge on n and the identifier assumptions only appear in the full version [14].

2 Model and definitions

2.1 Graphs

All the graphs we consider are finite, simple, loopless, connected, and undirected. For completeness, let us recall several basic graph definitions. Let G be a graph. We denote its set of vertices (resp. edges) by V(G) (resp. by E(G)), or simply by V (resp. by E) if G is clear from the context. Let u,vV(G). The distance between u and v, denoted by d(u,v), is the smallest number of edges in a path from u to v. The diameter of G is the largest distance between any two vertices. If G is a path, its length is its number of vertices (or equivalently, its diameter plus one). We say that G is a caterpillar if, when removing all the degree-1 vertices in G, the resulting graph is a path, called the central path of G (which is induced by the set of all vertices of degree at least two in G).

2.2 Local certification

Let G=(V,E) be a graph. We will sometimes assume that the vertices of G are equipped with unique identifiers and/or with inputs. An identifier assignment for G is an injective mapping from V to some set I (the set of identifiers) and an input function is a mapping from V to some set L (the set of labels). If G is equipped with identifiers, we say that we are in the locally checkable proof model, else we say that we are in the anonymous model. If the vertices of G have inputs, we say that G is labeled. Finally, let C be a non-empty set. A certificate assignment of G with certificates in C is a mapping c:VC.

Let r1 and c be a certificate assignment for G. Let uV. The view of u at distance r consists in:

  • all the vertices at distance at most r from u, and all the edges having at least one endpoint at distance at most r1,

  • the restriction of c to these vertices,

  • the restriction of the identifier assignment (if any) and of the input function (if any) to these vertices.

A verification algorithm (at distance r) is a function taking as input the view at distance r of a vertex, and outputting a decision, accept or reject. In all this paper, if r is not mentioned in a statement of a result, it is by default equal to 1. When we will consider settings where r2, it will always be explicitly written.

Let 𝒞 be a class of (possibly labeled) graphs and 𝒫 be a property on graphs in 𝒞. Note that if we consider labeled graphs, the fact that a graph G𝒞 satisfies the property 𝒫 does not depend only on the structure of G: it depends also on its input function. In other words, it is possible that a graph G𝒞 satisfies 𝒫 and that another graph G𝒞 with the same structure as G but with a different input function does not satisfy 𝒫. Let s:. We say that there exists a certification scheme for 𝒫 with certificates of size s if there exists a verification algorithm such that the two following conditions are satisfied:

  • (Completeness) For every n-vertex graph G𝒞 that satisfies 𝒫, and for every identifier assignment of G (if we are in the locally checkable proof model), there exists a certificate assignment in {0,,2s(n)1} such that the verification of every vertex accepts (we say that the graph is globally accepted).

  • (Soundness) For every graph G𝒞 that does not satisfy 𝒫, for every identifier assignment of G (if we are in the locally checkable proof model), for every k and every certificate assignment in {0,,2k1}, at least one vertex rejects.

Let us emphasize that, if G does not satisfy 𝒫, then for any assignment of certificates of any size, at least one vertex rejects. Let us also point out the fact that in a certification scheme for a property 𝒫 in some class 𝒞 (in this paper, we will for instance consider the cases where 𝒞 is the class of paths, of cycles, of trees…), the vertices have the promise that the graph belongs to 𝒞. In other words, the certification scheme depends on the property 𝒫 and on the class 𝒞, and we are not concerned by the output of the verification procedure of the vertices in graphs that do not belong to 𝒞.

3 Gap between 𝑶(𝟏) and 𝚯(𝐥𝐨𝐠𝐥𝐨𝐠𝒏) in paths

The goal of this section is to prove the following result:

Theorem 15.

Let c>2 and N. Let 𝒫 be a property on paths that can be certified with certificates of size s(n):=loglognc for all nN. Then, 𝒫 can also be certified with constant-size certificates.

Note that the constant c is larger here than in Theorem 5. We prove the stronger version in the full version.

3.1 Preliminary: automata point of view

For every property 𝒫 on paths, we can associate a subsets S of integers such that a path is accepted if and only if its length is in S. The property 𝒫 is equivalent to S and in the rest of the proof, we will completely forget the property 𝒫 and only focus on S. For every k, let us denote by Ck the set of certificates of size k, and by Sk the set of lengths of the paths that are accepted with certificates in Ck. We have: S=kSk.

The set Sk is a regular language that is accepted with the following nondeterministic finite automaton 𝒜k over a unary alphabet. The set of states is the set of pairs of certificates of size k plus two additional states, i and f, that is: Ck2{i,f}. There is a single initial state which is i and a single final state which is f. The transitions are the following:

  • for every c1,c2Ck, we put a transition between states i and (c1,c2) if a vertex of degree 1 that has certificate c1 and has a neighbor with certificate c2 accepts;

  • for every c1,c2,c3Ck, we put a transition between states (c1,c2) and (c2,c3) if a vertex of degree 2 that has certificate c2 and has two neighbors with certificates c1 and c3 accepts;

  • for every c1,c2Ck, we put a transition between states (c1,c2) and f if a vertex of degree 1 that has certificate c2 and has a neighbor with certificate c1 accepts;

  • if there exists cCk such that an isolated vertex with certificate c accepts, we put a transition from i to f.

Let us give an example to make things more concrete. Assume that we want to certify that the length of a path is divisible by 3. There is an easy way to do it by using three certificates 0, 1, and 2. The prover fixes an endpoint u and for every vertex v, the certificate it gives to v is d(u,v)mod3. Then, every vertex v checks that one of the two following conditions is satisfied:

  • v has degree 1, has certificate 0 or 2, and its neighbor has certificate 1, or

  • v has degree 2, and the set of certificates of v and its two neighbors is {0,1,2}.

The automaton corresponding to these certificates is represented on Figure 1.

Figure 1: The automaton corresponding to the certificates used to certify that the length of a path is divisible by 3. The states corresponding to the tuples (0,0), (1,1) and (2,2) are not represented because they have no incoming nor outgoing transitions. The final state is the state f.
Lemma 16.

For every t, a path on t vertices is accepted with certificates of size k if and only if there exists an accepting run of length t (i.e., going through t transitions) in 𝒜k.

The intuition behind this lemma has been given earlier and the formal proof is in the full version.

3.2 Proof via state complexity

Lemma 17.

Let Σ be a (possibly infinite) alphabet, and let L𝒜,LΣ be two languages over Σ recognized by nondeterministic finite automata 𝒜 and , having n𝒜 and n states respectively. Then:

  • L𝒜L can be recognized by an automaton having n𝒜+n states

  • L𝒜L can be recognized by an automaton having n𝒜n states

  • L𝒜¯ can be recognized by an automaton having 2n𝒜 states

These statements are folklore, see the full version for explanations.

Proof of Theorem 15..

Using the notations introduced previously, the automaton 𝒜k has Mk:=22k+2 states and recognizes Sk. Let 𝒫 be a property that can not be recognized with constant-size certificates. Then, the set X containing all the integers k such that Skik1Si is infinite (this set X contains all the integers k such that there exists a path that can be accepted with certificates of size k but not with smaller size). For kX, let nk be the smallest integer in Skik1Si. Let kX be such that nkN (such an integer kX exists because X is infinite and for all distinct k,kX we have nknk).

Since nkSs(nk) and nkSi for i<k, we have s(nk)k. By Lemma 17, ik1Si can be recognized by an automaton that has i=1k1Mi22k states. Thus, by Lemma 17, ik1Si¯ can be recognized by an automaton that has at most 222k states. Again by Lemma 17, Skik1Si can be recognized by an automaton having at most Mk222k states. Since nk is the smallest integer in Skik1Si, it is at most equal to the number of states of this automaton, so it follows that nkMk222k222k+1. Finally we get s(nk)kloglognk12, and the result follows.

4 A property with optimal size 𝚯(𝐥𝐨𝐠𝐥𝐨𝐠𝒏) in unlabeled paths

The goal of this Section is to prove the following theorem.

Theorem 6. [Restated, see original statement.]

There exist properties on paths that can be certified with certificates of size O(loglogn), but not with certificates of size O(1).

Before proving Theorem 6, let us show the following result:

Lemma 18.

Let m,t and m2. Certifying that the length a path on n vertices satisfies ntmodm can be done with certificates of size O(logm).

The proof of this statement can be found in the full version of the paper.

 Remark 19.

For every m,t with m2, we can also certify that the length n of a path satisfies ntmodm with certificates of size O(logm), with the same proof (just by replacing Distance[u]=t by Distance[u]t at the end). In particular, with certificates of size O(logm), we can certify that m divides n, or that m does not divide n.

Finally, let us introduce some notations and give some useful properties. For every k1, let us denote by pk the k-th prime number (i.e. p1=2,p2=3,p3=5…), and let ak be the product of the k first prime numbers: ak:=i=1kpi. Let S be the set {ak|k1}.

Lemma 20.

[37] We have pk=Θ(klogk) and ak=2klogk(1+o(1)).

Lemma 21.

There exists c>0 such that, for every even integer n2, there exists kclogn such that pk divides n and pk+1 does not divide n.

Proof.

Let n be an even integer and let k be the smallest integer such that pk divides n and pk+1 does not divide n (which exists because n is divisible by p1=2). Then, n is divisible by ak, so akn. Using Claim 20, we get 2klogk(1+o(1))n, so klogk(1+o(1))logn, and the result follows.

Lemma 22.

Let 1s<t. There exists mlogt such that stmodm.

Proof.

By contradiction, assume that for all mlogt, we have stmodm. Then, by the Chinese remainder theorem, we get stmodp, where p is the least common multiple of 1,2,,logt. It is well-known that this least common multiple is at least t, so we have 1s<tp. Together with stmodp, this implies s=t, which contradicts the assumption s<t.

Proposition 23.

Let n1, and c be the constant of Lemma 21. Then, nS if and only if at least one of the three following conditions is satisfied:

  1. 1.

    n is odd, or

  2. 2.

    there exists 1<kclogn such that p does not divide n and pk divides n, or

  3. 3.

    there exists 1kclogn and 1mlogn such that pk divides n, pk+1 does not divide n and nakmodm.

Proof.

First, assume that one of the three conditions is satisfied. If condition 1. holds, n is odd, so nS because S contains only even integers. If condition 2. holds, then nS because all the integers in S which are divisible by pk are also divisible by p for all k. If condition 3. holds, then nS, because the only integer in S which is divisible by pk and not by pk+1 is ak.

Conversely, assume that nS, and let us show that at least one of the three conditions is satisfied. Assume that conditions 1. and 2. are not satisfied, and let us show that condition 3. holds. Since condition 1. is not satisfied, n is even, so by Lemma 21, there exists kclogn such that pk divides n and pk+1 does not divide n. Since condition 2. is not satisfied, n is divisible by ak, so ak<n (this inequality is strict because, by assumption, nS). Finally, by Lemma 22, there exists mlogn such that nakmodm, so condition 3. is satisfied.

Note that the third item is used to avoid accepting numbers with some prime used twice. It is tempting to check this condition directly instead of using our indirect check. But in general this would use integers that are too large, for example in the case where n=q2 with q a prime number.

We are now able to prove Theorem 6.

Proof of Theorem 6.

Recall that we assume that the input graph is a path P. Let 𝒫 be the property of being a path whose length is not in S. First, observe that 𝒫 cannot be certified with constant-size certificates. Indeed, properties on paths that can be certified with constant size-certificates are paths whose length is in a set that is eventually periodic (the most simple proof for it uses Chrobak normal form, see the full version), and the set S¯ is not. Now, let us show that 𝒫 can be certified with certificates of size O(loglogn).

Let n1. If nS, the prover certifies that at least one of the three conditions of Proposition 23 is satisfied. More precisely:

  • if n is odd, the prover certifies it. By Lemma 18, this needs O(1) bits.

  • if there exists 1<kclogn such that pk divides n and p does not divide n, the prover writes k and in the certificate of each vertex, and certifies that pk divides n and that p does not divide n. Since kclogn and pk=Θ(klogk), by Lemma 18 and Remark 19, this needs O(loglogn) bits.

  • if there exists 1kclogn and 1mlogn such that pk divides n, pk+1 does not divide n and nakmodm, the prover writes k and m in the certificate, certifies that pk divides n, pk+1 does not divides n and that nakmodm. By Lemma 18 and Remark 19, this needs O(loglogn) bits.

The verification of the vertices just consists in checking that the condition given by the prover is indeed satisfied, with the verification procedure of Lemma 18. Note that the vertices do not need to check the bounds on k and m for conditions 2. and 3., because if condition 2. or 3. is satisfied with larger k or m, it also implies that nS (and then P𝒫 and P can be also accepted with certificates of size O(loglogn)). These bounds on k and m are only useful to get an upper bound on the size of the certificates.

5 Gap between 𝑶(𝟏) and 𝚯(𝐥𝐨𝐠𝒏) in cycles

In this section we prove the following theorem.

Theorem 7. [Restated, see original statement.]

Let c>12 and N. Let 𝒫 be a property on cycles that can be certified with certificates of size s(n):=lognc for every integer nN. Then, 𝒫 can also be certified with constant-size certificates.

5.1 Preliminaries on number theory and walks in graphs

Let us give some results from number theory on which we will rely. First, let us recall the prime number theorem:

Theorem 24 (Prime Number Theorem).

For n, let π(n) be the number of prime numbers in {1,,n}. Then:

π(n)nln(n)

From Theorem 24, we can deduce the immediate following corollary:

Corollary 25.

Let c>12. For every n, let πc(n) be the number of prime numbers p such that 24n+1<p2n(c/22). Then, there exists n0 such that for all nn0, πc(n)>n22n.

Proof.

For every n, we have πc(n)=π(2n(c/22))π(24n+1). Since c>12, by Theorem 24, we have π(24n+1)=o(π(2n(c/22))). Thus, πc(n)π(2n(c/22)). By applying again the prime number theorem, we get πc(n)2n(c/22)/(n(c/22)). Thus, n22n=o(πc(n)), and the result follows.

Finally, let us give the following generalization of Bézout’s identity that will be useful in the proof of Theorem 7:

Lemma 26 ([17]).

Let 1,,t be positive integers. Let m:=max(1,,t) and d:=gcd(1,,t). Then, for every integer nm2 which is divisible by d, there exists non-negative integers a1,at such that i=1taii=n.

We now move on to some definitions about walks in graphs. A closed walk in 𝒢k is a directed path that begins and ends in the same vertex (it is allowed to pass through the same vertex or the same edge multiple times). The length of a closed walk is its number of edges. A closed walk that does not pass through the same vertex twice (except for the starting and ending vertices which are the same) is called an elementary cycle. If 𝒞 and 𝒞 are two directed closed walks, we say that 𝒞 is a closed subwalk of 𝒞 if a subsequence of vertices in 𝒞 is equal to 𝒞. See Figure 2 for an example. Note that the length of an elementary closed walk in 𝒢k is at most equal to the number of vertices of 𝒢k which is 22k.

Figure 2: In this directed graph, ADEDEFA is a closed walk of length 6, which is not an elementary cycle. The closed walks FAGF and CDEFABC are elementary cycles and have length 3 and 6 respectively. Moreover, FAGF is a subwalk of DEFAGFAD.

5.2 Proof of Theorem 7

All this subsection is devoted to the proof of Theorem 7. The proofs of the claims can be found in Appendix B.

Let c>12 and N. Let S be the set of lengths of cycles in 𝒫. Assume by contradiction that there exists a property 𝒫 such that, for every integer nN satisfying 𝒫, the cycle of length n is accepted with certificates of size s(n), where s(n)lognc, and that a constant number of certificates are not sufficient to certify 𝒫.

For every k1, let Ck be the set of certificates of size k, and let Sk be the subset of S corresponding to the cycles accepted with certificates in Ck. Note that |Ck|=2k, and that we have S=kSk. Let 𝒢k=(Vk,Ek) be the directed graph where Vk:=Ck2 and for all a,b,cCk, there is an edge in Ek between (a,b) and (b,c) if and only if a degree-2 vertex with certificate b has a neighbor with certificate a and another neighbor with certificate c accepts (and there are no other edges in Ek, that is there is no edge between (a,b) and (c,d) if bc). The directed graph 𝒢k has 22k vertices.

Claim 27.

For every integer n3, nSk if and only there exists a closed walk of length n in 𝒢k.

By Corollary 25, there exists k0 such that for all kk0, πc(k)>k22k.

Since S is not accepted with a constant number of certificates, the set X of integers k such that Sk1i<kSi is infinite. For kX, let nk be the smallest integer Sk1i<kSi. Finally, let us fix an integer integer kX, such that kk0 and nkN (such an integer kX exists because X is infinite and for all distinct k,kX we have nknk).

Claim 28.

We have nk2ck.

Since nkSk, by Claim 27, there is a closed walk of length nk in 𝒢k. Let us consider the strongly connected component 𝒢k of 𝒢k containing this closed walk. Let t be the number of elementary cycles in 𝒢k that we denote by 𝒞1,,𝒞t, and let 1,,t be their lengths. Let d=gcd(1,,t). We have d22k, because we have i22k for every i{1,,t} (since 𝒢k has size 22k).

Claim 29.

Let 𝒞 be a closed walk in 𝒢k, and let be its length. There exists b1,,bt such that =i=1tbii. Thus, d divides . In particular, d divides nk.

Claim 30.

Let m be such that d divides m, and m24k+1. Then, there exists a closed walk in 𝒢k of length m. Thus, mSk.

Let us now combine these arguments to prove Theorem 7. Before giving the technical details, let us explain the intuition. Let us denote by d the d:=gcd of all the lengths of cycles in Sk. Since 𝒢k has size 22k, Lemma 30 ensures that all the cycles of length rd are in 𝒫 when r is large enough (but small compared to nk). Thus there exist many prime numbers p such that pd are in 𝒫 and pdnk. By definition of nk, at least two of them can be certified with the same set of bits and we can obtain a contradiction. Let us now formalize the argument.

Recall that k is an integer such that kk0 and nkN. Let p be a prime number such that 24k+1<p2k(c/22). By Claim 30, we have pdSk. Moreover, since p2k(c/22) and d22k, we have pd2kc/2, that is, pdnk using Claim 28. Since nk is the smallest integer in Sk1i<kSi, we have pd1i<kSi. For every i{1,,k1}, let Xi be the set of prime numbers p{24k+1+1,,2k(c/22)} such that pdSi. Since there are πc(k) prime numbers in {24k+1+1,,2k(c/22)} and we have πc(k)>k22k, by the pigeonhole principle there exists i<k such that |Xi|>22k. Let us fix this index i.

For every pXi, since pdSi, there exists a closed walk 𝒞(p) of length pd in 𝒢i. Since |Xi|>22k, and since 𝒢i has 22i vertices and i<k, again by the pigeonhole principle there exist p,qXi such that 𝒞(p) and 𝒞(q) have a vertex in common. Since 𝒞(p) has length pd, 𝒞(q) has length qd, and these two cycles have a vertex in common, for every a,b, there exists a closed walk of length apd+bqd in 𝒢i (obtained by starting from a vertex u𝒞(p)𝒞(q), taking a times 𝒞(p) and then b times 𝒞(q)). Thus, for every a,b, apd+bqdSi.

Finally, since gcd(pd,qd)=d (because p and q are two distinct prime numbers), since pd,qdnk, and since nk is divisible by d by Claim 29, we can apply Lemma 26 which states that there exists a,b such that apd+bpd=nk. So nkSi, which a contradiction, because by assumption nkSk1i<kSi. This concludes the proof of Theorem 7.

6 A property with optimal size 𝚯(𝐥𝐨𝐠𝒏) in cycles

Let us finish this short version of the paper with a property that can be certified with O(logn) bits but not with constant-size certificates, to show that the gap stated in Theorem 7 is optimal. The proof is in Appendix C

Proposition 31.

Certifying that the length of a cycle is not a power of 2 can be done with certificates of size O(logn) but not with certificates of size O(1).

References

  • [1] Karine Altisen, Stéphane Devismes, Swan Dubois, and Franck Petit. Introduction to Distributed Self-Stabilizing Algorithms. Morgan & Claypool Publishers, 2019. doi:10.2200/S00908ED1V01Y201903DCT015.
  • [2] James Aspnes and Eric Ruppert. An introduction to population protocols. Bull. EATCS, 93:98–117, 2007.
  • [3] Alkida Balliu, Sebastian Brandt, Yi-Jun Chang, Dennis Olivetti, Mikaël Rabie, and Jukka Suomela. The distributed complexity of locally checkable problems on paths is decidable. In Peter Robinson and Faith Ellen, editors, Proceedings of the 2019 ACM Symposium on Principles of Distributed Computing, PODC 2019, pages 262–271. ACM, 2019. doi:10.1145/3293611.3331606.
  • [4] Alkida Balliu, Sebastian Brandt, Yi-Jun Chang, Dennis Olivetti, Jan Studený, and Jukka Suomela. Efficient classification of locally checkable problems in regular trees. In Christian Scheideler, editor, 36th International Symposium on Distributed Computing, DISC 2022, volume 246 of LIPIcs, pages 8:1–8:19, 2022. doi:10.4230/LIPICS.DISC.2022.8.
  • [5] Alkida Balliu, Sebastian Brandt, Yi-Jun Chang, Dennis Olivetti, Jan Studený, Jukka Suomela, and Aleksandr Tereshchenko. Locally checkable problems in rooted trees. Distributed Comput., 36(3):277–311, 2023. doi:10.1007/S00446-022-00435-9.
  • [6] Alkida Balliu, Sebastian Brandt, Dennis Olivetti, and Jukka Suomela. Almost global problems in the LOCAL model. Distributed Comput., 34(4):259–281, 2021. doi:10.1007/S00446-020-00375-2.
  • [7] Alkida Balliu, Juho Hirvonen, Janne H. Korhonen, Tuomo Lempiäinen, Dennis Olivetti, and Jukka Suomela. New classes of distributed time complexity. In Ilias Diakonikolas, David Kempe, and Monika Henzinger, editors, Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2018, pages 1307–1318. ACM, 2018. doi:10.1145/3188745.3188860.
  • [8] Dan Alden Baterisna and Yi-Jun Chang. Optimal local certification on graphs of bounded pathwidth. CoRR, abs/2502.00676, 2025. doi:10.48550/arXiv.2502.00676.
  • [9] Lélia Blin, Swan Dubois, and Laurent Feuilloley. Silent MST approximation for tiny memory. In Stéphane Devismes and Neeraj Mittal, editors, Stabilization, Safety, and Security of Distributed Systems - 22nd International Symposium, SSS 2020, volume 12514, pages 118–132. Springer, 2020. doi:10.1007/978-3-030-64348-5_10.
  • [10] Lélia Blin, Pierre Fraigniaud, and Boaz Patt-Shamir. On proof-labeling schemes versus silent self-stabilizing algorithms. In Stabilization, Safety, and Security of Distributed Systems - 16th International Symposium, SSS 2014, volume 8756, pages 18–32, 2014. doi:10.1007/978-3-319-11764-5_2.
  • [11] Nicolas Bousquet, Linda Cook, Laurent Feuilloley, Théo Pierron, and Sébastien Zeitoun. Local certification of forbidden subgraphs. CoRR, abs/2402.12148, 2024. doi:10.48550/arXiv.2402.12148.
  • [12] Nicolas Bousquet, Louis Esperet, Laurent Feuilloley, and Sébastien Zeitoun. Renaming in distributed certification. CoRR, abs/2409.15404, 2024. doi:10.48550/arXiv.2409.15404.
  • [13] Nicolas Bousquet, Laurent Feuilloley, and Sébastien Zeitoun. Local certification of local properties: Tight bounds, trade-offs and new parameters. In 41st International Symposium on Theoretical Aspects of Computer Science, STACS 2024, volume 289 of LIPIcs, pages 21:1–21:18, 2024. doi:10.4230/LIPICS.STACS.2024.21.
  • [14] Nicolas Bousquet, Laurent Feuilloley, and Sébastien Zeitoun. Complexity landscape for local certification. CoRR, abs/2505.20915, 2025. doi:10.48550/arXiv.2505.20915.
  • [15] Sebastian Brandt, Orr Fischer, Juho Hirvonen, Barbara Keller, Tuomo Lempiäinen, Joel Rybicki, Jukka Suomela, and Jara Uitto. A lower bound for the distributed lovász local lemma. In Proceedings of the 48th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2016, pages 479–488. ACM, 2016. doi:10.1145/2897518.2897570.
  • [16] Sebastian Brandt, Juho Hirvonen, Janne H. Korhonen, Tuomo Lempiäinen, Patric R. J. Östergård, Christopher Purcell, Joel Rybicki, Jukka Suomela, and Przemyslaw Uznanski. LCL problems on grids. In Elad Michael Schiller and Alexander A. Schwarzmann, editors, Proceedings of the ACM Symposium on Principles of Distributed Computing, PODC 2017, pages 101–110. ACM, 2017. doi:10.1145/3087801.3087833.
  • [17] Alfred Brauer. On a problem of partitions. American Journal of Mathematics, 64(1):299–312, 1942.
  • [18] Keren Censor-Hillel, Ami Paz, and Mor Perry. Approximate proof-labeling schemes. Theor. Comput. Sci., 811:112–124, 2020. doi:10.1016/J.TCS.2018.08.020.
  • [19] Yi-Jun Chang. The complexity landscape of distributed locally checkable problems on trees. In Hagit Attiya, editor, 34th International Symposium on Distributed Computing, DISC 2020, volume 179 of LIPIcs, pages 18:1–18:17, 2020. doi:10.4230/LIPICS.DISC.2020.18.
  • [20] Yi-Jun Chang. The distributed complexity of locally checkable labeling problems beyond paths and trees. In 15th Innovations in Theoretical Computer Science Conference, ITCS 2024, January 30 to February 2, 2024, Berkeley, CA, USA, volume 287 of LIPIcs, pages 26:1–26:25, 2024. doi:10.4230/LIPICS.ITCS.2024.26.
  • [21] Yi-Jun Chang, Tsvi Kopelowitz, and Seth Pettie. An exponential separation between randomized and deterministic complexity in the LOCAL model. SIAM J. Comput., 48(1):122–143, 2019. doi:10.1137/17M1117537.
  • [22] Yi-Jun Chang and Seth Pettie. A time hierarchy theorem for the LOCAL model. SIAM J. Comput., 48(1):33–69, 2019. doi:10.1137/17M1157957.
  • [23] Yi-Jun Chang, Jan Studený, and Jukka Suomela. Distributed graph problems through an automata-theoretic lens. Theor. Comput. Sci., 951:113710, 2023. doi:10.1016/J.TCS.2023.113710.
  • [24] Linda Cook, Eun Jung Kim, and Tomás Masarík. A tight meta-theorem for LOCAL certification of mso2 properties within bounded treewidth graphs. CoRR, abs/2503.19671, 2025. doi:10.48550/arXiv.2503.19671.
  • [25] Shlomi Dolev. Self-Stabilization. MIT Press, 2000. URL: http://www.cs.bgu.ac.il/%7Edolev/book/book.html.
  • [26] Yuval Emek and Roger Wattenhofer. Stone age distributed computing. In Panagiota Fatourou and Gadi Taubenfeld, editors, ACM Symposium on Principles of Distributed Computing, PODC ’13, Montreal, QC, Canada, July 22-24, 2013, pages 137–146. ACM, 2013. doi:10.1145/2484239.2484244.
  • [27] Laurent Feuilloley. Introduction to local certification. Discret. Math. Theor. Comput. Sci., 23(3), 2021. doi:10.46298/DMTCS.6280.
  • [28] Laurent Feuilloley, Nicolas Bousquet, and Théo Pierron. What can be certified compactly? compact local certification of MSO properties in tree-like graphs. In PODC ’22: ACM Symposium on Principles of Distributed Computing, pages 131–140. ACM, 2022. doi:10.1145/3519270.3538416.
  • [29] Laurent Feuilloley and Pierre Fraigniaud. Survey of distributed decision. Bull. EATCS, 119, 2016. URL: http://eatcs.org/beatcs/index.php/beatcs/article/view/411.
  • [30] Pierre Fraigniaud, Mika Göös, Amos Korman, and Jukka Suomela. What can be decided locally without identifiers? In ACM Symposium on Principles of Distributed Computing, pages 157–165. ACM, 2013. doi:10.1145/2484239.2484264.
  • [31] Pierre Fraigniaud, Magnús M. Halldórsson, and Amos Korman. On the impact of identifiers on local decision. In Principles of Distributed Systems, 16th International Conference, OPODIS 2012, volume 7702, pages 224–238. Springer, 2012. doi:10.1007/978-3-642-35476-2_16.
  • [32] Pierre Fraigniaud, Juho Hirvonen, and Jukka Suomela. Node labels in local decision. Theor. Comput. Sci., 751:61–73, 2018. doi:10.1016/J.TCS.2017.01.011.
  • [33] Pierre Fraigniaud, Amos Korman, and David Peleg. Towards a complexity theory for local distributed computing. J. ACM, 60(5):35:1–35:26, 2013. doi:10.1145/2499228.
  • [34] Pierre Fraigniaud, Frédéric Mazoit, Pedro Montealegre, Ivan Rapaport, and Ioan Todinca. Distributed certification for classes of dense graphs. In Rotem Oshman, editor, 37th International Symposium on Distributed Computing, DISC 2023, volume 281 of LIPIcs, pages 20:1–20:17, 2023. doi:10.4230/LIPICS.DISC.2023.20.
  • [35] Pierre Fraigniaud, Pedro Montealegre, Ivan Rapaport, and Ioan Todinca. A meta-theorem for distributed certification. Algorithmica, 86(2):585–612, 2024. doi:10.1007/S00453-023-01185-1.
  • [36] Mika Göös and Jukka Suomela. Locally checkable proofs in distributed computing. Theory Comput., 12(1):1–33, 2016. doi:10.4086/TOC.2016.V012A019.
  • [37] Godfrey Harold Hardy and Edward Maitland Wright. An introduction to the theory of numbers. Oxford university press, 1979.
  • [38] Juris Hartmanis and Richard E Stearns. On the computational complexity of algorithms. Transactions of the American Mathematical Society, 117:285–306, 1965.
  • [39] Sungjin Im, Ravi Kumar, Silvio Lattanzi, Benjamin Moseley, and Sergei Vassilvitskii. Massively parallel computation: Algorithms and applications. Found. Trends Optim., 5(4):340–417, 2023. doi:10.1561/2400000025.
  • [40] Amos Korman, Shay Kutten, and David Peleg. Proof labeling schemes. Distributed Comput., 22(4):215–233, 2010. doi:10.1007/S00446-010-0095-3.
  • [41] Masayuki Miyamoto. Distributed complexity of pk-freeness: Decision and certification. CoRR, abs/2410.20353, 2024. doi:10.48550/arXiv.2410.20353.
  • [42] Jukka Suomela. Landscape of locality (invited talk). In Susanne Albers, editor, 17th Scandinavian Symposium and Workshops on Algorithm Theory, SWAT, volume 162 of LIPIcs, pages 2:1–2:1. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2020. doi:10.4230/LIPICS.SWAT.2020.2.

Appendix A No gap above 𝐥𝐨𝐠𝒏 in general graphs with identifiers

We establish that there is no gap in the local certification complexity in general graphs with identifiers in the Ω(logn) regime. The proof follows the same route as the proof of the Θ(n2) bound for non-trivial automorphism in [36] .

Theorem 1. [Restated, see original statement.]

For general graphs with identifiers, for any non-decreasing function f(n) in Ω(logn) and O(n2), there exists a property that can be certified with O(f(n)) bits, but not in o(f(n)) bits.

Since the proof is a rather direct adaptation of the one of [36] and is not central in this paper, we use a more sketchy style than for our other proofs.

Proof.

Fix some function f in Ω(logn) and O(n2) consider the following language. A graph is in the language if it is made of two copies of a graph H on f(n) nodes,where an arbitrary node is linked to its copy by a path of length n2f(n). A certification for this language is the following. On a correct instance, every node is given the following pieces of information.

  1. 1.

    Its part of the certification of the size n of the graph, via a spanning tree (see [27]).

  2. 2.

    The adjacency matrix of H.

  3. 3.

    The identifier assignment restricted to the copies of H.

  4. 4.

    Its parts of two spanning trees, pointing to the two nodes that belong both to a copy of H and to the path.

Every node checks the following.

  1. 1.

    Item 1 and 4 above are consistent (again see [27]).

  2. 2.

    If its identifier appears in the list of Item 3, it checks that its neighborhood in the graph is consistent with the neighborhood in H as described by the certificates; except if it is the root of a spanning tree of Item 4, in which case it should have exactly one additional neighbor.

  3. 3.

    If its identifier does not appear in the certificates it should have degree 2.

The correctness is follows from [36], where the same scheme is used except that the central path has constant length (and therefore the identifiers of the nodes on the path can be given to all nodes without overhead).

The spanning tree certifying the number of nodes can be encoded in O(logn) bits, the adjacency matrices use O((f(n))2)=O(f(n)) bits, and the identifier assignment uses 2f(n)logn bits. Hence in total O(f(n)) bits in the regime of the theorem.

Again the lower bound is very similar to the one of [36]. Basically, if we were to use o(f(n)) bits, by pigeon-hole principle, there would be two different correct instances of the same size n, for which the same certificates would be used on one edge of the path. Then we could consider the graph where we take the right part from one instance and the left part from the other (with their accepting certificates), gluing on the edge with identical certificates. This new graph would be accepted, but it is not in our language since the graphs at the end of the path are different. A contradiction.

Appendix B Missing proofs for the gap between 𝑶(𝟏) and 𝚯(𝐥𝐨𝐠𝒏) in cycles

In this appendix section, we restate and prove the claim of Section 5 for which the proof was missing.

Claim 27. [Restated, see original statement.]

For every integer n3, nSk if and only there exists a closed walk of length n in 𝒢k.

Proof.

If nSk, there exists an assignment of certificates c1,,cn to the vertices of a n-vertex cycle such that every vertex accepts. For every i{1,,n}, since the vertex with certificate ci accepts and has two neighbors with certificates ci1 and ci+1 (where i1 and i+1 are taken modulo n), by definition there is an edge in 𝒢k from (ci1,ci) to (ci,ci+1). So this gives an closed walk of length n in 𝒢k. Conversely, if there exists a closed walk (c1,c2),(c2,c3),,(cn,c1) in 𝒢k, by definition all the vertices of the cycle of length n with certificates c1,,cn accept, so nSk.

Claim 28. [Restated, see original statement.]

We have nk2ck.

Proof.

By definition of s(nk), we have nkSs(nk). Since nkSk1i<kSi, it follows that s(nk)k. Moreover, by assumption, s(nk)lognkc. So nk2ck.

Claim 29. [Restated, see original statement.]

Let 𝒞 be a closed walk in 𝒢k, and let be its length. There exists b1,,bt such that =i=1tbii. Thus, d divides . In particular, d divides nk.

Proof.

Let us prove this result by induction on the length of 𝒞. The base case includes all elementary cycles: if 𝒞 is an elementary cycle, there exists j{1,,t} such that =j, so the result is trivially true. Assume now that 𝒞 is not an elementary cycle, and consider the shortest closed subwalk 𝒞 of 𝒞. Then, 𝒞 is an elementary cycle (otherwise it would not be the shortest subwalk of 𝒞), so 𝒞{𝒞1,,𝒞t} and its length is equal to j for some j{1,,t}. Let us denote by 𝒞𝒞 the closed walk obtained by removing from 𝒞 the steps of 𝒞. The length of 𝒞𝒞 is j. Finally, apply the induction hypothesis to the closed walk 𝒞𝒞, to obtain integers b1,,bt such that j=i=1tbii. The result follows.

Claim 30. [Restated, see original statement.]

Let m be such that d divides m, and m24k+1. Then, there exists a closed walk in 𝒢k of length m. Thus, mSk.

Proof.

First, we construct greedily a closed walk 𝒞0 in 𝒢k that passes through all the vertices of 𝒢k (it exists, because 𝒢k is strongly connected). For every u,vV(𝒢k), the shortest directed path from u to v has length at most the number of vertices of 𝒢k which is 22k. Thus, there exists a closed walk 𝒞0 of length 0(22k)2=24k that passes through all the vertices. By Claim 29, d divides 0, so d divides m0. Furthermore, m024k. Since we have max1iti22k, we can apply Lemma 26 to get the existence of integers a1,,at such that m0=i=1taii. Finally, to construct a closed walk in 𝒢k of length m=0+i=1taii, we attach ai times the elementary cycle 𝒞i to the closed walk 𝒞0 for every i{1,,t} (this is possible, because 𝒞0 passes through all the vertices). By Claim 27, we have mSk.

Appendix C Proof of a property with optimal size 𝚯(𝐥𝐨𝐠𝒏) in cycles

Let us restate the result and prove it.

Proposition 32.

Certifying that the length of a cycle is not a power of 2 can be done with certificates of size O(logn) but not with certificates of size O(1).

Proof.

Let C be a cycle of length n, let uV(C) and let P be the path obtained from C by deleting one edge adjacent to u. To certify that n{2k,k}, the prover writes in the certificate of every vertex vV(C) the tuple (d,i) where d3 is an odd integer that divides n, and i is the distance from u to v in P modulo d. Every vertex checks that, if its certificate is (d,i) then its two neighbors have certificates (d,i1modd) and (d,i+1modd), and that d is indeed odd. Such a certificate has size O(logn). This scheme is correct: indeed, if all the vertices accept, the length of the cycle should be divisible by d (and conversely, with the certificates described above, all the vertices will accept if the cycle has length divisible by d).

Now, assume by contradiction that certifying that the length n of a cycle is not a power of 2 can be done with certificates of constant size k (or equivalently, that 2k distinct certificates are sufficient). Let p be an odd prime number such that p>2k. Let us consider an assignment of certificates to the vertices of a cycle C of length p such that all the vertices accept. Let us number the vertices of C in clockwise order starting from an arbitrary vertex, and for every i{0,,n1}, let ui be the i-th vertex in this numbering, and ci be its certificate. By the pigeonhole principle, there exists 0i<jn1 such that (ci,ci+1)=(cj,cj+1) (where j+1 is taken modulo n). If j=i+1, then a vertex with certificate ci accepts with two neighbors having certificate ci. Thus, in this case, any cycle is accepted (by giving the certificate ci to all the vertices) which is a contradiction. Else, let 1:=ji and 2:=pj+i. We have 1,2{1,,p1} and 1+2=p. Since p is prime, we get gcd(1,2)=1. Moreover, for any a1,a2, a cycle of size a11+a22 is accepted. Indeed, by cutting such a cycle in a1 portions of length 1 and a2 portions of length 2, giving the certificates ci,,cj1 to the vertices in portions of size 1 and cj,,cn1,c0,,ci1 to the vertices in portions of length 2, all the vertices accept because they have the same view as a vertex which accepts in C. Using Lemma 26, all the the cycles of length mp2 are accepted, which is a contradiction because the cycles whose length is a power of 2 greater than p2 should not be accepted.