Distributed Complexity of -Freeness:
Decision and Certification
Abstract
The class of graphs that do not contain a path on nodes as an induced subgraph (-free graphs) has rich applications in the theory of graph algorithms. This paper explores the problem of deciding -freeness from the viewpoint of distributed computing.
For specific small values of , we present the first algorithms specified for -freeness, utilizing structural properties of -free graphs in a novel way. Specifically, we show that -freeness can be decided in rounds for in the model, and in rounds for in the model, where is the number of nodes in the network and hides a factor. The main technical contribution is a novel technique used in our algorithm for -freeness to distinguish induced -paths from non-induced ones, which is potentially applicable to other induced subgraphs. This technique also enables the construction of a local certification of -freeness with certificates of size . This improves by Bousquet and Zeitoun (TCS 2025), and is nearly optimal, given our lower bound on certificate size.
For general , we establish the first lower bound, which is of the form . The factor is unavoidable, in view of the upper bound by Eden et al. (Dist. Comp. 2022). Additionally, our approach yields the first superlinear lower bound on certificate size for local certification. This partially answers the conjecture on the optimal certificate size of -freeness, asked by Bousquet et al. (arXiv:2402.12148).
Finally, we propose a novel variant of the problem called ordered detection. We show that in the model, the round complexity of ordered detection is for , and in contrast, proving any nontrivial lower bound for ordered detection implies a strong circuit lower bound. As a byproduct, we establish a circuit-complexity barrier for quantum lower bounds for induced -cycle detection. This is complemented by our quantum upper bound, which surpasses the classical lower bound by Le Gall and Miyamoto (ISAAC 2021).
Keywords and phrases:
subgraph detection, CONGEST model, local certification2012 ACM Subject Classification:
Theory of computation Distributed algorithmsAcknowledgements:
The author would like to thank François Le Gall for discussions and for commenting on an earlier version of this paper. The author also acknowledges the anonymous reviewers for their helpful comments.Editors:
Ho-Lin Chen, Wing-Kai Hon, and Meng-Tsung TsaiSeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
1.1 Background
The subgraph detection problems in the distributed setting of limited communication bandwidth, including the model, are interesting and well studied problems. For a given specific small pattern graph (the description of is known to all nodes of the network), the goal of detection is to decide if the network contains as a subgraph (or an induced subgraph). If the network contains , at least one node outputs “yes”, otherwise all nodes of the network output “no”. ( detection is often referred as freeness, where at least one node outputs “no” if the network contains , and otherwise all nodes of the network output “yes”. In this paper we abuse these two problems since they are essentially equivalent.) There are several variants previously considered in the literature: listing requires to output all copies of in (each node outputs some copies of and the union of all output lists are equivalent to the list of all copies of in ). In recent years, significant progress in distributed subgraph detection has been made, and we now understand well the round complexities for detection of e.g., cliques and cycles. However, much less is known about induced paths. In the full version, we also summarize the previous results in this context.
Since it is known that in the model, detecting non-induced -paths can be done in rounds for constant [27], our focus in this paper is specifically on the induced -path detection problem. Induced -paths are particularly more important than non-induced paths because graphs that do not contain an induced -path (often called -free graphs) have numerous applications in algorithmic graph theory, particularly in the centralized setting. The absence of induced -node paths imposes structural constraints on graphs, which can be leveraged to develop efficient algorithms for problems like independent set or coloring [5, 20, 22, 31, 23, 36]. Despite its importance, few results are known for -freeness in the distributed setting. The only known result we are aware of is the result of Nikabadi and Korhonen [34], who explored the multicolored variant of the induced -path detection, where each node is colored by an integer from , and the goal is to detect an induced path on different colors. They demonstrated that multicolored induced detection requires rounds for any . However, their proof fails for -freeness. To the best of our knowledge, there is no nontrivial result111For the trivial case of , detection can be done in rounds. for -freeness in the model beyond the general result by [14]: for any subgraph on nodes, there is a randomized algorithm that solves induced detection in rounds.
Local certification of -freeness
Another line of research focuses on local certification of -freeness. In local certification, small labels called certificates are assigned to the nodes of the graph, and each node decides if the graph satisfies some property by using its local view. There are two important parameters, that is, the size of certificates and the locality of verification. In local certification of size and locality , each node receives an -bit certificate from the external entity called the prover, and each node decides its binary output (accept or reject) only using the unique identifiers and certificates of its -hop neighbors. The scheme (with the locality ) is introduced by [28] under the name of proof-labeling schemes, which is now a popular setting in the community of distributed computing:
Definition 1 (proof-labeling schemes).
A proof-labeling scheme () of size is a pair () of functions and of the input graph such that
-
for each node , the function assigns a bit string of length called the certificate of ;
-
for each node , the function , called the verification algorithm, depends on the certificates of -hop neighbors of as well as the identifiers of these nodes, and decides the output of . More precisely, let be the -hop neighbors of , then
is the output of .
Let be a graph property. We say that there is a that certifies if there exists a verification algorithm with locality (which outputs or ) satisfies the following conditions.
-
If satisfies the property , there exists a certificate function such that all nodes in output .
-
If does not satisfy the property , for any certificate function , at least one node in outputs .
When the locality of the verification algorithm is , there is a trivial way to certify -freeness: by assigning the list of IDs of neighbors as a certificate, each node try to find a by looking edges adjacent to its -hop neighbors. If there is a , then its central node can now detect it. Thus, our interest lies in cases where the path length is long relative to the locality. Recently, Bousquet, Cook, Feuilloley, Pierron, and Zeitoun [2] studied this topic by analyzing the relationship among path length, certificate size, and locality of verification. They provided various nontrivial upper and lower bounds on certificate size (we discuss details in Section 1.2). While local certification of various subgraph-related problems has been studied (e.g., [3, 11, 18, 19]), the only known result for local certification of -freeness prior to [2] was for : there is a that certifies -freeness with bits [17]. Moreover, [17] proved that every property can be certified with bits and locality 1 if all graphs satisfying have bounded clique-width. Since -freeness can be expressed by , this suggests that the difficulty gap between certifying -freeness and -freeness can be attributed to the fact that -free graphs have bounded clique-width, whereas -free graphs do not. After that, Bousquet and Zeitoun [4] constructed a that certifies -freeness with bits.
1.2 Our results
1.2.1 Result 1: -freeness for and
We first show the following upper bounds for and . Throughout the paper, we use “a randomized algorithm” as an algorithm that outputs the correct answer with probability at least 2/3.
Theorem 2.
There exists a randomized algorithm that solves -freeness in the model, running in rounds.
Theorem 3.
There exists a randomized algorithm that solves -freeness in the model, running in rounds.
Technical challenges
As mentioned above, there were no nontrivial algorithms for -freeness for in the model beyond the general upper bound [14]. Moreover, the previous algorithms for subgraph detection have been mostly focused on non-induced cases. Most techniques from these results seem to be useless for induced subgraph detecion. Typical examples are the expander decomposition applicable to cliques [8, 9], and the BFS search with threshold applicable to (non-induced) cycles [6, 18]. We thus need to exploit structural properties of /-free graphs, resulting our algorithms quite different from the other algorithms for subgraph detection. We believe that our approach hints new algorithms for other induced subgraphs.
We turn our attention to the framework of local certification. We get the following certification scheme for -freeness.
Theorem 4.
There is a that certifies -freeness with certificates of size .
To obtain this result, we directly use the algorithm of Theorem 3. This is somewhat interesting, as algorithms for subgraph freeness cannot be used for local certification of subgraph freeness in general (unless they are algorithms). We also show that the size of our certificates is optimal, up to a subpolynomial factor:
Theorem 5.
For any , any for -freeness requires certificates of size .
This result is proved via a combination of several known reduction techniques: we first reduce the nondeterministic three-party communication complexity of set-disjointness function to triangle freeness, which is then reduced to -freeness.
Independent and concurrent works
After submitting the first draft of this paper, we learned about the following independent and concurrent works: Bousquet and Zeitoun [4] proved that there is a that certifies -freeness with certificates of size . Additionally, as noted in [4], Chaniotis, Cook, Hajebi, and Spirkl (independently and concurrently) obtained lower bound for -freeness.
1.2.2 Result 2: -freeness for general
We provide the following lower bounds for -freeness in the (randomized) model using the two-party communication complexity, which is the standard framework for distributed subgraph detection problems.
Theorem 6.
Let be any positive integer.
-
For , -freeness for require rounds in the randomized model.
-
For , -freeness for requires rounds in the randomized model.
The factor is unavoidable, in light of the upper bound of established by [14]. We will show these lower bounds using the standard reduction from two-party communication complexity. Although the framework used here is classic, there are several challenges that makes the proof highly non-trivial, which we discuss in detail later in Section 5.2.
Additionally, our constructions apply to the nondeterministic two-party communication setting, leading to the following.
Theorem 7.
Let and be positive integers.
-
For , any for -freeness requires certificates of size for .
-
For , any for -freeness requires certificates of size for .
Comparison with the recent results by Bousquet et al. [2]
Let us now compare our results with those of Ref. [2], who considered local certification of -freeness and obtained the following:
Theorem 8 ([2]).
If the locality is , then:
-
-freeness for can be certified with certificates of size ;
-
-freeness for can be certified with certificates of size ;
-
-freeness for requires certificates of size .
They also conjectured the following.
Conjecture 9 ([2]).
For all , the optimal size for local certification of -free graphs with locality is of the form , for some unbounded increasing function .
We summarize our results and previous results in Table 1. For , the results of Bousquet et al. [2] provide nontrivial lower bound for -freeness, and Bousquet and Zeitoun [4] provide nontrivial upper bound for -freeness. In Theorems 4 and 5, we show the nearly optimal bound of -freeness for .
1.2.3 Result 3: Ordered path detection and applications
We then define and study the following problem called ordered detection.
Definition 10 (ordered detection).
Each node of the graph has a color from , and the goal is to detect an induced path that consists of edges on nodes where is colored by .
Motivation
The definition of ordered path detection may seem somewhat artificial; however, it can be motivated in a manner similar to that of multicolored path detection studied in [34]. Algorithms for the multicolored/ordered variants of these problems with color-coding techniques [1] are often used to address the standard version. For example, the state-of-the-art algorithm for detecting -node cycles [6, 16] actually detects the ordered variant of -cycles. Consequently, lower bounds for the ordered variant reflect the limitations of algorithms that employ color-coding. Moreover, as demonstrated in Theorem 12, we will show that ordered path detection also provides some nontrivial insight for subgraph detection in the quantum model, which is not restricted to algorithms using color-coding.
Contribution
We first prove the following lower bound for .
Theorem 11.
Any randomized algorithm that solves ordered detection for requires rounds in the model.
We then focus on ordered detection. Our next finding is that, similarly to triangle detection, proving non-trivial lower bounds for ordered detection is difficult. This result also shows a circuit complexity barrier for induced detection.
Theorem 12 (Informal).
For any constant , proving any lower bound of the form for ordered -path detection in the model or for induced detection in the quantum model implies super-linear lower bounds on circuit complexity for an explicit family of boolean functions.
Note that our circuit complexity barrier for induced detection is in the quantum model [30]. We then complement this result by showing a nontrivial upper bound for induced .
Theorem 13.
In the quantum model, induced detection can be solved in rounds with high probability.
Previously, no nontrivial upper bound for induced detection was known, and our bound for induced detection beats a classical lower bound [29].
2 Preliminaries
We write . We let be the set of neighbors of . For a graph , we say that is a subgraph of if there is an injective function such that for any pair of nodes . We say that is an induced subgraph of if there is an injective function such that for any pair of nodes . Let be a path on nodes. We say that is -free if does not contain as an induced subgraph. Throughout the paper, we assume that the input graph is connected, otherwise we can treat each connected component separately.
The model and variants
In the model [35], each node of the network has a distinct -bit identifier and can communicate with its neighbors in a synchronized manner. In each round each node (1) does some local computation and (2) sends an -bit message to each of its neighbors. In the initial state, each node only knows its own unique identifier (represented by bits) and input, and the unique identifiers of its neighbors. The model is a weaker model in the sense that each node can only broadcast a -bit message in each round. In the model [32], we allow all-to-all communication in each round. That is, the communication topology is always the -node clique, and the input graph is a subgraph of the clique. In the quantum model [30], each node can send -qubit quantum message to each of its neighbors, instead of -bit classical message.
3 Algorithm for -freeness
In this section we prove Theorem 2.
It is well known that a graph is -free iff it is a cograph [10]. A cograph is defined as a graph that is constructed using the following rules:
-
A single-node graph is a cograph;
-
For cographs and , its disjoint union is a cograph;
-
For cographs and , its join is a cograph;
Using this characterization, we can construct a low-congestion spanning tree for any cograph as follows. A similar property on the spanning tree of cographs is also used in distributed (interactive) proofs for -freeness [17, 33]. We use the following results on the balls into bins problem.
Lemma 14 (Balls into bins problem [21]).
Assume that there are bins and balls. Each ball uniform randomly selects one bin which it places into. Then, with probability at least , all bins have at most balls.
Lemma 15.
There exists a algorithm that runs in rounds and performs the following:
-
If some node rejects, then is not a connected cograph.
-
Otherwise, with high probability, it outputs a rooted spanning tree of depth 2, where each node, except the root, has at most children.
Proof.
We first reject if the diameter of the graph is not , since every connected cograph has diameter at most 2. This can be accomplished in rounds by running a BFS search from the node with minimum identifier as the depth of the BFS tree provides a -approximation of the diameter. From the definition of cographs, a connected cograph is constructed from the join operation on two distinct node sets and where and . Let be the node with the maximum degree in . Note that can be found in rounds since the diameter of the graph is constant. If , the algorithm rejects (if is a connected cograph, then the maximum degree is at least as each node in has degree at least ). Consider the tree rooted at where all neighbors of are at depth 1. As below, we can construct the desired spanning tree with high probability if the graph is a connected cograph.
-
Case 1: . Define . Observe that and . Each uniform randomly selects one node from its neighbors as its parent in the tree. Now, each depth-1 node (bin) is selected as the parent of (ball) with probability at most as . Then from Lemma 14, all depth-1 nodes have at most children with high probability.
-
Case 2: . Define . Since , we have
Each node in selects one of its neighbors as its parent uniform randomly. Now, each depth-1 node (bin) is selected as the parent of (ball) with probability at most as . Then from Lemma 14, all depth-1 nodes have at most children with high probability.
Proof of Theorem 2.
Ref. [25] demonstrated a protocol in the randomized multiparty simultaneous messages model (also known as the distributed sketching model), where each node sends a message of size to a referee. The referee can then determine if is a cograph with high probability. We can simulate this protocol in the constructed depth-2 spanning tree, with the root acting as the referee. Since each depth-1 node has at most children, sending all messages to is completed in rounds.
4 Algorithm and certification for -freeness
In this section we give an overview of the proof strategy for Theorem 3 and Theorem 4. Since these theorems share the same proof strategy, we focus on our algorithm for -freeness (Theorem 3). Full proof of Theorem 4 will be found in the full version.
Our algorithm aims to efficiently detect whether a given graph is -free, focusing on cases where the graph has a diameter of 2 or 3 (observe that if the diameter is at least 4, then it must not be -free). We treat these cases separately due to their distinct properties.
4.1 Subgraph freeness in graphs with diameter two
Here we assume that the graph with nodes and edges has diameter 2. Let be the node with maximum degree. Then . Divide the node set where is the set of nodes whose distance to is . We first consider that every node broadcasts the list of its neighbors in rounds. After that learns all the edges connected to . In the remaining part we show that, with high probability, can learn the edge set in another rounds. Therefore, can locally decide if the graph is free.
Assume that we divide into subsets for as follows (assuming is an integer): each node chooses an integer uniformly at random and joins . Next, we label the set as where is the -th smallest among IDs of nodes in . The partition of and the label of nodes with respect to the order of IDs are informed to all the nodes in the network in rounds. We use the following lemma.
Lemma 16 ([8]).
Consider a graph with edges and vertices. We generate a subset by letting each vertex join independently with probability . Suppose that the maximum degree is and . Then, with probability for sufficiently large , the number of edges in the subgraph induced by is at most .
Here we set , , . Clearly, holds. As , one can verify that holds if . We assume that holds since otherwise a simple -round algorithm suffices. Then, with high probability, for all . Consider the node collects the edge set for . To do this, assume that . Then there exists a node such that since the diameter of the graph is 2. knows the edge because of the previous round communication and thus can send it to . Since , this is done in rounds. Finally, sends to in rounds. Now we have established the following theorem.
Theorem 17.
In graphs with diameter 2, for any subgraph , (non-induced and induced) -freeness can be solved in rounds in the model with high probability.
4.2 -freeness in graphs with diameter three
A High-Level Overview of Our Approach
In the case of a graph with diameter 3, the larger diameter necessitates more tricky definitions and case analysis, resulting in a more involved proof. We outline the main ideas of our proof here before giving details.
Let be the set of nodes at distance from , for . The edges (except the ones incident to ) are partitioned as for . We employ a procedure (Algorithm 1) that runs in rounds, enabling to gather and .
: the node with maximum degree.
: the rooted spanning tree created by the BFS traversal from .
- Step 1
-
Each node broadcasts the list of its neighbors in rounds.
- Step 2
-
Consider the partition of into subsets for as follows: Each node chooses an integer uniformly at random, joins and tells it to all other nodes so that all nodes in the graph learn the partition . The node broadcasts the set of nodes that contains all nodes in in ascending order with respect to their IDs so that all nodes in the graph agree with such that is the -th smallest ID in .
- Step 3
-
Define by the set of edges satisfying at least one of the following:
-
1.
where and for the partition , and is a neighbor of such that the integer is assigned in Step 2 for ,
-
2.
.
We also define by the set of edges satisfying . Each node tells the neighbors the list of its incident edges in and .
-
1.
- Step 4
-
For each edge , sends the edge between and to if received in Step 1. After that, each sends these edges between and to . This takes rounds since the number of edges between and is w.h.p., due to Lemma 16.
- Step 5
-
computes . This is done by using the tree . then rejects if the following holds:
- Condition 1
-
There is an edge in that is not sent to .
For each edge , rejects if the following holds:
- Condition 2
-
.
For each edge , rejects if the following holds:
- Condition 3
-
There is a node such that .
For each edge , rejects if the following holds:
- Condition 4
-
.
For this procedure, we have the following.
Lemma 18 (Restated in Lemma 22).
If any of the conditions in (Conditions 1-4) are met, then contains an induced .
After the execusion of , if detects any missing edge in – where is a carefully defined subset of (defined in Step 3 of ) – it can safely conclude that the graph is not -free. This is because (i) the invalidity of Condition 2 of ensures that , and (ii) the invalidity of Condition 1 of ensures that all edges in have been sent to . Therefore, if no node rejects in , learns .
The following lemmas shows that, for any containing an edge from , detection is performed by nodes connected to the endpoints of these edges.
Lemma 19 (Restated in Lemma 23).
Assume that no node rejects in . If contains an induced with at least one edge from , then there is a node that detects an induced .
Lemma 20 (Restated in Lemma 24).
Assume that no node rejects in . If contains an induced with at least one edge from , then there is a node that detects an induced .
For any composed solely of edges from , detection is performed by node , which now has access to . The key challenge here is distinguishing between proper s (induced by the entire edge set ) and improper s (induced by but not by ). We address this by having count the improper patterns and subtract this count from the total number of s it finds. As illustrated in Figure 1, there are 17 non-isomorphic patterns for possible improper s. We introduce the concept of a “dangerous” induced subgraph. An induced copy of in the input network is dangerous iff it induces a in . The following lemma shows that the number of such dangerous patterns can be counted efficiently.
Lemma 21 (Restated in Lemma 25).
Let be a set of five-node graphs that contain as a (non-necessarily induced) subgraph as in Figure 1. Then, for any , the number of induced copies of in the graph that are dangerous can be counted (by the node ) in rounds.
This yields the correct count of proper s, since the number of s in is now obtained by subtracting the number of dangerous patterns in from the number of s in .
4.2.1 Algorithm for -freeness in graphs with diameter three
Assume that the diameter of the input graph (with nodes and edges) is 3 and . Similarly to the previous case, we divide the node set into where is the set of nodes whose distance to is . For all , , we also define by the edge set between and . We use a subprotocol described in Algorithm 1, and analyze it here.
Lemma 22.
If any of the conditions in (Conditions 1-4) are met, then contains an induced .
Proof.
- Condition 1
-
Assume that there is an edge in that is not sent to in . Then for some satisfying , there exists an edge between and such that no node in is connected to . W.l.o.g., we assume that and . Then there exists a node that is not connected to . induces a .
Assume that there is an edge in that is not sent to in . First, observe that if an edge in satisfies the first condition of , it is sent to in . Assume that satisfies the second condition of and is not sent to in . Then w.l.o.g., there are two nodes and . Furthermore, assume that , , and . induces a .
- Condition 2
-
Suppose that, for some satisfying , there exists an edge between and satisfying . Then there exist a node that is not connected to . We can conclude that is an induced .
- Condition 3
-
Let . For a node such that , if there is , then is an induced for .
- Condition 4
-
Let . Then there exists a node . is a .
Now, assuming that does not reject, the following properties hold for each edge between and .
- Property 1
-
,
- Property 2
-
for ,
- Property 3
-
,
- Property 4
-
for each , .
Lemma 23.
Assume that no node rejects in . If contains an induced with at least one edge from , then there is a node that detects an induced .
Proof.
We write be the four edges constructing some induced . Here we assume that exactly one edge of them is bad. First, consider that . Fix one node . Then from Property 1 and Property 3, is in . Moreover, from Property 4. Similarly, . Now observe that knows and can detect the path.
We then consider that . Similarly to the above case, we can assume that . We distinguish the following cases.
- Case 1: .
-
If , can detect the path since knows and thus due to Property 1. If , can detect the path due to Property 3. If , for each node , . If or , can detect the path. Otherwise is an induced . can detect this path as follows. Since knows and , can conclude that due to Property 4.
- Case 2: .
-
If , then can detect the path due to Property 1. Assume . Fix arbitrary node . If , then can detect the path. Otherwise, knows that and . can detect the path since can verify that due to Property 4.
- Case 3: .
-
If , then can detect the path since do not have a neighbor in . Assume . Fix arbitrary node . If , then can detect the path. Otherwise, knows that . From Property 4, we have . can detect the path.
Therefore, any induced that contains exactly one edge from can be detected by some node.
Now we consider a path containing at least two edges from . If there are two consecutive bad edges in the path, , there is a node from Property 1. is also connected to when or when . can detect the path. The remaining case is the path containing two bad edges that are not consecutive. We have two distinct cases.
- Case 4: .
-
From Property 1 and Property 4, there is a node who can detect the path.
- Case 5: .
-
This is similar to a subcase of Case 1. For each node , . If or , can detect the path. Otherwise is an induced . can detect this path as follows. Since knows and , can conclude that due to Property 4.
Lemma 24.
Assume that no node rejects in . If contains an induced with at least one edge from , then there is a node that detects an induced .
Proof.
We write be the four edges constructing some induced . Consider that for instance. since Condition 4 of does not hold. Similarly, since we have . Therefore there exists that can detect the path. The analysis of the other cases is done in the same way – we can show that all path edges are from . Given Lemmas 23 and 24, we can detect any induced that involves edges from .
Our focus now shifts to the detection of an induced solely composed of edges from . In this context, we need to ensure that any induced by does not actually arise as an artifact of the removal of , but is genuinely induced by . To address this, we consider all five-node induced subgraphs of . As illustrated in Figure 1, there are exactly 17 distinct five-node patterns , each containing a as a subgraph. See Figure 2 for the illustration.
We introduce the concept of a “dangerous” induced subgraph. An induced copy of in the input network is dangerous iff it induces a in .
Lemma 25.
Let be a set of five-node graphs that contain as a (non-necessarily induced) subgraph as in Figure 1. Then, for any , the number of induced copies of in the graph that are dangerous can be counted (by the node ) in rounds.
Proof.
We first note that there is no dangerous copy of including edges from both and , since two endpoints of a bad edge do not have neighbors in due to Property 3 of .
We consider the case of in Figure 1, i.e., 5-node cycles . Let
be five edges that form a dangerous where and for . Then, from Property 1 and Property 3 of , we can assume that . Moreover, from Property 4 all the nodes in are neighbors of and . Let be the node in with minimum identifier. Then can detect the cycle as it knows all the neighbors of . Each node counts the number of such cycles, and sends it to . In this way, can count the number of dangerous in the graph. Note that it is impossible to have due to the fact that Condition 4 of does not hold: If , then . It contradicts the assumption that .
For any other pattern graph from illustrated in Figure 1, we can see that at least one node of the pattern is connected to at least three other nodes of the pattern who can detect the pattern. This completes the proof.
Proof of Theorem 3.
In rounds, we can compute the diameter of the graph [24]. If the diameter exceeds 3, the network immediately rejects. If the diameter is two, we apply Theorem 17, which establishes that -freeness can be verified in rounds with high probability. So we focus on the case where the diameter is exactly 3.
First, if the number of edges in the graph satisfies , the node collects all edges in rounds and decide if is -free. Otherwise, the network runs in rounds. Assuming does not reject, knows all the edges in as .
Next, we assume that there is no induced that contains at least one edge from as such a can be detected by some node due to Lemma 23 and Lemma 24.
For each , the node counts the number of induced copies of that are dangerous as in Lemma 25. Let be the count and let be the sum of them.
Finally, counts the number of ’s induced by . If it is equal to , we can conclude that is -free as removing from increases the number of induced ’s by . Otherwise, if the count is strictly larger than , is not -free.
4.3 Local certification of -freeness
We use the following standard technique.
Lemma 26.
Let be an -node connected graph, and be a spanning tree rooted at arbitrary node . For each , let be an integer assigned to . Then, there is a that computes with certificates of size .
Proof.
Let be a subtree of rooted at . The certificate to consists of two -bit values and . If is a leaf, rejects when . If is not a leaf, rejects when . The root node rejects when . If no node rejects, .
Proof of Theorem 4.
We first describe the certificates and then explain the verification process and its correctness.
The certification comprises the following components, each represented by bits. We assume that is an integer (otherwise, we use ).
-
: For each node , consists of a set of neighbors of .
-
: consists of a BFS tree starting from .
-
: describes a spanning tree rooted at the node with maximum degree, denoted . Thus, it is identical to .
-
: provides a partition of nodes. The prover divides into subsets so that the number of edges between and is for all . This partition exists due to Lemma 16. includes the partition and a set containing all the neighbors of in ascending order with respect to their IDs. Additionally, includes encoded values in -bit: , , , , , , , . These values are used to verify the size of edge subsets. Finally, includes, for each , the numbers and . These values are used to count the number of dangerous copy of in the graph.
-
: Let be the neighbor of whose degree is the -th largest among all the neighbors of for . then contains -bit descriptions of all edges between and where .
Verification
is used to certificate the graph diameter is at most 3. This is done by checking the structure of the trees as in [7].
Certifying the partition of nodes and edges.
Each node checks that the partition of nodes and the set representing all the neighbors of in ascending order with respect to their IDs written in , and the spanning tree written in are the same as of its neighbors, so that all nodes agree the same partition, and the distance from for all nodes. Now, given a partition of nodes in and the spanning tree in , endpoints of each edge determine which subset of
the edge belongs to. Next, each node computes the sizes of edge sets , , , , , using the protocol of Lemma 26 and the values written in as follows: For example, we can compute by setting , , and be the number of edges in incident to .
Certifying the assigned edges.
Let be the neighbor of that has the -th minimum ID among all the neighbors of . Since all nodes agree on the set representing all the neighbors of in ascending order with respect to their IDs, all nodes know which is . For each , the validity of can be checked as follows. First, each checks if the following holds.
-
1.
for . can check this since all nodes agree on the partition .
-
2.
For each edge , at least one of its endpoints is at most distance 2 from . can check this since knows nodes with distance 2 from by looking at of its neighbors.
If these conditions do not hold then rejects. Each neighbor of rejects if contains non-edge that includes for some by looking at . This ensures that the edge set is indeed the set of edges between and such that at least one of the endpoints are a 2-hop neighbor of . Each node then checks whether the four conditions described in hold: checks Condition 1, and each node checks Condition 2,3, and 4 for each edge . If any condition holds, the network rejects. This part of verification ensures that learns .
Detecting an induced containing .
Detecting an induced solely composed of .
The final components of are used to count the number of dangerous for , as described in Lemma 25. This is accomplished using Lemma 26 as follows. For simplicity, let us consider the case of induced ( in Figure 1). In our algorithm, as in the proof of Lemma 25, for each such cycle, the node in with minimum identifier that is incident to the cycle detects it, and report the number of detected cycles to the node through the BFS tree to compute the entire number of dangerous ’s in the graph. Setting , , and be the number of dangerous ’s detected by in Lemma 26, we can certify it. then determines the number of ’s that increase by removing (i.e., the number of dangerous copies of for all ) and compares this with the number of induced ’s in . Based on this comparison, can decide if is -free.
5 Lower bounds for -freeness
5.1 Proof of Theorem 5
Our proof leverages a reduction from the three-party nondeterministic communication complexity of the set-disjointness problem in the number-on-forehead (NOF) model. In this model, each player sees the inputs of the other two players but not their own. The players communicate by writing bit strings on a shared blackboard, and the communication cost of the protocol is the total number of bits written.
As a first step, we use the reduction from set-disjointness to triangle-freeness established by Drucker et al.[13]. The key graph used in their reduction is a tripartite graph where (1) and for arbitrary integer ; (2) contains triangles ; (3) each edge of the graph belongs to exactly one triangle. This graph construction relates to the Ruzsa–Szemerédi problem[26] in extremal graph theory and has been well studied.
For , we define as the graph obtained by removing edges from as follows. Let be a triangle such that is an edge between and , is an edge between and , and is an edge between and . For , we remove the edge iff the -th bit of is 0.
We then construct another graph as follows. This construction is obtained by tweaking the construction in [12, 34]. Let be the complement graph of . For each node in , we have a triangle in . We add edges for all . For each edge in , we add edges for all in . Additionally, there are two extra nodes . is connected to and for all . is connected to and for all . Thus, the number of nodes in is .
Lemma 27.
contains an induced if and only if , and are not disjoint.
Proof.
Suppose that there is an index such that the -th bit of , and are all 1. Then we have a triangle in the graph and thus have an independent set of size 3 in . induces in .
Conversely, assume that contains an induced . Let us focus on an independent set of size 3 in the path. The set does not contain and at the same time, since each other node is connected at least one of and . Moreover, the set cannot contain , as otherwise the other two nodes are written and , but it is impossible since and are always connected. A similar contradiction occurs if the set contains . Therefore three nodes in the set are written , , . From the construction in is an independent set. This means that , and are not disjoint.
We are now ready to prove Theorem 5.
Proof of Theorem 5.
Suppose that -freeness can be certified with certificates of size . We construct a non-deterministic communication protocol for set-disjointness with communication complexity . Suppose that Alice, Bob, and Charlie, who receive the inputs , , and , respectively, simulate as follows:
-
For all in , Alice simulates nodes in .
-
For all in , Bob simulates nodes in .
-
For all in , Charlie simulates nodes in .
-
All players simulates and .
Since the incident edges of simulated nodes by each player are independent from its input, each player can construct the inputs of its simulated nodes locally. Then the players simulate the certification for -freeness. Since the certificate size is and each player simulates nodes, the length of the certificates for each player is . In order to determine the outputs of the simulated nodes, each player writes its certificates to the shared blackboard. From Lemma 27, the players can compute set-disjointness of size . We now obtain as desired by the nondeterministic multiparty communication complexity of set-disjointness in the number-on-forehead model [37]. To extend the lower bound for larger locality, we just replace edges incident to and by longer paths. Thus we can increase the locality by 1 at the cost of increasing the path length by 4.
5.2 Lower bounds for -freeness: general
Technical challenges and the general proof idea for Theorem 6
We use a standard reduction from two-party set-disjointness function for the proof of Theorem 6 (and Theorem 7). Alice and Bob jointly construct some graph which depends on their inputs so that the constructed graph is -free iff the output of some function (in our case, the set-disjointness function) is 1.
The main challenge in this proof is to ensure that the cut of the constructed graph (i.e., the number of edges between nodes held by Alice and those held by Bob) is as sparse as possible. This was not an issue in the proof of Theorem 5, which uses the non-deterministic communication complexity of the NOF model or in the proof of the lower bound given in [2], which uses counting arguments. In fact, the constructions used in these proofs do not achieve a sparse cut, making them unsuitable for the proof of Theorem 6 that holds for the model.
Fortunately, in the simplest case where , we can utilize the construction from Ref. [29] with a slight modification, taking advantage of the graph structure that holds for all . However, for the cases where and , the construction from Ref. [29], which is applicable to induced -cycles for every , fails for induced -paths. This is intuitively because we lose several symmetric properties of induce -cycles by removing one edge from the cycle. We thus construct two other graph families from scratch. The main challenge in establishing these constructions is that both edges and non-edges must be carefully chosen to avoid creating unexpected induced -paths. Our proof requires a detailed analysis to ensure this, as the general structure valid for does not apply for .
6 Ordered path detection and applications
In this section, we address the problem of detecting an ordered . In this problem, each node of the graph is assigned a color from , and the objective is to detect an induced path on nodes , where each node is colored with . Our first result is that detecting an ordered is already challenging, unlike the case of -freeness where we only know nontrivial lower bound for as in Theorem 6.
Theorem 28.
Any randomized algorithm that solves ordered detection for requires rounds in the model.
The proof of this theorem leverages a reduction from two-party communication complexity, with a construction that is notably simpler compared to the proof of Theorem 6. The detailed proof is provided in the full version.
Our next insight regarding ordered path detection is the connection between proving non-trivial lower bounds for ordered detection and a long-standing open problem in circuit complexity.222This kind of connection is already known for several subgraphs such as triangles and -cycles [15, 14]. Moreover, this result has implications for the detection of induced in the quantum model. Specifically, the following problem has remained unresolved for decades:
Open Problem 1.
Construct an explicit family of boolean functions such that there exist constants such that any family of circuits , where each circuit is depth and consists of gates with constant fan-in and fan-out, cannot compute .
The following theorem formalizes this relationship:
Theorem 29 (The formal version of Theorem 12).
Let be a constant. Then proving lower bound for ordered detection in the model or lower bound for induced detection in the quantum model solves Open Problem 1.
Remark 30.
One might think that ordered detection can be solved in rounds as the case of -freeness. However, it is easily shown that the round complexity of ordered detection in the model is actually at least the round complexity of triangle detection in the model: Let be the graph resulting by deleting all edges in the input graph between a node colored by 1 and a node colored by 3, and replacing non-edges between them by edges. Then any algorithm for ordered detection is simulated in the model with the input graph where each ordered in corresponds to a multicolored triangle in . Note that multicolored triangle detection can be reduced to (the standard) triangle detection in this model [34].
Proof sketch of Theorem 29.
The first part of the statement builds on the known hardness results for triangle detection (and counting) established by [14]. We demonstrate that ordered detection is at least as difficult as certain variants of triangle counting or ordered counting in high-conductance graphs. The full proof is deferred to the full version.
For the second part, we use the framework of distributed quantum search [30]. Consider a function defined on some finite set , and assume a -round algorithm allows a fixed node to output with constant probability, given input . In the quantum model, we can find an element such that (if such an element exists) with constant probability in rounds.
Now, assume that ordered detection can be solved in rounds in the model. We can show that induced detection can be solved in rounds in the quantum model. The statement then follows from the first part of the theorem, as an lower bound for induced detection implies an lower bound for ordered detection.
Specifically, let be an arbitrary node, and define . Consider the subgraph induced by and . Nodes in are colored with color 2, while nodes in randomly choose their color from 1 and 3. Note that contains an ordered if contains an induced . Since and randomly choose their colors, the probability that is properly colored is constant. Therefore, in rounds, we can determine if belongs to an induced . By using distributed quantum search for a function , which outputs if and only if belongs to an induced , we can solve induced detection in rounds.
References
- [1] Noga Alon, Raphael Yuster, and Uri Zwick. Color-coding. Journal of the ACM (JACM), 42(4):844–856, 1995. doi:10.1145/210332.210337.
- [2] Nicolas Bousquet, Linda Cook, Laurent Feuilloley, Théo Pierron, and Sébastien Zeitoun. Local certification of forbidden subgraphs. arXiv preprint arXiv:2402.12148, 2024. doi:10.48550/arXiv.2402.12148.
- [3] Nicolas Bousquet, Laurent Feuilloley, and Théo Pierron. Local Certification of Graph Decompositions and Applications to Minor-Free Classes. In 25th International Conference on Principles of Distributed Systems (OPODIS 2021), pages 22:1–22:17, 2022. doi:10.4230/LIPIcs.OPODIS.2021.22.
- [4] Nicolas Bousquet and Sébastien Zeitoun. A subquadratic certification scheme for p5-free graphs. Theoretical Computer Science, page 115091, 2025. doi:10.1016/J.TCS.2025.115091.
- [5] Christoph Brause, Ingo Schiermeyer, Přemysl Holub, Zdeněk Ryjáček, Petr Vrána, and Rastislav Krivoš-Belluš. 4-colorability of -free graphs. Electronic Notes in Discrete Mathematics, 49:37–42, 2015.
- [6] Keren Censor-Hillel, François Le Gall, and Dean Leitersdorf. On distributed listing of cliques. In Proceedings of the 39th Symposium on Principles of Distributed Computing, pages 474–482, 2020. doi:10.1145/3382734.3405742.
- [7] Keren Censor-Hillel, Ami Paz, and Mor Perry. Approximate proof-labeling schemes. Theoretical Computer Science, 811:112–124, 2020. doi:10.1016/J.TCS.2018.08.020.
- [8] Yi-Jun Chang, Seth Pettie, and Hengjie Zhang. Distributed triangle detection via expander decomposition. In Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms, pages 821–840. SIAM, 2019. doi:10.1137/1.9781611975482.51.
- [9] Yi-Jun Chang and Thatchaphol Saranurak. Improved distributed expander decomposition and nearly optimal triangle enumeration. In Proceedings of the 2019 ACM Symposium on Principles of Distributed Computing, pages 66–73, 2019. doi:10.1145/3293611.3331618.
- [10] Derek G Corneil, Helmut Lerchs, and L Stewart Burlingham. Complement reducible graphs. Discrete Applied Mathematics, 3(3):163–174, 1981. doi:10.1016/0166-218X(81)90013-5.
- [11] Pierluigi Crescenzi, Pierre Fraigniaud, and Ami Paz. Trade-Offs in Distributed Interactive Proofs. In 33rd International Symposium on Distributed Computing (DISC 2019), pages 13:1–13:17, 2019. doi:10.4230/LIPICS.DISC.2019.13.
- [12] Mina Dalirrooyfard, Thuy Duong Vuong, and Virginia Vassilevska Williams. Graph pattern detection: Hardness for all induced patterns and faster non-induced cycles. In Proceedings of the 51st Annual ACM SIGACT Symposium on Theory of Computing, pages 1167–1178, 2019. doi:10.1145/3313276.3316329.
- [13] Andrew Drucker, Fabian Kuhn, and Rotem Oshman. On the Power of the Congested Clique Model. In Proceedings of the 2014 ACM symposium on Principles of distributed computing (PODC 2014), pages 367–376, 2014. doi:10.1145/2611462.2611493.
- [14] Talya Eden, Nimrod Fiat, Orr Fischer, Fabian Kuhn, and Rotem Oshman. Sublinear-time distributed algorithms for detecting small cliques and even cycles. Distributed Computing, pages 1–28, 2022.
- [15] Orr Fischer, Tzlil Gonen, Fabian Kuhn, and Rotem Oshman. Possibilities and impossibilities for distributed subgraph detection. In Proceedings of the 30th on Symposium on Parallelism in Algorithms and Architectures, pages 153–162, 2018. doi:10.1145/3210377.3210401.
- [16] Pierre Fraigniaud, Maël Luce, Frederic Magniez, and Ioan Todinca. Even-cycle detection in the randomized and quantum congest model. In Proceedings of the 43rd ACM Symposium on Principles of Distributed Computing (PODC 2024), pages 209–219, 2024.
- [17] Pierre Fraigniaud, Frédéric Mazoit, Pedro Montealegre, Ivan Rapaport, and Ioan Todinca. Distributed certification for classes of dense graphs. In Proceedings of the 37th International Symposium on Distributed Computing (DISC 2023), pages 20:1–20:17, 2023. doi:10.4230/LIPICS.DISC.2023.20.
- [18] Pierre Fraigniaud, Pedro Montealegre, Ivan Rapaport, and Ioan Todinca. A meta-theorem for distributed certification. Algorithmica, 86(2):585–612, 2024. doi:10.1007/S00453-023-01185-1.
- [19] Pierre Fraigniaud and Dennis Olivetti. Distributed detection of cycles. ACM Transactions on Parallel Computing (TOPC), 6(3):1–20, 2019. doi:10.1145/3322811.
- [20] Peter Gartland and Daniel Lokshtanov. Independent Set on -Free Graphs in Quasi-Polynomial Time. In 2020 IEEE 61st Annual Symposium on Foundations of Computer Science (FOCS), pages 613–624. IEEE, 2020.
- [21] Gaston H Gonnet. Expected length of the longest probe sequence in hash code searching. Journal of the ACM (JACM), 28(2):289–304, 1981. doi:10.1145/322248.322254.
- [22] Andrzej Grzesik, Tereza Klimošová, Marcin Pilipczuk, and Michał Pilipczuk. Polynomial-time algorithm for maximum weight independent set on -free graphs. ACM Transactions on Algorithms (TALG), 18(1):1–57, 2022. doi:10.1145/3414473.
- [23] Chính T Hoàng, Marcin Kamiński, Vadim Lozin, Joe Sawada, and Xiao Shu. Deciding -colorability of -free graphs in polynomial time. Algorithmica, 57:74–81, 2010. doi:10.1007/S00453-008-9197-8.
- [24] Stephan Holzer and Roger Wattenhofer. Optimal distributed all pairs shortest paths and applications. In Proceedings of the 2012 ACM symposium on Principles of distributed computing, pages 355–364, 2012. doi:10.1145/2332432.2332504.
- [25] Jarkko Kari, Martín Matamala, Ivan Rapaport, and Ville Salo. Solving the induced subgraph problem in the randomized multiparty simultaneous messages model. In International Colloquium on Structural Information and Communication Complexity (SIROCCO 2015), pages 370–384, 2015. doi:10.1007/978-3-319-25258-2_26.
- [26] János Komlós and Miklós Simonovits. Szemeredi”s regularity lemma and its applications in graph theory, 1995.
- [27] Janne H. Korhonen and Joel Rybicki. Deterministic Subgraph Detection in Broadcast CONGEST. In Proceedings of the 21st International Conference on Principles of Distributed Systems (OPODIS 2017), pages 4:1–4:16, 2018. doi:10.4230/LIPIcs.OPODIS.2017.4.
- [28] Amos Korman, Shay Kutten, and David Peleg. Proof labeling schemes. Distributed Computing, 22(4):215–233, 2010. doi:10.1007/S00446-010-0095-3.
- [29] François Le Gall and Masayuki Miyamoto. Lower Bounds for Induced Cycle Detection in Distributed Computing. In 32nd International Symposium on Algorithms and Computation (ISAAC 2021), pages 58:1–58:19, 2021. doi:10.4230/LIPICS.ISAAC.2021.58.
- [30] François Le Gall and Frédéric Magniez. Sublinear-time quantum computation of the diameter in CONGEST networks. In Proceedings of the 2018 ACM Symposium on Principles of Distributed Computing (PODC 2018), pages 337–346, 2018. URL: https://dl.acm.org/citation.cfm?id=3212744.
- [31] Daniel Lokshantov, Martin Vatshelle, and Yngve Villanger. Independent set in -free graphs in polynomial time. In Proceedings of the twenty-fifth annual ACM-SIAM symposium on discrete algorithms, pages 570–581. SIAM, 2014. doi:10.1137/1.9781611973402.43.
- [32] Zvi Lotker, Elan Pavlov, Boaz Patt-Shamir, and David Peleg. Mst construction in o (log log n) communication rounds. In Proceedings of the fifteenth annual ACM symposium on Parallel algorithms and architectures, pages 94–100, 2003. doi:10.1145/777412.777428.
- [33] Pedro Montealegre, Diego Ramírez-Romero, and Ivan Rapaport. Compact distributed interactive proofs for the recognition of cographs and distance-hereditary graphs. In International Symposium on Stabilizing, Safety, and Security of Distributed Systems (SSS 2021), pages 395–409, 2021. doi:10.1007/978-3-030-91081-5_26.
- [34] Amir Nikabadi and Janne Korhonen. Beyond distributed subgraph detection: Induced subgraphs, multicolored problems and graph parameters. In 25th International Conference on Principles of Distributed Systems (OPODIS 2021), volume 217, 2022. doi:10.4230/LIPIcs.OPODIS.2021.15.
- [35] David Peleg. Distributed computing: a locality-sensitive approach. SIAM, 2000.
- [36] Bert Randerath and Ingo Schiermeyer. 3-Colorability for -free graphs. Discrete Applied Mathematics, 136(2-3):299–313, 2004.
- [37] Anup Rao and Amir Yehudayoff. Simplified lower bounds on the multiparty communication complexity of disjointness. In 30th Conference on Computational Complexity (CCC 2015). Schloss-Dagstuhl-Leibniz Zentrum für Informatik, 2015. doi:10.4230/LIPIcs.CCC.2015.88.