Hardness of Clique Approximation for Monotone Circuits
Abstract
We consider a problem of approximating the size of the largest clique in a graph, using a monotone circuit. Concretely, we focus on distinguishing a random Erdős–Rényi graph , with chosen st. with high probability it does not even contain an -clique, from a random clique on vertices (where ). Using the approximation method of Razborov, Alon and Boppana showed in their influential work in 1987 that as long as , this problem requires a monotone circuit of size , implying a lower bound of for the exact version of the problem when . Recently, Cavalar, Kumar, and Rossman improved their result by showing a tight lower bound , in a limited range , implying a comparable lower bound after choosing the largest admissible .
We combine the ideas of Cavalar, Kumar and Rossman with recent breakthrough results on sunflower conjecture by Alweiss, Lovett, Wu, and Zhang to show that as long as , any monotone circuit rejecting graph while accepting a -clique needs to have size at least ; this implies a stronger lower bound for the unrestricted version of the problem.
We complement this result with a construction of an explicit monotone circuit of size which rejects , and accepts any graph containing -clique whenever . In particular, those two theorems give a precise characterization of the smallest -clique that can be distinguished from : when , there is a polynomial-size circuit that solves it, while for every circuit needs size .
Keywords and phrases:
circuit lower bounds, monotone circuits, sunflower conjectureCopyright and License:
2012 ACM Subject Classification:
Theory of computation Circuit complexityEditors:
Srikanth SrinivasanSeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
Monotone circuits form a restricted computation model, where computation is performed by a directed acyclic graph, with input nodes (of in-degree ) labeled by the input variables and internal nodes (gates) labeled or (computing logical and, or logical or respectively, of input wires). It is not difficult to see that every monotone function on binary variables (and only monotone functions), can be computed by such a circuit, and by a simple counting argument one can show that a random monotone function requires a monotone circuit of size .
Remarkably, in contrast with more general Boolean circuits (where the negation gate is also allowed), since the breakthrough work of Razborov [19], there are known unconditional super-linear lower bounds for the size of monotone circuit computing explicit monotone functions. Concretely, Razborov showed that for any , the function – interpreting its input as an adjecency matrix of a graph , and outputting if and only if said graph contains a clique on vertices – requires a monotone circuit of size . Putting , this gives a quasipolynomial lower bound ; an analogous result for boolean circuits including negation would imply and seems to be far beyond the reach of current techniques, almost 40 years later.
To prove his lower bound, Razborov used the sunflower lemma of Erdős and Rado [10], bringing it to the attention of the theoretical computer science community. In the highly influential follow-up work, Alon and Boppana [1] further utilized the approximation method introduced by Razborov. They showed a lower bound for the problem when – implying a lower bound at the optimal value of – a result which has since become a landmark of circuit complexity. Interestingly and less commonly known, in the same paper, they also showed a stronger inapproximability result: Any monotone circuit that rejects every graph without even an -clique and accepts every graph that has a -clique (where ) needs to have a size at least as long as . In fact, their result holds in the average case – they show that it is hard to reject a random -partite graph while accepting a clique on random vertices.
Since then, several other techniques for showing monotone circuit lower bounds for specific problems have been introduced over almost four decades (e.g., [3, 4, 15, 11, 12, 7]), but the result of Alon and Boppana stood as essentially the best-known lower bound for the clique problem.
Very recently, the breakthrough result of Alweiss, Lovett, Wu, and Zhang [2] made a major step towards resolving the sunflower conjecture – conjecture about the “right” quantitative dependency in the sunflower lemma – and was followed in short succession by a sequence of further improvements [5, 17, 23, 14, 18]. Particularly noteworthy is the exposition by Rao [18], presenting not only the improved sunflower bounds but also the resolution of the Kahn-Kalai conjecture in just a few pages.
Following up on these results, Cavalar, Kumar, and Rossman [8], showed how to utilize the new sunflower bounds to prove the first lower bound for any explicit monotone function (improving the earlier lower bound), specifically for the Harnik-Raz function [13]. In the same paper, they also gave the first substantial improvement over the Alon-Boppana bound for the clique problem: they showed that as long as the problem requires circuit size – a tight lower bound, but in a further restricted range of parameters; choosing the optimal , this leads to the same lower bound for Clique. As it turns out, their result for the clique did not use the new sunflower bounds, and, in fact, it was unclear how to combine those techniques.
1.1 Our results
We prove a lower bound on the size of monotone circuits solving the promise problem .
Definition 1.
For integers with , define the promise problem , where the two promise subsets are defined as
and
To prove a lower bound on , we introduce an associated strengthened distributional problem to distinguish two graph distributions and .
Definition 2.
For an integer with , let be a uniformly random distribution of isolated -cliques on vertices. Let the Erdős–Rényi random graph distribution on vertices with probability parameter .
The problem is to distinguish those two distributions – i.e. given on input a graph drawn either from or we want to reject with probability at least in the former case and accept with probability at least in the latter.
Note that is chosen so that is associated with as the probability that a graph drawn from contains a clique of size is small – a simple union bound can be used to argue that for . As such, any circuit solving the problem can distinguish from , and conversely, a lower bound for implies a lower bound for .
Our main results are lower and upper bounds on the monotone complexity of , captured in the following two theorems.
Theorem 3.
Let . Then there exists no monotone circuit solving with size less than .
Theorem 4.
Let . Then there exists a monotone circuit on inputs, solving with .
We want to emphasize a special case of (or, equivalently, the negative distribution being an Erdős–Rényi graph with ). In this case, according to Theorem 4, we can choose to get a polynomial-size circuit, rejecting yet accepting a clique of size . On the other hand, Theorem 3 states that if we wanted to reject and accept a clique of size , where , we would need a circuit of size .
As such, our two theorems provide a near-tight characterization of the power of polynomial-size monotone circuits to distinguish the graph from a large clique.
Another interesting corollary of Theorem 3, is obtained by setting , and taking as large as possible while satisfying the restriction , in order to obtain a strongest possible lower bound for the clique problem.
Corollary 5.
For any , the monotone complexity of the is . In particular monotone complexity of the Clique problem is .
Note that no truly exponential lower bounds for the monotone circuit size are known for any explicit monotone problem. A lower bound of (where is the size of the input) was shown for the arguably less-natural Harnik-Raz function [13] in [8], breaking the long-standing barrier of – and using the same breakthroughs in sunflower lemmas that are crucial in our improvement.
In our case, the input is an adjacency matrix of a graph on vertices hence the input has bits; our lower bound, in terms of the input size, is then – the exponent is still by a factor of two worse than in the strongest known lower bound for any explicit monotone function.
Concurrent and independent work
Independently to our results, Suzanna de Rezende and Marc Vinyals proved a comparable monotone circuit lower bound for the clique problem [9]. Specifically, they show a lower bound of for the problem of distinguishing a -clique from a -colorable graph as long as . Their does not use the approximation method, instead leveraging a combinatorial reduction together with a new query-to-communication complexity lifting theorem, and known relationship between specific communication complexity lower bounds and monotone circuit lower bounds. Interestingly, the key step in their improved lifting theorem also leverged the [2] breakthrough in sunflower bounds.
1.2 Approximation method
Let us briefly recall the idea behind the approximation method for showing monotone circuit lower bounds, introduced by Razborov [19], and then utilized by [1, 8].
In order to show a lower bound for a size of a monotone circuit distinguishing two specific distributions and , we proceed by inductively (gate-by-gate) approximating any given circuit by a simpler circuit (from some class of simple circuits).
If we can show that
-
1.
Every simple circuit fails to distinguish between and – that is, for all simple , we have .
-
2.
In each gate, by applying the approximation, we introduce error at most on both positive and negative distribution.
That implies a lower bound for the size of the smallest monotone circuit distinguishing those two distributions.
For the clique problem (say, distinguishing random graph which likely does not even have a clique of size , from a uniformly random clique of size ), following [19, 1, 8], the “simple” circuits we use to approximate a given circuit are just small DNF formulas, specifically conjunctions of clique indicators – that is circuits of form
where
Moreover, a circuit is simple if the size of each clique indicator is bounded by (we will eventually choose ), and the number of clique indicators of each size is appropriately bounded.
At a given gate, we wish to approximate a conjunction or a disjunction of two such simple circuits again by a simple circuit. By applying de Morgan laws in the conjunction case, we can transform the circuit again into DNF (without introducing any error).
Then, we repeat the following three steps, transforming the DNF obtained this way into a simple one.
-
1.
We replace all conjunctions by clique indicators .
-
2.
As long as there is a family of cliques on sets having a specific combinatorial structure, we replace all of them by a clique on the set , where is the intersection of .
-
3.
We remove all clique indicators for larger than threshold .
As it turns out, step 1 does not introduce any error on both the positive and negative distributions.
Step 3 can introduce error only on the positive distribution, which can be bounded by union bound – any given large clique indicator is unlikely to be satisfied by a large random clique. The number of those large clique indicators we discard is bounded (otherwise, we would have been able to apply step 2).
Finally, step 2 is the crux of the argument – we need to show that if the number of indicators of a given size is too large, then there always is a subset of those indicators that can be replaced by its intersection (the core), without introducing too much error on the negative distribution (clearly we will not introduce any error on the positive distribution in this case).
In the Razborov’s proof and some presentations of the Alon-Boppana proof (see for example [16, Chapter 9]), one can focus on finding a sunflower consisting of many sets among the set system (a sunflower is a family of sets for which all pairwise intersections are the same, see Section 1.3), and show that replacing a large sunflower by its core introduces only small error on the negative distribution. This, together with the Erdős-Rado result that any large enough family of bounded sets contains a large sunflower, gives an upper bound on the number of clique indicators of size for a “simple circuit” (one on which the step 2 can no longer be applied), and hence yields a complete proof of a monotone circuit lower bound for the clique problem. Specifically, going carefully through the calculations, one could show this way a lower bound for the problem whenever , translating to a lower bound for the Clique problem after taking optimal . 111In the actual Alon-Boppana paper, they used a slightly more general combinatorial structure than sunflowers: a sequence of distinct sets , together with a set (not necessarily distinct from ), s.t. all pairwise intersections – and they replaced by in step 2 here. Clearly any sunflower with the core satifies this property. By using this more general structure, they were able to show a lower bound for – matching the result one would get insisting on using a sunflower here if the sunflower conjecture was true; yet without having to prove this conjecture.
The work of Cavalar, Kumar, and Rossman [8], introduced a notion of robust clique-sunflower, which abstracts exactly the property needed to bound the error introduced on the negative distribution by replacing the robust clique sunflower by its core. They showed better quantitative bounds on the size of the set system needed to contain such a robust clique sunflower and were able to deduce a lower bound for – matching the same for the clique problem when picking largest admissible .
1.3 Sunflowers, robust sunflowers and robust clique sunflowers
We define the robust clique sunflower (after [8]) – a notion abstracting the main property of combinatorial sunflowers used to obtain a monotone lower bound for the problem, where the negative distribution is . This property ensures that for a DNF formula
if some of those sets form a robust clique sunflower, we can replace them by a single clique indicator on the common intersection while introducing only a small error on the negative distribution (and no error on the positive distribution).
Definition 6 (Robust clique sunflower).
A family of sets is an -robust clique sunflower (with core ), if
where is a random Erdős–Rényi graph sampled by including each edge independently with probability , and is a clique on vertices , i.e. .
This notion is intimately related to a similar notion of a robust sunflower, originally introduced in [20].
Definition 7 (Robust sunflower).
A family of sets is an -robust sunflower (with core ), if
where is a random subset of chosen by including each element independently with probability .
As it turns out, any large enough family of sets of size contains a robust sunflower or a robust clique sunflower. We introduce a notation for the dependence between the size of the family and the parameters of this sunflower.
Definition 8.
We define to be the smallest number such that any -uniform222-uniform set system is just a family of subsets of a universe, each subset of size exactly . set system of size at least contains a -robust sunflower.
Similarly, we define to be the smallest number such that any -uniform set system of size at least contains a -robust clique-sunflower.
Note that a family is a robust clique sunflower (as in the Definition 6), if and only if the family of edge-sets is a robust sunflower. This observation implies a simple (yet unsatisfactory) bound
| (1) |
In Section 1.4 (for instance Theorem 14) we provide a quantitative statement for how the upper bounds on can be translated into lower bounds for the Clique problem.
For completeness, let us discuss a way to reinterpret a result similar to the Alon-Boppana (with the negative distribution being instead of random -partite graph) in this framework, by deducing a bound on the robust clique sunflowers from the sunflower bound.
Definition 9 (-Sunflower).
An family of sets is a -sunflower (with a core ), if all pairwise intersections of sets from are , i.e. for all , we have .
What we call a -sunflower is sometimes called in the literature a sunflower with -petals. We chose a slightly more concise terminology.
A sunflower lemma [10] now says that for every and , there is a finite number such that every -uniform set system of size at least has a -sunflower. Subsequent results, including the breakthrough by [2] and improvements [5, 17, 23, 14, 18] provide successively smaller upper bounds on , concluding with the best currently known bound
| (2) |
while the famous sunflower conjecture ([10]) stipulates that .
The following observation, similar in form to the core of the Razborov and Alon-Boppana results, is a simple way to connect sunflowers to robust clique sunflowers.
Lemma 10.
Every -uniform -sunflower is an -robust clique sunflower. In particular .
Proof.
Consider a -sunflower with core . By the definition of sunflower the sets of edges are disjoint, hence the indicator random variables are independent. Since , the probability that all are zero is at most .
This, combined only with the classical upper bound by Erdős-Rado yields an upper bound
which can be used to recover lower bound via the approximation method outlined in the previous section (see for instance Theorem 14).
Crucially for us, a bound on robust sunflower numbers is an essential part of the new, improved series of results on the bounds for sunflower numbers. Specifically, the following theorem was used to deduce those new bounds and is important for our application.
This theorem fairly quickly implies the breakthrough upper bound on sunflower numbers (2). For our purposes, though, the sunflower consequence of Theorem 11 is less relevant, and we will leverage the existence of robust sunflowers directly.
In Section 3, we prove a much stronger reduction than (1), showing how robust sunflower bounds can be used to give bounds on robust clique-sunflowers, which leads to our main improvement.
Theorem 12.
Combining the upper bound for robust sunflowers in Theorem 11 with our comparison theorem (Theorem 12), we get
Corollary 13.
The clique sunflower numbers are bounded as
This result is an improvement over a bound from [8, Lemma 3.2] of the same quantity. They directly showed an upper bound
| (3) |
As we will see later, reducing the factor down to allowed us to show a lower bound for the clique problem as opposed to from [1, 8].
In order to prove Theorem 12 we show that any -robust sunflower is an -robust clique sunflower. This statement is easier to interpret for sunflowers with empty core: given a uniform family of sets (with empty common intersection), if we are very likely to cover one of those sets while sampling vertices of independently at random with probability , we are also very likely to cover one of those with a clique, when sampling edges of independently with probability .
As it turns out, one can set up specific stochastic processes , and associated with both of those experiments – the expected supremum of these processes will be directly connected (respectively) with the probability of covering one of the sets of by a random set (with the inclusion probability ), or the probability of covering one of the cliques on the vertices of by a random graph (with edge probability ). The central technical part of the proof is the comparison lemma stating that – theorems of that form appeared in the literature on the theory of stochastic processes (most well known is the Slepian lemma, or Gaussian comparison principle [22, Corollary 2.10.12]). Even though we could not apply any of the known comparison lemmas black-box, we adapt the ideas that were used in [21] to show a comparison lemma for coordinate-wise contractions of canonical Bernoulli processes, and we show the desired inequality for our two specific stochastic processes in question.
1.4 Lower bounds for monotone circuits depending on sunflower bounds
We carry over the high-level structure of the proof outlined in Section 1.2 in more detail in Section 2. This part of the proof is technically very similar to known results applying the approximation method (for example [19, 1, 8]). However, in contrast with many of the previous work, we made an effort to provide a statement of the lower bound theorem parameterized by the robust clique sunflower numbers – which, in turn, can be upper bounded in several ways by the robust sunflower numbers , or sunflower numbers as discussed in Section 1.3.
Making these dependencies explicit instead of choosing optimal values of relevant parameters ahead of time based on currently best-known bounds on sunflower numbers makes the interplay between sunflower-like combinatorial statements and monotone lower bounds for the clique problem much clearer.
Moreover, our analysis allows us to show the inapproximability results (Theorem 3) – i.e., hardness of distinguishing a random clique of size , from a random graph which is unlikely to even have a clique of size for .
The analysis of [19, 8] focused on the exact case, i.e., ; whereas in [19, 1], the negative distribution was chosen to be a random complete -partite graph. However, this choice was not crucial in their reasoning – only simple modifications are needed to adapt their proofs to the being the negative distribution.
As we have been made aware recently, an even more general statement can be found [6, Theorem 5.6.4] – the following theorem can be deduced as a corollary of their more general framework, by a concrete instantiation of their notion of the notion of abstract sunflower to denote the robust clique sunflower.
Theorem 14.
Fix some and take . If for every , we have
then the size of any monotone circuit distinguishing and satisfies
In particular, choosing , if for all we have
then the monotone complexity of distinguishing and is .
It is instructive to see how to essentially recover the Alon-Boppana bounds from Theorem 14, using a simple lemma stating that large enough sunflowers are robust clique sunflowers (Lemma 10) together with the new bounds on the sunflower numbers (2). In this case, we have an upper bound on the
so taking for a small constant , such that , we end up with a complexity lower bound , as long as
recovering [1, Theorem 3.11]. Setting , gives a lower bound of form as long as .
In a similar vein, plugging in the upper bound (3) for (originally shown in [8, Lemma 3.2]), yields the lower bound , as long as , and again choosing yields an for , recovering their [8, Theorem 3.23].
Finally, Theorem 14 together with our new bounds for the robust clique numbers (Corollary 13) directly imply the lower bound claimed in Theorem 3.
2 Approximating cliques
In this chapter, we will give an extended version of the approximation method (showing Theorem 14) and take advantage of our improved robust clique sunflower bound to strengthen the lower bound for to if .
2.1 Abstract approximation method
For the sake of abstraction, we describe a general inductive procedure of converting any monotone circuit computing a distributional decision problem , gate by gate, into an approximation circuit from a set of approximation circuits . This procedure depends on a pair of “compression” functions . The error introduced by the conversion, with respect to the distributions and , will depend solely on .
Definition 15.
For any monotone circuit and a pair of compression functions with define as the result of the following procedure.
-
1.
Input variables: Let be an input variable. Define
-
2.
-gate: Assume . Define
-
3.
-gate: Assume . Define
Depending on the choice of , can be very different from . For the sake of analysis, we are interested in the maximum error that a single transformation step can introduce on either distribution.
Definition 16.
Let be the image of and define the positive approximation error
and the negative approximation error
We can also formalize the earlier intuition about proving circuit size lower bounds using the relative and absolute approximation errors.
Definition 17.
We say that a circuit distinguishes distributions and if .
We say that a circuit is -simple if it is in the image of the compression function .
Lemma 18.
If every -simple circuit satisfies , then every circuit distinguishing and has
Proof.
The transformation of into introduces at most error on and error on per gate. As distinguishes the distributions, but , the error of the -transformation must match the absolute error of the -simple circuit , so the theorem follows.
2.2 Proving clique lower bounds
In the following, we will specialize the definition of the approximation circuits and the compression functions to prove the lower bounds on .
Definition 19.
Let be a subset of vertices. Let be the clique indicator function on , such that . For any family of subsets of , , define the associated approximation circuit as
Let be the set of all approximation circuits.
We will often freely switch between the formal definition of an approximator as a Boolean circuit and its representation as a set system over the universe of graph vertices .
Definition 20.
For an “inner-compression” function , define with
We observe that for the identity , introduces no error on the graph distributions and . Indeed, the positive distribution is supported on the cliques. A -clique contains both cliques and if and only if it contains a clique . On the other hand, on the negative distribution, transforming a conjunction into the clique indicator can only reject more instances. Thus, the error only depends on the choice of . For constructing such , we will use the robust clique sunflowers.
2.3 Robust closures
For a DNF formula we define to be a formula obtained from by repeatedly replacing any -robust clique sunflower by its core , as long as there is such a robust clique sunflower, interleaved with removing all sets , s.t. for some . The will be a circuit obtained from by removing all sets .
Finally, we will choose the inner compression function (used in Definition 20) as
We say that a circuit is -closed, if there are no -robust sunflowrers among the sets . The bound on the number of sets of a given size for an -closed circuit is a direct consequence of the definition of a closed circuit and the numbers.
Fact 21.
The number of clique indicators of size of in a -closed circuit is bounded by
2.4 The lower bound
A crucial property that we will exploit both in the analysis of the approximator as well as in the construction of an explicit algorithm solving is that the positive test distribution is “well-spread” in a sense that it is unlikely for a large clique-indicator to accept many instances of .
Lemma 22.
For a clique with ,
Proof.
A graph , sampled from is an isolated -clique on vertices. Thus if and only if such that
This lemma, together with a bound on the number of clique indicators of each size (Fact 21), allows us to easily show that any -simple circuit cannot distinguish from – either a circuit like that trivially accepts every output (and hence makes a large error on the negative distribution), or it accepts only small fraction of inputs from the positive distribution.
Lemma 23.
If is an approximator obtained by the compression function , then
Proof.
Single-step Approximation Errors
We give bounds on the single-step approximation errors and , induced by .
Lemma 24.
The single-step error introduced on the positive distribution is bounded as follows
Proof.
If we have for some that but , then there was some term of of size larger than with , which was discarded during the trimming process.
We can union bound the probability of this happening, in a similar way as Lemma 23, using the upper bound (Fact 21) on the number of terms of a given size , and the upper bound on the probability that a given term of size accepts the positive distribution (Lemma 22).
We also notice that for a fixed closure factor, the negative approximation error is bounded by the probability bound given by the robust clique-sunflower.
Lemma 25.
The single-step error introduced on the negative distribution is bounded as
Proof.
Note that while applying operation, each set of size at most can be added as a core of some robust clique sunflower at most once, as the core is always a strict subset of the sunflower indicators. Hence, we will repeat the process of replacing a robust sunflower by its core at most times, in each step introducing error at most on the negative distribution (by the definition of robust clique sunflower).
Complexity
By the results of the last sections, we can proceed to prove a quantified condition on the existence of a monotone circuit lower complexity bound on , depending on the robust-clique sunflower bound . The proof will follow by a simple application of Lemma 18 together with bounds on the error introduced in each step of the process, established in the previous section. We recall the statement of the theorem from Section 1.4.
Theorem 14. [Restated, see original statement.]
Fix some and take . If for every , we have
then the size of any monotone circuit distinguishing and satisfies
In particular, choosing , if for all we have
then the monotone complexity of distinguishing and is .
Proof.
This statement follows directly from Lemma 18. First, note that by Lemma 23 any -simple circuit either accepts all negative instances or accepts a positive instance with probability
hence a -simple circuit cannot distinguish it from .
The error was shown as Lemma 25. For the we can use Lemma 24, to obtain a bound
With those two bounds, we can now directly apply Lemma 18 to deduce the first part of the theorem. The second part follows by choosing , , and simple algebraic manipulations.
We can now deduce Theorem 3 directly from this more general statement and the new upper bound on the (Corollary 13).
Proof of Theorem 3.
By Corollary 13, taking , for we have
hence if we pick we will have
Now, when , this yields
and finally, applying Theorem 14, we get a lower bound of form
3 Robust clique numbers are upper bounded by robust sunflower numbers
In this section, we will show a reduction, proving that any robust sunflower bound implies a corresponding robust-clique-sunflower bound with slightly different parameters. For our convenience, let us recall the statement of the theorem.
Theorem 12. [Restated, see original statement.]
The main technical lemma towards Theorem 12 is a comparison principle for a concrete pair of stochastic processes. The proof of this result has been inspired by [21]. Before we state the comparison lemma, let us provide a necessary definition.
Definition 26.
We say that is a proper lifting if for every , we have .
Note that since the image of any set under the proper lifting has exactly elements, and it has exactly elements in each of the rows corresponding to elements of , we have for every , that . The canonical examples of proper liftings to have in mind are or , although we will need a slightly more complicated one in the proof of Theorem 12.
The following comparison lemma shows that among the specific class of the stochastic processes associated with a lifting , the smallest process is associated with the lifting .
Lemma 27.
Let be a -uniform family of subsets of , and random variables be independent and identically distributed.
If is any proper lifting (as in Definition 26), then
| (4) |
We will prove this lemma later in Section 3.1. Before we do, let us see how it implies Theorem 12. More concretely, we will show that every -robust sunflower is also an -robust clique sunflower, and Theorem 12 will be a direct corollary.
It is easier to gain the intuition of how to prove a statement of this form from Lemma 27 if we consider a special case of a robust sunflower with an empty core – in this case, we would apply Lemma 27 with proper lifting , and being independent valued random variables with . The -th row of a matrix has a sum equal to independently with probability . Therefore, by the robust sunflower definition, with high probability there exists a square of form with sum – this implies that the expected supremum over sums over as in the left-hand side of (4) is very close to , and hence also expected supremum over all sums over for (by the aforementioned lemma). We can deduce that with high probability there is a square in the matrix with sum – a statement corresponding to the fact that some clique is covered by a random graph.
The handling of the more general case, where the core is not necessarily empty, is only slightly more technical.
Lemma 28.
If -uniform family is a -robust sunflower, it is also a -robust clique sunflower.
Proof.
Consider a matrix , and , with independent entries, each equal to with probability . We can treat the part of the matrix below the diagonal (i.e., for ) as determining the adjacency matrix for a random graph (sampled according to the Erdős–Rényi distribution ).
Let us assume without loss of generality that for some . We can look at a -uniform family . Consider also the set of all indices such that the corresponding row in the matrix is all-one: . Note that each element in is included independently with probability . Since is a robust sunflower with the core , directly by definition of robust sunflower with probability at least , there is a set covered by , i.e.
This means that with probability , the value of
and therefore
We will now apply Lemma 27 to a family together with a lifting function to deduce
| (5) |
On the other hand, denoting by the probability that there exists , s.t. all entries of over are we have
| (6) |
Finally, as discussed above, if we consider a graph on , where the edge for if , this graph has exactly the distribution of . Moreover, when there exists , s.t. all entries for , then, in particular, the clique is contained in the graph . Indeed, all edges within are already contained in , and for implies that all edges between and are present in (since consists of the first elements of ).
The probability of this happening is , as desired, finishing the proof that is indeed a robust-clique-sunflower.
Proof of Theorem 12.
This theorem follows as a direct corollary of Lemma 28 and relevant definitions.
Indeed, every -uniform family of size at least contains a -robust sunflower, which by Lemma 28 is -robust clique sunflower.
3.1 Proof of Lemma 27
We will prove this statement by induction. Specifically, let us define , and (so that forms a partition of into the first rows, and the remaining rows).
Take defined as – i.e. on the first rows agrees with , and on the remaining rows it agrees with . Note that has always elements, and moreover , and . As such, it is enough to show that for any we have
| (7) |
where and are two collections of independent and identically distributed random variables (with the same distribution).
In order to prove this statement, we will consider a coupling where for all . Let us now condition on all variables for – we will show that for a fixed values of all those other variables, the desired inequality between expectation still holds, where expectation is only taken over the variables .
Concretely, for any given , we can decompose:
and similarly for . Note that when the pair if and only if , so if we denote
and by , and similarly , we have
and
since for .
Finally, notice that if , and , whereas if , we have . To finish the induction, we only need to show the following claim.
Claim 29.
Consider a finite sequence of pairs , where , and is either or .
Let be a distribution over , and let and be two sequences of independent random variables distributed according to . Moreover, let and . Then
Proof.
If all sets are empty, the statement is trivial. Let be an index s.t. and , and similarly let be an index such that , and . Consider any permutation , s.t. . We can extend the sequence to variables , and consider a coupling between and , given by . Clearly, under this coupling, both marginal distributions of and are the right ones, all we need to show is that for any given realization of the joint process we have
Indeed, by construction, we have
This lemma completes the proof of the inequality (7), and we can conclude the proof of Lemma 27, by chaining the inequalities (7) across all , since and .
4 The upper bound
We now turn towards the upper monotone complexity bound for where we will construct an explicit circuit for distinguishing and .
A useful subroutine that can be implemented using monotone boolean circuits is the ability to “sort” the input in polynomial size, as a sorting network on a boolean input can be easily used to construct a monotone boolean sorting circuit. 333For this, note that a sorting network essentially is a sequence of binary operations on the input, swapping two inputs if they are out of order. A swap of two booleans , can be trivially implemented by an and an gate. This also implies that monotone circuits can implement the threshold function , which accepts the input if there are at least input variables set to , simply wiring the output to the -th output of the sorting network.
In order to distinguish and , we will take a random family of subsets of size , and accept the graph if the number of covered clique indicators on those subsets exceeds some threshold – by wiring the outputs of clique indicators to the inputs of the threshold function .
Probabilities of including a random clique in and
and have an important distinctive property that we will use for constructing .
Lemma 30.
For a clique indicator with , the probability of clique inclusion is
and
Proof.
The first equation has already been proven in Lemma 22. The second equation follows trivially from the definition of – each of the edges of is present in independently with probability .
The interesting property is that, for the right choice of parameters, the “clique-probability” is noticeably larger on than on . Take, for instance and , if we can pick the size of the clique such that , we will be able to construct a circuit of size that accepts and rejects , by looking at randomly placed clique indicators, accepting the input if at least of those clique indicators are accepting on .
An explicit circuit
The probabilistic 444 We construct an explicit probabilistic circuit here. However, the easy direction in the Yaos Principle [24] also directly implies a deterministic upper bound for the distributional problem. monotone circuit for an -boolean input is defined by the following construction. For parameters
-
1.
Choose -element subsets independent uniformly at random and connect the associated clique indicators to the input.
-
2.
Connect the output of the clique indicators to the threshold function .
-
3.
Connect the output of to the output of .
The size of the entire circuit like that is . We would like to see how to chose and in order for the circuit to reject and accept with probability .
We will use the following well-known fact to analyze the above circuit construction.
Fact 31.
Let be Bernoulli random variables with (not necessarily independent). Then , and by Markov Inequality, .
On the other hand, if are independent Bernoulli random variables with , then as soon as , the Chebyshev inequality implies
We will use to be a clique indicator on random vertices under the distribution , and to be a clique indicator on random vertices under .
In what follows, we will discuss how to choose in a way such that indeed , and take as a small constant to obtain a circuit distinguishing from . For random variables (defined as clique indicators on the negative distribution), we use only the Markov inequality, and we do not need to worry about the correlations between them.
Let us quickly discuss that indeed, on the positive distribution, the random variables and are indeed independent – that is the case since conditioned on any specific realization of – i.e. being a clique on random vertices, the random variables and for are independent (since they are clique indicators on subsets of size chosen independently at random), and have – the position of the original vertices in a clique does not affect the probability that a random vertices are the subset of those vertices.
This fact gives a bound on the number of clique indicators needed to differentiate the graph distributions when the individual clique probabilities are sufficiently far apart.
Circuit Analysis
All we need to do now is to choose the parameter in such a way that , where and . The resulting circuit size will be of order .
Proof of Theorem 4.
We consider circuit constructed as discussed above, with the threshold .
According to Lemma 30 and Fact 31 in order to make sure that the circuit indeed rejects and accepts for given , we would like to pick the smallest such that
leading to a circuit of size . After a sequence of simple algebraic manipulations and taking
this condition is implied by
for some universal constant , since in this case we have . Taking , and observing that by assumption we have
we obtain the circuit size
References
- [1] N. Alon and R. B. Boppana. The monotone circuit complexity of boolean functions. Combinatorica 7, 1987.
- [2] Ryan Alweiss, Shachar Lovett, Kewen Wu, and Jiapeng Zhang. Improved bounds for the sunflower lemma. Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing, 2020.
- [3] Aleksandr Egorovich Andreev. A method for obtaining lower bounds on the complexity of individual monotone functions. In Doklady Akademii Nauk, volume 282(5), pages 1033–1037. Russian Academy of Sciences, 1985.
- [4] Alexander E Andreev. A method for obtaining efficient lower bounds for monotone complexity. Algebra and Logic, 26(1):1–18, 1987.
- [5] Tolson Bell, Suchakree Chueluecha, and Lutz Warnke. Note on sunflowers. Discrete Mathematics, 344(7):112367, 2021. doi:10.1016/j.disc.2021.112367.
- [6] Bruno Cavalar. Sunflower theorems in monotone circuit complexity. In Anais do XXXIV Concurso de Teses e Dissertações, pages 73–78, Porto Alegre, RS, Brasil, 2021. SBC. doi:10.5753/ctd.2021.15761.
- [7] Bruno P. Cavalar and Igor C. Oliveira. Constant-depth circuits vs. monotone circuits. In Proceedings of the Conference on Proceedings of the 38th Computational Complexity Conference, CCC ’23, Dagstuhl, DEU, 2023. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.CCC.2023.29.
- [8] Bruno Pasqualotto Cavalar, Mrinal Kumar, and Benjamin Rossman. Monotone circuit lower bounds from robust sunflowers. Algorithmica, 84(12):3655–3685, 2022. doi:10.1007/S00453-022-01000-3.
- [9] Susanna de Rezende and Marc Vinyals. Lifting with colorful sunflowers. Unpublished manuscript, 2025.
- [10] Paul Erdős and Richard Rado. Intersection theorems for systems of sets. Journal of the London Mathematical Society, 1960.
- [11] Ankit Garg, Mika Göös, Pritish Kamath, and Dmitry Sokolov. Monotone circuit lower bounds from resolution. In Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, pages 902–911, 2018. doi:10.1145/3188745.3188838.
- [12] Mika Göös, Pritish Kamath, Robert Robere, and Dmitry Sokolov. Adventures in monotone complexity and TFNP. In 10th Innovations in Theoretical Computer Science Conference (ITCS 2019), pages 38:1–38:19. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019.
- [13] Danny Harnik and Ran Raz. Higher lower bounds on monotone size. In Proceedings of the Thirty-Second Annual ACM Symposium on Theory of Computing, STOC ’00, pages 378–387, New York, NY, USA, 2000. Association for Computing Machinery. doi:10.1145/335305.335349.
-
[14]
Lunjia Hu, 2021.
URL:
https://theorydish.blog/2021/05/19/entropy-estimation-via-
two-chains-streamlining-the-proof-of-the-sunflower-lemma/. - [15] Stasys Jukna. Combinatorics of monotone computations. Combinatorica, 19(1):65–85, 1999. doi:10.1007/S004930050046.
- [16] Stasys Jukna et al. Boolean function complexity: advances and frontiers, volume 27. Springer, 2012. doi:10.1007/978-3-642-24508-4.
- [17] Anup Rao. Coding for Sunflowers. Discrete Analysis, February 2020. doi:10.19086/da.11887.
- [18] Anup Rao. Sunflowers: from soil to oil. Bulletin of the American Mathematical Society, 60(1):29–38, 2023.
- [19] Alexander Razborov. Lower bounds on the monotone complexity of some boolean function. In Soviet Math. Dokl., volume 31, pages 354–357, 1985.
- [20] Benjamin Rossman. The monotone complexity of k-clique on random graphs. In 2010 IEEE 51st Annual Symposium on Foundations of Computer Science, pages 193–201, 2010. doi:10.1109/FOCS.2010.26.
- [21] Michel Talagrand. Regularity of Infinitely Divisible Processes. The Annals of Probability, 21(1):362–432, 1993. doi:10.1214/aop/1176989409.
- [22] Michel Talagrand. Upper and lower bounds for stochastic processes, volume 60. Springer, 2014.
-
[23]
Terrence Tao, 2020.
URL:
https://terrytao.wordpress.com/2020/07/20/the-sunflower-
lemma-via-shannon-entropy/. - [24] Andrew Chi-Chin Yao. Probabilistic computations: Toward a unified measure of complexity. In 18th Annual Symposium on Foundations of Computer Science (sfcs 1977), pages 222–227. IEEE Computer Society, 1977.
