Abstract 1 Introduction 2 Technique 3 Lower Bounds 4 Discussion and Open Problems References Appendix A Proof of Claim 23

Lower Bounds Beyond DNF of Parities

Artur Riazanov ORCID EPFL, Lausanne, Switzerland Anastasia Sofronova ORCID EPFL, Lausanne, Switzerland Dmitry Sokolov ORCID EPFL, Lausanne, Switzerland
Université de Montréal, Canada
Abstract

We consider a subclass of 𝖠𝖢0[2] circuits that simultaneously captures DNFXor and depth-3 𝖠𝖢0 circuits. For this class we show a technique for proving lower bounds inspired by the top-down approach. We give lower bounds for the middle slice function, inner product function, and affine dispersers.

Keywords and phrases:
boolean circuits, top-down, unpredictability
Copyright and License:
[Uncaptioned image] © Artur Riazanov, Anastasia Sofronova, and Dmitry Sokolov; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Circuit complexity
Related Version:
Full Version: https://eccc.weizmann.ac.il/report/2025/024/
Acknowledgements:
We thank Mika Göös for fruitful discussions. We also thank Pengxiang Wang and anonymous reviewers for useful comments on the text. In particular, we thank a CCC 2025 reviewer who pointed out an issue with the proof of Theorem 16 in the earlier version of the paper.
Funding:
Authors are supported by the Swiss State Secretariat for Education, Research and Innovation (SERI) under contract number MB22.00026.
Editor:
Shubhangi Saraf

1 Introduction

Constant-depth (or 𝖠𝖢0) de Morgan circuits is one of the circuit classes that is reasonably well-studied, even though strong exponential lower bounds of the form 2Ω(n) are still out of reach. This model can use unbounded , and ¬ gates, and the underlying graph of a computation is a constant-depth tree; intuitively, these circuits represent computations that can be efficiently parallelized. There are known hard examples of functions that cannot be computed with small-size 𝖠𝖢0 circuits. The most notable ones include Xor and Maj:

Xor(x1,,xn)x1xn;Maj(x1,,xn)𝟙[x1++xn>n/2].

These functions require size 2Ω(n1/(d1)) depth-d 𝖠𝖢0 circuits, and the lower bounds for them are achieved mainly with two techniques: random restrictions, or switching lemma, [13, 1, 42, 18, 19] and polynomial approximation [33, 38]. In particular, for circuits of depth 3, there is a lower bound of 2Ω(n) for both of these functions (for Xor it is known to be tight), and breaking through the n barrier in the exponent for any explicit function is a major open question. Proving strong exponential lower bounds for depth-3 circuits would essentially give a superpolynomial lower bound for general circuits [40, 14] which is a major open problem in complexity theory. Both switching lemma and polynomial approximation seem unable to give us such strong lower bounds.

Circuits with MOD Gates

The situation becomes more challenging in terms of lower bounds, when we plug-in hard functions for 𝖠𝖢0 into our computational model. One of the most natural generalisations of 𝖠𝖢0 circuits that follow this concept is 𝖠𝖢0[m] circuits, that can also utilise gates computing 𝖬𝖮𝖣m defined as

𝖬𝖮𝖣m(x1,,xn)𝟙[(x1++xn)modm=0].

On the one hand, a lower bound for this model is necessary to show lower bounds for general circuits. On the other hand, showing such lower bounds is a challenging problem. For example, techniques based on random restrictions, such as switching lemma application, do not work quite as they do in 𝖠𝖢0, since 𝖬𝖮𝖣m gates are not simplified after restriction. However, when m is a prime power, polynomial approximation achieves lower bounds of the form 2n1/2d for Maj [33, 38], as well as for computing 𝖬𝖮𝖣q for a prime power q that is relatively prime with m.

When m is not a prime power, very little is known. In fact, utilising non-prime m with many divisors, it is possible to compute any symmetric function in subexponential size even in depth 3 [8]. The “minimal example” of the non-prime regime is 𝖠𝖢0[6]. It is still an open question to prove lower bounds for 𝖠𝖢0[6], and the known techniques fail at resolving that. The reason for that is that polynomial approximation only works over fields, and there is no field with 6 elements.

Even for the simplest example of 𝖬𝖮𝖣p gates: 𝖬𝖮𝖣2, or Xor, we are very far from understanding the exact power of 𝖠𝖢0[2] circuits. Allowing the use of the Xor function in the gates of a circuit increases its computational power; for example, depth-4 𝖠𝖢0[2] circuits can compute Maj in size 2O(n1/4) [29], whereas it requires 2Ω(n1/3)-size 𝖠𝖢0 circuits.

The drawbacks of Razborov–Smolensky polynomial approximation method translate to the gaps in our understanding of the class 𝖠𝖢0[2]. In particular, we do not have strong correlation bounds against these circuits even for one of the simplest subclasses of these circuits: DNFXor (depth-3 unbounded fan-in circuits of (Xor)-type) [22]. Here, denotes the composition, and this means that Xor gates are only allowed in the bottom layer of the circuit. Moreover, the polynomial approximation method only applies to functions that require a large degree over 𝔽2. Thus, it is unknown whether the inner product IPn(x,y)x1y1x2y2xnyn requires large 𝖠𝖢0 circuits with an additional layer of parity gates in the bottom (𝖠𝖢0Xor). It is known that IP requires exponentially large DNFXor circuits [24, 10], but even for (Xor)-circuits the best known lower bound is n2o(1) [9].

1.1 Top-down Approach

Overall, there is a clear shortage of new techniques in circuit complexity and, by extension, in adjacent areas, while the well-known ones also have the well-known drawbacks. In this work, we focus on studying another circuit lower bound technique, which falls under the umbrella of top-down methods.

Top-down lower bounds start from the output gate of a candidate circuit and move down the circuit in search of a mistake. While such an approach has been known for a long time, and there is a long line of work on top-down lower bounds for depth-3 𝖠𝖢0 circuits [4, 36, 27, 20, 35, 31, 32, 23, 30, 41, 6, 28, 14, 12, 15], it still remains largely underdeveloped. So far top-down lower bounds against 𝖠𝖢0 are known only for circuits up to depth-4 [17], while bottom-up methods yield lower bounds for arbitrary constant depth (or even logn/loglogn in some cases). The motivation for studying top-down comes from the fact that this method is complete for 𝖠𝖢0 circuits (see discussion in [21, 15, 17]). In other words, there are no formal barriers that would prevent such an approach from being able to prove lower bounds in the regimes where other known methods cannot.

The main model to which top-down techniques are applicable is 𝖠𝖢0 circuits, and the main example of a hard function is Xor. In this work, we attempt to adapt such techniques to be able to prove lower bounds for circuits with parity gates. We consider a subclass of 𝖠𝖢0Xor which is strictly stronger than DNFXor and prove lower bounds for it in a top-down fashion. In particular, we prove a lower bound for an affine disperser, which does not follow from Razborov–Smolensky method.

1.2 The Model and Results

In a recent paper Huang, Ivanov, and Viola [22] give an explanation of why the class Xor resists known lower bound techniques. They show that there is a circuit of this type that computes a very strong affine extractor: a function that is almost balanced on all large enough affine subspaces. On the other hand, it is known that affine extractors are hard for 𝖠𝖢0 (by definition we know upper bounds on Fourier coefficients, which contradicts with spectrum concentration of small 𝖠𝖢0 circuits obtained from switching lemma or polynomial approximation, see, for example [39]) and DNFXor [10]. Moreover, [22] show that DNFXor can compute a one-sided affine extractor: a balanced function that is never too biased towards zero on large enough affine subspaces. They then use the latter result to separate DNFXor from 𝖠𝖢0Xor that has at most n distinct Xor gates (here n is the number of variables). In other words, such a circuit is a composition of an 𝖠𝖢0-circuit and a non-singular affine transformation over 𝔽2, or an 𝖠𝖢0𝔹 circuit, where 𝔹 denotes the set of such affine transformations.

That circuit class can already compute arbitrary linear forms, but we consider a stronger model. Our model is essentially a union of 𝖠𝖢0 and 𝖠𝖢0𝔹 within Xor.

Definition 1.

Let C1,,CN be some constant depth de Morgan circuits and A1,,AN be non-singular affine transformations. We then say that Or(𝖠𝖢0𝔹)-circuit is i[N]CiAi, i.e. the disjunction of compositions of a constant-depth circuit and an affine map. The depth of a Or(𝖠𝖢0𝔹)-circuit is the depth of C1CN. The size of a Or(𝖠𝖢0𝔹) circuit is the total number of gates in C1,,CN. If N=1 we denote the corresponding circuit type as 𝖠𝖢0𝔹. We denote a depth-3 Or(𝖠𝖢0𝔹)-circuit class by Or(Π0𝔹), this class is our main focus.

We remark that the choice of as the top gate is arbitrary, all the arguments could be handled with on top (with hard functions changing appropriately).

Our primary motivation for studying this class is to develop a line of attack on subclasses of 𝖠𝖢0Xor circuits. Along the way, we establish lower bounds for the class, making concrete progress in this direction. As a subclass of 𝖠𝖢0Xor, Or(Π0𝔹) also has a natural interpretation in a top-down framework, as it corresponds to specific assumptions allowed in the proof strategy. We discuss this in Section 2. It is worth noting that as DNFXor is a subclass of Or(Π0𝔹), Xor is a subclass of OrAnd(Σ0𝔹). So, strong average-case lower bounds against Or(Π0𝔹) would also imply Xor lower bounds. In Section 4, we propose a roadmap of intermediate open questions which aims to extend this approach to eventually achieve lower bounds for stronger subclasses of 𝖠𝖢0Xor and even 𝖠𝖢0[6].

As a main result we present a general approach for proving lower bound for this model of computation. We give the highlights of the technique in Section 2. We now show the comparison of this model with classical models and state the lower bounds that we get.

The Comparison of the Models

Let us first observe that Or(Π0𝔹) is properly larger than depth-3 𝖠𝖢0𝔹. Observe that DNFXor is a special case of Or(Π0𝔹). The strict inclusion is then implied by the following.

Theorem 2 ([22]).

A function that admits a polynomial DNFXor circuit may require an exponential 𝖠𝖢0𝔹 circuit.

Proof sketch.

Implied by the combination of Corollary 5 and Claim 21 in [22], the former shows that DNFXor can compute functions for which there is a correlation bound for (n𝗉𝗈𝗅𝗒(logn))-depth parity decision trees (PDT), while the latter observes that the switching lemma applied to a 𝖠𝖢0𝔹-circuit yields a (nlogω(1)n)-depth PDT approximating the function.

On the other hand, Or(Π0𝔹) is properly larger than DNFXor. Since Or(Π0𝔹) contains CNF this statement is implied by the following Theorem.

Theorem 3.

Let f:{0,1}n×3{0,1} defined as f(x)i[n](xi1+xi2+xi3=1) where the sum is over . Then any DNFXor circuit computing f has size Ω(1.5n).

To the best of our knowledge, this is the simplest existing lower bound for DNFXor. We include the proof in Section 3.4.

Affine Dispersers

f:{0,1}n{0,1} is a (k,ε)-affine extractor if for every affine subspace A{0,1}n (where we equate {0,1}n and 𝔽2n) of dimension at least k we have |Pr𝒂A[f(𝒂)=1]1/2|<ε. We say that f is a k-affine disperser if it is a (k,1/2) affine extractor, i.e. Pr𝒂A[f(𝒂)=1]{0,1} for every k-dimensional affine subspace A.

In our first result, we confirm that Or(Π0𝔹) is smaller than Xor. Theorem 3 in [22] shows that polynomial-size Xor circuit computes affine extractors with polylogarithmic dimension (i.e. the function is close to being balanced in every affine subspace of at least polylogarithmic dimension). On the other hand, we prove the following theorem in Section 3.2.

Theorem 4 (informal).

If a function is not constant on any n1/3o(1)-dimension affine subspace (i.e. it is an n1/3o(1)-affine disperser) then it requires Or(Π0𝔹) circuits of exponential size.

Theorem 3 in [22] implies that the polynomial approximation method can not prove Theorem 4, since it can not distinguish between a Or(Π0𝔹)-circuit and a Xor-circuit, which contains some affine extractors.

Inner Product

The inner product function IPn:{0,1}n×{0,1}n{0,1} defined as follows IPn(x,y)=i[n]xiyimod2. It is a big open problem to get a lower bound for the inner product in Xor. In Section 3.3 we show that the inner product function requires exponentially large Or(𝖠𝖢0𝔹) circuits. The technique there is a combination of random restrictions with a top-down step.

Middle Slice

The middle slice function Midn:{0,1}n{0,1} is defined as Midn(x)=𝟙[|x|=n/2], i.e. it equals 1 iff the input contains exactly n/2 ones. In Section 3.1 we prove via a top-down argument that this function requires 2Ω(n)-size Or(Π0𝔹) circuits.

2 Technique

2.1 𝗔𝗖𝟎 Top-down Lower Bounds

The general sketch of an 𝖠𝖢0 top-down proof looks as follows.

  • Consider a circuit C=i=0sCi (the case of is treated analogously). Suppose we want to prove that C cannot distinguish certain sets of inputs A and B. Assume that, on the contrary, C(A)=1 and C(B)=0.

  • Note that iCi1(1)A and also for every i it holds that Ci1(0)B. We pick some Ci such that AiCi1(1). Now, this is a circuit that separates Ai and B, and it has as the top gate.

  • We repeat the procedure until we arrive at a shallow enough circuit C that is supposed to separate some sets A and B.

  • We prove that C cannot separate these sets. Note that if the original circuit C made an error in computing A and B, there always exists a sequence of choices of subcircuits such that the error is traced until C.

A “shallow enough circuit” could be, in principle, even a variable, but it turns out that there is a convenient way to argue about circuits of depth 2 in this context. Moreover, for now we assume that these circuits of depth 2 are CNFs/DNFs of width bounded by some parameter k. At the end of this section, we discuss why this assumption is acceptable for the proper choice of k.

The central notion for analyzing k-CNFs and k-DNFs is a k-limit. The notion comes from [20], inspired by “limit vectors” from [37] as well as communication complexity techniques [25].

Definition 5 ([20]).

Let A{0,1}n. x{0,1}n is a k-limit of A, if for any subset of indices I([n]k) there is yA such that xI=yI.

Claim 6 ([20, 28, 17]).

If a k-CNF formula C accepts a set A, then it accepts every k-limit of A.

At the same time, it is known that a k-limit is a complete notion in the following sense.

Claim 7 ([20]).

Let A{0,1}n be a set such that every k-limit of A belongs to A. Then there exists a k-CNF formula C such that C1(1)=A.

Proof.

We start from an empty CNF formula C and gradually add clauses to it. Consider any yA. As it is not a k-limit of A, there exists a set I([n]k) such that yIaI for any aA. Let us add into C a clause D=iIxi1yi. This clause evaluates to 0 on y and evaluates to 1 on any aA.

We repeat the procedure for every yA. The resulting CNF evaluates to 0 on every yA and evaluates to 1 on any aA, which proves the claim.

Note that a k-CNF over n variables has at most (2n)k=2O(klogn) clauses. The standard assumption for 𝖠𝖢0 is that the bottom fan-in of the circuit is bounded by the logarithm of its size, so k-limits are a tool that helps to prove lower bounds of the form 2k. To be more precise, it follows from Claim 6 that if a set A has a k-limit outside of A, then any CNF recognising it should have size 2Ω(k), and it follows from Claim 7 that for any set A containing all of its k-limits there is a CNF of size 2O(klogn) recognising A. So there is a multiplicative gap of logn in the exponent between related lower bound and upper bound. In most cases, this is a negligible difference, but for some examples this might be important: for example, Maj function has a lower bound of 2Ω(n) in depth-3 𝖠𝖢0 and an upper bound of 2O(nlogn) in the same model. Proving a 2ω(n) lower bound for Maj would beat all state-of-the-art lower bounds for depth-3 𝖠𝖢0 circuits.

Reducing Bottom Fan-in

Bottom-up 𝖠𝖢0 lower bounds usually use the fact that bottom fan-in is bounded by a parameter k. In most cases, k can be made as small as logs, where s is the size of the circuit. This is done by random restrictions, which kill the bottom layer gates with a big fan-in. In case of 𝖠𝖢0Xor, this becomes more of a problem, since Xor survives under small restrictions.

In fact, handling the bottom fan-in can be done using top-down techniques. In [17], the lower bound is fully top-down in the sense that no random restrictions are used even for reducing the bottom fan-in, and we follow the same path. With the use of Xor gates, this becomes crucial. The idea is that instead of just one k-limit, one needs to find many, so that wide clauses in a CNF could not reject all of them. We make this intuition precise:

Lemma 8.

Let C be a CNF over n variables of size s and AC1(1). Let LC1(0) be a set of k-limits for A. Then s>|L|/2nk.

Proof.

Let C=D1Ds. Then C1(0)=i[s]Di1(0). Let i[s] be the clause with the largest size of Di1(0)L, this size is at least |L|/s since LC1(0). Now the width of Di must be larger than k, since it distinguishes all k-limits in LC1(0) from A, hence |L|/s|Di1(0)|<2nk. The claim then follows.

2.2 Extending the Approach to Parity Gates

We can define the analogous notion for 𝖠𝖢0Xor circuits.

Definition 9 (k-parity limit).

Let A{0,1}n. x is a k-parity limit of A, if for any affine subspace L{0,1}n of co-dimension k such that xL it holds that AL.

The proof of the following claim is analogous to Claim 6. We include the proof for completeness.

Claim 10.

If a k-CNFXor circuit C accepts a set A, then it accepts any k-parity limit of A.

Proof.

Let x be the k-parity limit of A. Suppose C rejects x. Then there exists a particular OrXor subcircuit of C (denote it by C) such that C(x)=0. Let L be the affine subspace of co-dimension k defined by zeroes of C. Then xL, and therefore AL. Then for some yA it holds that C(y)=0 and therefore C(y)=0, which is a contradiction with the assumption that C accepts the whole A.

As there are much more linear subspaces of co-dimension k than clauses of width k, while we can prove the analogue of Claim 7 for k-parity limits, the resulting circuit would have huge size, so this approach would not give a meaningful size upper bound. One could ask a question: is it possible to prove a reasonable upper bound from non-existence of “outer” k-parity limits?

Problem 11.

Is k-parity limit complete in the following sense: if a set A contains all of its k-limits, then there is a CNFXor circuit C of width k and size 2kO(1) such that it accepts exactly set A?

For proving lower bounds against 𝖠𝖢0Xor, it is sufficient to find k-parity limits only with respect to a fixed set of linear forms present in a circuit (subcircuit), but it would be interesting to know if the more general statement is true. Again, the existence of a k-parity limit with respect to a fixed set of linear forms is a necessary condition for a lower bound.

The proof of the next claim is, again, analogous to Claim 6.

Claim 12.

Let C=i=1sLi be a (k-CNF)Xor (here Li is such that Li1(0) is a co-dimension-k affine subspace) and let 𝒮 be a collection of affine subspaces of co-dimension k such that Li1(0)𝒮 for all i[s]. Suppose that C accepts A. Consider y such that for any L𝒮, yL implies that there exists xA such that xL. Then C accepts y.

Here 𝒮 can be a collection of all affine subspaces of co-dimension k, or it can be a smaller family of subspaces that still contains all linear systems used in a (k-CNF)Xor. In other words, as the number of linear systems used in a circuit is bounded by its size, one can relax the definition of a k-parity limit to be able to fool only these linear systems. One can also prove a variant of completeness for this weaker notion of k-parity limits analogous to Claim 7.

Exponential size Or(Π0𝔹), in particular, can use exponential number of different linear forms, with the restriction that inside each CNFXor subcircuit there are only n different linear forms. Our results can be seen as finding k-parity limits under these restrictions on the model. In top-down language, the extra Or on top symbolises that the first step down in the proof is oblivious to the actual parity gates that are used in the circuit. Now, just two such oblivious steps would imply lower bounds for Xor.

When comparing top-down lower bounds for plain 𝖠𝖢0 and Or(Π0𝔹), the most important difference (and our main technical contribution) is the following: for Or(Π0𝔹), we should be able to construct sets such that we can find their k-limits after any change of basis in 𝔽2n. Note that the hard functions in this case should be uncorrelated with affine subspaces, which, in particular, is not true for Xor function: after the appropriate change of basis, we could encode the value of the function in the first bit of the string in the new basis.

This seems like a natural step towards fully adapting the top-down approach for circuits with Xor gates. We discuss this further in Section 4.

2.3 Unpredictability and Local Limits

One of the most successful to date ways to find local limits is via unpredictability from partial information.

Let X{0,1}n and R[n]. A pair (Q,a) with Q[n]R and a{0,1}Q is a certificate for R if there exists b{0,1}R such that whenever xQ=a for xX, xRb. In this case, we say that x contains a certificate for R, and the size of such certificate is q|Q|. This notion was introduced in [28] who proved the following result for |R|=1.

Lemma 13 (Bit unpredictability [28]).

Let X{0,1}n have density |X|/2n2d. Then for any q1,

Pr(𝒙,𝒊)X×[n][𝒙 contains a size-q certificate for 𝒊 wrt X]O(dq/n).

More recently [17] generalized this result for |R|>1.

Lemma 14 (Block unpredictability [17]).

Let X{0,1}n have density |X|/2n2d. Then for any q,r1,

Pr(𝒙,𝑹)X×(nr)[𝒙 contains a size-q certificate for 𝑹 wrt X]O(dqr/n)1/6.

Bit and block unpredictability were used in top-down lower bounds for low-depth 𝖠𝖢0 circuits as a way to extract local limits (Definition 5).

Lemma 15 ([28, 17]).

Suppose xX does not contain any q-certificates for R wrt to X. Then every x such that x[n]R=x[n]R is a q-limit of X.

Proof.

Suppose that there exists x with x[n]R=x[n]R that is not a q-limit for X, i.e. there exists a set S[n] of size q such that for every yX we have xSyS. Observe that (SR,xSR) is then a certificate for R: indeed for any b{0,1}R that agrees with xSR we have that for every yX we have yRb.

3 Lower Bounds

3.1 Middle Slice

In this section we prove the following.

Theorem 16.

Let C be a circuit of the following form. CC1C2CN where Ci is a composition of a CNF Di and an affine transformation Ai of full rank. Suppose that C computes the characteristic function of the middle slice ([n]n/2). Then the total size of CNFs D1,,DN is at least 2Ω(n).

The key ingredient in our proof is a density boosting lemma for affine subspaces. Following a similar definition for boolean subcubes in [3] we say that a set XS where S is a vector space is θ-linearly spread in S if for every affine subspace AS of co-dimension d we have

Pr𝒙X[𝒙A]θd.

We remark that the range of interesting values for θ is (1,2], for θ1 all sets are spread, for θ>2 no set is spread. A similar, but different notion was used in [26]. The following is a new density boosting lemma, generalized for affine subspaces, which might be of independent interest. For boolean cubes, such a lemma was introduced in [16] and appears among others in [11].

Lemma 17.

For every set X{0,1}n of size at least 2nd there exists an affine subspace A of {0,1}n of co-dimension at most d/(1log2θ) such that XA is θ-linearly spread in A and |XA||X|θd/(1log2θ).

Proof.

Suppose that X is not θ-linearly spread in {0,1}n. Then consider the affine subspace of largest co-dimension A that witnesses the lack of θ-spreadness of X: suppose that A has co-dimension then we have |XA|/|X|>θ.

We claim then that XA is θ-lineraly spread in A. Indeed, suppose that it is not, i.e. there exists an affine subspace B of A of co-dimension (in A) such that |XB|/|XA|>θ. Then |XB|/|X|>θ which contradicts the maximality of the co-dimension of A.

Now it remains to bound the co-dimension of A. On the one hand

Pr𝒙X[𝒙A]=yAPr[𝒙=y]|A|2dn=2(n)+dn=2d.

On the other hand Pr𝒙X[𝒙A]>θ. Hence d>(1log2θ). The lower bound on |XA| follows.

Proof of Theorem 16.

Suppose for contradiction that C1(1)=([n]n/2), N2γn where γ is a constant to choose later. Since C1(1)=i[N]Ci1(1), there exists i0[N] such that |Ci01(1)||([n]n/2)|2γn.

Thus, there exists a CNF D and a full-rank linear transformation A such that X(DA)1(1) has size at least (nn/2)2γn2nγnlogn and X([n]n/2). Let us identify the linear transformation A with the matrix in {0,1}n×n defining it: A(x)Ax.

First we apply Lemma 17 to the set X with the parameter θ=2. We get that there exists an affine space B of co-dimension at most 2(γn+logn) such that XB is θ-linearly spread in B and |XB||X|/2γn+logn.

Applying Lemma 13 to the set A(XB)={AxxXB} with q=5γn we get:

Pr(𝒙,𝒊)(XB)×[n][A𝒙 does not contain a size-q certificate for 𝒊 wrt XB]1O(γnq/n).

Pick γ such that this probability is at least 0.9. That is, with probability 0.9 for a pair (𝒙,𝒊)(XB)×[n] we have by Lemma 15 that A𝒙+e𝒊 (where ei=0i110ni) is a q-limit of the set A(XB). In order to invoke Lemma 8 we need to find many q-limits in D1(0), i.e., outside of A(X).

Suppose that Ax+eiA(X) for some (x,i)(XB)×[n]. Equivalently x+A1eiX. Then in particular x+A1ei([n]n/2). Since xX([n]n/2), this is only possible if A1ei has even hamming weight and in that case implies that x,A1ei=(|{j[n](A1ei)j=1}|/2)mod2ci.

In other words, if adding A1ei to x preserves its Hamming weight n2, it means that exactly half of the indices of 1-coordinates in A1ei match up with indices of 1-coordinates in x, which fixes the value of x,A1ei to a constant ci.

Now consider the affine subspace Bi{yBy,A1ei=ci}. If A1eiB{x{0,1}nyB,x,y is fixed} (B is the span of all linear constraints defining B), then Bi=Bi or B=, otherwise Bi has co-dimension 1 in B. Let E be the event when A1e𝒊 is in B. We then have

Pr[A𝒙+e𝒊A(X)]Pr[A𝒙+e𝒊A(X)¬E]+Pr[E].

Since the co-dimension of B (equivalently the dimension of B) is at most 2(nγ+logn) and A1e1,,A1en are linearly independent, Pr[E]2(γn+logn)/n=o(1). On the other hand since A𝒙+e𝒊A(X) implies that 𝒙B𝒊 and XB is θ-linearly spread in B we get Pr[A𝒙+e𝒊A(x)¬E]1/2. Therefore Pr[A𝒙+e𝒊A(X)A𝒙+e𝒊 is a q-limit of A(XB)]0.91/2o(1), so there are Ω(|XB|) q-limits to A(XB) outside of A(X), hence by Lemma 8 we get that

|D|=Ω(|XB|)/2n5γn=Ω(2n2(γn+logn)/2n5γn)=Ω(2γn).

3.2 Affine Disperser

Theorem 18.

Let γ be any constant in (0,1/3). Let f:{0,1}n{0,1} be an affine disperser for dimension knγ such that |f1(1)|2nnγ and C be a circuit of the form CC1C2CN, where Ci is a composition of a CNF Di and a linear transformation Ai of full rank. If C computes f, then the total size of C1,,CN is at least 2Ω(nγ).

Proof.

We show that either N2nγ or one of the CNFs C1,,CN has size at least 2nγ, which yields the claim. Suppose N<2nγ. Then there exists Ci=DiAi such that |Ci1(1)||f1(1)|/2nγ2n2nγ.

Let X(DiAi)1(1). For the set Ai(X) we apply Lemma 14 with the following parameters:

  • the density loss t=2nγ;

  • the size of certificate q=αnγ;

  • the size of the unpredictable block r=βnγ.

The constants α,β are to be chosen later. It follows that with probability 1O(n3γ1)=1o(1) for 𝒙X and 𝑹([n]r) there is no certificate of size q in Ai𝒙 for 𝑹. Hence, by Lemma 15 all elements of

L𝒙,𝑹{y{0,1}ny[n]𝑹=𝒙[n]𝑹}

are q-limits of Ai(X) with probability 1o(1). Let E be the set of pairs x,RX×([n]r) for which this holds.

For (x,R)E consider the set Ai1(Lx,R). Lx,R is an r-dimensional affine subspace of {0,1}n, thus Ai1(Lx,R) is as well. As f is an affine disperser, there is an input yAi1(Lx,R)f1(0), hence Aiy is not in X and is a q-limit of X.

Now we need to count the number of q-limits we got in order to invoke Lemma 8. Let g:E{0,1}n be the function mapping (x,R) to Aiy (to an arbitrary one, if there are several). Let us upper bound |g1(z)| for an arbitrary z{0,1}n. Suppose g(x,R)=z, let y=(Ai)1z, then x[n]R=y[n]R, hence, there are at most 2r([n]r) such preimages. Thus we get |E|/(2r([n]r))=|X|/2r q-limits in total. Therefore by Lemma 8 we get that |Di|(1o(1))2n2nγ/(2r2nq)2qr2nγ12(αβ2)nγ1. Hence for any α>β+2 we get the desired bound.

3.3 Inner Product

In this section, we give an exponential lower bound for Or(𝖠𝖢0𝔹)-circuit size required to compute the inner product. Our proof is a combination of bottom-up techniques with one top-down-like step.

Theorem 19.

Let C be a circuit of the form CC1C2CN where each Ci is a composition of a d-depth circuit Di composed with a full-rank affine mapping Ai.

Suppose that C computes IPn. Then the total size of circuits D1,,DN is at least 2nΩ(1/d).

Proof.

Suppose for contradiction that the total size of all Di is at most 2nε for ε=o(1/d). Then let us pick 𝜶{0,1}n and apply the restriction y=𝜶 to C. Then C|y=𝜶 computes the function Xor𝜶i[n]:𝜶i=1xi and has the same form as before, disjunction of compositions of constant-depth circuits with affine transformations.

Let Di,Ai be such that C|y=𝜶=i[N]DiAi. Since Xor𝜶 is balanced, there exists i0[N] such that |(Di0Ai0)1(1)|2|α|1nε. On the other hand (DiAi)1(0)Xor𝜶1(0) for all i[N]. Then applying (Ai0)1 to Di0Ai0 and to Xor𝜶 on the right we get that the circuit (Di0)1(0)Xor𝜶 where 𝜶=𝜶(Ai0)1 and since Ai0 has full rank |(Di0)1(1)|=|(Di0Ai0)1(1)|2|α|1nε.

Now observe that
Pr[|𝜶|n]=Pr[𝜶 is a linear combination of n rows of (Ai0)1]N(|α|n)/2n=o(1).

Hence, there exists a depth-d de Morgan circuit D=Di0 that computes parity Xorα0 on n bits correctly on all 0-inputs and on at least 2nε-fraction of 1-inputs. Then let 𝒚1,,𝒚MXorα01(0) be independent random variables. Then the depth-(d+1) circuit E𝒚(x)i[M]D(x𝒚i) computes the value of Xorα0 correctly on an input x with probability 1(12nε)M, which exceeds 12n for M>3n2nε, hence, there exists a setting of y=y1,,yM such that Ey computes Xorα0. Thus by [18] the size of Ey is at least 2nΩ(1/d) which means that ε=Ω(1/d) which is a contradiction.

3.4 Proof of Theorem 3

A DNFXor circuit computing f is equivalent to a covering of f1(1) by affine subspaces f1(1)=i[N]Ai. Consider an arbitrary Aj{0,1}n×3. Affine spaces over 𝔽2 are closed under sums of three elements, so let a,b,cAj. Then d=abcAj. For xf1(1) we have that for every i[n] among xi,1,xi,2,xi,3 exactly one value is 1 and the other two are zeroes. For xf1(1) we can define x¯[3]n be such that for every i[n] we have xi,x¯i=1 and xi,j=0 if jx¯i. Then since dAif1(1) for every i[n] we have |{a¯i,b¯i,c¯i}|<3, since otherwise di=(1,1,1) which contradicts f(d)=1. Since this is true for any a,b,cAj we get that there exists β[3]n such that for every xAj and for every i[n] we have xi,β(i)=0. Since for every xf1(1) we have xi,1+xi,2+xi,3=1 we get that |Aj|2n. Since |f1(1)|=3n we get that N(3/2)n, which completes the proof.

4 Discussion and Open Problems

In the results above, having an extra Or on top of the circuit, and only fixing the linear transformation in the subcircuits, can be interpreted in the following way. When implementing the top-down strategy, the first choice of the subcircuit (and the subset of 1-inputs, respectively), does not depend on the specific linear forms used in the circuit.

In other words, let A=f1(1) and B=f1(0) for one of the hard functions considered in the main section. Informally, we prove that for any covering of A by no more 2nε sets A1,,A2nε (for some ε=Ω(1)) there is a choice of Ai such that for any affine map L there is a k-limit for Ai in B with respect to that map. Note that for different affine maps, we might find different k-limits. For proving lower bounds for Xor, we would need to prove a statement where the last two quantifiers are in a different order: there is a k-limit that works for any affine map. Or, at least, for any affine map in a large enough collection of such. As mentioned in Section 2, this corresponds to making two “oblivious” steps down the circuit, where we do not use the knowledge of specific parity gates, while in our lower bound, we make one “oblivious” step.

Note that this is not true for the affine extractors in general, as there is an affine extractor computable by Xor circuits [22]. The middle slice function, however, could still be a good example for honing the top-down techniques.

The first natural step could be finding the same k-limit with respect to an arbitrary pair of affine maps. This corresponds to the lower bounds in the following computational model.

Problem 20.

Prove top-down lower bounds for the class of (2-DNF)(CNF𝔹) circuits.

In fact, that would already be a step towards another elusive circuit class .Another motivation for this open question comes from the Mod6 perspective. A Mod6 gate can be seen as a conjunction of Mod2 and Mod3 gates. After appropriately expanding the brackets, DNFMod6 can be transformed to a 2-DNF of conjunctions such that in each conjunction there are only Mod2 or only Mod3 gates. So a first related problem would be to adapt the technique for Mod3.

Problem 21.

Prove top-down lower bounds for subclasses of 𝖠𝖢0Mod3.

The next step would be combining the two last problems together. Let 𝕃{f:{0,1}n{0,1}n} be the union of linear maps over 𝔽2 (𝔹) and maps x(𝟙[(Ax)i=ai])i[n] where A𝔽3n×n and a𝔽3n. In other words, we can choose a transformation of the inputs that either uses only Xor operations, or only Mod3 operations.

Problem 22.

Prove lower bounds for (2-DNF)(CNF𝕃) circuits.

Solving this problem would imply lower bounds for DNFMod6.

Claim 23.

Let f:{0,1}n{0,1} be such that it is computable by k-DNFMod6-circuit D of size s. Then it is computable by (2-DNF)(CNF𝕃) of size O(s2kk).

Proof.

See Appendix A.

When proving the results of this form, it all essentially boils down to finding a k-(parity) limit. The two known techniques for this are (robust) sunflowers or spreadness [20, 17] and unpredictability [28, 17]. They both have certain downsides. For starters, these techniques only find k-limits that are close to the set in Hamming distance (or, in the case of our result, they are close in Hamming distance after a certain affine transformation). In principle, this might not be the case.

Problem 24.

Let A{0,1}n be a subset of a code with minimum distance d=Ω(n/logn). Can you find a log2(n)-limit of A?

When looking for a k-limit, we can also ask for some structure of the considered set. Let us say that our set A is a half of a n-wise independent set. From the results of Bazzi [5], Razborov [34], and Braverman [7], we know that roughly half the points of the whole boolean cube should be nε-limits of the set A. However, current techniques do not allow us to find some explicit k-limit, assuming the knowledge of A. One of the reasons for this is that the size of such sets can be as small as 2O(nlogn) [2].

Problem 25.

Prove a top-down lower bound 2kΩ(1) for separating two disjoint k-wise independent sets by depth-3 𝖠𝖢0 circuits.

References

  • [1] Miklos Ajtai. Σ11-formulae on finite structures. Annals of Pure and Applied Logic, 24(1):1–48, 1983. doi:10.1016/0168-0072(83)90038-6.
  • [2] Noga Alon, László Babai, and Alon Itai. A fast and simple randomized parallel algorithm for the maximal independent set problem. J. Algorithms, 7(4):567–583, 1986. doi:10.1016/0196-6774(86)90019-2.
  • [3] Ryan Alweiss, Shachar Lovett, Kewen Wu, and Jiapeng Zhang. Improved bounds for the sunflower lemma. In Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing, STOC 2020, pages 624–630, New York, NY, USA, 2020. Association for Computing Machinery. doi:10.1145/3357713.3384234.
  • [4] Theodore Baker and Alan Selman. A second step toward the polynomial hierarchy. Theoretical Computer Science, 8(2):177–187, 1979. doi:10.1016/0304-3975(79)90043-4.
  • [5] Louay M. J. Bazzi. Polylogarithmic independence can fool DNF formulas. SIAM J. Comput., 38(6):2220–2272, 2009. doi:10.1137/070691954.
  • [6] Elmar Böhler, Christian Glaßer, and Daniel Meister. Error-bounded probabilistic computations between MA and AM. Journal of Computer and System Sciences, 72(6):1043–1076, 2006. doi:10.1016/j.jcss.2006.05.001.
  • [7] Mark Braverman. Poly-logarithmic independence fools bounded-depth boolean circuits. Commun. ACM, 54(4):108–115, 2011. doi:10.1145/1924421.1924446.
  • [8] Brynmor Chapman and R. Ryan Williams. Smaller ACC0 circuits for symmetric functions. In Mark Braverman, editor, 13th Innovations in Theoretical Computer Science Conference, ITCS 2022, January 31 - February 3, 2022, Berkeley, CA, USA, volume 215 of LIPIcs, pages 38:1–38:19. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPIcs.ITCS.2022.38.
  • [9] Mahdi Cheraghchi, Elena Grigorescu, Brendan Juba, Karl Wimmer, and Ning Xie. AC0MOD2 lower bounds for the boolean inner product. J. Comput. Syst. Sci., 97:45–59, 2018. doi:10.1016/J.JCSS.2018.04.006.
  • [10] Gil Cohen and Igor Shinkar. The complexity of DNF of parities. In Madhu Sudan, editor, Proceedings of the 2016 ACM Conference on Innovations in Theoretical Computer Science, Cambridge, MA, USA, January 14-16, 2016, pages 47–58. ACM, 2016. doi:10.1145/2840728.2840734.
  • [11] Sandro Coretti, Yevgeniy Dodis, Siyao Guo, and John Steinberger. Random oracles and non-uniformity. In Jesper Buus Nielsen and Vincent Rijmen, editors, Advances in Cryptology – EUROCRYPT 2018, pages 227–258, Cham, 2018. Springer International Publishing. doi:10.1007/978-3-319-78381-9_9.
  • [12] Peter Frankl, Svyatoslav Gryaznov, and Navid Talebanfard. A variant of the VC-dimension with applications to depth-3 circuits. In Proceedings of the 13th Conference on Innovations in Theoretical Computer Science (ITCS), volume 215, pages 72:1–72:19. Schloss Dagstuhl, 2022. doi:10.4230/LIPIcs.ITCS.2022.72.
  • [13] Merrick Furst, James Saxe, and Michael Sipser. Parity, circuits, and the polynomial-time hierarchy. Mathematical Systems Theory, 17(1):13–27, 1984. doi:10.1007/bf01744431.
  • [14] Alexander Golovnev, Alexander Kulikov, and Ryan Williams. Circuit depth reductions. In Proceedings of the 12th Conference on Innovations in Theoretical Computer Science (ITCS), volume 185, pages 24:1–24:20. Schloss Dagstuhl, 2021. doi:10.4230/LIPIcs.ITCS.2021.24.
  • [15] Mika Göös, Ziyi Guan, and Tiberiu Mosnoi. Depth-3 Circuits for Inner Product. In 48th International Symposium on Mathematical Foundations of Computer Science (MFCS 2023), volume 272, pages 51:1–51:12. Schloss Dagstuhl, 2023. doi:10.4230/LIPIcs.MFCS.2023.51.
  • [16] Mika Göös, Shachar Lovett, Raghu Meka, Thomas Watson, and David Zuckerman. Rectangles are nonnegative juntas. SIAM Journal on Computing, 45(5):1835–1869, 2016. doi:10.1137/15M103145X.
  • [17] Mika Göös, Artur Riazanov, Anastasia Sofronova, and Dmitry Sokolov. Top-down lower bounds for depth-four circuits. In 2023 IEEE 64th Annual Symposium on Foundations of Computer Science (FOCS), pages 1048–1055, 2023. doi:10.1109/FOCS57990.2023.00063.
  • [18] Johan Håstad. Almost optimal lower bounds for small depth circuits. In Proceedings of the Eighteenth Annual ACM Symposium on Theory of Computing, STOC ’86, pages 6–20, New York, NY, USA, 1986. Association for Computing Machinery. doi:10.1145/12130.12132.
  • [19] Johan Håstad. Computational Limitations for Small Depth Circuits. PhD thesis, MIT, 1987.
  • [20] Johan Håstad, Stasys Jukna, and Pavel Pudlák. Top-down lower bounds for depth-three circuits. Computational Complexity, 5(2):99–112, 1995. doi:10.1007/bf01268140.
  • [21] Suichi Hirahara. A duality between depth-three formulas and approximation by depth-two. Technical report, arXiv, 2017. arXiv:1705.03588.
  • [22] Xuangui Huang, Peter Ivanov, and Emanuele Viola. Affine Extractors and AC0-Parity. In Amit Chakrabarti and Chaitanya Swamy, editors, Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2022), volume 245 of Leibniz International Proceedings in Informatics (LIPIcs), pages 9:1–9:14, Dagstuhl, Germany, 2022. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.APPROX/RANDOM.2022.9.
  • [23] Russell Impagliazzo, Ramamohan Paturi, and Francis Zane. Which problems have strongly exponential complexity? Journal of Computer and System Sciences, 63(4):512–530, December 2001. doi:10.1006/jcss.2001.1774.
  • [24] Stasys Jukna. On graph complexity. Comb. Probab. Comput., 15(6):855–876, 2006. doi:10.1017/S0963548306007620.
  • [25] Mauricio Karchmer and Avi Wigderson. Monotone circuits for connectivity require super-logarithmic depth. SIAM J. Discret. Math., 3(2):255–265, 1990. doi:10.1137/0403021.
  • [26] Zander Kelley and Raghu Meka. Strong bounds for 3-progressions. In 64th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2023, Santa Cruz, CA, USA, November 6-9, 2023, pages 933–973. IEEE, 2023. doi:10.1109/FOCS57990.2023.00059.
  • [27] Ker-I Ko. Separating and collapsing results on the relativized probabilistic polynomial-time hierarchy. Journal of the ACM, 37(2):415–438, 1990. doi:10.1145/77600.77623.
  • [28] Or Meir and Avi Wigderson. Prediction from partial information and hindsight, with application to circuit lower bounds. Computational Complexity, 28(2):145–183, 2019. doi:10.1007/s00037-019-00177-4.
  • [29] Igor Carboni Oliveira, Rahul Santhanam, and Srikanth Srinivasan. Parity helps to compute majority. In Amir Shpilka, editor, 34th Computational Complexity Conference, CCC 2019, July 18-20, 2019, New Brunswick, NJ, USA, volume 137 of LIPIcs, pages 23:1–23:17. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019. doi:10.4230/LIPIcs.CCC.2019.23.
  • [30] Ramamohan Paturi, Pavel Pudlák, Michael Saks, and Francis Zane. An improved exponential-time algorithm for k-SAT. Journal of the ACM, 52(3):337–364, 2005. doi:10.1145/1066100.1066101.
  • [31] Ramamohan Paturi, Pavel Pudlak, and Francis Zane. Satisfiability coding lemma. Chicago Journal of Theoretical Computer Science, 5(1):1–19, 1999. doi:10.4086/cjtcs.1999.011.
  • [32] Ramamohan. Paturi, Michael Saks, and Francis Zane. Exponential lower bounds for depth three boolean circuits. computational complexity, 9(1):1–15, 2000. doi:10.1007/PL00001598.
  • [33] Alexander Razborov. Lower bounds on the size of bounded depth circuits over a complete basis with logical addition. Mathematical Notes of the Academy of Sciences of the USSR, 41(4):333–338, 1987. doi:10.1007/bf01137685.
  • [34] Alexander A. Razborov. A simple proof of bazzi’s theorem. ACM Trans. Comput. Theory, 1(1):3:1–3:5, 2009. doi:10.1145/1490270.1490273.
  • [35] Alexander Russell and Ravi Sundaram. Symmetric alternation captures BPP. Computational Complexity, 7(2):152–162, November 1998. doi:10.1007/s000370050007.
  • [36] Miklos Santha. Relativized Arthur–Merlin versus Merlin–Arthur games. Information and Computation, 80(1):44–49, 1989. doi:10.1016/0890-5401(89)90022-9.
  • [37] Michael Sipser. A topological view of some problems in complexity theory. In Michal Chytil and Václav Koubek, editors, Mathematical Foundations of Computer Science 1984, Praha, Czechoslovakia, September 3-7, 1984, Proceedings, volume 176 of Lecture Notes in Computer Science, pages 567–572. Springer, 1984. doi:10.1007/BFB0030341.
  • [38] Roman Smolensky. Algebraic methods in the theory of lower bounds for boolean circuit complexity. In Proceedings of the 19th Symposium on Theory of Computing (STOC). ACM Press, 1987. doi:10.1145/28395.28404.
  • [39] Avishay Tal. Tight bounds on the fourier spectrum of AC0. In Ryan O’Donnell, editor, 32nd Computational Complexity Conference, CCC 2017, July 6-9, 2017, Riga, Latvia, volume 79 of LIPIcs, pages 15:1–15:31. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2017. doi:10.4230/LIPIcs.CCC.2017.15.
  • [40] Leslie G. Valiant. Graph-theoretic arguments in low-level complexity. In Jozef Gruska, editor, Mathematical Foundations of Computer Science 1977, 6th Symposium, Tatranska Lomnica, Czechoslovakia, September 5-9, 1977, Proceedings, volume 53 of Lecture Notes in Computer Science, pages 162–176. Springer, 1977. doi:10.1007/3-540-08353-7_135.
  • [41] Guy Wolfovitz. The complexity of depth-3 circuits computing symmetric boolean functions. Information Processing Letters, 100(2):41–46, October 2006. doi:10.1016/j.ipl.2006.06.008.
  • [42] Andrew Yao. Separating the polynomial-time hierarchy by oracles. In 26th Annual Symposium on Foundations of Computer Science (SFCS). IEEE, 1985. doi:10.1109/sfcs.1985.49.

Appendix A Proof of Claim 23

Consider a term of D. We rewrite it as a 2-CNF using 𝖬𝖮𝖣2 and 𝖬𝖮𝖣3 gates:

i[k][(j[n]αjixj)mod6=ai]i[k][(j[n]βjixj)mod6bi]=
i[k][(j[n]αjixj)mod2=aimod2(j[n]αjixj)mod3=aimod3]
i[k][(j[n]βjixj)mod2bimod2(j[n]βjixj)mod3bimod3]

Here α,β6k×n, a,b6k. A 2-CNF with k terms can be transformed into a DNF of size 2kk. Now, any term of that DNF has the following form:

i[k][(j[n]γjixj)mod2=ai]i[k][(j[n]δjixj)mod2bi]
i[k][(j[n]εjixj)mod3=ci]i[k][(j[n]φjixj)mod3di]=
i[k]A(x)ii[k]B(x)i

Here A and B are transformations from 𝕃, γ,δ𝔽2k×n;ε,φ𝔽3k×n and a,b𝔽3k, c,d𝔽3k. Overall, f is then computable by a circuit of the following form:

tDi[k]At(x)ii[k]Bt(x)i

This is a (2-DNF)(CNF𝕃) circuit of no greater size than O(s2kk).