Lower Bounds Beyond DNF of Parities

Riazanov, Artur; Sofronova, Anastasia; Sokolov, Dmitry

doi:10.4230/LIPIcs.ITCS.2026.112

Lower Bounds Beyond DNF of Parities

Artur Riazanov

EPFL, Lausanne, Switzerland Anastasia Sofronova

EPFL, Lausanne, Switzerland Dmitry Sokolov

EPFL, Lausanne, Switzerland
Université de Montréal, Canada

Abstract

We consider a subclass of ${\mathsf{AC}}^{0}[2]$ circuits that simultaneously captures $\textsf{DNF}\circ\textsc{Xor}$ and depth- $3$ ${\mathsf{AC}}^{0}$ circuits. For this class we show a technique for proving lower bounds inspired by the top-down approach. We give lower bounds for the middle slice function, inner product function, and affine dispersers.

Keywords and phrases:

boolean circuits, top-down, unpredictability

Copyright and License:

2012 ACM Subject Classification:

Theory of computation

\rightarrow

Circuit complexity

Acknowledgements:

We thank Mika Göös for fruitful discussions. We also thank Pengxiang Wang and anonymous reviewers for useful comments on the text. In particular, we thank a CCC 2025 reviewer who pointed out an issue with the proof of Theorem 16 in the earlier version of the paper.

Funding:

Authors are supported by the Swiss State Secretariat for Education, Research and Innovation (SERI) under contract number MB22.00026.

DOI:

10.4230/LIPIcs.ITCS.2026.112

Event:

17th Innovations in Theoretical Computer Science Conference (ITCS 2026)

Editor:

Shubhangi Saraf

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

Constant-depth (or ${\mathsf{AC}}^{0}$ ) de Morgan circuits is one of the circuit classes that is reasonably well-studied, even though strong exponential lower bounds of the form $2^{\Omega(n)}$ are still out of reach. This model can use unbounded $\wedge$ , $\vee$ and $\neg$ gates, and the underlying graph of a computation is a constant-depth tree; intuitively, these circuits represent computations that can be efficiently parallelized. There are known hard examples of functions that cannot be computed with small-size ${\mathsf{AC}}^{0}$ circuits. The most notable ones include Xor and Maj:

\textsc{Xor}(x_{1},\ldots,x_{n})\coloneqq x_{1}\oplus\dots\oplus x_{n};% \nobreak\ \nobreak\ \nobreak\ \nobreak\ \textsc{Maj}(x_{1},\ldots,x_{n})% \coloneqq\mathds{1}_{[x_{1}+\dots+x_{n}>n/2]}.

These functions require size $2^{\Omega(n^{1/(d-1)})}$ depth- $d$ ${\mathsf{AC}}^{0}$ circuits, and the lower bounds for them are achieved mainly with two techniques: random restrictions, or switching lemma, [13, 1, 42, 18, 19] and polynomial approximation [33, 38]. In particular, for circuits of depth $3$ , there is a lower bound of $2^{\Omega(\sqrt{n})}$ for both of these functions (for Xor it is known to be tight), and breaking through the $\sqrt{n}$ barrier in the exponent for any explicit function is a major open question. Proving strong exponential lower bounds for depth- $3$ circuits would essentially give a superpolynomial lower bound for general circuits [40, 14] which is a major open problem in complexity theory. Both switching lemma and polynomial approximation seem unable to give us such strong lower bounds.

Circuits with MOD Gates

The situation becomes more challenging in terms of lower bounds, when we plug-in hard functions for ${\mathsf{AC}}^{0}$ into our computational model. One of the most natural generalisations of ${\mathsf{AC}}^{0}$ circuits that follow this concept is ${\mathsf{AC}}^{0}[m]$ circuits, that can also utilise gates computing ${\mathsf{MOD}}_{m}$ defined as

{\mathsf{MOD}}_{m}(x_{1},\dots,x_{n})\coloneqq\mathds{1}_{[(x_{1}+\dots+x_{n})% \bmod m=0]}.

On the one hand, a lower bound for this model is necessary to show lower bounds for general circuits. On the other hand, showing such lower bounds is a challenging problem. For example, techniques based on random restrictions, such as switching lemma application, do not work quite as they do in ${\mathsf{AC}}^{0}$ , since ${\mathsf{MOD}}_{m}$ gates are not simplified after restriction. However, when $m$ is a prime power, polynomial approximation achieves lower bounds of the form $2^{n^{1/2d}}$ for Maj [33, 38], as well as for computing ${\mathsf{MOD}}_{q}$ for a prime power $q$ that is relatively prime with $m$ .

When $m$ is not a prime power, very little is known. In fact, utilising non-prime $m$ with many divisors, it is possible to compute any symmetric function in subexponential size even in depth $3$ [8]. The “minimal example” of the non-prime regime is ${\mathsf{AC}}^{0}[6]$ . It is still an open question to prove lower bounds for ${\mathsf{AC}}^{0}[6]$ , and the known techniques fail at resolving that. The reason for that is that polynomial approximation only works over fields, and there is no field with $6$ elements.

Even for the simplest example of ${\mathsf{MOD}}_{p}$ gates: ${\mathsf{MOD}}_{2}$ , or Xor, we are very far from understanding the exact power of ${\mathsf{AC}}^{0}[2]$ circuits. Allowing the use of the Xor function in the gates of a circuit increases its computational power; for example, depth- $4$ ${\mathsf{AC}}^{0}[2]$ circuits can compute Maj in size $2^{O(n^{1/4})}$ [29], whereas it requires $2^{\Omega(n^{1/3})}$ -size ${\mathsf{AC}}^{0}$ circuits.

The drawbacks of Razborov–Smolensky polynomial approximation method translate to the gaps in our understanding of the class ${\mathsf{AC}}^{0}[2]$ . In particular, we do not have strong correlation bounds against these circuits even for one of the simplest subclasses of these circuits: $\textsf{DNF}\circ\textsc{Xor}$ (depth- $3$ unbounded fan-in circuits of $(\land\circ\lor\circ\textsc{Xor})$ -type) [22]. Here, $\circ$ denotes the composition, and this means that Xor gates are only allowed in the bottom layer of the circuit. Moreover, the polynomial approximation method only applies to functions that require a large degree over $\mathbb{F}_{2}$ . Thus, it is unknown whether the inner product $\textsc{IP}_{n}(x,y)\coloneqq x_{1}y_{1}\oplus x_{2}y_{2}\oplus\dots\oplus x_{% n}y_{n}$ requires large ${\mathsf{AC}}^{0}$ circuits with an additional layer of parity gates in the bottom ( ${\mathsf{AC}}^{0}\circ\textsc{Xor}$ ). It is known that IP requires exponentially large $\textsf{DNF}\circ\textsc{Xor}$ circuits [24, 10], but even for $(\lor\circ\land\circ\lor\circ\textsc{Xor})$ -circuits the best known lower bound is $n^{2-o(1)}$ [9].

1.1 Top-down Approach

Overall, there is a clear shortage of new techniques in circuit complexity and, by extension, in adjacent areas, while the well-known ones also have the well-known drawbacks. In this work, we focus on studying another circuit lower bound technique, which falls under the umbrella of top-down methods.

Top-down lower bounds start from the output gate of a candidate circuit and move down the circuit in search of a mistake. While such an approach has been known for a long time, and there is a long line of work on top-down lower bounds for depth- $3$ ${\mathsf{AC}}^{0}$ circuits [4, 36, 27, 20, 35, 31, 32, 23, 30, 41, 6, 28, 14, 12, 15], it still remains largely underdeveloped. So far top-down lower bounds against ${\mathsf{AC}}^{0}$ are known only for circuits up to depth- $4$ [17], while bottom-up methods yield lower bounds for arbitrary constant depth (or even $\log n/\log\log n$ in some cases). The motivation for studying top-down comes from the fact that this method is complete for ${\mathsf{AC}}^{0}$ circuits (see discussion in [21, 15, 17]). In other words, there are no formal barriers that would prevent such an approach from being able to prove lower bounds in the regimes where other known methods cannot.

The main model to which top-down techniques are applicable is ${\mathsf{AC}}^{0}$ circuits, and the main example of a hard function is Xor. In this work, we attempt to adapt such techniques to be able to prove lower bounds for circuits with parity gates. We consider a subclass of ${\mathsf{AC}}^{0}\circ\textsc{Xor}$ which is strictly stronger than $\textsf{DNF}\circ\textsc{Xor}$ and prove lower bounds for it in a top-down fashion. In particular, we prove a lower bound for an affine disperser, which does not follow from Razborov–Smolensky method.

1.2 The Model and Results

In a recent paper Huang, Ivanov, and Viola [22] give an explanation of why the class $\lor\circ\land\circ\lor\circ\textsc{Xor}$ resists known lower bound techniques. They show that there is a circuit of this type that computes a very strong affine extractor: a function that is almost balanced on all large enough affine subspaces. On the other hand, it is known that affine extractors are hard for ${\mathsf{AC}}^{0}$ (by definition we know upper bounds on Fourier coefficients, which contradicts with spectrum concentration of small ${\mathsf{AC}}^{0}$ circuits obtained from switching lemma or polynomial approximation, see, for example [39]) and $\textsf{DNF}\circ\textsc{Xor}$ [10]. Moreover, [22] show that $\textsf{DNF}\circ\textsc{Xor}$ can compute a one-sided affine extractor: a balanced function that is never too biased towards zero on large enough affine subspaces. They then use the latter result to separate $\textsf{DNF}\circ\textsc{Xor}$ from ${\mathsf{AC}}^{0}\circ\textsc{Xor}$ that has at most $n$ distinct Xor gates (here $n$ is the number of variables). In other words, such a circuit is a composition of an ${\mathsf{AC}}^{0}$ -circuit and a non-singular affine transformation over $\mathbb{F}_{2}$ , or an ${\mathsf{AC}}^{0}\circ\mathbb{B}$ circuit, where $\mathbb{B}$ denotes the set of such affine transformations.

That circuit class can already compute arbitrary linear forms, but we consider a stronger model. Our model is essentially a union of ${\mathsf{AC}}^{0}$ and ${\mathsf{AC}}^{0}\circ\mathbb{B}$ within $\lor\circ\land\circ\lor\circ\textsc{Xor}$ .

Definition 1.

Let $C_{1},\dots,C_{N}$ be some constant depth de Morgan circuits and $A_{1},\dots,A_{N}$ be non-singular affine transformations. We then say that $\textsc{Or}\circ({\mathsf{AC}}^{0}\circ\mathbb{B})$ -circuit is $\bigvee_{i\in[N]}C_{i}\circ A_{i}$ , i.e. the disjunction of compositions of a constant-depth circuit and an affine map. The depth of a $\textsc{Or}\circ({\mathsf{AC}}^{0}\circ\mathbb{B})$ -circuit is the depth of $C_{1}\lor\dots\lor C_{N}$ . The size of a $\textsc{Or}\circ({\mathsf{AC}}^{0}\circ\mathbb{B})$ circuit is the total number of gates in $C_{1},\dots,C_{N}$ . If $N=1$ we denote the corresponding circuit type as ${\mathsf{AC}}^{0}\circ\mathbb{B}$ . We denote a depth- $3$ $\textsc{Or}\circ({\mathsf{AC}}^{0}\circ\mathbb{B})$ -circuit class by $\textsc{Or}\circ(\Pi^{0}\circ\mathbb{B})$ , this class is our main focus.

We remark that the choice of $\lor$ as the top gate is arbitrary, all the arguments could be handled with $\land$ on top (with hard functions changing appropriately).

Our primary motivation for studying this class is to develop a line of attack on subclasses of ${\mathsf{AC}}^{0}\circ\textsc{Xor}$ circuits. Along the way, we establish lower bounds for the class, making concrete progress in this direction. As a subclass of ${\mathsf{AC}}^{0}\circ\textsc{Xor}$ , $\textsc{Or}\circ(\Pi^{0}\circ\mathbb{B})$ also has a natural interpretation in a top-down framework, as it corresponds to specific assumptions allowed in the proof strategy. We discuss this in Section 2. It is worth noting that as $\textsf{DNF}\circ\textsc{Xor}$ is a subclass of $\textsc{Or}\circ(\Pi^{0}\circ\mathbb{B})$ , $\lor\circ\land\circ\lor\circ\textsc{Xor}$ is a subclass of $\textsc{Or}\circ\textsc{And}\circ(\Sigma^{0}\circ\mathbb{B})$ . So, strong average-case lower bounds against $\textsc{Or}\circ(\Pi^{0}\circ\mathbb{B})$ would also imply $\lor\circ\land\circ\lor\circ\textsc{Xor}$ lower bounds. In Section 4, we propose a roadmap of intermediate open questions which aims to extend this approach to eventually achieve lower bounds for stronger subclasses of ${\mathsf{AC}}^{0}\circ\textsc{Xor}$ and even ${\mathsf{AC}}^{0}[6]$ .

As a main result we present a general approach for proving lower bound for this model of computation. We give the highlights of the technique in Section 2. We now show the comparison of this model with classical models and state the lower bounds that we get.

The Comparison of the Models

Let us first observe that $\textsc{Or}\circ(\Pi^{0}\circ\mathbb{B})$ is properly larger than depth- $3$ ${\mathsf{AC}}^{0}\circ\mathbb{B}$ . Observe that $\textsf{DNF}\circ\textsc{Xor}$ is a special case of $\textsc{Or}\circ(\Pi^{0}\circ\mathbb{B})$ . The strict inclusion is then implied by the following.

Theorem 2 ([22]).

A function that admits a polynomial $\textsf{DNF}\circ\textsc{Xor}$ circuit may require an exponential ${\mathsf{AC}}^{0}\circ\mathbb{B}$ circuit.

Proof sketch.

Implied by the combination of Corollary 5 and Claim 21 in [22], the former shows that $\textsf{DNF}\circ\textsc{Xor}$ can compute functions for which there is a correlation bound for $(n-{\mathsf{poly}}(\log n))$ -depth parity decision trees (PDT), while the latter observes that the switching lemma applied to a ${\mathsf{AC}}^{0}\circ\mathbb{B}$ -circuit yields a $(n-\log^{\omega(1)}n)$ -depth PDT approximating the function. $\hfill\blacktriangleleft$

On the other hand, $\textsc{Or}\circ(\Pi^{0}\circ\mathbb{B})$ is properly larger than $\textsf{DNF}\circ\textsc{Xor}$ . Since $\textsc{Or}\circ(\Pi^{0}\circ\mathbb{B})$ contains CNF this statement is implied by the following Theorem.

Theorem 3.

Let $f\colon\{0,1\}^{n\times 3}\to\{0,1\}$ defined as $f(x)\coloneqq\bigwedge_{i\in[n]}(x_{i1}+x_{i2}+x_{i3}=1)$ where the sum is over $\mathbb{R}$ . Then any $\textsf{DNF}\circ\textsc{Xor}$ circuit computing $f$ has size $\Omega(1.5^{n})$ .

To the best of our knowledge, this is the simplest existing lower bound for $\textsf{DNF}\circ\textsc{Xor}$ . We include the proof in Section 3.4.

Affine Dispersers

$f\colon\{0,1\}^{n}\to\{0,1\}$ is a $(k,\varepsilon)$ -affine extractor if for every affine subspace $A\subseteq\{0,1\}^{n}$ (where we equate $\{0,1\}^{n}$ and $\mathbb{F}_{2}^{n}$ ) of dimension at least $k$ we have $|\Pr_{\bm{a}\sim A}[f(\bm{a})=1]-1/2|<\varepsilon$ . We say that $f$ is a $k$ -affine disperser if it is a $(k,1/2)$ affine extractor, i.e. $\Pr_{\bm{a}\sim A}[f(\bm{a})=1]\not\in\{0,1\}$ for every $k$ -dimensional affine subspace $A$ .

In our first result, we confirm that $\textsc{Or}\circ(\Pi^{0}\circ\mathbb{B})$ is smaller than $\lor\circ\land\circ\lor\circ\textsc{Xor}$ . Theorem 3 in [22] shows that polynomial-size $\lor\circ\land\circ\lor\circ\textsc{Xor}$ circuit computes affine extractors with polylogarithmic dimension (i.e. the function is close to being balanced in every affine subspace of at least polylogarithmic dimension). On the other hand, we prove the following theorem in Section 3.2.

Theorem 4 (informal).

If a function is not constant on any $n^{1/3-o(1)}$ -dimension affine subspace (i.e. it is an $n^{1/3-o(1)}$ -affine disperser) then it requires $\textsc{Or}\circ(\Pi^{0}\circ\mathbb{B})$ circuits of exponential size.

Theorem 3 in [22] implies that the polynomial approximation method can not prove Theorem 4, since it can not distinguish between a $\textsc{Or}\circ(\Pi^{0}\circ\mathbb{B})$ -circuit and a $\lor\circ\land\circ\lor\circ\textsc{Xor}$ -circuit, which contains some affine extractors.

Inner Product

The inner product function $\textsc{IP}_{n}\colon\{0,1\}^{n}\times\{0,1\}^{n}\to\{0,1\}$ defined as follows $\textsc{IP}_{n}(x,y)=\sum_{i\in[n]}x_{i}\cdot y_{i}\bmod 2$ . It is a big open problem to get a lower bound for the inner product in $\lor\circ\land\circ\lor\circ\textsc{Xor}$ . In Section 3.3 we show that the inner product function requires exponentially large $\textsc{Or}\circ({\mathsf{AC}}^{0}\circ\mathbb{B})$ circuits. The technique there is a combination of random restrictions with a top-down step.

Middle Slice

The middle slice function $\textsc{Mid}_{n}\colon\{0,1\}^{n}\to\{0,1\}$ is defined as $\textsc{Mid}_{n}(x)=\mathds{1}_{[|x|=n/2]}$ , i.e. it equals $1$ iff the input contains exactly $n/2$ ones. In Section 3.1 we prove via a top-down argument that this function requires $2^{\Omega(\sqrt{n})}$ -size $\textsc{Or}\circ(\Pi^{0}\circ\mathbb{B})$ circuits.

2 Technique

2.1 ${\mathsf{AC}}^{0}$ Top-down Lower Bounds

The general sketch of an ${\mathsf{AC}}^{0}$ top-down proof looks as follows.

$\blacksquare$

Consider a circuit $C=\bigvee_{i=0}^{s}C_{i}$ (the case of $\bigwedge$ is treated analogously). Suppose we want to prove that $C$ cannot distinguish certain sets of inputs $A$ and $B$ . Assume that, on the contrary, $C(A)=1$ and $C(B)=0$ .
$\blacksquare$

Note that $\bigcup_{i}C_{i}^{-1}(1)\supseteq A$ and also for every $i$ it holds that $C_{i}^{-1}(0)\supseteq B$ . We pick some $C_{i}$ such that $A_{i}\coloneqq C_{i}^{-1}(1)$ . Now, this is a circuit that separates $A_{i}$ and $B$ , and it has $\land$ as the top gate.
$\blacksquare$

We repeat the procedure until we arrive at a shallow enough circuit $C^{\prime}$ that is supposed to separate some sets $A^{\prime}$ and $B^{\prime}$ .
$\blacksquare$

We prove that $C^{\prime}$ cannot separate these sets. Note that if the original circuit $C$ made an error in computing $A$ and $B$ , there always exists a sequence of choices of subcircuits such that the error is traced until $C^{\prime}$ .

A “shallow enough circuit” could be, in principle, even a variable, but it turns out that there is a convenient way to argue about circuits of depth $2$ in this context. Moreover, for now we assume that these circuits of depth $2$ are CNFs/DNFs of width bounded by some parameter $k$ . At the end of this section, we discuss why this assumption is acceptable for the proper choice of $k$ .

The central notion for analyzing $k$ -CNFs and $k$ -DNFs is a $k$ -limit. The notion comes from [20], inspired by “limit vectors” from [37] as well as communication complexity techniques [25].

Definition 5 ([20]).

Let $A\subseteq\{0,1\}^{n}$ . $x\in\{0,1\}^{n}$ is a $k$ -limit of $A$ , if for any subset of indices $I\in\binom{[n]}{k}$ there is $y\in A$ such that $x_{I}=y_{I}$ .

Claim 6 ([20, 28, 17]).

If a $k$ -CNF formula $C$ accepts a set $A$ , then it accepts every $k$ -limit of $A$ .

At the same time, it is known that a $k$ -limit is a complete notion in the following sense.

Claim 7 ([20]).

Let $A\subseteq\{0,1\}^{n}$ be a set such that every $k$ -limit of $A$ belongs to $A$ . Then there exists a $k$ -CNF formula $C$ such that $C^{-1}(1)=A$ .

Proof.

We start from an empty CNF formula $C$ and gradually add clauses to it. Consider any $y\notin A$ . As it is not a $k$ -limit of $A$ , there exists a set $I\in\binom{[n]}{k}$ such that $y_{I}\neq a_{I}$ for any $a\in A$ . Let us add into $C$ a clause $D=\bigvee_{i\in I}x_{i}^{1-y_{i}}$ . This clause evaluates to $0$ on $y$ and evaluates to $1$ on any $a\in A$ .

We repeat the procedure for every $y\notin A$ . The resulting CNF evaluates to $0$ on every $y\notin A$ and evaluates to $1$ on any $a\in A$ , which proves the claim. $\hfill\vartriangleleft$

Note that a $k\text{-}\textsf{CNF}$ over $n$ variables has at most $(2n)^{k}=2^{O(k\log n)}$ clauses. The standard assumption for ${\mathsf{AC}}^{0}$ is that the bottom fan-in of the circuit is bounded by the logarithm of its size, so $k$ -limits are a tool that helps to prove lower bounds of the form $2^{k}$ . To be more precise, it follows from Claim 6 that if a set $A$ has a $k$ -limit outside of $A$ , then any CNF recognising it should have size $2^{\Omega(k)}$ , and it follows from Claim 7 that for any set $A$ containing all of its $k$ -limits there is a CNF of size $2^{O(k\log{n})}$ recognising $A$ . So there is a multiplicative gap of $\log{n}$ in the exponent between related lower bound and upper bound. In most cases, this is a negligible difference, but for some examples this might be important: for example, Maj function has a lower bound of $2^{\Omega\left(\sqrt{n}\right)}$ in depth- $3$ ${\mathsf{AC}}^{0}$ and an upper bound of $2^{O\left(\sqrt{n}\log{n}\right)}$ in the same model. Proving a $2^{\omega(\sqrt{n})}$ lower bound for Maj would beat all state-of-the-art lower bounds for depth- $3$ ${\mathsf{AC}}^{0}$ circuits.

Reducing Bottom Fan-in

Bottom-up ${\mathsf{AC}}^{0}$ lower bounds usually use the fact that bottom fan-in is bounded by a parameter $k$ . In most cases, $k$ can be made as small as $\log{s}$ , where $s$ is the size of the circuit. This is done by random restrictions, which kill the bottom layer gates with a big fan-in. In case of ${\mathsf{AC}}^{0}\circ\textsc{Xor}$ , this becomes more of a problem, since Xor survives under small restrictions.

In fact, handling the bottom fan-in can be done using top-down techniques. In [17], the lower bound is fully top-down in the sense that no random restrictions are used even for reducing the bottom fan-in, and we follow the same path. With the use of Xor gates, this becomes crucial. The idea is that instead of just one $k$ -limit, one needs to find many, so that wide clauses in a CNF could not reject all of them. We make this intuition precise:

Lemma 8.

Let $C$ be a CNF over $n$ variables of size $s$ and $A\subseteq C^{-1}(1)$ . Let $L\subseteq C^{-1}(0)$ be a set of $k$ -limits for $A$ . Then $s>|L|/2^{n-k}$ .

Proof.

Let $C=D_{1}\land\dots\land D_{s}$ . Then $C^{-1}(0)=\bigcup_{i\in[s]}D^{-1}_{i}(0)$ . Let $i\in[s]$ be the clause with the largest size of $D^{-1}_{i}(0)\cap L$ , this size is at least $|L|/s$ since $L\subseteq C^{-1}(0)$ . Now the width of $D_{i}$ must be larger than $k$ , since it distinguishes all $k$ -limits in $L\cap C^{-1}(0)$ from $A$ , hence $|L|/s\leq|D^{-1}_{i}(0)|<2^{n-k}$ . The claim then follows. $\hfill\blacktriangleleft$

2.2 Extending the Approach to Parity Gates

We can define the analogous notion for ${\mathsf{AC}}^{0}\circ\textsc{Xor}$ circuits.

Definition 9 ( $k$ -parity limit).

Let $A\subseteq\{0,1\}^{n}$ . $x$ is a $k$ -parity limit of $A$ , if for any affine subspace $L\subseteq\{0,1\}^{n}$ of co-dimension $k$ such that $x\in L$ it holds that $A\cap L\neq\emptyset$ .

The proof of the following claim is analogous to Claim 6. We include the proof for completeness.

Claim 10.

If a $k\text{-}\textsf{CNF}\circ\textsc{Xor}$ circuit $C$ accepts a set $A$ , then it accepts any $k$ -parity limit of $A$ .

Proof.

Let $x$ be the $k$ -parity limit of $A$ . Suppose $C$ rejects $x$ . Then there exists a particular $\textsc{Or}\circ\textsc{Xor}$ subcircuit of $C$ (denote it by $C^{\prime}$ ) such that $C^{\prime}(x)=0$ . Let $L$ be the affine subspace of co-dimension $k$ defined by zeroes of $C^{\prime}$ . Then $x\in L$ , and therefore $A\cap L\neq\emptyset$ . Then for some $y\in A$ it holds that $C^{\prime}(y)=0$ and therefore $C(y)=0$ , which is a contradiction with the assumption that $C$ accepts the whole $A$ . $\hfill\vartriangleleft$

As there are much more linear subspaces of co-dimension $k$ than clauses of width $k$ , while we can prove the analogue of Claim 7 for $k$ -parity limits, the resulting circuit would have huge size, so this approach would not give a meaningful size upper bound. One could ask a question: is it possible to prove a reasonable upper bound from non-existence of “outer” $k$ -parity limits?

Problem 11.

Is $k$ -parity limit complete in the following sense: if a set $A$ contains all of its $k$ -limits, then there is a $\textsf{CNF}\circ\textsc{Xor}$ circuit $C$ of width $k$ and size $2^{k^{O(1)}}$ such that it accepts exactly set $A$ ?

For proving lower bounds against ${\mathsf{AC}}^{0}\circ\textsc{Xor}$ , it is sufficient to find $k$ -parity limits only with respect to a fixed set of linear forms present in a circuit (subcircuit), but it would be interesting to know if the more general statement is true. Again, the existence of a $k$ -parity limit with respect to a fixed set of linear forms is a necessary condition for a lower bound.

The proof of the next claim is, again, analogous to Claim 6.

Claim 12.

Let $C=\bigwedge_{i=1}^{s}L_{i}$ be a ( $k$ - $\textsf{CNF})\circ\textsc{Xor}$ (here $L_{i}$ is such that $L_{i}^{-1}(0)$ is a co-dimension- $k$ affine subspace) and let $\mathcal{S}$ be a collection of affine subspaces of co-dimension $k$ such that $L_{i}^{-1}(0)\in\mathcal{S}$ for all $i\in[s]$ . Suppose that $C$ accepts $A$ . Consider $y$ such that for any $L\in\mathcal{S}$ , $y\in L$ implies that there exists $x\in A$ such that $x\in L$ . Then $C$ accepts $y$ .

Here $\mathcal{S}$ can be a collection of all affine subspaces of co-dimension $k$ , or it can be a smaller family of subspaces that still contains all linear systems used in a ( $k$ - $\textsf{CNF})\circ\textsc{Xor}$ . In other words, as the number of linear systems used in a circuit is bounded by its size, one can relax the definition of a $k$ -parity limit to be able to fool only these linear systems. One can also prove a variant of completeness for this weaker notion of $k$ -parity limits analogous to Claim 7.

Exponential size $\textsc{Or}\circ(\Pi^{0}\circ\mathbb{B})$ , in particular, can use exponential number of different linear forms, with the restriction that inside each $\textsf{CNF}\circ\textsc{Xor}$ subcircuit there are only $n$ different linear forms. Our results can be seen as finding $k$ -parity limits under these restrictions on the model. In top-down language, the extra Or on top symbolises that the first step down in the proof is oblivious to the actual parity gates that are used in the circuit. Now, just two such oblivious steps would imply lower bounds for $\lor\circ\land\circ\lor\circ\textsc{Xor}$ .

When comparing top-down lower bounds for plain ${\mathsf{AC}}^{0}$ and $\textsc{Or}\circ(\Pi^{0}\circ\mathbb{B})$ , the most important difference (and our main technical contribution) is the following: for $\textsc{Or}\circ(\Pi^{0}\circ\mathbb{B})$ , we should be able to construct sets such that we can find their $k$ -limits after any change of basis in $\mathbb{F}_{2}^{n}$ . Note that the hard functions in this case should be uncorrelated with affine subspaces, which, in particular, is not true for Xor function: after the appropriate change of basis, we could encode the value of the function in the first bit of the string in the new basis.

This seems like a natural step towards fully adapting the top-down approach for circuits with Xor gates. We discuss this further in Section 4.

2.3 Unpredictability and Local Limits

One of the most successful to date ways to find local limits is via unpredictability from partial information.

Let $X\subseteq\{0,1\}^{n}$ and $R\subseteq[n]$ . A pair $(Q,a)$ with $Q\subseteq[n]\setminus R$ and $a\in\{0,1\}^{Q}$ is a certificate for $R$ if there exists $b\in\{0,1\}^{R}$ such that whenever $x_{Q}=a$ for $x\in X$ , $x_{R}\neq b$ . In this case, we say that $x$ contains a certificate for $R$ , and the size of such certificate is $q\coloneqq|Q|$ . This notion was introduced in [28] who proved the following result for $|R|=1$ .

Lemma 13 (Bit unpredictability [28]).

Let $X\subseteq\{0,1\}^{n}$ have density $|X|/2^{n}\geq 2^{-d}$ . Then for any $q\geq 1$ ,

\Pr_{(\bm{x},\bm{i})\sim X\times[n]}[\,\text{$\bm{x}$ contains a size-$q$ certificate for $\bm{i}$ wrt $X$}\,]\nobreak\ \leq\nobreak\ O(dq/n).

More recently [17] generalized this result for $|R|>1$ .

Lemma 14 (Block unpredictability [17]).

Let $X\subseteq\{0,1\}^{n}$ have density $|X|/2^{n}\geq 2^{-d}$ . Then for any $q,r\geq 1$ ,

\Pr_{(\bm{x},\bm{R})\sim X\times\binom{n}{r}}[\,\text{$\bm{x}$ contains a size% -$q$ certificate for $\bm{R}$ wrt $X$}\,]\nobreak\ \leq\nobreak\ O(dqr/n)^{1/6}.

Bit and block unpredictability were used in top-down lower bounds for low-depth ${\mathsf{AC}}^{0}$ circuits as a way to extract local limits (Definition 5).

Lemma 15 ([28, 17]).

Suppose $x\in X$ does not contain any $q$ -certificates for $R$ wrt to $X$ . Then every $x^{\prime}$ such that $x_{[n]\setminus R}=x^{\prime}_{[n]\setminus R}$ is a $q$ -limit of $X$ .

Proof.

Suppose that there exists $x^{\prime}$ with $x^{\prime}_{[n]\setminus R}=x_{[n]\setminus R}$ that is not a $q$ -limit for $X$ , i.e. there exists a set $S\subseteq[n]$ of size $q$ such that for every $y\in X$ we have $x^{\prime}_{S}\neq y_{S}$ . Observe that $(S\setminus R,x_{S\setminus R})$ is then a certificate for $R$ : indeed for any $b\in\{0,1\}^{R}$ that agrees with $x^{\prime}_{S\cap R}$ we have that for every $y\in X$ we have $y_{R}\neq b$ . $\hfill\blacktriangleleft$

3 Lower Bounds

3.1 Middle Slice

In this section we prove the following.

Theorem 16.

Let $C$ be a circuit of the following form. $C\coloneqq C_{1}\lor C_{2}\lor\ldots\lor C_{N}$ where $C_{i}$ is a composition of a CNF $D_{i}$ and an affine transformation $A_{i}$ of full rank. Suppose that $C$ computes the characteristic function of the middle slice $\binom{[n]}{n/2}$ . Then the total size of CNFs $D_{1},\dots,D_{N}$ is at least $2^{\Omega(\sqrt{n})}$ .

The key ingredient in our proof is a density boosting lemma for affine subspaces. Following a similar definition for boolean subcubes in [3] we say that a set $X\subseteq S$ where $S$ is a vector space is $\theta$ -linearly spread in $S$ if for every affine subspace $A\subseteq S$ of co-dimension $d$ we have

\Pr_{\bm{x}\sim X}[\bm{x}\in A]\leq\theta^{-d}.

We remark that the range of interesting values for $\theta$ is $(1,2]$ , for $\theta\leq 1$ all sets are spread, for $\theta>2$ no set is spread. A similar, but different notion was used in [26]. The following is a new density boosting lemma, generalized for affine subspaces, which might be of independent interest. For boolean cubes, such a lemma was introduced in [16] and appears among others in [11].

Lemma 17.

For every set $X\subseteq\{0,1\}^{n}$ of size at least $2^{n-d}$ there exists an affine subspace $A$ of $\{0,1\}^{n}$ of co-dimension at most $d/(1-\log_{2}\theta)$ such that $X\cap A$ is $\theta$ -linearly spread in $A$ and $|X\cap A|\geq|X|\theta^{-d/(1-\log_{2}\theta)}$ .

Proof.

Suppose that $X$ is not $\theta$ -linearly spread in $\{0,1\}^{n}$ . Then consider the affine subspace of largest co-dimension $A$ that witnesses the lack of $\theta$ -spreadness of $X$ : suppose that $A$ has co-dimension $\ell$ then we have $|X\cap A|/|X|>\theta^{-\ell}$ .

We claim then that $X\cap A$ is $\theta$ -lineraly spread in $A$ . Indeed, suppose that it is not, i.e. there exists an affine subspace $B$ of $A$ of co-dimension $\ell^{\prime}$ (in $A$ ) such that $|X\cap B|/|X\cap A|>\theta^{-\ell^{\prime}}$ . Then $|X\cap B|/|X|>\theta^{-\ell^{\prime}-\ell}$ which contradicts the maximality of the co-dimension of $A$ .

Now it remains to bound the co-dimension of $A$ . On the one hand

\Pr_{\bm{x}\sim X}[\bm{x}\in A]=\sum_{y\in A}\Pr[\bm{x}=y]\leq|A|\cdot 2^{d-n}% =2^{(n-\ell)+d-n}=2^{d-\ell}.

On the other hand $\Pr_{\bm{x}\sim X}[\bm{x}\in A]>\theta^{-\ell}$ . Hence $d>\ell(1-\log_{2}\theta)$ . The lower bound on $|X\cap A|$ follows. $\hfill\blacktriangleleft$

Proof of Theorem 16.

Suppose for contradiction that $C^{-1}(1)=\binom{[n]}{n/2}$ , $N\leq 2^{\gamma\sqrt{n}}$ where $\gamma$ is a constant to choose later. Since $C^{-1}(1)=\bigcup_{i\in[N]}C_{i}^{-1}(1)$ , there exists $i_{0}\in[N]$ such that $|C_{i_{0}}^{-1}(1)|\geq|\binom{[n]}{n/2}|\cdot 2^{-\gamma\sqrt{n}}$ .

Thus, there exists a CNF $D$ and a full-rank linear transformation $A$ such that $X\coloneqq(D\circ A)^{-1}(1)$ has size at least $\binom{n}{n/2}\cdot 2^{-\gamma\sqrt{n}}\geq 2^{n-\gamma\sqrt{n}-\log n}$ and $X\subseteq\binom{[n]}{n/2}$ . Let us identify the linear transformation $A$ with the matrix in $\{0,1\}^{n\times n}$ defining it: $A(x)\coloneqq Ax$ .

First we apply Lemma 17 to the set $X$ with the parameter $\theta=\sqrt{2}$ . We get that there exists an affine space $B$ of co-dimension at most $2(\gamma\sqrt{n}+\log n)$ such that $X\cap B$ is $\theta$ -linearly spread in $B$ and $|X\cap B|\geq|X|/2^{\gamma\sqrt{n}+\log n}$ .

Applying Lemma 13 to the set $A(X\cap B)=\{Ax\mid x\in X\cap B\}$ with $q=5\gamma\sqrt{n}$ we get:

\Pr_{(\bm{x},\bm{i})\sim(X\cap B)\times[n]}[\text{$A\bm{x}$ does not contain a size-$q$ certificate for $\bm{i}$ wrt $X\cap B$}]\geq 1-O(\gamma% \sqrt{n}\cdot q/n).

Pick $\gamma$ such that this probability is at least $0.9$ . That is, with probability $0.9$ for a pair $(\bm{x},\bm{i})\sim(X\cap B)\times[n]$ we have by Lemma 15 that $A\bm{x}+e_{\bm{i}}$ (where $e_{i}=0^{i-1}10^{n-i}$ ) is a $q$ -limit of the set $A(X\cap B)$ . In order to invoke Lemma 8 we need to find many $q$ -limits in $D^{-1}(0)$ , i.e., outside of $A(X)$ .

Suppose that $Ax+e_{i}\in A(X)$ for some $(x,i)\in(X\cap B)\times[n]$ . Equivalently $x+A^{-1}e_{i}\in X$ . Then in particular $x+A^{-1}e_{i}\in\binom{[n]}{n/2}$ . Since $x\in X\subseteq\binom{[n]}{n/2}$ , this is only possible if $A^{-1}e_{i}$ has even hamming weight and in that case implies that $\langle x,A^{-1}e_{i}\rangle=(|\{j\in[n]\mid(A^{-1}e_{i})_{j}=1\}|/2)\bmod 2% \eqqcolon c_{i}$ .

In other words, if adding $A^{-1}e_{i}$ to $x$ preserves its Hamming weight $\frac{n}{2}$ , it means that exactly half of the indices of $1$ -coordinates in $A^{-1}e_{i}$ match up with indices of $1$ -coordinates in $x$ , which fixes the value of $\langle x,A^{-1}e_{i}\rangle$ to a constant $c_{i}$ .

Now consider the affine subspace $B^{\prime}_{i}\coloneqq\{y\in B\mid\langle y,A^{-1}e_{i}\rangle=c_{i}\}$ . If $A^{-1}e_{i}\in B^{\bot}\coloneqq\{x\in\{0,1\}^{n}\mid\forall y\in B,\langle x,% y\rangle\text{ is fixed}\}$ ( $B^{\bot}$ is the span of all linear constraints defining $B$ ), then $B^{\prime}_{i}=B_{i}$ or $B^{\prime}=\emptyset$ , otherwise $B^{\prime}_{i}$ has co-dimension $1$ in $B$ . Let $E$ be the event when $A^{-1}e_{\bm{i}}$ is in $B^{\bot}$ . We then have

\Pr[A\bm{x}+e_{\bm{i}}\in A(X)]\leq\Pr[A\bm{x}+e_{\bm{i}}\in A(X)\mid\lnot E]+% \Pr[E].

Since the co-dimension of $B$ (equivalently the dimension of $B^{\bot}$ ) is at most $2(n^{\gamma}+\log n)$ and $A^{-1}e_{1},\dots,A^{-1}e_{n}$ are linearly independent, $\Pr[E]\leq 2(\gamma\sqrt{n}+\log n)/n=o(1)$ . On the other hand since $A\bm{x}+e_{\bm{i}}\in A(X)$ implies that $\bm{x}\in B^{\prime}_{\bm{i}}$ and $X\cap B$ is $\theta$ -linearly spread in $B$ we get $\Pr[A\bm{x}+e_{\bm{i}}\in A(x)\mid\lnot E]\leq 1/\sqrt{2}.$ Therefore $\Pr[A\bm{x}+e_{\bm{i}}\not\in A(X)\land A\bm{x}+e_{\bm{i}}\text{ is a $q$-limit of }A(X\cap B)]\geq 0.9-1/\sqrt{2}-o(1),$ so there are $\Omega(|X\cap B|)$ $q$ -limits to $A(X\cap B)$ outside of $A(X)$ , hence by Lemma 8 we get that

|D|=\Omega(|X\cap B|)/2^{n-5\gamma\sqrt{n}}=\Omega(2^{n-2(\gamma\sqrt{n}+\log n% )}/2^{n-5\gamma\sqrt{n}})=\Omega(2^{\gamma\sqrt{n}})\ .

$\hfill\blacktriangleleft$

3.2 Affine Disperser

Theorem 18.

Let $\gamma$ be any constant in $(0,1/3)$ . Let $f\colon\{0,1\}^{n}\to\{0,1\}$ be an affine disperser for dimension $k\coloneqq n^{\gamma}$ such that $|f^{-1}(1)|\geq 2^{n-n^{\gamma}}$ and $C$ be a circuit of the form $C\coloneqq C_{1}\lor C_{2}\lor\ldots\lor C_{N}$ , where $C_{i}$ is a composition of a CNF $D_{i}$ and a linear transformation $A_{i}$ of full rank. If $C$ computes $f$ , then the total size of $C_{1},\dots,C_{N}$ is at least $2^{\Omega(n^{\gamma})}$ .

Proof.

We show that either $N\geq 2^{n^{\gamma}}$ or one of the CNFs $C_{1},\dots,C_{N}$ has size at least $2^{n^{\gamma}}$ , which yields the claim. Suppose $N<2^{n^{\gamma}}$ . Then there exists $C_{i}=D_{i}\circ A_{i}$ such that $|C_{i}^{-1}(1)|\geq|f^{-1}(1)|/2^{n^{\gamma}}\geq 2^{n-2n^{\gamma}}$ .

Let $X\coloneqq(D_{i}\circ A_{i})^{-1}(1)$ . For the set $A_{i}(X)$ we apply Lemma 14 with the following parameters:

$\blacksquare$

the density loss $t=2n^{\gamma}$ ;
$\blacksquare$

the size of certificate $q=\alpha n^{\gamma}$ ;
$\blacksquare$

the size of the unpredictable block $r=\beta n^{\gamma}$ .

The constants $\alpha,\beta$ are to be chosen later. It follows that with probability $1-O(n^{3\gamma-1})=1-o(1)$ for $\bm{x}\sim X$ and $\bm{R}\sim\binom{[n]}{r}$ there is no certificate of size $q$ in $A_{i}\bm{x}$ for $\bm{R}$ . Hence, by Lemma 15 all elements of

L_{\bm{x},\bm{R}}\coloneqq\{y\in\{0,1\}^{n}\mid y_{[n]\setminus\bm{R}}=\bm{x}_% {[n]\setminus\bm{R}}\}

are $q$ -limits of $A_{i}(X)$ with probability $1-o(1)$ . Let $E$ be the set of pairs $x,R\in X\times\binom{[n]}{r}$ for which this holds.

For $(x,R)\in E$ consider the set $A_{i}^{-1}(L_{x,R})$ . $L_{x,R}$ is an $r$ -dimensional affine subspace of $\{0,1\}^{n}$ , thus $A_{i}^{-1}(L_{x,R})$ is as well. As $f$ is an affine disperser, there is an input $y\in A_{i}^{-1}(L_{x,R})\cap f^{-1}(0)$ , hence $A_{i}\cdot y$ is not in $X$ and is a $q$ -limit of $X$ .

Now we need to count the number of $q$ -limits we got in order to invoke Lemma 8. Let $g\colon E\to\{0,1\}^{n}$ be the function mapping $(x,R)$ to $A_{i}\cdot y$ (to an arbitrary one, if there are several). Let us upper bound $|g^{-1}(z)|$ for an arbitrary $z\in\{0,1\}^{n}$ . Suppose $g(x,R)=z$ , let $y=(A_{i})^{-1}z$ , then $x_{[n]\setminus R}=y_{[n]\setminus R}$ , hence, there are at most $2^{r}\cdot\binom{[n]}{r}$ such preimages. Thus we get $|E|/(2^{r}\binom{[n]}{r})=|X|/2^{r}$ $q$ -limits in total. Therefore by Lemma 8 we get that $|D_{i}|\geq(1-o(1))2^{n-2n^{\gamma}}/(2^{r}\cdot 2^{n-q})\geq 2^{q-r-2n^{% \gamma}-1}\geq 2^{(\alpha-\beta-2)n^{\gamma}-1}$ . Hence for any $\alpha>\beta+2$ we get the desired bound. $\hfill\blacktriangleleft$

3.3 Inner Product

In this section, we give an exponential lower bound for $\textsc{Or}\circ({\mathsf{AC}}^{0}\circ\mathbb{B})$ -circuit size required to compute the inner product. Our proof is a combination of bottom-up techniques with one top-down-like step.

Theorem 19.

Let $C$ be a circuit of the form $C\coloneqq C_{1}\lor C_{2}\lor\dots\lor C_{N}$ where each $C_{i}$ is a composition of a $d$ -depth circuit $D_{i}$ composed with a full-rank affine mapping $A_{i}$ .

Suppose that $C$ computes $\textsc{IP}_{n}$ . Then the total size of circuits $D_{1},\dots,D_{N}$ is at least $2^{n^{\Omega(1/d)}}$ .

Proof.

Suppose for contradiction that the total size of all $D_{i}$ is at most $2^{n^{\varepsilon}}$ for $\varepsilon=o(1/d)$ . Then let us pick $\bm{\alpha}\sim\{0,1\}^{n}$ and apply the restriction $y=\bm{\alpha}$ to $C$ . Then $C|_{y=\bm{\alpha}}$ computes the function $\textsc{Xor}_{\bm{\alpha}}\coloneqq\bigoplus_{i\in[n]\colon\bm{\alpha}_{i}=1}x% _{i}$ and has the same form as before, disjunction of compositions of constant-depth circuits with affine transformations.

Let $D^{\prime}_{i},A^{\prime}_{i}$ be such that $C|_{y=\bm{\alpha}}=\bigvee_{i\in[N]}D^{\prime}_{i}\circ A^{\prime}_{i}$ . Since $\textsc{Xor}_{\bm{\alpha}}$ is balanced, there exists $i_{0}\in[N]$ such that $|(D^{\prime}_{i_{0}}\circ A^{\prime}_{i_{0}})^{-1}(1)|\geq 2^{|\alpha|-1-n^{% \varepsilon}}$ . On the other hand $(D^{\prime}_{i}\circ A^{\prime}_{i})^{-1}(0)\supseteq\textsc{Xor}_{\bm{\alpha}% }^{-1}(0)$ for all $i\in[N]$ . Then applying $(A^{\prime}_{i_{0}})^{-1}$ to $D^{\prime}_{i_{0}}\circ A^{\prime}_{i_{0}}$ and to $\textsc{Xor}_{\bm{\alpha}}$ on the right we get that the circuit $(D^{\prime}_{i_{0}})^{-1}(0)\supseteq\textsc{Xor}_{\bm{\alpha^{\prime}}}$ where $\bm{\alpha^{\prime}}=\bm{\alpha}\cdot(A^{\prime}_{i_{0}})^{-1}$ and since $A^{\prime}_{i_{0}}$ has full rank $|(D^{\prime}_{i_{0}})^{-1}(1)|=|(D^{\prime}_{i_{0}}\circ A^{\prime}_{i_{0}})^{% -1}(1)|\geq 2^{|\alpha|-1-n^{\varepsilon}}$ .

Now observe that
$\Pr[|\bm{\alpha}^{\prime}|\leq\sqrt{n}]=\Pr[\bm{\alpha}\text{ is a linear % combination of }\leq\sqrt{n}\text{ rows of }(A_{i_{0}}^{\prime})^{-1}]\leq N% \cdot\binom{|\alpha|}{\sqrt{n}}/2^{n}=o(1).$

Hence, there exists a depth- $d$ de Morgan circuit $D=D^{\prime}_{i_{0}}$ that computes parity $\textsc{Xor}_{\alpha_{0}}$ on $\sqrt{n}$ bits correctly on all $0$ -inputs and on at least $2^{-n^{\varepsilon}}$ -fraction of $1$ -inputs. Then let $\bm{y}_{1},\dots,\bm{y}_{M}\sim\textsc{Xor}_{\alpha_{0}}^{-1}(0)$ be independent random variables. Then the depth- $(d+1)$ circuit $E_{\bm{y}}(x)\coloneqq\bigvee_{i\in[M]}D(x\oplus\bm{y}_{i})$ computes the value of $\textsc{Xor}_{\alpha_{0}}$ correctly on an input $x$ with probability $1-(1-2^{-n^{\varepsilon}})^{M}$ , which exceeds $1-2^{-n}$ for $M>3n\cdot 2^{n^{\varepsilon}}$ , hence, there exists a setting of $y=y_{1},\dots,y_{M}$ such that $E_{y}$ computes $\textsc{Xor}_{\alpha_{0}}$ . Thus by [18] the size of $E_{y}$ is at least $2^{n^{\Omega(1/d)}}$ which means that $\varepsilon=\Omega(1/d)$ which is a contradiction. $\hfill\blacktriangleleft$

3.4 Proof of Theorem 3

A $\textsf{DNF}\circ\textsc{Xor}$ circuit computing $f$ is equivalent to a covering of $f^{-1}(1)$ by affine subspaces $f^{-1}(1)=\bigcup_{i\in[N]}A_{i}$ . Consider an arbitrary $A_{j}\subseteq\{0,1\}^{n\times 3}$ . Affine spaces over $\mathbb{F}_{2}$ are closed under sums of three elements, so let $a,b,c\in A_{j}$ . Then $d=a\oplus b\oplus c\in A_{j}$ . For $x\in f^{-1}(1)$ we have that for every $i\in[n]$ among $x_{i,1},x_{i,2},x_{i,3}$ exactly one value is $1$ and the other two are zeroes. For $x\in f^{-1}(1)$ we can define $\bar{x}\in[3]^{n}$ be such that for every $i\in[n]$ we have $x_{i,\bar{x}_{i}}=1$ and $x_{i,j}=0$ if $j\neq\bar{x}_{i}$ . Then since $d\in A_{i}\subseteq f^{-1}(1)$ for every $i\in[n]$ we have $|\{\bar{a}_{i},\bar{b}_{i},\bar{c}_{i}\}|<3$ , since otherwise $d_{i}=(1,1,1)$ which contradicts $f(d)=1$ . Since this is true for any $a,b,c\in A_{j}$ we get that there exists $\beta\in[3]^{n}$ such that for every $x\in A_{j}$ and for every $i\in[n]$ we have $x_{i,\beta(i)}=0$ . Since for every $x\in f^{-1}(1)$ we have $x_{i,1}+x_{i,2}+x_{i,3}=1$ we get that $|A_{j}|\leq 2^{n}$ . Since $|f^{-1}(1)|=3^{n}$ we get that $N\geq(3/2)^{n}$ , which completes the proof.

4 Discussion and Open Problems

In the results above, having an extra Or on top of the circuit, and only fixing the linear transformation in the subcircuits, can be interpreted in the following way. When implementing the top-down strategy, the first choice of the subcircuit (and the subset of $1$ -inputs, respectively), does not depend on the specific linear forms used in the circuit.

In other words, let $A=f^{-1}(1)$ and $B=f^{-1}(0)$ for one of the hard functions considered in the main section. Informally, we prove that for any covering of $A$ by no more $2^{n^{\varepsilon}}$ sets $A_{1},\dots,A_{2^{n^{\varepsilon}}}$ (for some $\varepsilon=\Omega(1)$ ) there is a choice of $A_{i}$ such that for any affine map $L$ there is a $k$ -limit for $A_{i}$ in $B$ with respect to that map. Note that for different affine maps, we might find different $k$ -limits. For proving lower bounds for $\lor\circ\land\circ\lor\circ\textsc{Xor}$ , we would need to prove a statement where the last two quantifiers are in a different order: there is a $k$ -limit that works for any affine map. Or, at least, for any affine map in a large enough collection of such. As mentioned in Section 2, this corresponds to making two “oblivious” steps down the circuit, where we do not use the knowledge of specific parity gates, while in our lower bound, we make one “oblivious” step.

Note that this is not true for the affine extractors in general, as there is an affine extractor computable by $\lor\circ\land\circ\lor\circ\textsc{Xor}$ circuits [22]. The middle slice function, however, could still be a good example for honing the top-down techniques.

The first natural step could be finding the same $k$ -limit with respect to an arbitrary pair of affine maps. This corresponds to the lower bounds in the following computational model.

Problem 20.

Prove top-down lower bounds for the class of $(2\text{-}\textsf{DNF})\circ(\textsf{CNF}\circ\mathbb{B})$ circuits.

In fact, that would already be a step towards another elusive circuit class .Another motivation for this open question comes from the $\textsc{Mod}_{6}$ perspective. A $\textsc{Mod}_{6}$ gate can be seen as a conjunction of $\textsc{Mod}_{2}$ and $\textsc{Mod}_{3}$ gates. After appropriately expanding the brackets, $\textsf{DNF}\circ\textsc{Mod}_{6}$ can be transformed to a $2$ -DNF of conjunctions such that in each conjunction there are only $\textsc{Mod}_{2}$ or only $\textsc{Mod}_{3}$ gates. So a first related problem would be to adapt the technique for $\textsc{Mod}_{3}$ .

Problem 21.

Prove top-down lower bounds for subclasses of ${\mathsf{AC}}^{0}\circ\textsc{Mod}_{3}$ .

The next step would be combining the two last problems together. Let $\mathbb{L}\subseteq\{f\colon\{0,1\}^{n}\to\{0,1\}^{n}\}$ be the union of linear maps over $\mathbb{F}_{2}$ ( $\mathbb{B}$ ) and maps $x\mapsto(\mathds{1}_{[(Ax)_{i}=a_{i}]})_{i\in[n]}$ where $A\in\mathbb{F}_{3}^{n\times n}$ and $a\in\mathbb{F}_{3}^{n}$ . In other words, we can choose a transformation of the inputs that either uses only Xor operations, or only $\textsc{Mod}_{3}$ operations.

Problem 22.

Prove lower bounds for $(2$ - $\textsf{DNF})\circ(\textsf{CNF}\circ\mathbb{L})$ circuits.

Solving this problem would imply lower bounds for $\textsf{DNF}\circ\textsc{Mod}_{6}$ .

Claim 23.

Let $f\colon\{0,1\}^{n}\rightarrow\{0,1\}$ be such that it is computable by $k\text{-}\textsf{DNF}\circ\textsc{Mod}_{6}$ -circuit $D$ of size $s$ . Then it is computable by $(2\text{-}\textsf{DNF})\circ(\textsf{CNF}\circ\mathbb{L})$ of size $O(s\cdot 2^{k}\cdot k)$ .

Proof.

See Appendix A. $\hfill\vartriangleleft$

When proving the results of this form, it all essentially boils down to finding a $k$ -(parity) limit. The two known techniques for this are (robust) sunflowers or spreadness [20, 17] and unpredictability [28, 17]. They both have certain downsides. For starters, these techniques only find $k$ -limits that are close to the set in Hamming distance (or, in the case of our result, they are close in Hamming distance after a certain affine transformation). In principle, this might not be the case.

Problem 24.

Let $A\subseteq\{0,1\}^{n}$ be a subset of a code with minimum distance $d=\Omega(n/\log{n})$ . Can you find a $\log^{2}(n)$ -limit of $A$ ?

When looking for a $k$ -limit, we can also ask for some structure of the considered set. Let us say that our set $A$ is a half of a $\sqrt{n}$ -wise independent set. From the results of Bazzi [5], Razborov [34], and Braverman [7], we know that roughly half the points of the whole boolean cube should be $n^{\varepsilon}$ -limits of the set $A$ . However, current techniques do not allow us to find some explicit $k$ -limit, assuming the knowledge of $A$ . One of the reasons for this is that the size of such sets can be as small as $2^{O(\sqrt{n}\log{n})}$ [2].

Problem 25.

Prove a top-down lower bound $2^{k^{\Omega(1)}}$ for separating two disjoint $k$ -wise independent sets by depth- $3$ ${\mathsf{AC}}^{0}$ circuits.

References

[1] Miklos Ajtai. $\Sigma^{1}_{1}$ -formulae on finite structures. Annals of Pure and Applied Logic, 24(1):1–48, 1983. doi:10.1016/0168-0072(83)90038-6.
[2] Noga Alon, László Babai, and Alon Itai. A fast and simple randomized parallel algorithm for the maximal independent set problem. J. Algorithms, 7(4):567–583, 1986. doi:10.1016/0196-6774(86)90019-2.
[3] Ryan Alweiss, Shachar Lovett, Kewen Wu, and Jiapeng Zhang. Improved bounds for the sunflower lemma. In Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing, STOC 2020, pages 624–630, New York, NY, USA, 2020. Association for Computing Machinery. doi:10.1145/3357713.3384234.
[4] Theodore Baker and Alan Selman. A second step toward the polynomial hierarchy. Theoretical Computer Science, 8(2):177–187, 1979. doi:10.1016/0304-3975(79)90043-4.
[5] Louay M. J. Bazzi. Polylogarithmic independence can fool DNF formulas. SIAM J. Comput., 38(6):2220–2272, 2009. doi:10.1137/070691954.
[6] Elmar Böhler, Christian Glaßer, and Daniel Meister. Error-bounded probabilistic computations between MA and AM. Journal of Computer and System Sciences, 72(6):1043–1076, 2006. doi:10.1016/j.jcss.2006.05.001.
[7] Mark Braverman. Poly-logarithmic independence fools bounded-depth boolean circuits. Commun. ACM, 54(4):108–115, 2011. doi:10.1145/1924421.1924446.
[8] Brynmor Chapman and R. Ryan Williams. Smaller ACC0 circuits for symmetric functions. In Mark Braverman, editor, 13th Innovations in Theoretical Computer Science Conference, ITCS 2022, January 31 - February 3, 2022, Berkeley, CA, USA, volume 215 of LIPIcs, pages 38:1–38:19. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPIcs.ITCS.2022.38.
[9] Mahdi Cheraghchi, Elena Grigorescu, Brendan Juba, Karl Wimmer, and Ning Xie. AC ${}^{\mbox{0}}$ $\circ$ MOD ${}_{\mbox{2}}$ lower bounds for the boolean inner product. J. Comput. Syst. Sci., 97:45–59, 2018. doi:10.1016/J.JCSS.2018.04.006.
[10] Gil Cohen and Igor Shinkar. The complexity of DNF of parities. In Madhu Sudan, editor, Proceedings of the 2016 ACM Conference on Innovations in Theoretical Computer Science, Cambridge, MA, USA, January 14-16, 2016, pages 47–58. ACM, 2016. doi:10.1145/2840728.2840734.
[11] Sandro Coretti, Yevgeniy Dodis, Siyao Guo, and John Steinberger. Random oracles and non-uniformity. In Jesper Buus Nielsen and Vincent Rijmen, editors, Advances in Cryptology – EUROCRYPT 2018, pages 227–258, Cham, 2018. Springer International Publishing. doi:10.1007/978-3-319-78381-9_9.
[12] Peter Frankl, Svyatoslav Gryaznov, and Navid Talebanfard. A variant of the VC-dimension with applications to depth-3 circuits. In Proceedings of the 13th Conference on Innovations in Theoretical Computer Science (ITCS), volume 215, pages 72:1–72:19. Schloss Dagstuhl, 2022. doi:10.4230/LIPIcs.ITCS.2022.72.
[13] Merrick Furst, James Saxe, and Michael Sipser. Parity, circuits, and the polynomial-time hierarchy. Mathematical Systems Theory, 17(1):13–27, 1984. doi:10.1007/bf01744431.
[14] Alexander Golovnev, Alexander Kulikov, and Ryan Williams. Circuit depth reductions. In Proceedings of the 12th Conference on Innovations in Theoretical Computer Science (ITCS), volume 185, pages 24:1–24:20. Schloss Dagstuhl, 2021. doi:10.4230/LIPIcs.ITCS.2021.24.
[15] Mika Göös, Ziyi Guan, and Tiberiu Mosnoi. Depth-3 Circuits for Inner Product. In 48th International Symposium on Mathematical Foundations of Computer Science (MFCS 2023), volume 272, pages 51:1–51:12. Schloss Dagstuhl, 2023. doi:10.4230/LIPIcs.MFCS.2023.51.
[16] Mika Göös, Shachar Lovett, Raghu Meka, Thomas Watson, and David Zuckerman. Rectangles are nonnegative juntas. SIAM Journal on Computing, 45(5):1835–1869, 2016. doi:10.1137/15M103145X.
[17] Mika Göös, Artur Riazanov, Anastasia Sofronova, and Dmitry Sokolov. Top-down lower bounds for depth-four circuits. In 2023 IEEE 64th Annual Symposium on Foundations of Computer Science (FOCS), pages 1048–1055, 2023. doi:10.1109/FOCS57990.2023.00063.
[18] Johan Håstad. Almost optimal lower bounds for small depth circuits. In Proceedings of the Eighteenth Annual ACM Symposium on Theory of Computing, STOC ’86, pages 6–20, New York, NY, USA, 1986. Association for Computing Machinery. doi:10.1145/12130.12132.
[19] Johan Håstad. Computational Limitations for Small Depth Circuits. PhD thesis, MIT, 1987.
[20] Johan Håstad, Stasys Jukna, and Pavel Pudlák. Top-down lower bounds for depth-three circuits. Computational Complexity, 5(2):99–112, 1995. doi:10.1007/bf01268140.
[21] Suichi Hirahara. A duality between depth-three formulas and approximation by depth-two. Technical report, arXiv, 2017. arXiv:1705.03588.
[22] Xuangui Huang, Peter Ivanov, and Emanuele Viola. Affine Extractors and AC0-Parity. In Amit Chakrabarti and Chaitanya Swamy, editors, Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2022), volume 245 of Leibniz International Proceedings in Informatics (LIPIcs), pages 9:1–9:14, Dagstuhl, Germany, 2022. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.APPROX/RANDOM.2022.9.
[23] Russell Impagliazzo, Ramamohan Paturi, and Francis Zane. Which problems have strongly exponential complexity? Journal of Computer and System Sciences, 63(4):512–530, December 2001. doi:10.1006/jcss.2001.1774.
[24] Stasys Jukna. On graph complexity. Comb. Probab. Comput., 15(6):855–876, 2006. doi:10.1017/S0963548306007620.
[25] Mauricio Karchmer and Avi Wigderson. Monotone circuits for connectivity require super-logarithmic depth. SIAM J. Discret. Math., 3(2):255–265, 1990. doi:10.1137/0403021.
[26] Zander Kelley and Raghu Meka. Strong bounds for 3-progressions. In 64th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2023, Santa Cruz, CA, USA, November 6-9, 2023, pages 933–973. IEEE, 2023. doi:10.1109/FOCS57990.2023.00059.
[27] Ker-I Ko. Separating and collapsing results on the relativized probabilistic polynomial-time hierarchy. Journal of the ACM, 37(2):415–438, 1990. doi:10.1145/77600.77623.
[28] Or Meir and Avi Wigderson. Prediction from partial information and hindsight, with application to circuit lower bounds. Computational Complexity, 28(2):145–183, 2019. doi:10.1007/s00037-019-00177-4.
[29] Igor Carboni Oliveira, Rahul Santhanam, and Srikanth Srinivasan. Parity helps to compute majority. In Amir Shpilka, editor, 34th Computational Complexity Conference, CCC 2019, July 18-20, 2019, New Brunswick, NJ, USA, volume 137 of LIPIcs, pages 23:1–23:17. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019. doi:10.4230/LIPIcs.CCC.2019.23.
[30] Ramamohan Paturi, Pavel Pudlák, Michael Saks, and Francis Zane. An improved exponential-time algorithm for $k$ -SAT. Journal of the ACM, 52(3):337–364, 2005. doi:10.1145/1066100.1066101.
[31] Ramamohan Paturi, Pavel Pudlak, and Francis Zane. Satisfiability coding lemma. Chicago Journal of Theoretical Computer Science, 5(1):1–19, 1999. doi:10.4086/cjtcs.1999.011.
[32] Ramamohan. Paturi, Michael Saks, and Francis Zane. Exponential lower bounds for depth three boolean circuits. computational complexity, 9(1):1–15, 2000. doi:10.1007/PL00001598.
[33] Alexander Razborov. Lower bounds on the size of bounded depth circuits over a complete basis with logical addition. Mathematical Notes of the Academy of Sciences of the USSR, 41(4):333–338, 1987. doi:10.1007/bf01137685.
[34] Alexander A. Razborov. A simple proof of bazzi’s theorem. ACM Trans. Comput. Theory, 1(1):3:1–3:5, 2009. doi:10.1145/1490270.1490273.
[35] Alexander Russell and Ravi Sundaram. Symmetric alternation captures BPP. Computational Complexity, 7(2):152–162, November 1998. doi:10.1007/s000370050007.
[36] Miklos Santha. Relativized Arthur–Merlin versus Merlin–Arthur games. Information and Computation, 80(1):44–49, 1989. doi:10.1016/0890-5401(89)90022-9.
[37] Michael Sipser. A topological view of some problems in complexity theory. In Michal Chytil and Václav Koubek, editors, Mathematical Foundations of Computer Science 1984, Praha, Czechoslovakia, September 3-7, 1984, Proceedings, volume 176 of Lecture Notes in Computer Science, pages 567–572. Springer, 1984. doi:10.1007/BFB0030341.
[38] Roman Smolensky. Algebraic methods in the theory of lower bounds for boolean circuit complexity. In Proceedings of the 19th Symposium on Theory of Computing (STOC). ACM Press, 1987. doi:10.1145/28395.28404.
[39] Avishay Tal. Tight bounds on the fourier spectrum of AC0. In Ryan O’Donnell, editor, 32nd Computational Complexity Conference, CCC 2017, July 6-9, 2017, Riga, Latvia, volume 79 of LIPIcs, pages 15:1–15:31. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2017. doi:10.4230/LIPIcs.CCC.2017.15.
[40] Leslie G. Valiant. Graph-theoretic arguments in low-level complexity. In Jozef Gruska, editor, Mathematical Foundations of Computer Science 1977, 6th Symposium, Tatranska Lomnica, Czechoslovakia, September 5-9, 1977, Proceedings, volume 53 of Lecture Notes in Computer Science, pages 162–176. Springer, 1977. doi:10.1007/3-540-08353-7_135.
[41] Guy Wolfovitz. The complexity of depth-3 circuits computing symmetric boolean functions. Information Processing Letters, 100(2):41–46, October 2006. doi:10.1016/j.ipl.2006.06.008.
[42] Andrew Yao. Separating the polynomial-time hierarchy by oracles. In 26th Annual Symposium on Foundations of Computer Science (SFCS). IEEE, 1985. doi:10.1109/sfcs.1985.49.

Appendix A Proof of Claim 23

Consider a term of $D$ . We rewrite it as a $2$ -CNF using ${\mathsf{MOD}}_{2}$ and ${\mathsf{MOD}}_{3}$ gates:

	$\displaystyle\bigwedge_{i\in[k]}\left[\big(\sum_{j\in[n]}\alpha^{i}_{j}x_{j}% \big)\bmod 6=a_{i}\right]\land\bigwedge_{i\in[k]}\left[\big(\sum_{j\in[n]}% \beta^{i}_{j}x_{j}\big)\bmod 6\neq b_{i}\right]=$
	$\displaystyle\bigwedge_{i\in[k]}\left[\big(\sum_{j\in[n]}\alpha^{i}_{j}x_{j}% \big)\bmod 2=a_{i}\bmod 2\wedge\big(\sum_{j\in[n]}\alpha^{i}_{j}x_{j}\big)% \bmod 3=a_{i}\bmod 3\right]$
	$\displaystyle\land\bigwedge_{i\in[k]}\left[\big(\sum_{j\in[n]}\beta^{i}_{j}x_{% j}\big)\bmod 2\neq b_{i}\bmod 2\vee\big(\sum_{j\in[n]}\beta^{i}_{j}x_{j}\big)% \bmod 3\neq b_{i}\bmod 3\right]$

Here $\alpha,\beta\subseteq\mathbb{Z}_{6}^{k\times n}$ , $a,b\in\mathbb{Z}^{k}_{6}$ . A $2$ -CNF with $k$ terms can be transformed into a DNF of size $2^{k}\cdot k$ . Now, any term of that DNF has the following form:

	$\displaystyle\bigwedge_{i\in[k]}\left[\big(\sum_{j\in[n]}\gamma_{j}^{i}x_{j}% \big)\bmod 2=a_{i}\right]\wedge\bigwedge_{i\in[k]}\left[\big(\sum_{j\in[n]}% \delta_{j}^{i}x_{j}\big)\bmod 2\neq b_{i}\right]\wedge$
	$\displaystyle\bigwedge_{i\in[k]}\left[\big(\sum_{j\in[n]}\varepsilon_{j}^{i}x_% {j}\big)\bmod 3=c_{i}\right]\wedge\bigwedge_{i\in[k]}\left[\big(\sum_{j\in[n]}% \varphi_{j}^{i}x_{j}\big)\bmod 3\neq d_{i}\right]=$
	$\displaystyle\bigwedge_{i\in[k]}A(x)_{i}\wedge\bigwedge_{i\in[k]}B(x)_{i}$

Here $A$ and $B$ are transformations from $\mathbb{L}$ , $\gamma,\delta\subseteq\mathbb{F}_{2}^{k\times n};\varepsilon,\varphi\in\mathbb% {F}_{3}^{k\times n}$ and $a,b\in\mathbb{F}_{3}^{k}$ , $c,d\in\mathbb{F}_{3}^{k}$ . Overall, $f$ is then computable by a circuit of the following form:

\bigvee_{t\in D}\bigwedge_{i\in[k]}A_{t}(x)_{i}\wedge\bigwedge_{i\in[k]}B_{t}(% x)_{i}

This is a $(2\text{-}\textsf{DNF})\circ(\textsf{CNF}\circ\mathbb{L})$ circuit of no greater size than $O(s\cdot 2^{k}\cdot k)$ .

[bib.bib1] [1] Miklos Ajtai. $\Sigma^{1}_{1}$ -formulae on finite structures. Annals of Pure and Applied Logic, 24(1):1–48, 1983. doi:10.1016/0168-0072(83)90038-6.

[bib.bib2] [2] Noga Alon, László Babai, and Alon Itai. A fast and simple randomized parallel algorithm for the maximal independent set problem. J. Algorithms, 7(4):567–583, 1986. doi:10.1016/0196-6774(86)90019-2.

[bib.bib3] [3] Ryan Alweiss, Shachar Lovett, Kewen Wu, and Jiapeng Zhang. Improved bounds for the sunflower lemma. In Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing, STOC 2020, pages 624–630, New York, NY, USA, 2020. Association for Computing Machinery. doi:10.1145/3357713.3384234.

[bib.bib4] [4] Theodore Baker and Alan Selman. A second step toward the polynomial hierarchy. Theoretical Computer Science, 8(2):177–187, 1979. doi:10.1016/0304-3975(79)90043-4.

[bib.bib5] [5] Louay M. J. Bazzi. Polylogarithmic independence can fool DNF formulas. SIAM J. Comput., 38(6):2220–2272, 2009. doi:10.1137/070691954.

[bib.bib6] [6] Elmar Böhler, Christian Glaßer, and Daniel Meister. Error-bounded probabilistic computations between MA and AM. Journal of Computer and System Sciences, 72(6):1043–1076, 2006. doi:10.1016/j.jcss.2006.05.001.

[bib.bib7] [7] Mark Braverman. Poly-logarithmic independence fools bounded-depth boolean circuits. Commun. ACM, 54(4):108–115, 2011. doi:10.1145/1924421.1924446.

[bib.bib8] [8] Brynmor Chapman and R. Ryan Williams. Smaller ACC0 circuits for symmetric functions. In Mark Braverman, editor, 13th Innovations in Theoretical Computer Science Conference, ITCS 2022, January 31 - February 3, 2022, Berkeley, CA, USA, volume 215 of LIPIcs, pages 38:1–38:19. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPIcs.ITCS.2022.38.

[bib.bib9] [9] Mahdi Cheraghchi, Elena Grigorescu, Brendan Juba, Karl Wimmer, and Ning Xie. AC ${}^{\mbox{0}}$ $\circ$ MOD ${}_{\mbox{2}}$ lower bounds for the boolean inner product. J. Comput. Syst. Sci., 97:45–59, 2018. doi:10.1016/J.JCSS.2018.04.006.

[bib.bib10] [10] Gil Cohen and Igor Shinkar. The complexity of DNF of parities. In Madhu Sudan, editor, Proceedings of the 2016 ACM Conference on Innovations in Theoretical Computer Science, Cambridge, MA, USA, January 14-16, 2016, pages 47–58. ACM, 2016. doi:10.1145/2840728.2840734.

[bib.bib11] [11] Sandro Coretti, Yevgeniy Dodis, Siyao Guo, and John Steinberger. Random oracles and non-uniformity. In Jesper Buus Nielsen and Vincent Rijmen, editors, Advances in Cryptology – EUROCRYPT 2018, pages 227–258, Cham, 2018. Springer International Publishing. doi:10.1007/978-3-319-78381-9_9.

[bib.bib12] [12] Peter Frankl, Svyatoslav Gryaznov, and Navid Talebanfard. A variant of the VC-dimension with applications to depth-3 circuits. In Proceedings of the 13th Conference on Innovations in Theoretical Computer Science (ITCS), volume 215, pages 72:1–72:19. Schloss Dagstuhl, 2022. doi:10.4230/LIPIcs.ITCS.2022.72.

[bib.bib13] [13] Merrick Furst, James Saxe, and Michael Sipser. Parity, circuits, and the polynomial-time hierarchy. Mathematical Systems Theory, 17(1):13–27, 1984. doi:10.1007/bf01744431.

[bib.bib14] [14] Alexander Golovnev, Alexander Kulikov, and Ryan Williams. Circuit depth reductions. In Proceedings of the 12th Conference on Innovations in Theoretical Computer Science (ITCS), volume 185, pages 24:1–24:20. Schloss Dagstuhl, 2021. doi:10.4230/LIPIcs.ITCS.2021.24.

[bib.bib15] [15] Mika Göös, Ziyi Guan, and Tiberiu Mosnoi. Depth-3 Circuits for Inner Product. In 48th International Symposium on Mathematical Foundations of Computer Science (MFCS 2023), volume 272, pages 51:1–51:12. Schloss Dagstuhl, 2023. doi:10.4230/LIPIcs.MFCS.2023.51.

[bib.bib16] [16] Mika Göös, Shachar Lovett, Raghu Meka, Thomas Watson, and David Zuckerman. Rectangles are nonnegative juntas. SIAM Journal on Computing, 45(5):1835–1869, 2016. doi:10.1137/15M103145X.

[bib.bib17] [17] Mika Göös, Artur Riazanov, Anastasia Sofronova, and Dmitry Sokolov. Top-down lower bounds for depth-four circuits. In 2023 IEEE 64th Annual Symposium on Foundations of Computer Science (FOCS), pages 1048–1055, 2023. doi:10.1109/FOCS57990.2023.00063.

[bib.bib18] [18] Johan Håstad. Almost optimal lower bounds for small depth circuits. In Proceedings of the Eighteenth Annual ACM Symposium on Theory of Computing, STOC ’86, pages 6–20, New York, NY, USA, 1986. Association for Computing Machinery. doi:10.1145/12130.12132.

[bib.bib19] [19] Johan Håstad. Computational Limitations for Small Depth Circuits. PhD thesis, MIT, 1987.

[bib.bib20] [20] Johan Håstad, Stasys Jukna, and Pavel Pudlák. Top-down lower bounds for depth-three circuits. Computational Complexity, 5(2):99–112, 1995. doi:10.1007/bf01268140.

[bib.bib21] [21] Suichi Hirahara. A duality between depth-three formulas and approximation by depth-two. Technical report, arXiv, 2017. arXiv:1705.03588.

[bib.bib22] [22] Xuangui Huang, Peter Ivanov, and Emanuele Viola. Affine Extractors and AC0-Parity. In Amit Chakrabarti and Chaitanya Swamy, editors, Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2022), volume 245 of Leibniz International Proceedings in Informatics (LIPIcs), pages 9:1–9:14, Dagstuhl, Germany, 2022. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.APPROX/RANDOM.2022.9.

[bib.bib23] [23] Russell Impagliazzo, Ramamohan Paturi, and Francis Zane. Which problems have strongly exponential complexity? Journal of Computer and System Sciences, 63(4):512–530, December 2001. doi:10.1006/jcss.2001.1774.

[bib.bib24] [24] Stasys Jukna. On graph complexity. Comb. Probab. Comput., 15(6):855–876, 2006. doi:10.1017/S0963548306007620.

[bib.bib25] [25] Mauricio Karchmer and Avi Wigderson. Monotone circuits for connectivity require super-logarithmic depth. SIAM J. Discret. Math., 3(2):255–265, 1990. doi:10.1137/0403021.

[bib.bib26] [26] Zander Kelley and Raghu Meka. Strong bounds for 3-progressions. In 64th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2023, Santa Cruz, CA, USA, November 6-9, 2023, pages 933–973. IEEE, 2023. doi:10.1109/FOCS57990.2023.00059.

[bib.bib27] [27] Ker-I Ko. Separating and collapsing results on the relativized probabilistic polynomial-time hierarchy. Journal of the ACM, 37(2):415–438, 1990. doi:10.1145/77600.77623.

[bib.bib28] [28] Or Meir and Avi Wigderson. Prediction from partial information and hindsight, with application to circuit lower bounds. Computational Complexity, 28(2):145–183, 2019. doi:10.1007/s00037-019-00177-4.

[bib.bib29] [29] Igor Carboni Oliveira, Rahul Santhanam, and Srikanth Srinivasan. Parity helps to compute majority. In Amir Shpilka, editor, 34th Computational Complexity Conference, CCC 2019, July 18-20, 2019, New Brunswick, NJ, USA, volume 137 of LIPIcs, pages 23:1–23:17. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019. doi:10.4230/LIPIcs.CCC.2019.23.

[bib.bib30] [30] Ramamohan Paturi, Pavel Pudlák, Michael Saks, and Francis Zane. An improved exponential-time algorithm for $k$ -SAT. Journal of the ACM, 52(3):337–364, 2005. doi:10.1145/1066100.1066101.

[bib.bib31] [31] Ramamohan Paturi, Pavel Pudlak, and Francis Zane. Satisfiability coding lemma. Chicago Journal of Theoretical Computer Science, 5(1):1–19, 1999. doi:10.4086/cjtcs.1999.011.

[bib.bib32] [32] Ramamohan. Paturi, Michael Saks, and Francis Zane. Exponential lower bounds for depth three boolean circuits. computational complexity, 9(1):1–15, 2000. doi:10.1007/PL00001598.

[bib.bib33] [33] Alexander Razborov. Lower bounds on the size of bounded depth circuits over a complete basis with logical addition. Mathematical Notes of the Academy of Sciences of the USSR, 41(4):333–338, 1987. doi:10.1007/bf01137685.

[bib.bib34] [34] Alexander A. Razborov. A simple proof of bazzi’s theorem. ACM Trans. Comput. Theory, 1(1):3:1–3:5, 2009. doi:10.1145/1490270.1490273.

[bib.bib35] [35] Alexander Russell and Ravi Sundaram. Symmetric alternation captures BPP. Computational Complexity, 7(2):152–162, November 1998. doi:10.1007/s000370050007.

[bib.bib36] [36] Miklos Santha. Relativized Arthur–Merlin versus Merlin–Arthur games. Information and Computation, 80(1):44–49, 1989. doi:10.1016/0890-5401(89)90022-9.

[bib.bib37] [37] Michael Sipser. A topological view of some problems in complexity theory. In Michal Chytil and Václav Koubek, editors, Mathematical Foundations of Computer Science 1984, Praha, Czechoslovakia, September 3-7, 1984, Proceedings, volume 176 of Lecture Notes in Computer Science, pages 567–572. Springer, 1984. doi:10.1007/BFB0030341.

[bib.bib38] [38] Roman Smolensky. Algebraic methods in the theory of lower bounds for boolean circuit complexity. In Proceedings of the 19th Symposium on Theory of Computing (STOC). ACM Press, 1987. doi:10.1145/28395.28404.

[bib.bib39] [39] Avishay Tal. Tight bounds on the fourier spectrum of AC0. In Ryan O’Donnell, editor, 32nd Computational Complexity Conference, CCC 2017, July 6-9, 2017, Riga, Latvia, volume 79 of LIPIcs, pages 15:1–15:31. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2017. doi:10.4230/LIPIcs.CCC.2017.15.

[bib.bib40] [40] Leslie G. Valiant. Graph-theoretic arguments in low-level complexity. In Jozef Gruska, editor, Mathematical Foundations of Computer Science 1977, 6th Symposium, Tatranska Lomnica, Czechoslovakia, September 5-9, 1977, Proceedings, volume 53 of Lecture Notes in Computer Science, pages 162–176. Springer, 1977. doi:10.1007/3-540-08353-7_135.

[bib.bib41] [41] Guy Wolfovitz. The complexity of depth-3 circuits computing symmetric boolean functions. Information Processing Letters, 100(2):41–46, October 2006. doi:10.1016/j.ipl.2006.06.008.

[bib.bib42] [42] Andrew Yao. Separating the polynomial-time hierarchy by oracles. In 26th Annual Symposium on Foundations of Computer Science (SFCS). IEEE, 1985. doi:10.1109/sfcs.1985.49.

Lower Bounds Beyond DNF of Parities

Abstract

Keywords and phrases:

Copyright and License:

2012 ACM Subject Classification:

Related Version:

Acknowledgements:

Funding:

DOI:

Event:

Editor:

Series and Publisher:

1 Introduction

Circuits with MOD Gates

1.1 Top-down Approach

1.2 The Model and Results

Definition 1.

The Comparison of the Models

Theorem 2 ([22]).

Proof sketch.

Theorem 3.

Affine Dispersers

Theorem 4 (informal).

Inner Product

Middle Slice

2 Technique

2.1 𝗔𝗖𝟎 Top-down Lower Bounds

Definition 5 ([20]).

Claim 6 ([20, 28, 17]).

Claim 7 ([20]).

Proof.

Reducing Bottom Fan-in

Lemma 8.

Proof.

2.2 Extending the Approach to Parity Gates

Definition 9 (k-parity limit).

Claim 10.

Proof.

Problem 11.

Claim 12.

2.3 Unpredictability and Local Limits

Lemma 13 (Bit unpredictability [28]).

Lemma 14 (Block unpredictability [17]).

Lemma 15 ([28, 17]).

Proof.

3 Lower Bounds

3.1 Middle Slice

Theorem 16.

Lemma 17.

Proof.

Proof of Theorem 16.

3.2 Affine Disperser

Theorem 18.

Proof.

3.3 Inner Product

Theorem 19.

Proof.

3.4 Proof of Theorem 3

4 Discussion and Open Problems

Problem 20.

Problem 21.

Problem 22.

Claim 23.

Proof.

Problem 24.

Problem 25.

References

Appendix A Proof of Claim 23

2.1 ${\mathsf{AC}}^{0}$ Top-down Lower Bounds

Definition 9 ( $k$ -parity limit).