On the Automatability of Tree-Like k-DNF Resolution

Carenini, Gaia; de Rezende, Susanna F.

doi:10.4230/LIPIcs.CCC.2025.14

On the Automatability of Tree-Like $k$ -DNF Resolution

Gaia Carenini

University of Cambridge, UK Susanna F. de Rezende

Lund University, Sweden

Abstract

A proof system $\mathcal{P}$ is said to be automatable in time $f(N)$ if there exists an algorithm that given as input an unsatisfiable formula $F$ outputs a refutation of $F$ in the proof system $\mathcal{P}$ in time $f(N)$ , where $N$ is the size of the smallest $\mathcal{P}$ -refutation of $F$ plus the size of $F$ . Atserias and Bonet (ECCC 2002), observed that tree-like $k$ -DNF resolution is automatable in time $N^{c\cdot k\log N}$ for a universal constant $c$ . We show that, under the randomized exponential-time hypothesis (rETH), this is tight up to a $O(\log k)$ -factor in the exponent, i.e., we prove that tree-like $k$ -DNF resolution, for $k$ at most logarithmic in the number of variables of $F$ , is not automatable in time $N^{o\left((k/\log k)\cdot\log N\right)}$ unless rETH is false. Our proof builds on the non-automatability results for resolution by Atserias and Müller (FOCS 2019), for algebraic proof systems by de Rezende, Göös, Nordström, Pitassi, Robere and Sokolov (STOC 2021), and for tree-like resolution by de Rezende (LAGOS 2021).

Keywords and phrases:

Proof Complexity, Tree-like

k

-DNF Resolution, Automatability

Funding:

Gaia Carenini: Trinity College CB European PhD Studentship.

Susanna F. de Rezende: Knut and Alice Wallenberg grants KAW 2018.0371 and KAW 2021.0307, ELLIIT, and the Swedish Research Council grant 2021-05104.

Copyright and License:

2012 ACM Subject Classification:

Theory of computation

Acknowledgements:

The authors wish to thank Noel Arteche and Jonas Conneryd for feedback on the presentation. We are also grateful to the anonymous reviewers for their careful reading of our manuscript and their many insightful comments and suggestions. Part of this work was done while G. Carenini was a student at the Department of Computer Science of Ecole Normale Supérieure – PSL Research University, Paris, France.

DOI:

10.4230/LIPIcs.CCC.2025.14

Event:

40th Computational Complexity Conference (CCC 2025)

Editors:

Srikanth Srinivasan

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

A central question in proof complexity concerns the difficulty of finding proofs: given a theorem, can we efficiently find a proof of it in a given proof system? This question – whether proof search is automatable – was first raised informally by Gödel in a 1956 letter to von Neumann [39] and has been studied formally since the 1990s when Bonet, Pitassi and Raz introduced the notion of automatability. A proof system $\mathcal{P}$ is said to be automatable in time $f(N)$ if there exists an algorithm that, given an unsatisfiable formula $F$ , outputs a refutation of $F$ in $\mathcal{P}$ in time $f(N)$ , where $N$ denotes the sum of the size of the smallest $\mathcal{P}$ -refutation of $F$ and the size of $F$ .

A series of results in the late 1990s and early 2000s established non-automatability of various proof systems under standard complexity-theoretic and cryptographic assumptions [7, 26, 28, 12, 1, 11, 5, 2, 21]. While cryptographic assumptions remain the only known way to proving non-automatability of Frege systems beyond $AC^{0}$ -Frege [3], recent years have seen renewed interest in understanding the automatability of weaker systems [31, 6, 23, 22, 8, 18, 16, 25, 33]. This resurgence is largely due to a breakthrough result by Atserias and Müller [6], who showed via a clever – and in hindsight natural – reduction that automating resolution is NP-hard.

A key frontier in this line of research involves understanding the automatability of tree-like proof systems. These systems correspond to natural search algorithms that explore the solution space via branching heuristics. Thus, non-automatability results for such systems translate into lower bounds for SAT solvers implementing these heuristics. A canonical example is tree-like resolution, where derived clauses may only be used once to derive new clauses. Robinson [37] showed that tree-like resolution is polynomially equivalent to non-deterministic DPLL [15, 14] algorithms.

The DPLL algorithm searches for satisfying assignments to a CNF formula $F$ by recursively branching on the assignment of a variable. A natural generalization of this algorithm is to allows branching on Boolean functions that depend only on a small number of variables. Such algorithms correspond to the tree-like $k$ -DNF resolution proof system (Res( $k$ )), a generalization of tree-like resolution where up to $k$ literals may be resolved at once.

Automating tree-like proof systems involves efficiently identifying good branching choices. This is often challenging, and many tree-like proof systems are not known to be automatable even in random subexponential time. The exceptions, for which non-trivial automating algorithms are known, are tree-like resolution and tree-like $k$ -DNF resolution: Beame and Pitassi [7] proposed a simple automating algorithm for tree-like resolution running in time $N^{O(\log N)}$ , which was later adapted by Atserias and Bonet [4] to show that tree-like $k$ -DNF resolution is automatable in time $N^{c\cdot k\log N}$ , for a universal constant $c$ , independent of $k$ . This latter result holds even when $k$ is a function of the number of variables of the formula,

The first evidence that tree-like resolution is not automatable in polynomial time was given by Alekhnovich and Razborov [2]. Their result, combined with a derandomization procedure from [20], implies that if tree-like resolution is automatable in polynomial time, then ${\text{W[P]}}={\text{FPT}}$ . More recently, Mertz, Pitassi and Wei [31] proved that automating tree-like resolution in time $N^{O(\log^{1/7-\epsilon}\log N)}$ would refute the Exponential Time Hypothesis (ETH). Finally, de Rezende [16, 17] improved this to to show that automating tree-like resolution in time $N^{O(\log^{1-\epsilon}N)}$ contradicts ETH, and doing so in time $N^{o(\log N)}$ contradicts randomized ETH [19].

For most other tree-like systems, the picture remains unclear. Even though non-automatability results akin to those of [16] are plausible, it is not known whether they would be tight. In most cases, we do not know of any non-trivial automating algorithm. This leaves open the question of whether we can prove better non-automatability results for other tree-like proof systems.

1.1 Our result

We prove our result under the randomized exponential time hypothesis which we introduce next. The exponential time hypothesis (ETH), is the strengthening of the ${\text{P}}\neq{\text{NP}}$ conjecture, introduced by Impagliazzo and Paturi [24], that states that 3- ${\mathsf{SAT}}$ cannot be solved in subexponential time. More formally, it states that there exists an $\varepsilon>0$ such that 3- ${\mathsf{SAT}}$ for $n$ -variate formulas cannot be decided in deterministic time $2^{\varepsilon n}$ . The randomized exponential time hypothesis (rETH), introduced by Dell et al. [19], is instead a strengthening of the ${\text{NP}}\not\subseteq{\mathsf{BPP}}$ conjecture. It asserts that there exists $\varepsilon>0$ and $c>1$ such that 3- ${\mathsf{SAT}}$ for $n$ -variate formulas with $c n$ clauses cannot be decided in randomized time $2^{\varepsilon n}$ .

Our main result is that for any function $k=k(n)\leq\log n$ , if tree-like Res( $k$ ) is automatable in time $N^{o\left(\frac{k}{\log k}\cdot\log N\right)}$ , then rETH is false. We defer the definition of the Res( $k$ ) proof system to Section 2, but note here that in this work – and also in many others – the argument $n$ of the function $k(n)$ is the number of variables in the formula (and does not depend on the size of the proof as per Krajicek’s original definition [27]).

Theorem 1 (Informal).

For any function $k={k}(n)\leq\log n$ , if tree-like Res( $k$ ) is automatable on $n$ -variate CNF formulas in time $N^{o\left(\frac{k}{\log k}\cdot\log N\right)}$ , then rETH is false.

This implies that, under rETH, the algorithm of [4] that automates tree-like Res( $k$ ) in time $N^{O\left(k\cdot\log N\right)}$ is tight up to a $\log k$ factor in the exponent. Previously, the best known lower bound under rETH, for both tree-like resolution and tree-like Res( $k$ ), was $N^{\Omega\left(\log N\right)}$ [16].

Theorem 1 is a consequence of the following result, which is our main technical contribution.

Theorem 2 (Main result).

There exists a randomized algorithm $\mathcal{A}$ that given as input a CNF formula $F$ on $n$ variables and ${\mathsf{poly}}(n)$ clauses and an integer $k=O(n/\log^{2}n)$ , runs in time $2^{O(q)}$ , where $q=\sqrt{(n\log 2k)/k}$ , and outputs a CNF formula $\mathcal{A}(F,k)$ that, except with negligible probability, satisfies:

1.

if $F$ is satisfiable, then $\mathcal{A}(F,k)$ admits a tree-like Res( $k$ ) refutation of size $2^{O(q)}$ ; and
2.

if $F$ is unsatisfiable, then $\mathcal{A}(F,k)$ requires a tree-like ${\text{Res}}({k^{\prime}})$ refutation of size $2^{\Omega(n)}$ , for all $k^{\prime}=o(\exp(q)/n)$ .

The proof that Theorem 1 follows from Theorem 2 is quite standard. We refer the reader to the full version of this paper for details. We now focus on the strategy and the new ideas required to prove Theorem 2.

1.2 Prior related work

The overall structure of the proof of Theorem 2 follows the general strategy of Atserias and Müller [6], and uses ideas from [18] and [16]. In what follows, we briefly summarize these results, and after that we discuss the challenges encountered to prove our main theorem, and the new ideas needed to overcome them.

The main technical theorem in [6], which yields the NP-hardness of automating resolution, is given below.

Theorem 3 ([6]).

There is a polynomial-time algorithm $\mathcal{A}$ that on input an $n$ -variate CNF formula $F$ outputs a CNF formula $\mathcal{A}(F)$ such that

$\blacksquare$

If $F$ is satisfiable, then $\mathcal{A}(F)$ admits a resolution refutation of size at most $n^{O(1)}$ .
$\blacksquare$

If $F$ is unsatisfiable, then $\mathcal{A}(F)$ requires resolution refutations of size at least $2^{n^{\Omega(1)}}$ .

The starting point of the algorithm $\mathcal{A}$ is to build the formula $\mathrm{Ref}(F,s)$ that encodes, in a natural way, that $F$ has a (DAG-like) resolution refutation of size at most $s$ . The idea for building this formula to obtain a large gap between the size of refutation depending on whether $F$ is satisfiable or not comes from Pudlák [34], inspired by Razborov [36]. Pudlák [34] showed that if $F$ is satisfiable then $\mathrm{Ref}(F,s)$ has a short resolution refutation. Intuitively, this refutation uses a satisfying assignment $x^{*}$ for $F$ as a guide to find an incorrect step in the purported refutation of $F$ , which must exist since $x^{*}$ satisfies all clause in $F$ but falsifies the final empty clause.

Proving a lower bound for $\mathrm{Ref}(F,s)$ when $F$ is unsatisfiable turns out to be more challenging and, to address this, Atserias and Müller [6] define a relativized or lifted version of $\mathrm{Ref}(F,s)$ for which they can prove the lower bound and, at the same time, show that the upper bound still holds. In [18] a similar formula is defined and the result is extended to the algebraic proof systems polynomial calculus and Nullstellensatz. The main novelty of [18] is to show that the lower bound can be proven by a reduction from the pigeonhole principle. Finally, in [16], it is shown that, unless rETH is false it is not possible to automate tree-like resolution in time $n^{o(\log n)}$ . The main idea of this result is to define a blown-up version of $\mathrm{Ref}(F,s)$ of size $\exp(\Theta(\sqrt{n}))$ such that, when $F$ is satisfiable, there is a tree-like Resolution refutation of this formula in size $\exp(\Theta(\sqrt{n}))$ .

1.3 Our technical contribution

In most of the follow-up works to [6], the main technical challenge was to show that, when $F$ is unsatisfiable, there exist no small refutations of $\mathcal{A}(F)$ for some proof system $\mathcal{P}$ , which was typically stronger than resolution. For tree-like resolution, however, the main difficulty is quite different. Pudlák’s upper bound [34] for the case where $F$ satisfiable is inherently DAG-like, and the best known tree-like resolution refutations are of exponential size. In fact, we do not expect significantly smaller tree-like resolution refutations of $\mathrm{Ref}(F,s)$ to exist. In particular, if there were such refutations of polynomial size, then the algorithm of [7] that automates tree-like resolution in quasi-polynomial time would imply that NP is in quasi-polynomial time. Overcoming this limitation requires a substantial modification to the structure of $\mathcal{A}(F)$ .

The key insight of [16] is that one can (randomly) construct a much larger formula, denoted $\mathrm{ShallowRef}(F)$ , of size $\exp(\Theta(\sqrt{n}))$ , such that, when $F$ is satisfiable, $\mathrm{ShallowRef}(F)$ admits a tree-like resolution refutation also of the size $\exp(\Theta(\sqrt{n}))$ .¹¹1In [16], it is mistakenly stated that this can be achieved without randomness. However, the construction of $\mathrm{ShallowRef}(F)$ in [16] relies on polynomially unbalanced expander graphs with constant degree, and these are only known to constructable in randomized polynomial time. Together with an exponential lower bound in the casa $F$ is unsatisfiable, this construction yields that automating tree-like resolution in time $n^{o(\log n)}$ would lead to a randomized algorithm for SAT running in time $\exp(o(n))$ .

This non-automatability result also extends to tree-like Res( $k$ ). However, for this system, the best known automating algorithm – due to Atserias and Bonet [4] – runs in time $n^{c\cdot k\log n}$ . To show that this upper bound is (nearly) tight, we must construct a smaller formula which also admits a smaller tree-like Res( $k$ ) refutation when $F$ is satisfiable. In order to understand the challenge we face in obtaining a better non-automatability result for tree-like Res( $k$ ), we need to describe in more detail the formula $\mathrm{ShallowRef}(F)$ , which we do in Section 2. For now, we note that [16] actually shows that we can construct formulas $\mathcal{A}(F)$ with the following trade-off: for any size parameter $\log n\leq s\leq n$ , it is possible to construct a formula $\mathcal{A}_{s}(F)$ of size $2^{O(s)}$ such that the upper bound for refuting $\mathcal{A}_{s}(F)$ when $F$ is satisfiable is $2^{O(n/s)}$ . In order to obtain our non-automatability result for tree-like Res( $k$ ), however, we would need that both the formula $\mathcal{A}(F)$ and the upper bound be of size roughly $2^{O(\sqrt{n/k})}$ .

We present the intuition behind the construction of $\mathcal{A}(F)$ in the next section. One of the core technical challenges is efficiently representing and accessing objects of size $2^{\sqrt{nk}}$ within a formula whose size is only $2^{\sqrt{n/k}}$ . We would like to represent such objects via pigeonhole principle instances, but the naive encoding would produce a formula of size $2^{\sqrt{nk}}$ . The key insight is to instead use small circuits that compute the neighbor function of an unbalanced expander graph used for defining the pigeonhole principle instances. These circuits act as shortcuts for accessing the larger set of objects, allowing us to simulate the effect of having access to $2^{\sqrt{nk}}$ items within a formula of size $2^{\sqrt{n/k}}$ .

For the lower bound, our argument builds on the strategy in [18], which relies on a low-depth reduction from the pigeonhole principle. However, in our case, the formula incorporates different types of unbalanced expanders, making it difficult to reduce from a single pigeonhole principle instance, as was done previously. To address this, we introduce a generalized reduction that maps multiple formulas to a single formula. We note that even in [18], using these generalized reductions would have slightly improved the parameters, making them match those of [6].

Organization of the paper

The paper is organized as follows. Section 2 recalls the necessary concepts in proof complexity and graph theory, states formally Theorem 1, and provides the intuition guiding the construction of the CNF formula $\mathcal{A}(F)$ . Section 3 is devoted to the definition of the formula $\mathcal{A}(F)$ , which we denote $\mathrm{ShortcutRef}_{k}(F)$ . Section 4 proves the upper bound in Theorem 2. Section 5 briefly outlines the proof of the lower bound in Theorem 2. Finally, Section 6 concludes the paper with some related open problems.

2 Preliminaries and proof overview

In this section we state Theorem 1 formally and show why it follows from Theorem 2. We then present the idea behind the construction of the formula output by the algorithm $\mathcal{A}$ of Theorem 2. Before starting, we need to introduce some notation and definitions.

For a positive integer $n$ , the set $\{1,\dots,n\}$ is denoted by $[n]$ . Given a Boolean variable $x$ ranging over $\{0,1\}$ , its positive literal is $x$ (sometimes denoted as $x^{1}$ ) and its negative literal is $\overline{x}$ (sometimes denoted as $x^{0}$ ). A clause is a disjunction of literals $\ell_{1}\vee\dots\vee\ell_{t}$ and a term is a conjunction of literals $\ell_{1}\wedge\dots\wedge\ell_{t}$ . We view clauses and terms as sets: the order of the literals is irrelevant, and we can assume there are no repetitions.

A CNF formula $\{C_{1},\dots,C_{m}\}$ is a conjunction of clauses $C_{1}\wedge\dots\wedge C_{m}$ and a DNF formula is a disjunction of terms $T_{1}\vee\dots\vee T_{m}$ . A $k$ -CNF formula is a CNF formula where every clause has at most $k$ literals and analogously a $k$ -DNF formula is a DNF formula where every term has at most $k$ literals. We say that a variable $x$ appears in a clause $C$ or in a term $T$ if a literal over $x$ is an element of $C$ or of $T$ , respectively. A formula is satisfiable if there exists an assignment to the Boolean variables of the formula that sets its value to $1$ ; otherwise it is said to be unsatisfiable.

The $k$ -DNF resolution proof system, denoted by Res( $k$ ), is a proof system where each lines is a $k$ -DNFs and new lines can be derived from previous one using any of the following inference rules

Weakening	Cut
$\displaystyle\frac{A}{A\vee B}$	$\displaystyle\frac{A\vee(\overline{\ell_{1}}\vee\dots\vee\overline{\ell_{r}})% \hskip 5.69046ptB\vee(\ell_{1}\wedge\dots\wedge\ell_{r})}{A\vee B}$
$\wedge$ -introduction	$\wedge$ -elimination
$\displaystyle\frac{A\vee\bigwedge_{\ell\in L_{A}}\ell\hskip 5.69046ptB\vee% \bigwedge_{\ell\in L_{B}}\ell}{A\vee B\vee\bigwedge_{\ell\in L_{A}\cup L_{B}}\ell}$	$\displaystyle\frac{A\vee(\ell_{1}\wedge\dots\wedge\ell_{r})}{A\vee\ell_{i}}% \hskip 12.0pti\in[r]$

where $A$ and $B$ are $k$ -DNF formulas, $L_{A}$ and $L_{B}$ are sets of literals such that $|L_{A}\cup L_{B}|\leq k$ , $\ell_{i}$ are literals, and $r\leq k$ . We recall that in this work, when $k$ is a function $k(n)$ , the argument $n$ of the function is the number of variables in the formula (and does not depend on the size of the proof as per Krajicek’s original definition of $k$ -DNF resolution, which he denotes $R(k)$ [27]).

A Res( $k$ ) derivation from a CNF formula ${F}$ is a sequence of $k$ -DNF formulas $(D_{1},\dots,D_{s})$ such that each $i\in[s]$ , $D_{i}$ either belongs to ${F}$ or follows from preceding $D_{j}$ s by an application of one of the inference rules. A Res( $k$ ) refutation of ${F}$ is a Res( $k$ ) derivation from ${F}$ where the final clause is the empty clause, denoted $\bot$ . The size of a Res( $k$ ) derivation $\Pi=(D_{1},\dots,D_{s})$ is $s$ and is denoted by $|\Pi|$ . A refutation $\Pi$ is said to be tree-like if every $D_{i}$ appearing in $\Pi$ is used in at most one derivation step.

The resolution proof system corresponds to ${\text{Res}}(1)$ and has only two inference rules:

Weakening	Cut
$\displaystyle\frac{A}{A\vee B}$	$\displaystyle\frac{A\vee{x}\hskip 19.91684ptB\vee\overline{x}}{A\vee B}$

where $A$ and $B$ are clauses, and $x$ is a variable.

2.1 Formal main result

We are now ready to state Theorem 1 formally.

Theorem 4.

Let ${k}:\mathbb{N}^{+}\rightarrow\mathbb{N}^{+}$ be a non-decreasing function where ${k}(x)\leq\log x$ . If there exists an algorithm $\mathbb{A}_{k}$ such that given as input an unsatisfiable $n$ -variate CNF formula $\mathcal{F}$ , the algorithm outputs in time $N^{o\left(\frac{k}{\log 2k}\cdot\log N\right)}$ a Res( $k$ ) refutation of $\mathcal{F}$ , where $k={k}(n)$ and $N$ is the size of $\mathcal{F}$ plus the size of the smallest Res( $k$ ) refutation of $\mathcal{F}$ , then rETH is false.

The proof of this theorem follows from Theorem 2 in a rather straightforward way. The only aspect that needs some care is to choose the $k$ that should be given as input to the randomized algorithm $\mathcal{A}$ of Theorem 2. We leave the details of this proof to the full-length version of this paper.

2.2 Expander graphs used to construct the formula $\mathcal{A}(F)$

Before we present the intuition behind the construction of the formula $\mathcal{A}(F)$ , we introduce some graph theory notation and define the expander graphs that we use to construct the formula $\mathcal{A}(F)$ . We refer the reader to, e.g., [10] for missing basic definitions in graph theory.

In this paper, graphs $G=(V,E)$ are simple and undirected. We often identify the vertex set $V$ of a graph with the subset $[|V|]$ of $\mathbb{N}$ , and assume that the vertices of the graph inherit the standard ordering of $\mathbb{N}$ . A $d$ -regular graph is a graph where every vertex has degree exactly $d$ . In the context of bipartite graphs, we say that $\Gamma=(L\mathop{\dot{\cup}}R,E)$ is left $d$ -regular graph (resp., right $d$ -regular graph) if and only if all its vertices in $L$ (resp., $R$ ) have degree $d$ . Given a graph $G=(V,E)$ , and a subset of the its vertex set $S\subseteq V$ , we define the neighborhood of $S$ , denoted as $\mathcal{N}(S)$ , to be the set of all vertices in $V$ which are adjacent to some vertex in $S$ . Formally, $\mathcal{N}(S)=\{v\in V:\exists u\in S,\{u,v\}\in E\}$ . We assume that the neighbors of a vertex are ordered according to an arbitrary yet fixed ordering. We define the following function that allows us to access the neighbors of a vertex.

Definition 5 (Neighbor function).

Let $\Gamma=([m]\times[n],E)$ be a left $d$ -regular bipartite graph. The neighbor function of $\Gamma$ , denoted as $f_{\Gamma}:[m]\times[d]\rightarrow[n]$ , is the function that takes as input a vertex $v\in[m]$ and an index $i\in[d]$ and returns as output the $i$ th neighbor of $v$ .

We now define the types of expander graphs which appear in this work.

Definition 6 (Bipartite expander graph).

A $(m,n,d,\gamma,\alpha)$ -bipartite expander graph is a left $d$ -regular bipartite graph $\Gamma=([m]\cup[n],E)$ where for every subset $S\subseteq[m]$ such that $|S|\leq\gamma$ , the neighborhood of $S$ satisfies $|\mathcal{N}(S)|\geq\alpha|S|$ . If we have the stronger condition that there are at least $\alpha|S|$ vertices in $\mathcal{N}(S)$ that are adjacent to exactly one vertex in $S$ , then we say that $\Gamma$ is a $(m,n,d,\gamma,\alpha)$ -bipartite unique-neighbor expander.

We are interested in unbalanced bipartite expander graphs with small left degree. For our purposes, we require more than the existence of such expander graphs – we need there to be a small circuit that computes its neighbor function and, moreover, these circuit must be constructable in randomized polynomial time.

Theorem 7.

There is an absolute constant $C$ and a randomized algorithm $\mathcal{A}$ that given as inputs $k,n\in\mathbb{N}$ , outputs in time $O(n\cdot(k\log n)^{C})$ a circuit $\mathcal{C}_{n,k}:[n^{k}]\times[d]\rightarrow[n]$ , for $d=12k$ , that computes the neighbor function of a graph $\Gamma_{n,k}$ that with probability at least $1-1/n^{\Omega(1)}$ is a $(n^{k},n,d,\sqrt{n},d/3)$ -bipartite unique-neighbor expander.

The proof of this theorem uses known constructions of $k$ -independent functions, and can be found in the full-length version of this paper. We are now ready to describe the main ideas needed for the construction $\mathcal{A}(F)$ .

2.3 Intuition for construction of $\mathcal{A}(F)$

Let $F$ be a CNF formula over variables $x_{1},\ldots,x_{n}$ with ${\mathsf{poly}}(n)$ clauses. The $\mathrm{Ref}(F,s)$ formula encodes that there is a resolution refutation of $F$ of size $s$ , which is usually chosen to be a polynomial in $n$ . As mentioned earlier, Pudlák’s [34] upper bound for $\mathrm{Ref}(F,s)$ when $F$ is satisfiable, which was adapted in [6], is intrinsically DAG-like. We do not expect there to be polynomial-size tree-like $\mathrm{Res}(k)$ refutations of $\mathrm{Ref}(F,s)$ for satisfiable $F$ since, due to the quasi-polynomial algorithm that automates tree-like $\mathrm{Res}(k)$ , this would imply that it is possible to solve SAT in quasi-polynomial time. It may be the case that tree-like $\mathrm{Res}(k)$ requires size at least $2^{\Omega(n)}$ to refute $\mathrm{Ref}(F,s)$ , regardless of whether $F$ is satisfiable and, therefore, it seems that non-automatability results for tree-like $\mathrm{Res}(k)$ are unlikely to be obtained using this formula.

In order to describe the formula $\mathrm{ShortcutRef}_{k}(F)$ we first describe a simplified version of $\mathrm{ShortcutRef}_{1}(F)$ , which is inspired by the formula $\mathrm{ShallowRef}(F)$ from [16]. Consider the trivial tree-like resolution refutation of a formula $F$ , denoted by $\Pi$ , corresponding to the refutation of depth $n$ that has $2^{n}$ leaves, one for every width- $n$ clause (each of which is a weakening of some axiom), and that resolves over variables in reverse order, that is, $x_{n}$ then $x_{n-1}$ , etc., until it resolves over $x_{1}$ to derive the empty clause. We view the clauses of this refutation as arranged in layers, and we say that all clauses that are obtained by resolving over variable $i$ are in layer $i$ .

We can create a formula stating that this tree-like resolution refutation fits into a smaller structure as follows (see Figure 1). The variables of this formula are partitioned into blocks, one block for each clause of the purported structured derivation. The structure of the blocks is as follows. We start from the top of the refutation (the part closest to $\bot$ ) and for the first $2q$ layers we simply copy the first $2q$ layers of $\Pi$ (i.e., copy the structure and the clauses). At this point, we have $2^{2q}$ leaves and our structure is starting to get too big. We therefore create a new layer with only $2^{q}$ blocks and encode (the contradictory statement) that, actually, the $2^{2q}$ blocks of the layer above can be mapped in a one-to-one fashion to these $2^{q}$ new blocks. This mapping is encoded by enforcing that if block $B$ is mapped to block $B^{\prime}$ then both contain the exact same clause. Starting from each of the $2^{q}$ new blocks, we construct a full binary tree of blocks of depth $q$ . This gives us $2^{2q}$ blocks at the last layer and we again map these blocks to $2^{q}$ blocks. By repeating this procedure $n/q$ times, we create a structure of depth $n$ , but that has size only $2^{O(q)}$ . For reasons that will become clear soon, we encode the mapping from $2^{2q}$ to $2^{q}$ as a pigeonhole principle instance over an expander of constant degree $\Delta$ . Note that this structure has size $2^{O(q)}$ .

We can define a formula, which we refer to $\mathrm{ShortcutRef}^{\star}_{1}(F)$ (see Figure 1), of size $2^{O(q)}$ that encodes that a resolution refutation with this structure exists. For some intuition as to why $\mathrm{ShortcutRef}^{\star}_{1}(F)$ could give us a non-automatability result for tree-like resolution, observe that, if $F$ is unsatisfiable, the only reason $\mathrm{ShortcutRef}^{\star}_{1}(F)$ is unsatisfiable is that we cannot have a one-to-one map from $2^{2q}$ blocks to $2^{q}$ blocks. Since pigeonhole principle formulas over expander graphs are hard to refute for resolution, it is reasonable that $\mathrm{ShortcutRef}^{\star}_{1}(F)$ should be hard to refute for resolution (and thus also for tree-like resolution).

Figure 1: On the left, a trivial tree-like resolution refutation. On the right, the structure of the formula

\mathrm{ShortcutRef}^{\star}_{1}(F)

with a general parameter

q

. Triangles correspond to tree-like subderivations, and squares to blocks encoding clauses of the derivation. In between layers of triangles, we have an expander graph

\Gamma_{\mathrm{sh}}

.

When $F$ is satisfiable, however, there is another reason $\mathrm{ShortcutRef}^{\star}_{1}(F)$ is unsatisfiable: there is an assignment $x^{*}$ that satisfies all clauses in $F$ . A proof that $\mathrm{ShortcutRef}^{\star}_{1}(F)$ is unsatisfiable corresponds to showing that, no matter what purported refutation of $F$ is (that fits in the structure), it will contain a mistake. It is too early to get into many details here, but the rough intuition is that a tree-like resolution refutation of $\mathrm{ShortcutRef}^{\star}_{1}(F)$ can be viewed as following a path from the clause $\bot$ to a clause at the bottom of the structure of $\mathrm{ShortcutRef}^{\star}_{1}(F)$ keeping the invariant that $x^{*}$ falsifies all clauses of the purported refutation $\mathrm{ShortcutRef}^{\star}_{1}(F)$ it goes through. If we did not find a mistake in the purported refutation earlier, once we reach a clause $C$ at the bottom layer we necessarily find a mistake, since by the invariant $x^{*}$ falsifies $C$ , but it satisfies all clauses of $F$ , so $C$ cannot be a weakening of an axiom of $F$ .

This argument can be carried out in tree-like resolution in size which is proportional to the number of paths from the top block to a block at the bottom layer in $\mathrm{ShortcutRef}^{\star}_{1}(F)$ . If we did not have an expander graph between layers, but instead had a complete bipartite graph, the number of such paths would be $(2^{q})^{n/q}=2^{n}$ , which is too large. With bipartite expander graphs with maximum left-degree $\Delta$ , the number of such paths becomes $\Delta^{n/q}$ . If we define $q=\sqrt{n}$ , we have that the size of the formula $\mathrm{ShortcutRef}^{\star}_{1}(F)$ and the upper bound when $F$ is satisfiable is $2^{O(\sqrt{n})}$ . This, together with a size lower bound of $2^{\Omega(n)}$ for refuting $\mathrm{ShortcutRef}^{\star}_{1}(F)$ when $F$ is unsatisfiable, gives the non-automatability result for tree-like resolution in [16].

This result for tree-like resolution also applies for tree-like $\mathrm{Res}(k)$ , but our goal is to prove a stronger non-automatability result for tree-like $\mathrm{Res}(k)$ . To this end, we want to design a formula that is both smaller and for which we get a smaller tree-like $\mathrm{Res}(k)$ refutation when $F$ is satisfiable. The best one could hope for, without contradicting ETH, is that both the size and the upper bound become $2^{O(\sqrt{n/k})}$ . By varying $q$ , we can make either of the parameters small, but not both at the same time. In particular, if we consider $\mathrm{ShortcutRef}^{\star}_{1}(F)$ with $q=\sqrt{nk}$ we get an upper bound of $2^{O(\sqrt{n/k})}$ , at the cost of increasing the size of the formula to $2^{\Theta(\sqrt{nk})}$ . (Note that the upper bound is smaller than the size of the formula since by definition of refutation size we only count the clauses used in the refutation.) In particular, between two “shrinking layers”, we have trees of size $2^{\sqrt{nk}}$ .

Here is where our modules comes into play. Let $C$ be the root of one of the trees of size $2^{\sqrt{nk}}$ . Instead of encoding that there is a tree-like derivation of $C$ from the $2^{\sqrt{nk}}$ leaves, we encode that there is size- $s$ resolution derivation of $C$ from only $m=2^{\sqrt{n/k}}$ clauses. This can be done using a formula that is structurally close to $\mathrm{Ref}(F,s)$ , except that instead of a fixed $F$ we have blocks encoding potential axioms $A_{1},\ldots,A_{m}$ , and we do not enforce that the last block is the empty clause. We denote the formula encoding this small derivation by $\mathrm{Mod}_{s,m}$ .

We can use these formulas $\mathrm{Mod}_{s,m}$ to obtain a formula similar to $\mathrm{ShortcutRef}^{\star}_{1}(F)$ but of size $2^{O(\sqrt{n/k})}$ as follows. Observe that after each layer of $2^{\sqrt{n/k}}$ modules we have $2^{\sqrt{n/k}}$ sets of $m=2^{\sqrt{n/k}}$ axioms $A_{1},\ldots,A_{m}$ . We now define the graph PHP formula over blocks that map these $(2^{\sqrt{n/k}})^{2}$ blocks to $2^{\sqrt{n/k}}$ blocks. To build this formula we use a bipartite expander graphs, shrinking expanders, that plays the analogous role as the expander graph in $\mathrm{ShortcutRef}^{\star}_{1}(F)$ .

While the formula obtained at this point has the correct size, we need to ensure that the upper bound can go through. To this end, we need to be able to easily access any of the $2^{\sqrt{nk}}$ extensions of $C$ . We therefore encode an additional graph PHP formula over an expander graph, the shortcut expander, that maps $2^{\sqrt{nk}}$ partial assignments to the $2^{\sqrt{n/k}}$ blocks corresponding to $A_{1},\ldots,A_{m}$ . This expander graph is too large to be hardcoded (the whole point is that we are trying to get the formula size down to $2^{\sqrt{n/k}}$ ) so we instead encode a circuit of size $2^{O(\sqrt{n/k})}$ that computes the neighbor function of this graph. With some other minor technicalities, like adding some extension variables, we can obtain an upper bound of roughly $k^{O(\sqrt{n/k})}$ . This would already give us a good non-automatability result, but we can get a bit better by balancing the size of the formula and the upper bound. This is why the precise parameters of our formula $\mathrm{ShortcutRef}_{k}(F)$ presented in the next section differ slightly from the ones presented here.

3 The formula $\mathrm{ShortcutRef}_{k}(F)$

This section is devoted to the definition of the formula $\mathrm{ShortcutRef}_{k}(F)$ . The idea is that, given a CNF formula $F$ , we would like to encode that $F$ admits a short tree-like Resolution refutation. Instead, what actually is encoded is the contradictory claim that “the trivial exponential-size refutation of $F$ fits in a structure of much smaller size.”

Let $F$ be a formula over variables $x_{1},\dots,x_{n}$ and with ${\mathsf{poly}}(n)$ clauses. We start by defining the three building blocks of $\mathrm{ShortcutRef}_{k}(F)$ .

3.1 Modules for subderivations

We start by defining the formula $\mathrm{Mod}_{s,m}$ which we refer to as a module. This formula is similar to $\mathrm{Ref}_{s}(F)$ , except that instead of a fixed $F$ we have $m$ blocks encoding potential axioms, and we do not enforce that the last block is the empty clause.

Module variables

For simplicity, let $s$ and $m$ be powers of $2$ . The variables of $\mathrm{Mod}_{s,m}$ are partitioned into $s+m+1$ blocks: one special block, $D_{s+1}$ , corresponding to the derived clause; $s$ blocks, $D_{1},D_{2},\ldots,D_{s}$ , corresponding to a purported derivation of $D_{s+1}$ ; and $m$ axiom blocks, $A_{1},A_{2},\cdots,A_{m}$ , corresponding to the axioms, or initial clauses, of the derivation. Each block $B$ (of any type) contains the following variables.

$\blacksquare$

literal variables $\textsf{lit}^{B}_{\ell}$ , one for each literal $\ell\in\{x_{1},\overline{x}_{1},\dots,x_{n},\overline{x}_{n}\}$ , such that $\textsf{lit}_{\ell}^{B}=1$ if and only if the clause in the block contains $\ell$ .
$\blacksquare$

enabled variables $\textsf{enbl}^{B}$ that encode whether a block is enabled or not.

Moreover, blocks $D=D_{i}$ for some $i\in[s+1]$ contain the following variables.

$\blacksquare$

type variables $\mathsf{type}^{D}$ with the intended meaning that $\mathsf{type}^{D}=0$ if $D$ is derived from one of the axiom blocks through a weakening step, and $\mathsf{type}^{D}=1$ if $D$ is derived from two previous blocks $D_{j}$ and $D_{j^{\prime}}$ .
$\blacksquare$

axiom pointer $\mathsf{axiom}^{D}\in[m]$ , encoded with $\log m$ Boolean variables, indicating, in the case where $\mathsf{type}^{D}=0$ , what axiom $D$ is a weakening of.
$\blacksquare$

resolved variable $\mathsf{resvar}^{D}\in[n]$ , encoded with $\log n$ Boolean variables, indicating, in the case where $\mathsf{type}^{D}=1$ , which variable $x_{j}$ was resolved on to derive $D$ .
$\blacksquare$

derived pointers $\textsf{point}^{D}_{+},\textsf{point}^{D}_{-}\in[s]$ encoded with $\log s$ Boolean variables each. These variables indicate, in the case where $\mathsf{type}^{D}=1$ , which blocks were used to derive $D$ .

Module axioms

The idea is that the axioms enforce that the blocks encode a valid derivation of the clause at $D_{s+1}$ from the clauses at $A_{1},\ldots,A_{m}$ . For all $D=D_{i}$ for $i\in[s+1]$ , and $\star\in\{+,-\}$ , we include axioms encoding that the clauses used to derive $D$ come before $D$ in the ordering.

	$\displaystyle(\textsf{enbl}^{D}\land\mathsf{type}^{D})\rightarrow(\textsf{% point}^{D}_{\star}\neq j)$	$\displaystyle\forall j>i.$	(1)
Moreover, for $j\in[s+1]$ we have the following axioms
	$\displaystyle(\textsf{enbl}^{D}\land\mathsf{type}^{D}\land(\textsf{point}^{D}_% {\star}=j))\rightarrow\textsf{enbl}^{D_{j}}$		(2)
	$\displaystyle(\textsf{enbl}^{D}\land\mathsf{type}^{D}\land(\textsf{point}^{D}_% {\star}=j)\land(\mathsf{resvar}^{D}\neq k))\;\qquad\qquad$
	$\displaystyle\rightarrow(\textsf{lit}_{\ell}^{D_{j}}\rightarrow\textsf{lit}_{% \ell}^{D})$	$\displaystyle\forall k\in[n],\forall\ell\in\{x_{k},\overline{x}_{k}\}$	(3)
	$\displaystyle(\textsf{enbl}^{D}\land\mathsf{type}^{D}\land(\textsf{point}^{D}_% {+}=j)\land(\mathsf{resvar}^{D}=k))\rightarrow\lnot\textsf{lit}_{\overline{x}_% {k}}^{D_{j}}$		(4)
	$\displaystyle(\textsf{enbl}^{D}\land\mathsf{type}^{D}\land(\textsf{point}^{D}_% {-}=j)\land(\mathsf{resvar}^{D}=k))\rightarrow\lnot\textsf{lit}_{{x}_{k}}^{D_{% j}}$		(5)
encoding that the blocks used to derive $D$ are enabled and that the resolution step is sound. Finally, for $j\in[m]$ we include axioms
	$\displaystyle(\textsf{enbl}^{D}\land\lnot\mathsf{type}^{D}\land(\textsf{axiom}% ^{D}=j))\rightarrow\textsf{enbl}^{A_{j}}$		(6)
	$\displaystyle(\textsf{enbl}^{D}\land\lnot\mathsf{type}^{D}\land(\textsf{axiom}% ^{D}=j))\rightarrow(\textsf{lit}_{\ell}^{A_{j}}\rightarrow\textsf{lit}_{\ell}^% {D})$	$\displaystyle\forall k\in[n],\forall\ell\in\{x_{k},\overline{x}_{k}\}$	(7)

encoding that if $D$ is a weakening of an axiom $A_{j}$ , then $A_{j}$ is enabled and $D$ contains all literal in $A_{j}$ .

3.2 Shrinking graph-PHP formula for blocks

We now define the graph-PHP formula over blocks that will be necessary to reduce the size of the final formula. Given $M,N,\Delta\in\mathbb{N}$ , with $\Delta\leq N\leq M$ , and given a bipartite graph $\Gamma=(([M],[N]),E)$ with uniform left degree $\Delta$ , we define the formula $\mathrm{ShrinkBlocks}(\Gamma)$ which encodes the statement, which is contradictory when $M>N$ , that there is a subset of edges of $E$ that define an injective map from $M$ blocks to $N$ blocks. We assume for simplicity that $N$ is a power of $2$ .

Shrinking variables

The variables are partitioned into blocks $P_{1},P_{2},\ldots,P_{M}$ and $H_{1},H_{2},\ldots,H_{N}$ . For every block $B$ we have the same literal and enabled variables as before:

$\blacksquare$

literal variables $\textsf{lit}^{B}_{\ell}$ , one for each literal $\ell\in\{x_{1},\overline{x}_{1},\dots,x_{n},\overline{x}_{n}\}$ .
$\blacksquare$

enabled variables $\textsf{enbl}^{B}$ .

For blocks $P=P_{i}$ we also have funnel pointer variables.

$\blacksquare$

funnel variables $\textsf{fun}^{P}_{j}$ for $j\in[\Delta]$ indicating whether block $P$ is mapped to its $j$ th neighbor.

Shrinking axioms

We have axioms encoding the following two statements.

$\blacksquare$

If $P$ is enabled, it is mapped to at least one of its neighbor; that is, for $j\in[M]$ and $P=P_{j}$ we have the axiom

$\textsf{enbl}^{P}\rightarrow\bigvee_{i\in[\Delta]}\textsf{fun}^{P}_{i}\,.$ (8)
$\blacksquare$

If $P$ is mapped to its $i$ th neighbor, say block $H$ , then $H$ is enabled and its assignment is exactly the assignment of $P$ . This is, for $i\in[\Delta]$ , and for $j\in[M]$ , if $P=P_{j}$ and $H$ is the $i$ the neighbor of $P$ , we have the axioms

$\displaystyle(\textsf{enbl}^{P}\wedge\textsf{fun}^{P}_{i})\rightarrow{}\textsf% {enbl}^{H}$ (9)

$\displaystyle(\textsf{enbl}^{P}\wedge\textsf{fun}^{P}_{i})\rightarrow{}(% \textsf{lit}_{\ell}^{H}\leftrightarrow\textsf{lit}_{\ell}^{P})$ $\displaystyle\forall\ell\in\{x_{1},\overline{x}_{1},\dots,x_{n},\overline{x}_{% n}\}\,.$ (10)

3.3 Shortcut graph PHP formula for blocks

Let $M,N,\Delta\in\mathbb{N}$ , with $\Delta\leq N\leq M$ , and assume for simplicity that $M$ and $N$ are a power of $2$ . Given a circuit $\mathcal{C}$ of size $S$ encoding the neighbor function of a bipartite graph $\Gamma=(([M],[N]),E)$ with uniform left degree $\Delta$ , we define the auxiliary formula $\mathrm{ShortcutBlocks}(\mathcal{C},t)$ which encodes the statement, which is contradictory when $M>N$ , that there is an injective map from $M$ blocks to $N$ blocks.

We start by considering $\Delta$ circuits $\mathcal{C}_{j}$ for $j\in[\Delta]$ , where the $\mathcal{C}_{j}$ is obtained by fixing the second part of the input of $\mathcal{C}$ to be $j$ . Note that such a $\mathcal{C}_{j}$ computes the $j$ th neighbor of the input vertex. We first define a formula that encodes the circuit $\mathcal{C}_{j}$ .

Circuit variables

The circuit variables consist of the following: input variables $\textsf{inp}_{i,j}$ for $i\in[\log M]$ ; gate variables $\textsf{gate}_{g,j}$ for $g\in[S]$ ; and output variables $\textsf{out}_{i,j}$ for $i\in[\log N]$ .

Circuit axioms

The circuit axioms encode that the circuit computes correctly. For every gate $g$ in $\mathcal{C}$ , we have one of the following constraints depending on whether $g$ is a NOT, AND or OR gate.

If $g$ is a NOT gate
$\displaystyle\qquad\qquad\qquad\qquad\textsf{gate}_{g,j}\leftrightarrow\neg% \textsf{gate}_{g^{\prime},j}$	where $g^{\prime}$ is the input gate of $g$ .	(11)
If $g$ is an AND gate
$\displaystyle\qquad\qquad\qquad\qquad\textsf{gate}_{g,j}\leftrightarrow\textsf% {gate}_{g^{\prime},j}\land\textsf{gate}_{g^{\prime\prime}}$	where $g^{\prime},$ $g^{\prime\prime}$ are the input gates of $g$ .	(12)
If $g$ is an OR gate
$\displaystyle\qquad\qquad\qquad\qquad\textsf{gate}_{g,j}\leftrightarrow\textsf% {gate}_{g^{\prime},j}\lor\textsf{gate}_{g^{\prime\prime},j}$	where $g^{\prime},$ $g^{\prime\prime}$ are the input gates of $g$ .	(13)

Moreover, if $g$ corresponds to the $i$ th input gate then $\textsf{gate}_{g,j}=\textsf{inp}_{i,j}$ , and if $g$ corresponds to the $i$ th output gate we have $\textsf{gate}_{g,j}=\textsf{out}_{i,j}$ . Note that each of these $S$ constraints can be encoded by at most $3$ clauses, each of width at most $3$ .

We are now ready to define the auxiliary formula $\mathrm{ShortcutBlocks}(\mathcal{C},t)$ .

Shortcut variables

The variables are partitioned into blocks ${\widetilde{H}}_{1},{\widetilde{H}}_{2},\ldots,{\widetilde{H}}_{N}$ . For every block ${\widetilde{H}}={\widetilde{H}}_{i}$ for $i\in[N]$ we have

$\blacksquare$

literal variables $\textsf{lit}^{{\widetilde{H}}}_{\ell}$ , one for each literal $\ell\in\{x_{1},\overline{x}_{1},\dots,x_{n},\overline{x}_{n}\}$ ;
$\blacksquare$

enabled variables $\textsf{enbl}^{{\widetilde{H}}}$ ; and
$\blacksquare$

circuit variables for $j\in[\Delta]$ we have $\textsf{inp}^{{\widetilde{H}}}_{i,j}$ for $i\in[\log M]$ , $\textsf{gate}^{{\widetilde{H}}}_{g,j}$ for $g\in[S]$ , and $\textsf{out}^{{\widetilde{H}}}_{i,j}$ for $i\in[\log N]$ .

These latter variables are meant to encode the circuits necessary to compute the neighbor function of the graph $\Gamma$ .

Shortcut axioms

For $i\in[N]$ , we include the circuit axioms for the circuit variables of ${{\widetilde{H}}}_{i}$ , encoding that the computation is correct.

Moreover, for all $i\in[N]$ we have axioms encoding that the input to each of the $j$ circuits of block ${{\widetilde{H}}}_{i}$ is given by the polarity of $\log M$ literal variables (starting at $x_{1+t}$ ); and that one of these $j$ circuits outputs the index $i$ . That is, for $i\in[N]$ we have

	$\displaystyle\textsf{enbl}^{{\widetilde{H}}_{i}}\rightarrow{}(\textsf{inp}^{{% \widetilde{H}}_{i}}_{i^{\prime},j}\leftrightarrow\textsf{lit}_{{x}_{i^{\prime}% +t}}^{{\widetilde{H}}_{i}})$	$\displaystyle\forall i^{\prime}\in[\log M],\forall j\in[\Delta]$		(14)
	$\displaystyle\textsf{enbl}^{{\widetilde{H}}_{i}}\rightarrow{}\left(\bigvee_{j% \in[\Delta]}\textsf{out}_{j}^{{\widetilde{H}}_{i}}=i\right)\,,$			(15)

where $\textsf{out}_{j}^{{\widetilde{H}}_{i}}=i$ is shorthand for the fact that the output variables generate the binary encoding of $i$ . Note that (14) is the only place where $t$ appears.

3.4 Final formula

Given a CNF formula $F$ over variables $x_{1},\ldots,x_{n}$ with $m={\mathsf{poly}}(n)$ clauses, and an integer $k\in[n]$ , we can now explain how to put these building blocks together to construct $\mathrm{ShortcutRef}_{k}(F)$ .

Let $r=\sqrt{n/k\log k}$ , $q=\sqrt{(n\log k)/k}$ , $c=2^{q}$ and $s=nc$ . We consider $r c$ copies of the module $\mathrm{Mod}_{s,c}$ arranged in $r$ layers, $c$ copies per layer. For every block $B$ appearing in $\mathrm{Mod}_{s,c}$ , we denote by $B[i,j]$ the $j$ th copy of the block in layer $i$ .

Root axioms

We refer to the block $D=D_{s+1}[1,1]$ as the root block. This first group of axioms consists of the unit clause $\textsf{enbl}^{D}$ and the $2n$ unit clauses:

\displaystyle\neg\textsf{lit}_{\ell}^{D}\hskip 28.45274pt

\displaystyle\forall\ell\in\{x_{1},\overline{x}_{1},\dots,x_{n},\overline{x}_{% n}\}

(16)

enforcing the root to be enabled and to contain the empty clause.

Formula $𝑭$ axioms

For $i\in[m]$ , let $C_{i}$ be the $i$ th clause of $F$ . We need to encode that the final $A$ -blocks in the last layer of modules are actually weakening of clauses in $F$ . Therefore, for $\tau,j\in[c]$ , and $A=A_{\tau}[r,j]$ , we include pointer variables $\mathsf{axiom}^{A}\in[m]$ and, for each $i\in[m]$ and each literal $\ell$ in $C_{i}$ , the axioms encoding

\displaystyle(\textsf{enbl}^{A}\land\mathsf{axiom}^{A}=i)\rightarrow\textsf{% lit}_{\ell}^{A}\,.

(17)

Layer axioms

We would like to encode that the only variables resolved on in modules in layer $i$ are $x_{(i-1)kq+1},\ldots,x_{ikq}$ . We therefore include, for all $i\in[r]$ , $j\in[c]$ , $\ell\not\in\{x,\bar{x}:x\in\{x_{(i-1)kq+1},\ldots,x_{ikq}\}\}$ , $\tau\in[c]$ and $\tau^{\prime}\in[s]$ , the axioms

	$\displaystyle\textsf{enbl}^{A_{\tau}[i,j]}\wedge\textsf{lit}^{A_{\tau}[i,j]}_{% \ell}\rightarrow\textsf{lit}^{D_{s+1}[i,j]}_{\ell},$		(18)
	$\displaystyle\textsf{enbl}^{A_{\tau}[i,j]}\wedge\textsf{lit}^{D_{\tau^{\prime}% }[i,j]}_{\ell}\rightarrow\textsf{lit}^{D_{s+1}[i,j]}_{\ell}\,,$		(19)

encoding that all blocks in the module contain the same literals as the final derived clause $D_{s+1}$ , except possibly for the literals over variables that could have been resolved over in this module. Moreover, for $\tau\in[c]$ , if $A_{\tau}$ is enabled we want that for every $x\in\{x_{(i-1)kq+1},\ldots,x_{ikq}\}$ , either $x$ or $\bar{x}$ appears as a literal in $A_{\tau}$ , that is

\displaystyle\textsf{enbl}^{A_{\tau}[i,j]}\rightarrow(\textsf{lit}^{A_{\tau}[i% ,j]}_{x}\lor\textsf{lit}^{A_{\tau}[i,j]}_{\bar{x}})

\displaystyle\forall x\in\{x_{(i-1)kq+1},\ldots,x_{ikq}\}\,.

(20)

Connection axioms

Now, let $\Gamma_{\mathrm{sh}}=(U\mathop{\dot{\cup}}V,E)$ be a bipartite unique-neighbor expander with $|U|=c^{2}$ , $|V|=c$ and uniform left degree $\Delta_{\mathrm{sh}}=O(1)$ given by Lemma 7 when the value of parameter $k$ is assigned to $2$ . Moreover, let $\mathcal{C}$ be a circuit computing the neighbor function of a bipartite unique-neighbor expander $\Gamma_{\mathrm{sc}}=(\widetilde{U}\mathop{\dot{\cup}}\widetilde{V},\widetilde% {E})$ with $|\widetilde{U}|=c^{k}$ , $|\widetilde{V}|=c$ and uniform left degree $\Delta_{\mathrm{sc}}=O(k)$ given by Lemma 7.

In order to connect the different modules we use the other two formulas. For each $i\in[r]$ , we consider $c$ copies of $\mathrm{ShortcutBlocks}(\mathcal{C},(i-1)qk)$ , arranged in layer $i$ . Again, for every block $B$ appearing in the formula, we denote by $B[i,j]$ the $j$ th copy of the block in layer $i$ . Moreover, between two consecutive layers, we introduce a copy of the formula $\mathrm{ShrinkBlocks}(\Gamma_{\mathrm{sh}})$ , and for each block $B$ in $\mathrm{ShrinkBlocks}(\Gamma_{\mathrm{sh}})$ we denote by $B[i]$ the copy that appears after layer $i$ .

To connect the blocks in these different formulas we include the following extra axioms.

$\blacksquare$

Connecting $\mathrm{Mod}_{s,c}$ to $\mathrm{ShortcutBlocks}(\mathcal{C},t)$ . For all $i\in[r]$ and $j,\tau\in[c]$

$\displaystyle\textsf{enbl}^{A_{\tau}[i,j]}$ $\displaystyle\leftrightarrow\textsf{enbl}^{{\widetilde{H}}_{\tau}[i,j]}$ (21)

$\displaystyle\textsf{lit}^{A_{\tau}[i,j]}_{\ell}$ $\displaystyle\leftrightarrow\textsf{lit}^{{\widetilde{H}}_{\tau}[i,j]}_{\ell}$ $\displaystyle\forall\ell\in\{x_{1},\overline{x}_{1},\dots,x_{n},\overline{x}_{% n}\}\,.$ (22)
$\blacksquare$

Connecting $\mathrm{ShortcutBlocks}(\mathcal{C},t)$ to $\mathrm{ShrinkBlocks}(\Gamma_{\mathrm{sh}})$ . For all $i\in[r]$ and $j,\tau\in[c]$

$\displaystyle\textsf{enbl}^{{\widetilde{H}}_{\tau}[i,j]}$ $\displaystyle\leftrightarrow\textsf{enbl}^{P_{jc+\tau}[i]}$ (23)

$\displaystyle\textsf{lit}^{{\widetilde{H}}_{\tau}[i,j]}_{\ell}$ $\displaystyle\leftrightarrow\textsf{lit}^{P_{jc+\tau}[i]}_{\ell}$ $\displaystyle\forall\ell\in\{x_{1},\overline{x}_{1},\dots,x_{n},\overline{x}_{% n}\}\,.$ (24)
$\blacksquare$

Connecting $\mathrm{ShrinkBlocks}(\Gamma_{\mathrm{sh}})$ to $\mathrm{Mod}_{s,c}$ . For all $i\in[r]$ and $j\in[c]$ ,

$\displaystyle\textsf{enbl}^{H_{j}[i]}$ $\displaystyle\leftrightarrow\textsf{enbl}^{D_{s+1}[i+1,j]}$ (25)

$\displaystyle\textsf{lit}^{H_{j}[i]}_{\ell}$ $\displaystyle\leftrightarrow\textsf{lit}^{D_{s+1}[i+1,j]}_{\ell}$ $\displaystyle\forall\ell\in\{x_{1},\overline{x}_{1},\dots,x_{n},\overline{x}_{% n}\}\,.$ (26)

Extension variables and axioms

Finally, in order for the upper bound to go through, we need to include some extension variables encoding that certain sets of literals are not present in a block.

For each block in each copy of $\mathrm{Mod}_{s,c}$ and of $\mathrm{ShortcutBlocks}(\mathcal{C},\star)$ we include $2^{q}$ extension variables encoding whether subsets of literals are not present in the block. That is, for $i\in\{0,1,\ldots,r-1\}$ and for each block $B$ appearing in any of the copies of $\mathrm{Mod}_{s,c}$ or of $\mathrm{ShortcutBlocks}(\mathcal{C},\star)$ in layer $i+1$ , we have

$\blacksquare$

extension literal variables $\mathsf{absent}^{B}_{a,\rho}$ for each $a\in\{0,1,\ldots,k-1\}$ and $\rho\in\{0,1\}^{q}$ , with the intended meaning is that $\mathsf{absent}^{B}_{a,\rho}=1$ if and only if the block is enabled and the clause in the block does not contain any of the literals $x_{(ik+a)q+1}^{\rho_{1}},x_{(ik+a)q+2}^{\rho_{2}},\ldots,x_{(ik+a)q+q}^{\rho_{% q}}$ .

To enforce the intended meaning of the extension variables, we have, for each $a\in\{0,1,\ldots,k-1\}$ and $\rho\in\{0,1\}^{q}$ , clauses encoding the following constraint

\displaystyle\mathsf{absent}^{B}_{a,\rho}\leftrightarrow\left(\textsf{enbl}^{B% }\land\bigwedge_{\ell\in\mathcal{L}(i,a,\rho)}\lnot\textsf{lit}_{\ell}^{B}% \right)\,,

(27)

where $\mathcal{L}(i,a,\rho)=\{x_{(ik+a)q+1}^{\rho_{1}},x_{(ik+a)q+2}^{\rho_{2}},% \ldots,x_{(ik+a)q+q}^{\rho_{q}}\}$ .

This concludes the presentation of the formula $\mathrm{ShortcutRef}_{k}(F)$ . As we mentioned, the algorithm that outputs our formula is randomized. However, as clear from the construction presented, the only the randomness needed is for constructing the expander graph $\Gamma_{\mathrm{sh}}$ and the circuits computing the neighbor functions of the expander graph $\Gamma_{\mathrm{sc}}$ .

4 Upper bound

In this section, we prove the upper bound in Theorem 2, namely we show that if $F$ is a satisfiable formula on $n$ variables, there is a tree-like Res( $k$ ) refutation of $\mathrm{ShortcutRef}_{k}(F)$ of size at most ${\mathsf{poly}}(n)\cdot 2^{O(\sqrt{(n\log 2k)/k})}$ . Before presenting the formal proof, we introduce some notation and provide a brief description of the strategy and the intuition behind it.

We view clauses and conjunctions as functions, that is, given a clause $C$ on literals over variables $x_{1},\ldots,x_{n}$ , we say that $C(x)=0$ for $x\in\{0,1\}^{n}$ if $x$ falsifies all literals in $C$ , and $C(x)=1$ otherwise. Similarly, given a conjunction $A$ we say that $A(x)=1$ if $x$ satisfies all literals in $A$ , and $A(x)=0$ otherwise.

Let $F=C_{1}\wedge\cdots\wedge C_{m}$ be a CNF formula on $n$ variables. The relation $\mathsf{Search}(F)\subseteq\{0,1\}^{n}\times[m]$ is defined by

(x,i)\in\mathsf{Search}(F)\Longleftrightarrow C_{i}(x)=0\,.

(28)

We view this relation as a search problem: given $x\in\{0,1\}^{n}$ as input, find an $i\in[m]$ such that $(x,i)\in\mathsf{Search}(F)$ , i.e., such that $x$ falsifies $C_{i}$ . We refer to this relation as the falsified clause search problem.

Definition 8.

Let $F$ be an unsatisfiable CNF formula on $n$ variables. A $k$ -decision tree solving $\mathsf{Search}(F)$ is given by a rooted binary tree such that:

$\blacksquare$

Non-leaf nodes. Each non-leaf node $v$ in the tree has two children – the $0$ -child and the $1$ -child – and is associated with a conjunction $A_{v}$ of at most $k$ Boolean literals over the variables of $F$ .
$\blacksquare$

Paths. Given an assignment $x\in\{0,1\}^{n}$ we can follow a path from the root to a leaf by starting at the root and moving to the $A_{v}(x)$ -child. Denote by $v_{x}$ be the leaf reached when following this path for $x$ .
$\blacksquare$

Leaf nodes. Each leaf node $v$ is labeled with the index $i$ to a clause $C_{i}$ in $F$ that satisfies the following statement: for all assignments $x\in\{0,1\}^{n}$ such that $v=v_{x}$ it holds that $(x,i)\in\mathsf{Search}(F)$ , i.e., it holds that $x$ falsifies $C_{i}$ .

The size of a $k$ -decision tree is the number of nodes in the graph.

We describe a $k$ -decision tree of size ${\mathsf{poly}}(n)\cdot 2^{O(\sqrt{(n\log 2k)/k})}$ solving the falsified clause search problem on $\mathrm{ShortcutRef}_{k}(F)$ and, by the classical equivalence between $k$ -decision trees and tree-like Res( $k$ ) [38], the desired upper bound follows.

4.1 Intuition

The proof follows the general strategy of the upper bound proof for $\mathrm{Ref}(F)$ as presented in, e.g., [18]. Recall that the $\mathrm{ShortcutRef}_{k}(F)$ formula is organized as a grid of modules with $r=$ layers and $c=$ modules per layer, and $\Delta_{\mathrm{sc}}=O(k)$ is the degree of the expander graph encoded in the subformulas $\mathrm{ShrinkBlocks}(\Gamma_{\mathrm{sh}})$ of $\mathrm{ShortcutRef}$ .

Given a block $B$ of $\mathrm{ShortcutRef}_{k}(F)$ , we refer to the clause encoded by the variables $\textsf{lit}_{\ell}^{B}$ to be the clause at block $B$ .

We start by explaining how to define a decision tree (that is, a $1$ -decision tree) that solves $\mathsf{Search}(\mathrm{ShortcutRef}_{k}(F))$ when $F$ is satisfiable. The only problem with this upper bound is that it will be too large. We later explain how $k$ -decision trees can use the shortcuts of $\mathrm{ShortcutRef}_{k}(F)$ to yield a better upper bound.

Let $x^{*}$ be a satisfying assignment of $F$ . Starting at root block $D_{s+1}[1,1]$ , we can query the enabled variable and, one by one, all literal variables that $x^{*}$ satisfies. Either we have found a falsified clause (one of the root axioms) or we have that the root block is enabled and has a clause with no literals, which therefore is falsified by $x^{*}$ . The goal is to move down the blocks of $\mathrm{ShortcutRef}_{k}(F)$ until we reach the axiom blocks at the last layer, all the while keeping the invariant that the block we are at is enabled and contains a clause that is falsified by $x^{*}$ . In this process, we either find a non-sound step, which immediately gives us a falsified clause of $\mathrm{ShortcutRef}_{k}(F)$ , or the path reaches an axiom block at the last layer. If we reach an axiom block $A$ at the last layer while keeping the invariant, we can easily find a falsified clause of $\mathrm{ShortcutRef}_{k}(F)$ . Indeed, since $x^{*}$ falsifies the clause at block $A$ but $x^{*}$ satisfies all clauses of $F$ , one of the axioms of $\mathrm{ShortcutRef}_{k}(F)$ encoding that the clause at $A$ must be a weakening of some clause in $F$ will be falsified.

As mentioned, this strategy works, but the decision tree obtained is too large: the size corresponds to roughly the number of possible root-to-axiom paths that maintain the invariant. In order to obtain a better upper bound, we make use of the shortcut variables and the extension variables. The idea is to move down layer by layer of $\mathrm{ShortcutRef}_{k}(F)$ , skipping modules entirely. We later define these special root-to-axiom paths that skip modules. The advantage is that there are much fewer such paths, roughly $\Delta_{\mathrm{sc}}^{r}$ , and we have included extension variables so that $k$ -decision trees can realize this upper bound.

4.2 Formal proof

Let $x^{*}$ be a satisfying assignment of $F$ and let $\mathcal{L}$ be all literals $x_{i}$ or $\overline{x}_{i}$ that $x^{*}$ satisfies. Given $a,b\in[n]$ , $a\leq b$ , we denote by $x^{*}[a,b]\in\{0,1\}^{b-a+1}$ the projection of $x^{*}$ to indices in the interval $[a,b]$ . The upper bound proceeds in $r$ rounds, starting at round $0$ until round $r-1$ . At the beginning of round $i$ we are at a node $v$ of the $k$ -decision tree which is labeled by some block $D=D_{s+1}[i+1,j]$ for some $j\in[c]$ and the queries leading to $v$ determine that the following invariant holds: “ $D$ is enabled and the clause at block $D$ is falsified by $x^{*}$ ”. More formally, the invariant is that the queries leading up to $v$ node enforce that $\textsf{enbl}^{D}=1$ and that $\textsf{lit}_{\ell}^{D}=0$ for each $\ell\in\mathcal{L}$ . At each round $i$ , we define a subtree of the $k$ -decision tree: some of the leaves of this subtree correspond to leaves of the $k$ -decision tree since we can already identify a clause in $\mathrm{ShortcutRef}_{k}(F)$ that is falsified by all assignments (to the variables of $\mathrm{ShortcutRef}_{k}(F)$ ) that are compatible with the queries leading to this leaf. Other leaves of the subtree we refer to as open nodes, and they are labeled by some $D_{s+1}[i+2,j]$ which we argue satisfies the invariant.

Before starting round $0$ , we define $D=D_{s+1}[1,1]$ and query the enabled variable $\textsf{enbl}^{D}$ and, one by one, each literal variable $\textsf{lit}_{\ell}^{D}$ for $\ell\in\mathcal{L}$ . Either we have identified a falsified clause of $\mathrm{ShortcutRef}_{k}(F)$ (one of the root axioms) or we now have that block $D$ satisfies the invariant. We now start round $0$ .

Suppose we are at round $i$ , at a node $v$ labeled by $D=D_{s+1}[i+1,j]$ for some $j\in[c]$ and that $D$ satisfies the invariant. For $a\in\{0,1,\ldots,k-1\}$ , let $\rho({a})=x^{*}[(ik+a)q+1,(ik+a)q+q]$ . Given a block $B$ that is either $A_{\tau}[i+1,j]$ for some $\tau\in[c]$ or $D_{\tau}[i+1,j]$ for some $\tau\in[s]$ , we define the $k$ -term

Q(B)=\bigwedge_{a\in\{0,1,\ldots,k-1\}}\mathsf{absent}_{a,\rho(a)}^{B}\,.

(29)

We start by querying $Q(A_{1})$ and, while the answers to the queries so far are all $0$ , we continue querying $Q(A_{2}),Q(A_{3}),\ldots,Q(A_{c})$ . Note that this defines a depth- $c$ tree with $c+1$ leaves: there is one leaf for every $\tau\in[c]$ corresponding to an answer $1$ to the query $Q(A_{\tau})$ , and one leaf corresponding to all answers being $0$ . We argue that in all but $\Delta_{\mathrm{sc}}$ of these nodes, we can easily (in ${\mathsf{poly}}(sn)$ size) find a clause of $\mathrm{ShortcutRef}_{k}(F)$ that is falsified. In each of the other $\Delta_{\mathrm{sc}}$ cases, we either find a falsified clause of $\mathrm{ShortcutRef}_{k}(F)$ or we reach a single node where the invariant is satisfied for some block $D_{s+1}[i+2,j]$ and we move to round $i+1$ .

Case 1: Queries $Q(A_{1}),Q(A_{2}),\ldots,Q(A_{c})$ are all $0$

Let $u$ be the node where all the queries $Q(A_{\tau})$ for $\tau\in[c]$ are $0$ . We show that the subtree rooted at $u$ ends in closed leaves and is of size at most $(sn)^{O(1)}$ . The idea is that, unless we find a falsified clause of $\mathrm{ShortcutRef}_{k}(F)$ , we have that for every $\tau\in[c]$ , either $A_{\tau}$ is not enabled, or the clause at $A_{\tau}$ is satisfied by $x^{*}$ . Then, since $x^{*}$ falsifies the clause at $D=D_{s+1}[i+1,j]$ and the module encodes that we can derive the clause at $D$ from the clauses at the enabled $A_{\tau}$ ’s, there is a falsified clause of $\mathrm{ShortcutRef}_{k}(F)$ in the $[i+1,j]$ copy of $\mathrm{Mod}_{s,m}$ .

To find a falsified clause of $\mathrm{ShortcutRef}_{k}(F)$ we proceed as follows. Given $\tau\in[s]$ , we query $Q(D_{1}),\ldots,Q(D_{s+1})$ until we get the an answer $1$ . Let $\tau$ be such that $Q(D_{\tau})=1$ . If $\tau=s+1$ , then we falsify one of the extension axioms. If $\tau\leq s$ we query $\textsf{enbl}^{D_{\tau}}$ and $\textsf{lit}_{\ell}^{D_{\tau}}$ for $\ell\in\mathcal{L}$ . If $\textsf{enbl}^{D_{\tau}}=0$ or $\textsf{lit}_{\ell}^{D_{\tau}}=1$ , then we falsify one of the extension axioms. Otherwise, we query $\mathsf{type}^{D_{\tau}}$ to determine whether the clause is derived or whether it is a weakening of an axiom.

If $D_{\tau}$ is derived, we query $\mathsf{resvar}^{D_{\tau}}$ to determine what variable is resolved on at this clause. Let $i^{\prime}=\mathsf{resvar}^{D_{\tau}}$ and let $\star=+$ if $x^{*}_{i^{\prime}}=0$ (i.e., $\overline{x}_{i}\in\mathcal{L}$ ) and $\star=-$ if $x^{*}_{i^{\prime}}=1$ . We query the pointer variables to the $\star$ -child and let $\tau_{\star}=\textsf{point}_{\star}^{D_{\tau}}$ . If $\tau_{\star}\geq\tau$ then we falsify one of the module axioms. Let $D_{\tau_{\star}}=D_{\tau_{\star}}[i+1,j]$ . We now query $\textsf{enbl}^{D_{\tau_{\star}}}$ and $\textsf{lit}_{\ell}^{D_{\tau_{\star}}}$ for $\ell\in\mathcal{L}$ . If $\textsf{enbl}^{D_{\tau_{\star}}}=0$ or $\textsf{lit}_{\ell}^{D_{\tau_{\star}}}=1$ , then we falsify one of the module axioms. We now query $\mathsf{absent}^{D_{\tau_{\star}}}_{a,\rho(a)}$ for $a\in\{0,1,\ldots,k-1\}$ until we get an answer $0$ . It cannot be that all answers are $1$ because $Q(D_{\tau_{\star}})=0$ (since ${\tau_{\star}}<\tau$ ). Let $a^{\prime}$ be such that $\mathsf{absent}^{D_{\tau_{\star}}}_{a^{\prime},\rho(a^{\prime})}=0$ . We have that $\textsf{enbl}^{D_{\tau_{\star}}}=1$ and $\textsf{lit}_{\ell}^{D_{\tau_{\star}}}=0$ for all $\ell\in\mathcal{L}$ and $\mathsf{absent}^{D_{\tau_{\star}}}_{a^{\prime},\rho(a^{\prime})}=0$ . This contradicts one of the extension axioms.

Now, if $D_{\tau}$ is a weakening of an axiom, we query the $\textsf{axiom}^{D_{\tau}}$ pointer to obtain the index $t$ of which $D_{\tau}$ is supposedly a weakening of. We query, one by one, $\textsf{enbl}^{A_{t}}$ and $\textsf{lit}_{\ell}^{A_{t}}$ for $\ell\in\mathcal{L}$ . If $\textsf{enbl}^{A_{\tau}}=0$ or if $\textsf{lit}_{\ell}^{A_{t}}=1$ , we falsify either axiom (6) or axiom (7) encoding that this is indeed a weakening of an axiom. Finally, if $\textsf{enbl}^{A_{\tau}}=1$ and for all $\ell\in\mathcal{L}$ , $\textsf{lit}_{\ell}^{A_{t}}=0$ , then, similarly to the previous case, we contradict one of the extensions axioms since $Q(A_{t})=0$ .

Case 2: Query $Q(\tau)$ is $1$ for some $\tau\in[c]$

Let $u_{\tau}$ be the node corresponding to answer $1$ for $Q(\tau)$ , for $\tau\in[c]$ . We start by querying $\textsf{enbl}^{A_{\tau}}$ : if it is falsified we immediately contradict one of the extension axioms. We then query all variables $\textsf{lit}_{\ell}^{A_{\tau}}$ for $\ell\in\mathcal{L}$ . If any of these returns $1$ , then we either contradict an extension axiom or layer axiom (18) (the latter due to the invariant). We query $\textsf{lit}_{x_{t}}^{A_{\tau}}$ for all $t\in\{{ikq+1},\ldots,{(i+1)kq}\}$ such that $x^{*}_{t}=0$ . If for some such $t$ we have that $\textsf{lit}_{x_{t}}^{\widetilde{H}_{\tau}}=0$ , then we contradict layer axiom (20).

In preparation for the next steps, we query, for $\widetilde{H}_{\tau}=\widetilde{H}_{\tau}[i+1,j]$ , the variables $\textsf{enbl}^{\widetilde{H}_{\tau}}$ and $\textsf{lit}_{\ell}^{\widetilde{H}_{\tau}}$ for all $\ell\in\mathcal{L}\cup\{x_{(i-1)kq+1},\ldots,x_{ikq}\}$ . If either $\textsf{enbl}^{\widetilde{H}_{\tau}}=0$ or $\textsf{lit}_{\ell}^{\widetilde{H}_{\tau}}\neq\textsf{lit}_{\ell}^{A_{\tau}}$ , we contradict one of the connection axioms. Now, let $y\in\{0,1\}^{kq}$ be such that $y_{i^{\prime}}=1-x^{*}_{ikq+{i^{\prime}}}$ for all ${i^{\prime}}\in[kq]$ . Note that $y_{i^{\prime}}=\textsf{lit}_{x_{ikq+{i^{\prime}}}}^{\widetilde{H}_{\tau}}$ . We consider two cases.

Case 2.1: There exists a $d\in{[\Delta_{\mathrm{sc}}]}$ such that circuit $\mathcal{C}_{d}(y)=\tau$ .: Let $P=P_{jc+\tau}[i]$ . We query $\textsf{enbl}^{P}$ and $\textsf{lit}_{\ell}^{P}$ for $\ell\in\mathcal{L}$ . If $\textsf{enbl}^{P}=0$ or $\textsf{lit}_{\ell}^{P}=1$ , then we falsify one of the connection axioms. Otherwise, we query $\textsf{fun}_{1}^{P},\textsf{fun}_{2}^{P},\ldots,\textsf{fun}_{\Delta_{\mathrm% {sh}}}^{P}$ until one of the queries returns $1$ . If they all return $0$ , then we contradict the shrinking axiom (8). Let $u_{t}$ be the node where $\textsf{fun}_{t}^{P}=1$ for $t\in[\Delta_{\mathrm{sh}}]$ and let block $H=H_{j^{\prime}}[i,j]$ be the $t$ th neighbor of block $P$ . At $u_{t}$ we query $\textsf{enbl}^{H}$ and $\textsf{lit}_{\ell}^{H}$ for $\ell\in\mathcal{L}$ . If $\textsf{enbl}^{H}=0$ or $\textsf{lit}_{\ell}^{H}=1$ , then we falsify one of the shrinking axioms (9)-(10). Finally, let $D^{\prime}=D_{s+1}[i+2,j^{\prime}]$ . We query $\textsf{enbl}^{D^{\prime}}$ and $\textsf{lit}_{\ell}^{D^{\prime}}$ for $\ell\in\mathcal{L}$ . If $\textsf{enbl}^{D^{\prime}}=0$ or $\textsf{lit}_{\ell}^{D^{\prime}}=1$ , then we falsify one of the connection axioms. Otherwise, $D^{\prime}$ satisfies the invariant and we move on to round $i+1$ .
Case 2.2: For all $d\in{[\Delta_{\mathrm{sc}}]}$ it holds that circuit $\mathcal{C}_{d}(y)\neq\tau$ .: In this case, it is enough to query variables of the $[i+1,j]$ copy of the formula $\mathrm{ShortcutBlocks}(\mathcal{C},ikq)$ to find a contradiction. Indeed, we can query all circuit variables in $\widetilde{H}_{\tau}$ in topological order, starting from the inputs. If the input to the circuit is not $y$ , then we have falsified the axiom that define the input of the circuit. If there is any incorrect computation, we immediately find a circuit axiom that is falsified. If the output does not correspond to the output gates, we also immediately falsify the axioms defining the output of the circuit. Finally, if everything is correct so far, then we are falsifying axiom (15).

After $r$ rounds, we reach a block $D_{s+1}[r,j]$ at last layer. We repeat most of the round once more until we reach $A_{\tau}$ in the last row of the module. Here the decision tree queries the pointer and $\textsf{axiom}^{A_{\tau}}$ which encodes a clause in $F$ that $A_{\tau}$ is supposedly a weakening of. In this way, we obtain a subtree with $m={\mathsf{poly}}(n)$ open nodes, one for each clause of $F$ . At each of these nodes, we construct a subtree of size $O(n)$ that checks the clause axioms for the corresponding clause of $F$ , which will necessarily lead to contradiction since the assignment $x^{*}$ satisfies $F$ . This completes the description of the $k$ -decision tree since there are no more open nodes.

Note that, every node $v$ labeled by a $D=D_{s+1}[i+1,j]$ that satisfies the invariant at round $i$ , is the root of a subtree of size $(sn)^{O(1)}=2^{O(q)}$ , with only $\Delta_{\mathrm{sc}}$ open leaves $v_{\tau}$ that are labeled by some $D^{\prime}=D_{s+1}[i+1,j^{\prime}]$ that satisfy the invariant at round $i+1$ . This implies that the number of leaves of the tree is at most

\left(\sum_{i=0}^{r-1}\Delta_{\mathrm{sc}}^{i}\cdot{\mathsf{poly}}(n)\cdot 2^{% O(q)}\right)+\Delta_{\mathrm{sc}}^{r}\cdot{\mathsf{poly}}(n)={\mathsf{poly}}(n% )\cdot 2^{O(q)}\cdot\Delta_{\mathrm{sc}}^{r}\,.

(30)

Therefore, the size of the tree is at most ${\mathsf{poly}}(n)\cdot 2^{O(q)}$ which is $2^{O(q)}$ for our range of $k$ . This concludes the upper bound.

5 Lower bound

It remains to prove the lower bound in Theorem 2, namely show that if $F$ is an unsatisfiable formula on $n$ variables, any tree-like Res( $k$ ) refutation of $F$ requires size at least $2^{\Omega(n)}$ . The crucial step consists in exhibiting the following lower bound for the width of a Resolution refutation of $\mathrm{ShortcutRef}_{k}(F)$ .

w(\mathrm{ShortcutRef}_{k}(F)\vdash\bot)\geq\exp\left(\Omega(q)\right)\,

(31)

where $q=\sqrt{(n\log 2k)/k}$ . Indeed, a tree-like Res( $k$ ) size lower bound follows from the size-width relation implicit in [29], and proven in [30].

Lemma 9 ([30]).

Let $F$ be a CNF formula of width $w$ , such that exists a tree-like Res( $k$ ) refutation of size $\tau$ , then $F$ has a resolution refutation of width $k\cdot\lceil\log\tau\rceil+\max(k,w)$ .

If we assume that (31) holds, then for any $k^{\prime}$ such that $\exp\left(\Omega(q)\right)\gg nk^{\prime}$ , since the width of $\mathrm{ShortcutRef}_{k}(F)$ is $O(n)$ , we have that any ${\text{Res}}(k^{\prime})$ refutation of $\mathrm{ShortcutRef}_{k}(F)$ requires size $\exp\left(\exp\left(\Omega(q)\right)\right)\geq 2^{n}$ .

We present the proof of the width lower bound in the full version of this paper. The main idea, following in [18], is to reduce some pigeonhole principle formulas, for which we do have width lower bounds, to $\mathrm{ShortcutRef}_{k}(F)$ . In our case, however, we need to extend this notion of reduction to allow reducing a collection of formulas to a single formula. We note that using such generalized reductions would lead to an improvement in parameters in [18] (matching those of [6]). We use this framework to reduce a large collection of suitable variants of pigeonhole principles – too large to obtain any non-trivial result via the vanilla reduction – to $\mathrm{ShortcutRef}_{k}(F)$ . The width lower bound for $\mathrm{ShortcutRef}_{k}(F)$ then follows from width lower bounds for the different variants of pigeonhole principle.

6 Concluding remarks

In this work, we prove that automating tree-like $k$ -DNF resolution in time $n^{o\left(\frac{k}{\log k}\cdot\log n\right)}$ is rETH-hard. This implies that, assuming rETH, the automating algorithm proposed by Atserias and Bonet [4] is optimal up to an $O(\log k)$ in the exponent. A natural open problem is whether a similar result can be obtained under ETH. If we maintain the overall structure of our argument, obtaining the derandomization would be equivalent to designing a deterministic poly-time algorithm able to construct constant-degree polynomially unbalanced expander graphs. This question has been extensively studied in the area of pseudorandomness and remains an outstanding open problem. It worth mentioning that, for our application, we are allowed to relax the condition classically imposed on the size of the sets of vertices that expand, and in particular we may assume it to be polylogarithmic in the number of vertices.

Numerous intriguing unresolved problems involve automatability. We just focus on some of the ones most relevant to this work. As mentioned in the introduction, one of the frontiers of automatability results lies in understanding tree-like proof systems. It is highly plausible that the vast majority of tree-like proof systems are as hard to automate as their DAG-like counterparts. Nevertheless, many (weak) tree-like proof systems are unable to establish the upper bound needed to prove non-automatability results by the now standard techniques. Can we bypass this obstacle for concrete proof systems such as tree-like ${\text{Res}}(\oplus)$ (resolution over linear equations mod 2) or tree-like cutting planes and obtain a non-automatability result which is stronger than what we know for tree-like resolution?

Very few non-trivial automating algorithm are known when automatability is defined with respect to size. However, there exist alternative definitions for automatability where the size parameter is substituted, for example, by degree or width, and in those cases there are some automating algorithms [13, 9, 32, 35]. Are these algorithms optimal? Partial results can be derived from [6].

References

[1] Michael Alekhnovich, Sam Buss, Shlomo Moran, and Toniann Pitassi. Minimum propositional proof length is NP-hard to linearly approximate. Journal of Symbolic Logic, 66(1):171–191, 2001. doi:10.2307/2694916.
[2] Michael Alekhnovich and Alexander A. Razborov. Resolution is not automatizable unless W[P] is tractable. SIAM Journal on Computing, 38(4):1347–1363, 2008. Preliminary version in FOCS ’01. doi:10.1137/06066850X.
[3] Noel Arteche, Gaia Carenini, and Matthew Gray. Quantum automating $TC^{0}$ -Frege is LWE-hard. In Proceedings of the 39th Computational Complexity Conference (CCC ’24), volume 300 of Leibniz International Proceedings in Informatics (LIPIcs), pages 15:1–15:25, July 2024. doi:10.4230/LIPIcs.CCC.2024.15.
[4] Albert Atserias and María Luisa Bonet. On the automatizability of resolution and related propositional proof systems. Technical Report TR02-010, Electronic Colloquium on Computational Complexity (ECCC), 2002. URL: https://eccc.weizmann.ac.il/report/2002/010/.
[5] Albert Atserias and María Luisa Bonet. On the automatizability of resolution and related propositional proof systems. Information and Computation, 189(2):182–201, March 2004. Preliminary version in CSL ’02. doi:10.1016/j.ic.2003.10.004.
[6] Albert Atserias and Moritz Müller. Automating resolution is NP-hard. Journal of the ACM, 67(5), 2020. Preliminary version in FOCS ’19. doi:10.1145/3409472.
[7] Paul Beame and Toniann Pitassi. Simplified and improved resolution lower bounds. In Proceedings of the 37th Annual IEEE Symposium on Foundations of Computer Science (FOCS ’96), pages 274–282, October 1996. doi:10.1109/SFCS.1996.548486.
[8] Zoë Bell. Automating regular or ordered resolution is NP-hard. Technical Report TR20-105, Electronic Colloquium on Computational Complexity (ECCC), 2020. URL: https://eccc.weizmann.ac.il/report/2020/105/.
[9] Eli Ben-Sasson and Avi Wigderson. Short proofs are narrow—resolution made simple. Journal of the ACM, 48(2):149–169, 2001. doi:10.1145/375827.375835.
[10] John Adrian Bondy and Uppaluri Siva Ramachandra Murty. Graph theory. Springer, 2008.
[11] María Luisa Bonet, Carlos Domingo, Ricard Gavaldà, Alexis Maciel, and Toniann Pitassi. Non-automatizability of bounded-depth Frege proofs. Computational Complexity, 13(1-2):47–68, December 2004. Preliminary version in CCC ’99. doi:10.1007/s00037-004-0183-5.
[12] María Luisa Bonet, Toniann Pitassi, and Ran Raz. On interpolation and automatization for Frege systems. SIAM Journal on Computing, 29(6):1939–1967, 2000. Preliminary version in FOCS ’97. doi:10.1137/S0097539798353230.
[13] Matthew Clegg, Jeff Edmonds, and Russell Impagliazzo. Using the Groebner basis algorithm to find proofs of unsatisfiability. In Proceedings of the 28th Annual ACM Symposium on Theory of Computing (STOC 96), pages 174–183, 1996. doi:10.1145/237814.237860.
[14] Martin Davis, George Logemann, and Donald Loveland. A machine program for theorem proving. Communications of the ACM, 5(7):394–397, July 1962. doi:10.1145/368273.368557.
[15] Martin Davis and Hilary Putnam. A computing procedure for quantification theory. Journal of the ACM, 7(3):201–215, 1960. doi:10.1145/321033.321034.
[16] Susanna F. de Rezende. Automating tree-like resolution in time $n^{o(\log n)}$ is ETH-hard. In Proceedings of the 11th Latin and American Algorithms, Graphs and Optimization Symposium (LAGOS ’21), May 2021. doi:10.1016/j.procs.2021.11.021.
[17] Susanna F. de Rezende. Automating tree-like resolution in time $n^{o(\log n)}$ is ETH-hard. Technical Report TR21-033, Electronic Colloquium on Computational Complexity (ECCC), 2021. URL: https://eccc.weizmann.ac.il/report/2021/033/.
[18] Susanna F. de Rezende, Mika Göös, Jakob Nordström, Toniann Pitassi, Robert Robere, and Dmitry Sokolov. Automating algebraic proof systems is NP-hard. In Proceedings of the 53rd Annual ACM Symposium on Theory of Computing (STOC ’21), June 2021. doi:10.1145/3406325.3451080.
[19] Holger Dell, Thore Husfeldt, Dániel Marx, Nina Taslaman, and Martin Wahlén. Exponential time complexity of the permanent and the Tutte polynomial. ACM Trans. Algorithms, 10(4):21:1–21:32, August 2014. doi:10.1145/2635812.
[20] Kord Eickmeyer, Martin Grohe, and Magdalena Grüber. Approximation of natural W[P]-complete minimisation problems is hard. In Proceedings of the 23rd Annual IEEE Conference on Computational Complexity (CCC ’08), pages 8–18, June 2008. doi:10.1109/CCC.2008.24.
[21] Nicola Galesi and Massimo Lauria. On the automatizability of polynomial calculus. Theory of Computing Systems, 47(2):491–506, 2010. doi:10.1007/s00224-009-9195-5.
[22] Michal Garlík. Failure of feasible disjunction property for $k$ -dnf resolution and NP-hardness of automating it. In Proceedings of the 39th Computational Complexity Conference (CCC ’24), volume 300 of Leibniz International Proceedings in Informatics (LIPIcs), pages 33:1–33:23, July 2024. doi:10.4230/LIPIcs.CCC.2024.33.
[23] Mika Göös, Sajin Koroth, Ian Mertz, and Toniann Pitassi. Automating cutting planes is NP-hard. In Proceedings of the 52nd Annual ACM Symposium on Theory of Computing (STOC ’20), pages 68–77, June 2020. doi:10.1145/3357713.3384248.
[24] Russell Impagliazzo and Ramamohan Paturi. On the complexity of $k$ -SAT. Journal of Computer and System Sciences, 62(2):367–375, March 2001. Preliminary version in CCC ’99. doi:10.1006/jcss.2000.1727.
[25] Dmitry Itsykson and Artur Riazanov. Automating OBDD proofs is NP-hard. In Proceedings of the 47th International Symposium on Mathematical Foundations of Computer Science (MFCS ’22), volume 241 of Leibniz International Proceedings in Informatics (LIPIcs), pages 59:1–59:15, August 2022. doi:10.4230/LIPIcs.MFCS.2022.59.
[26] Kazuo Iwama. Complexity of finding short resolution proofs. In Proceedings of the 22nd International Symposium on Mathematical Foundations of Computer Science (MFCS ’97), pages 309–318, 1997. doi:10.1007/BFb0029974.
[27] Jan Krajíček. On the weak pigeonhole principle. Fundamenta Mathematicae, 170(1-3):123–140, 2001. doi:10.4064/fm170-1-8.
[28] Jan Krajícek and Pavel Pudlák. Some consequences of cryptographical conjectures for S ${}^{1}_{2}$ and EF. Information and Computation, 140(1):82–94, 1998. doi:10.1006/inco.1997.2674.
[29] Jan Krajiček. Lower bounds to the size of constant-depth propositional proofs. The Journal of Symbolic Logic, 59(1):73–86, 1994. doi:10.2307/2275250.
[30] Massimo Lauria. A note about $k$ -DNF resolution. Information Processing Letters, 137:33–39, September 2018. doi:10.1016/j.ipl.2018.04.014.
[31] Ian Mertz, Toniann Pitassi, and Yuanhao Wei. Short proofs are hard to find. In Proceedings of the 46th International Colloquium on Automata, Languages, and Programming (ICALP ’19), pages 84:1–84:16, 2019. doi:10.4230/LIPIcs.ICALP.2019.84.
[32] Ryan O’Donnell. SOS is not obviously automatizable, even approximately. In Proceedings of the 8th Innovations in Theoretical Computer Science Conference (ITCS), volume 67, pages 59:1–59:10. Schloss Dagstuhl, 2017. doi:10.4230/LIPIcs.ITCS.2017.59.
[33] Theodoros Papamakarios. Depth- $d$ Frege systems are not automatable unless P=NP. In Proceedings of the 39th Computational Complexity Conference (CCC ’24), volume 300 of Leibniz International Proceedings in Informatics (LIPIcs), pages 22:1–22:17, July 2024. doi:10.4230/LIPIcs.CCC.2024.22.
[34] Pavel Pudlák. On reducibility and symmetry of disjoint NP pairs. Theoretical Computer Science, 295:323–339, February 2003. doi:10.1016/S0304-3975(02)00411-5.
[35] Prasad Raghavendra and Benjamin Weitz. On the bit complexity of sum-of-squares proofs. In Proceedings of the 44th International Colloquium on Automata, Languages, and Programming (ICALP), pages 80:1–80:13, 2017. doi:10.4230/LIPIcs.ICALP.2017.80.
[36] Alexander A Razborov. On provably disjoint NP-pairs. Aarhus Universitet. Basic Research in Computer Science [BRICS], November 1994. doi:10.7146/brics.v1i36.21607.
[37] John Alan Robinson. The generalized resolution principle. In Automation of Reasoning: 2: Classical Papers on Computational Logic 1967–1970, pages 135–151. Springer, 1968. doi:10.1007/978-3-642-81955-1_9.
[38] Nathan Segerlind, Samuel R. Buss, and Russell Impagliazzo. A switching lemma for small restrictions and lower bounds for $k$ -DNF resolution. SIAM Journal on Computing, 33(5):1171–1200, 2004. Preliminary version in FOCS ’02. doi:10.1109/SFCS.2002.1181984.
[39] Michael Sipser. The history and status of the p versus NP question. In Proceedings of the 24th Annual ACM Symposium on Theory of Computing (STOC ’92), 1992. doi:10.1145/129712.129771.

[bib.bib1] [1] Michael Alekhnovich, Sam Buss, Shlomo Moran, and Toniann Pitassi. Minimum propositional proof length is NP-hard to linearly approximate. Journal of Symbolic Logic, 66(1):171–191, 2001. doi:10.2307/2694916.

[bib.bib2] [2] Michael Alekhnovich and Alexander A. Razborov. Resolution is not automatizable unless W[P] is tractable. SIAM Journal on Computing, 38(4):1347–1363, 2008. Preliminary version in FOCS ’01. doi:10.1137/06066850X.

[bib.bib3] [3] Noel Arteche, Gaia Carenini, and Matthew Gray. Quantum automating $TC^{0}$ -Frege is LWE-hard. In Proceedings of the 39th Computational Complexity Conference (CCC ’24), volume 300 of Leibniz International Proceedings in Informatics (LIPIcs), pages 15:1–15:25, July 2024. doi:10.4230/LIPIcs.CCC.2024.15.

[bib.bib4] [4] Albert Atserias and María Luisa Bonet. On the automatizability of resolution and related propositional proof systems. Technical Report TR02-010, Electronic Colloquium on Computational Complexity (ECCC), 2002. URL: https://eccc.weizmann.ac.il/report/2002/010/.

[bib.bib5] [5] Albert Atserias and María Luisa Bonet. On the automatizability of resolution and related propositional proof systems. Information and Computation, 189(2):182–201, March 2004. Preliminary version in CSL ’02. doi:10.1016/j.ic.2003.10.004.

[bib.bib6] [6] Albert Atserias and Moritz Müller. Automating resolution is NP-hard. Journal of the ACM, 67(5), 2020. Preliminary version in FOCS ’19. doi:10.1145/3409472.

[bib.bib7] [7] Paul Beame and Toniann Pitassi. Simplified and improved resolution lower bounds. In Proceedings of the 37th Annual IEEE Symposium on Foundations of Computer Science (FOCS ’96), pages 274–282, October 1996. doi:10.1109/SFCS.1996.548486.

[bib.bib8] [8] Zoë Bell. Automating regular or ordered resolution is NP-hard. Technical Report TR20-105, Electronic Colloquium on Computational Complexity (ECCC), 2020. URL: https://eccc.weizmann.ac.il/report/2020/105/.

[bib.bib9] [9] Eli Ben-Sasson and Avi Wigderson. Short proofs are narrow—resolution made simple. Journal of the ACM, 48(2):149–169, 2001. doi:10.1145/375827.375835.

[bib.bib10] [10] John Adrian Bondy and Uppaluri Siva Ramachandra Murty. Graph theory. Springer, 2008.

[bib.bib11] [11] María Luisa Bonet, Carlos Domingo, Ricard Gavaldà, Alexis Maciel, and Toniann Pitassi. Non-automatizability of bounded-depth Frege proofs. Computational Complexity, 13(1-2):47–68, December 2004. Preliminary version in CCC ’99. doi:10.1007/s00037-004-0183-5.

[bib.bib12] [12] María Luisa Bonet, Toniann Pitassi, and Ran Raz. On interpolation and automatization for Frege systems. SIAM Journal on Computing, 29(6):1939–1967, 2000. Preliminary version in FOCS ’97. doi:10.1137/S0097539798353230.

[bib.bib13] [13] Matthew Clegg, Jeff Edmonds, and Russell Impagliazzo. Using the Groebner basis algorithm to find proofs of unsatisfiability. In Proceedings of the 28th Annual ACM Symposium on Theory of Computing (STOC 96), pages 174–183, 1996. doi:10.1145/237814.237860.

[bib.bib14] [14] Martin Davis, George Logemann, and Donald Loveland. A machine program for theorem proving. Communications of the ACM, 5(7):394–397, July 1962. doi:10.1145/368273.368557.

[bib.bib15] [15] Martin Davis and Hilary Putnam. A computing procedure for quantification theory. Journal of the ACM, 7(3):201–215, 1960. doi:10.1145/321033.321034.

[bib.bib16] [16] Susanna F. de Rezende. Automating tree-like resolution in time $n^{o(\log n)}$ is ETH-hard. In Proceedings of the 11th Latin and American Algorithms, Graphs and Optimization Symposium (LAGOS ’21), May 2021. doi:10.1016/j.procs.2021.11.021.

[bib.bib17] [17] Susanna F. de Rezende. Automating tree-like resolution in time $n^{o(\log n)}$ is ETH-hard. Technical Report TR21-033, Electronic Colloquium on Computational Complexity (ECCC), 2021. URL: https://eccc.weizmann.ac.il/report/2021/033/.

[bib.bib18] [18] Susanna F. de Rezende, Mika Göös, Jakob Nordström, Toniann Pitassi, Robert Robere, and Dmitry Sokolov. Automating algebraic proof systems is NP-hard. In Proceedings of the 53rd Annual ACM Symposium on Theory of Computing (STOC ’21), June 2021. doi:10.1145/3406325.3451080.

[bib.bib19] [19] Holger Dell, Thore Husfeldt, Dániel Marx, Nina Taslaman, and Martin Wahlén. Exponential time complexity of the permanent and the Tutte polynomial. ACM Trans. Algorithms, 10(4):21:1–21:32, August 2014. doi:10.1145/2635812.

[bib.bib20] [20] Kord Eickmeyer, Martin Grohe, and Magdalena Grüber. Approximation of natural W[P]-complete minimisation problems is hard. In Proceedings of the 23rd Annual IEEE Conference on Computational Complexity (CCC ’08), pages 8–18, June 2008. doi:10.1109/CCC.2008.24.

[bib.bib21] [21] Nicola Galesi and Massimo Lauria. On the automatizability of polynomial calculus. Theory of Computing Systems, 47(2):491–506, 2010. doi:10.1007/s00224-009-9195-5.

[bib.bib22] [22] Michal Garlík. Failure of feasible disjunction property for $k$ -dnf resolution and NP-hardness of automating it. In Proceedings of the 39th Computational Complexity Conference (CCC ’24), volume 300 of Leibniz International Proceedings in Informatics (LIPIcs), pages 33:1–33:23, July 2024. doi:10.4230/LIPIcs.CCC.2024.33.

[bib.bib23] [23] Mika Göös, Sajin Koroth, Ian Mertz, and Toniann Pitassi. Automating cutting planes is NP-hard. In Proceedings of the 52nd Annual ACM Symposium on Theory of Computing (STOC ’20), pages 68–77, June 2020. doi:10.1145/3357713.3384248.

[bib.bib24] [24] Russell Impagliazzo and Ramamohan Paturi. On the complexity of $k$ -SAT. Journal of Computer and System Sciences, 62(2):367–375, March 2001. Preliminary version in CCC ’99. doi:10.1006/jcss.2000.1727.

[bib.bib25] [25] Dmitry Itsykson and Artur Riazanov. Automating OBDD proofs is NP-hard. In Proceedings of the 47th International Symposium on Mathematical Foundations of Computer Science (MFCS ’22), volume 241 of Leibniz International Proceedings in Informatics (LIPIcs), pages 59:1–59:15, August 2022. doi:10.4230/LIPIcs.MFCS.2022.59.

[bib.bib26] [26] Kazuo Iwama. Complexity of finding short resolution proofs. In Proceedings of the 22nd International Symposium on Mathematical Foundations of Computer Science (MFCS ’97), pages 309–318, 1997. doi:10.1007/BFb0029974.

[bib.bib27] [27] Jan Krajíček. On the weak pigeonhole principle. Fundamenta Mathematicae, 170(1-3):123–140, 2001. doi:10.4064/fm170-1-8.

[bib.bib28] [28] Jan Krajícek and Pavel Pudlák. Some consequences of cryptographical conjectures for S ${}^{1}_{2}$ and EF. Information and Computation, 140(1):82–94, 1998. doi:10.1006/inco.1997.2674.

[bib.bib29] [29] Jan Krajiček. Lower bounds to the size of constant-depth propositional proofs. The Journal of Symbolic Logic, 59(1):73–86, 1994. doi:10.2307/2275250.

[bib.bib30] [30] Massimo Lauria. A note about $k$ -DNF resolution. Information Processing Letters, 137:33–39, September 2018. doi:10.1016/j.ipl.2018.04.014.

[bib.bib31] [31] Ian Mertz, Toniann Pitassi, and Yuanhao Wei. Short proofs are hard to find. In Proceedings of the 46th International Colloquium on Automata, Languages, and Programming (ICALP ’19), pages 84:1–84:16, 2019. doi:10.4230/LIPIcs.ICALP.2019.84.

[bib.bib32] [32] Ryan O’Donnell. SOS is not obviously automatizable, even approximately. In Proceedings of the 8th Innovations in Theoretical Computer Science Conference (ITCS), volume 67, pages 59:1–59:10. Schloss Dagstuhl, 2017. doi:10.4230/LIPIcs.ITCS.2017.59.

[bib.bib33] [33] Theodoros Papamakarios. Depth- $d$ Frege systems are not automatable unless P=NP. In Proceedings of the 39th Computational Complexity Conference (CCC ’24), volume 300 of Leibniz International Proceedings in Informatics (LIPIcs), pages 22:1–22:17, July 2024. doi:10.4230/LIPIcs.CCC.2024.22.

[bib.bib34] [34] Pavel Pudlák. On reducibility and symmetry of disjoint NP pairs. Theoretical Computer Science, 295:323–339, February 2003. doi:10.1016/S0304-3975(02)00411-5.

[bib.bib35] [35] Prasad Raghavendra and Benjamin Weitz. On the bit complexity of sum-of-squares proofs. In Proceedings of the 44th International Colloquium on Automata, Languages, and Programming (ICALP), pages 80:1–80:13, 2017. doi:10.4230/LIPIcs.ICALP.2017.80.

[bib.bib36] [36] Alexander A Razborov. On provably disjoint NP-pairs. Aarhus Universitet. Basic Research in Computer Science [BRICS], November 1994. doi:10.7146/brics.v1i36.21607.

[bib.bib37] [37] John Alan Robinson. The generalized resolution principle. In Automation of Reasoning: 2: Classical Papers on Computational Logic 1967–1970, pages 135–151. Springer, 1968. doi:10.1007/978-3-642-81955-1_9.

[bib.bib38] [38] Nathan Segerlind, Samuel R. Buss, and Russell Impagliazzo. A switching lemma for small restrictions and lower bounds for $k$ -DNF resolution. SIAM Journal on Computing, 33(5):1171–1200, 2004. Preliminary version in FOCS ’02. doi:10.1109/SFCS.2002.1181984.

[bib.bib39] [39] Michael Sipser. The history and status of the p versus NP question. In Proceedings of the 24th Annual ACM Symposium on Theory of Computing (STOC ’92), 1992. doi:10.1145/129712.129771.

	$\displaystyle(\textsf{enbl}^{P}\wedge\textsf{fun}^{P}_{i})\rightarrow{}\textsf% {enbl}^{H}$			(9)
	$\displaystyle(\textsf{enbl}^{P}\wedge\textsf{fun}^{P}_{i})\rightarrow{}(% \textsf{lit}_{\ell}^{H}\leftrightarrow\textsf{lit}_{\ell}^{P})$	$\displaystyle\forall\ell\in\{x_{1},\overline{x}_{1},\dots,x_{n},\overline{x}_{% n}\}\,.$		(10)

	$\displaystyle\textsf{enbl}^{A_{\tau}[i,j]}$	$\displaystyle\leftrightarrow\textsf{enbl}^{{\widetilde{H}}_{\tau}[i,j]}$			(21)
	$\displaystyle\textsf{lit}^{A_{\tau}[i,j]}_{\ell}$	$\displaystyle\leftrightarrow\textsf{lit}^{{\widetilde{H}}_{\tau}[i,j]}_{\ell}$	$\displaystyle\forall\ell\in\{x_{1},\overline{x}_{1},\dots,x_{n},\overline{x}_{% n}\}\,.$		(22)

	$\displaystyle\textsf{enbl}^{{\widetilde{H}}_{\tau}[i,j]}$	$\displaystyle\leftrightarrow\textsf{enbl}^{P_{jc+\tau}[i]}$			(23)
	$\displaystyle\textsf{lit}^{{\widetilde{H}}_{\tau}[i,j]}_{\ell}$	$\displaystyle\leftrightarrow\textsf{lit}^{P_{jc+\tau}[i]}_{\ell}$	$\displaystyle\forall\ell\in\{x_{1},\overline{x}_{1},\dots,x_{n},\overline{x}_{% n}\}\,.$		(24)

	$\displaystyle\textsf{enbl}^{H_{j}[i]}$	$\displaystyle\leftrightarrow\textsf{enbl}^{D_{s+1}[i+1,j]}$			(25)
	$\displaystyle\textsf{lit}^{H_{j}[i]}_{\ell}$	$\displaystyle\leftrightarrow\textsf{lit}^{D_{s+1}[i+1,j]}_{\ell}$	$\displaystyle\forall\ell\in\{x_{1},\overline{x}_{1},\dots,x_{n},\overline{x}_{% n}\}\,.$		(26)