Abstract 1 Introduction 2 Preliminaries 3 Randomized Proof Complexity and Explicit Separations 4 Nephew 5 Open Problems References

Total Search Problems in ZPP

Noah Fleming ORCID Lund University, Sweden
Columbia University, New York, NY, USA
Stefan Grosser ORCID McGill University, Montreal, Canada Siddhartha Jain ORCID UT Austin, TX, USA Jiawei Li ORCID UT Austin, TX, USA Hanlin Ren ORCID Institute for Advanced Study, Princeton, NJ, USA Morgan Shirley ORCID Lund University, Sweden Weiqiang Yuan ORCID EPFL, Lausanne, Switzerland
Abstract

We initiate a systematic study of TFZPP, the class of total NP search problems solvable by polynomial time randomized algorithms. TFZPP contains a variety of important search problems such as Bertrand-Chebyshev (finding a prime between N and 2N), refuter problems for many circuit lower bounds, and Lossy-Code. The Lossy-Code problem has found prominence due to its fundamental connections to derandomization, catalytic computing, and the metamathematics of complexity theory, among other areas.

While TFZPP collapses to FP under standard derandomization assumptions in the white-box setting, we are able to separate TFZPP from the major TFNP subclasses in the black-box setting. In fact, we are able to separate it from every uniform TFNP class assuming that NP is not in quasi-polynomial time. To do so, we extend the connection between proof complexity and black-box TFNP to randomized proof systems and randomized reductions.

Next, we turn to developing a taxonomy of TFZPP problems. We highlight a problem called Nephew, originating from an infinity axiom in set theory. We show that Nephew is in 𝖯𝖶𝖯𝖯𝖳𝖥𝖹𝖯𝖯 and conjecture that it is not reducible to Lossy-Code. Intriguingly, except for some artificial examples, most other black-box TFZPP problems that we are aware of reduce to Lossy-Code:

  • We define a problem called Empty-Child capturing finding a leaf in a rooted (binary) tree, and show that this problem is equivalent to Lossy-Code. We also show that a variant of Empty-Child with “heights” is complete for the intersection of SOPL and Lossy-Code.

  • We strengthen Lossy-Code with several combinatorial inequalities such as the AM-GM inequality. Somewhat surprisingly, we show the resulting new problems are still reducible to Lossy-Code. A technical highlight of this result is that they are proved by formalizations in bounded arithmetic, specifically in Jeřábek’s theory 𝖠𝖯𝖢1 (JSL 2007).

  • Finally, we show that the Dense-Linear-Ordering problem reduces to Lossy-Code.

Keywords and phrases:
TFNP, lossy code, randomized proof systems, query complexity
Funding:
Noah Fleming: Supported by an NSERC Discovery grant and the Swedish Research Council under grant number 2025-06762.
Stefan Grosser: Supported by the NSERC CGS D fellowship.
Siddhartha Jain: Supported by Scott Aaronson’s Berkeley CIQC grant and an Amazon AI Fellowship.
Jiawei Li: Supported by Scott Aaronson’s Open Philanthropy grant.
Morgan Shirley: Supported by NSERC and by Knut and Alice Wallenberg grant KAW 2023.0116.
Weiqiang Yuan: Supported by the Swiss State Secretariat for Education, Research and Innovation (SERI) under contract number MB22.00026.
Copyright and License:
[Uncaptioned image] © Noah Fleming, Stefan Grosser, Siddhartha Jain, Jiawei Li, Hanlin Ren, Morgan Shirley, and Weiqiang Yuan; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Oracles and decision trees
; Theory of computation Proof complexity
Related Version:
Full Version: https://eccc.weizmann.ac.il/report/2025/204/
Acknowledgements:
We thank Robert Robere, Yuhao Li, and Ben Davis for extensive discussions about the Nephew problem and TFZPP. As well, we thank the reviewers for suggestions which improved the presentation of this paper.
Editor:
Shubhangi Saraf

1 Introduction

Total search problems are abundant in theoretical computer science. The formal study of these problems has been highly impactful to a wide range of areas including game theory [24, 19], cryptography [43, 30, 6], proof complexity [5, 25, 62, 38, 81, 35, 61, 28], and recently in the study of explicit construction problems and derandomization [51, 53, 55]. Central to the latter has been the Range Avoidance (or simply Avoid) problem.

Avoid.

Given a circuit D:{0,1}n1{0,1}n, find x{0,1}n such that for every y{0,1}n1, D(y)x.

Avoid captures the explicit construction problems for many combinatorial objects whose existence follows from the probabilistic method. Notable examples include functions with high circuit complexity, rigid matrices, Ramsey graphs, strong error correcting codes, and many more [53, 47, 37, 31]. By developing algorithms for Avoid, a line of work has shown circuit lower bounds against a variety of classes [77, 20, 14, 63].

Avoid belongs to the class 𝖳𝖥Σ2P, the second level of the total function polynomial hierarchy. If one Herbrandizes111Herbrandization is a basic construction in logic; see Appendix A in the full version for a short overview. the Avoid problem, then one obtains its TFNP sibling, the Lossy-Code problem (see [55] for a survey). This problem asks to find an element that is not in the range of a pair of compressing and decompressing maps C and D.

Lossy-Code.

Given a pair of circuits C:{0,1}n{0,1}n1 and D:{0,1}n1{0,1}n, find x{0,1}n such that D(C(x))x.

Lossy-Code was originally defined by Jeřábek in [47, 48], under the name retraction weak pigeonhole principle, showing that it is equivalent to the set of problems whose totality is provable in 𝖠𝖯𝖢1. Since then, it has been considered predominantly through the lens of bounded arithmetic as a TFNP problem and as a combinatorial principle [66, 52]. Korten [54] asked to understand the set of TFNP problems that are reducible to Lossy-Code. Besides being an interesting problem on its own, Lossy-Code also arises naturally in a few other places, further motivating its study:

  • Derandomization. In the recent certified derandomization framework [75], the derandomization algorithm is required to either output the correct answer, or report that the underlying circuit lower bound assumption is false by providing a small circuit violating the assumption. It turns out that certified derandomization is characterized by Lossy-Code [60].

    Such derandomization ideas are particularly explored in the context of catalytic computing [9, 65] in a framework known as “compress-or-random” [74, 23, 57, 1]: If the contents of the catalytic tape is “incompressible” (which usually means that it is not a solution of a certain Lossy-Code instance), then it can be used for derandomization; otherwise we can compress the catalytic tape and obtain more free space. As a result, although we are currently unable to prove that 𝖢𝖫 (catalytic logspace) is in P, we can show that 𝖢𝖫 reduces to Lossy-Code [23].

  • Metamathematics of complexity theory. It turns out that (variants of) Lossy-Code captures the complexity of many refuter problems [15], which are natural total search problems reflecting the metamathematical complexity of proving lower bounds. Many lower bounds in circuit complexity and communication complexity have refuter problems equivalent to Lossy-Code [54, 16], and the refuter complexity for some proof complexity lower bounds is captured by variants of Lossy-Code as well [61].

  • Bounded arithmetic. A basic theory of bounded arithmetic for approximate counting and reasoning about randomized computation is 𝖠𝖯𝖢1, developed in a series of papers by Jeřábek [45, 46, 47]. Wilkie’s witnessing theorem [80, 45] implies that Lossy-Code is “complete” for 𝖠𝖯𝖢1 in the following sense: 𝖠𝖯𝖢1 proves the totality of Lossy-Code, and every TFNP problem provably total in 𝖠𝖯𝖢1 reduces to Lossy-Code.

Lossy-Code belongs to the class TFZPP, the subclass of TFNP containing the total search problems that admit polynomial-time randomized algorithms, introduced in [10]. Since we are dealing with total NP search problems, every randomized algorithm that may make mistakes can be turned into one that does not make any mistakes.222This was observed by Jeřábek [49]; in his terminology, we have TFRP=TFZPP. Hence, it seems that TFZPP is the only natural (semantic) subclass of TFNP capturing randomized polynomial time.

Besides Lossy-Code, there are a variety of important total search problems that sit inside TFZPP. We list two of them that we think reflect the importance of TFZPP the best:

Example 1.

The Bertrand–Chebyshev theorem states that for every integer N1, there is a prime number between N and 2N. This motivates the following total search problem called Bertrand-Chebyshev: Given an integer N (represented in binary), output a prime number between N and 2N. In fact, the Prime Number Theorem implies that there are Θ(N/logN) such prime numbers, and the AKS primality test [2] provides a deterministic method for verifying solutions, hence Bertrand-Chebyshev is in TFZPP.

Example 2.

A family of important total search problems is refuter problems [17, 15, 18] for complexity lower bounds. Let 𝒞 be a circuit class and L be a hard problem for 𝒞, the refuter problem, Refuter(L𝒞), is the following total search problem: given a small 𝒞 circuit C attempting to compute L, the goal is to output an instance x such that C(x)L(x). The complexity of these refuter problems are closely related to the provability of complexity lower bounds [16, 61].

Such refuter problems are often in TFZPP: In fact, if L is average-case hard against 𝒞 (and both 𝒞 and L are in polynomial-time), then Refuter(L𝒞)TFZPP as the algorithm for the refuter problem can repeatedly sample inputs from the hard distribution until it finds a solution x where C(x)L(x). Even though average-case lower bounds against AC0[p] circuits have been proved for nearly 40 years [76, 79], we are not aware of any non-trivial TFNP upper bound for the problem Refuter(MAJAC0[2]).

Finally, an additional motivation for studying TFZPP is its connection to Avoid and APEPP (the class of total search problems mapping reducible to Avoid [51]): it is the “projection” of Avoid to TFNP in the following sense:

Theorem 3.

TFZPP=TFNPAPEPP.

Our Contributions

In this work, we initiate a formal study of TFZPP as a class of total search problems. Analogous to the setting of decision problems, we expect that TFZPP=FP. Indeed, this follows from the same assumption used in [44] – namely that E requires circuits of exponential size. Moreover, TFZPP=FP appears to be weaker than a full derandomization of BPP.

However, we show that this is not the case in the black-box setting in a very strong sense. In the black-box setting one only has access to the input via an oracle; black-box classes are denoted by a dt superscript (for “decision trees”). We say that a TFNPdt class is uniformly generated if it has a complete problem R={Rn}n such that there is a Turing Machine which on input 1n outputs Rn in polynomial time. Note that all of the major TFNPdt subclasses in the literature are uniformly generated. Under the assumption that NP is not in quasi-polynomial time (𝖰𝖯), we show that no uniformly generated TFNPdt class contains TFZPPdt.

Theorem 4.

TFZPPdt𝒞 for every uniformly generated class 𝒞TFNPdt, unless NP𝖰𝖯.

To prove these separations, we employ a close connection between total search problems and proof complexity [5, 35, 11, 28]. This connection shows that, in the black-box setting, a search problem belongs to a class if and only if an associated proof system can prove the totality of that search problem. In this case, we say that the class is characterized by that proof system. To prove our separations, we first show that TFZPPdt is characterized by the random tree-like resolution proof systems of Buss et al. [13].

Theorem 5.

TFZPPdt is characterized by random tree-like resolution.

More generally, we show that if a class 𝒞 of total search problems is characterized by a proof system Π, then the class of problems that are efficiently randomized reducible to a complete problem in 𝒞 is characterized by the proof system random Π. Theorem 4 then follows by combining the following two results: (1) Buss et al. [11] showed that every uniformly generated TFNP class has a characterizing proof system, and (2) Pudlák and Thapen [73] showed that a propositional proof system simulating random tree-like resolution would imply faster algorithms for NP.

A Highly Unsatisfiable Cook-Reckhow Program

Theorem 4 is striking as it suggests that TFZPP problems can be arbitrarily hard in the black-box model. This motivates an interesting direction of research: find explicit TFZPP problems that are hard for stronger and stronger subclasses of TFNPdt. By the close connection between TFNPdt and proof complexity [11], this can be seen as a Cook-Reckhow program for highly unsatisfiable formulas: for increasingly more expressive proof systems, exhibit a highly unsatisfiable CNF formula which is hard for that system.

Towards this program, we provide explicit separations of TFZPPdt from every major TFNP class defined in the 1990s [50, 68]. Note that all those TFNP classes are contained in PLS, PPP, and PPA. The separation from PPA was shown by Beame et al. [5]. Leveraging the recent work of Hopkins and Lin [41], we are able to show the following.

Theorem 6.

There are explicit (polynomial-time constructable) problems in TFZPPdt which are not in either PPPdt or PLSdt.

A TFZPP Zoo

We now turn to studying the structure of problems inside of TFZPP. Figure 1 shows the zoo of problems within TFZPP that we consider, as well as their relationships.

Like ZPP, TFZPP is a semantic class and therefore it is unlikely to admit complete problems unless FP=TFZPP. However, we observe an interesting phenomenon: almost every TFZPP problem that has been studied in the literature is reducible to Lossy-Code!333One exception is the Bertrand–Chebyshev problem for which it is not known whether it is reducible to Lossy-Code. However, it is unclear how to define the Bertrand–Chebyshev problem in the black-box model, and we are unable to separate any natural black-box TFZPP problem from Lossy-Code. This raises the question: are there “natural” TFZPP problems which are not reducible Lossy-Code in the black-box setting, and what do they look like? Theorem 4 and Theorem 6 already provide examples of problems that are not reducible to Lossy-Code; however, we do not consider these problems natural – we are looking for problems that would be studied outside of the context of proving such separations.

While we are unable to resolve this question – indeed, many of our conjectured separating examples turned out to be reducible to Lossy-Code in surprising ways! – we provide natural TFZPP problems which we conjecture witness a separation, and which we believe are of independent interest. As well, we show several surprising reductions to Lossy-Code.

Figure 1: The TFZPP zoo.
Nephew

Our primary candidate is the following.

Nephew.

Given a set V of vertices and two functions f:VV and g:VV. Think of f(v) as the father of v and g(v) as the nephew of v. A solution is one of the following.

s1.

vV such that f(f(g(v)))f(v) (your nephew’s grandparent is not your parent)

s2.

vV such that f(g(v))=v (you are your nephew’s parent)

It may not be immediately obvious that Nephew is a total search problem. A proof may be found in the textbook of Börger, Grädel, and Gurevich [8, Proposition 6.5.5]; we sketch the argument here. Create a directed graph Gf with vertex set V and with an edge from u to v if f(u)=v. Then one can assign to each vertex a “level” that represents its distance to the core444We intentionally leave “core” undefined for this brief sketch. of Gf. Let v be a vertex with maximum level max. However, if v is not a solution to Nephew, it must be that g(v) has level max+1, a contradiction. A similar intuition will be used in Section 4 for other proofs involving Nephew.

Nephew is derived from the minimal axioms of infinity in model theory. One method for constructing a total search problem is to begin with a sentence in logic that has an infinite model but no finite model. Such a sentence is known as an axiom of infinity, since any model for it must be infinite. Axioms of infinity are classified under the number of quantifiers and predicate and function symbols of certain arity that they contain, and there are 10 minimal classes ([8, Theorem 6.5.4]). Each of these classes corresponds to a total search problem, and in most cases, this problem is complete for a well-known TFNP class. The only exception is the one to which Nephew belongs; Nephew can be interpreted as the Herbrandization of

xy(F(F(y))=F(x)F(y)x). (1)

The proof that Nephew belongs to TFZPP is highly non-trivial. Furthermore, our best upper bound on the complexity of Nephew is that it is contained within PWPP, the problems reducible to the weak pigeonhole principle, a relaxation of the Lossy-Code problem.

Theorem 7.

NephewTFZPPPWPP.

TFZPP Problems Studied in the Full Version

We define a number of natural problems in TFZPP which may be of independent interest. These problems are studied in detail in the full version of this paper.

Empty Child

The proof that Nephew is in TFZPP proceeds by reducing it to the observation that a leaf in a binary tree can be found in logarithmic time in expectation. We define a total search problem whose membership in TFZPP formalizes this observation.

Empty-Child.

Given a set V of vertices and three functions F,L,R:VV, where F(u) is the father of u, and L(u),R(u) are the left and the right child of u respectively, a solution is one of the following.

s1.

uV such that F(L(u))u or F(R(u))u or L(u)=R(u)u; (Empty child)

s2.

1, if L(1)=1 or R(1)=1 or F(1)1. (Wrong root)

Surprisingly, Empty-Child is equivalent to Lossy-Code under decision tree reductions, denoted =dt. Thus, if Nephew is indeed not reducible to Lossy-Code, the hardness of Nephew does not come from this portion of the reduction.

Theorem 8.

Empty-Child=dtLossy-Code.

As a warm-up to the techniques needed to prove this theorem, we consider a variant of Empty-Child which includes an additional “height” function that outputs the height of a given node in the tree. We show that this is a complete problem for the class LOSSYPLS=LOSSYSOPL, where LOSSY is the class of problems efficiently reducible to Lossy-Code. The proof resembles previous intersection theorems from TFNP [26, 34].

Then, we use Empty-Child as an intermediate problem to study the relationship between Nephew and Lossy-Code.

Theorem 9.

Empty-ChilddtNephew.

Thus, combining with Theorem 8, we have Lossy-Code reduces to Nephew.

AM-GM Lossy-Code

One of our original (and failed) candidates for separation from Lossy-Code was a problem called AM-GM Lossy-Code, obtained by combining Lossy-Code itself with the AM-GM Inequality: a+b2ab. This problem was inspired by the Bad 2-Coloring problem in [70]: Given an undirected graph G=(V,E) with |V|=2N vertices and |E|=N2+1 edges along with a 2-coloring C:V{0,1} of V, find an edge (x,y)E that is not colored properly. This problem is total exactly because of the AM-GM inequality: suppose there are a black vertices and b=2Na white vertices and every edge is colored properly, then there are at most ab(a+b2)2=N2 edges, contradicting |E|>N2.

To compose this problem with Lossy-Code, we need to make two adaptations: First, to put it inside TFZPP, the number of edges needs to be much larger than N2, say (1+ε)N2; second, we are given a function F from [(1+ε)N2] to the set of properly colored edges and its purported inverse G and we need to find x[(1+ε)N2] such that G(F(x))x. We arrive at the following problem:

𝒄-AMGM-LC.

Let c>1 be a constant, V:=[2N] and P:=[cN2]. The input is a coloring function C:V{0,1} and two mappings F:PV×V, G:V×VP. Let H:=C1(0)×C1(1). The goal is to find solutions of either type:

s1.

a pigeon xP such that G(F(x))x; (Wrong Encoding-Decoding)

s2.

a pigeon xP such that F(x)H; (Invalid Hole)

Indeed, it seems unclear how to massage a Lossy-Code instance of the shape [cN2]H into a standard Lossy-Code instance of the form [2M][M], even though the AM-GM inequality implies that cN2|H|. However, it turns out that such a reduction is possible (although highly non-trivial)! We encourage the reader to take a moment to think about how to reduce AMGM-LC to Lossy-Code.

Theorem 10.

For every constant c>1, c-AMGM-LC dt Lossy-Code.

Our reduction uses bounded arithmetic: we formalize the totality of c-AMGM-LC in Jeřábek’s theory 𝖠𝖯𝖢1 [47], and Wilkie’s witnessing theorem for 𝖠𝖯𝖢1 [80, 45] implies a reduction from c-AMGM-LC to Lossy-Code. In particular, like every formalization in 𝖠𝖯𝖢1, our reduction makes use of the Nisan–Wigderson generator [67].

Moreover, the techniques underlying 𝖠𝖯𝖢1 allow us to reduce problems to Lossy-Code in a systematic way; we provide two additional examples (c-Dual-AMGM-LC and a problem capturing the Inclusion-Exclusion principle). A secondary goal of expounding these reductions is to introduce the ideas of 𝖠𝖯𝖢1 to audiences who are less familiar with bounded arithmetic.

 Remark 11.

Unfortunately, it seems unclear how to formalize the totality of Nephew in 𝖠𝖯𝖢1, hence the bounded arithmetic approach does not seem to provide a reduction from Nephew to Lossy-Code. For example, our proof that NephewTFZPP requires reasoning about the level of each node, which seems to be global reasoning that is infeasible in 𝖠𝖯𝖢1.

Linear Ordering

Finally, we consider one more natural problem in TFZPP. The Linear Ordering Principle has a storied history in proof complexity [59, 7, 71] and bounded arithmetic [21, 39, 13, 4]. It has been studied in the context of total search problems as well [56, 40], where it was used in order to construct new algorithms for Avoid. A line of works in proof complexity [78, 3, 36, 22] has also considered a dense variant of this problem defined as follows, which lies in TFZPP.

Dense-Linear-Ordering.

The input consists of the descriptions of a linear ordering over N elements and a median function 𝗆𝖾𝖽:[N]×[N][N]. Without loss of generality, we may assume that for xy[N], exactly one of (xy) and (yx) is true, and that 𝗆𝖾𝖽(x,y)=𝗆𝖾𝖽(y,x). (That is, is represented by a string of (N2) bits and 𝗆𝖾𝖽 is represented by a list of (n2) elements in [N].) A solution is one of the following.

s1.

x,y,z[N] such that xy, yz, and zx; or (Transitivity violation)

s2.

x,y[N] such that xy, but neither x𝗆𝖾𝖽(x,y) nor 𝗆𝖾𝖽(x,y)y.

(Invalid median)

While Avoid reduces to the Linear Ordering Principle [56], we show a converse in the dense setting.

Theorem 12.

Dense-Linear-OrderingdtLossy-Code.

2 Preliminaries

2.1 Basics of TFNP

TFNP contains all search problems which are (i) total: a solution is guaranteed to exist, and (ii) in NP: there is an efficient procedure to check whether a candidate solution is valid. It is believed that TFNP does not admit complete problems [72] and much of the research in this area has focused on studying syntactic subclasses (those with complete problems) which capture many of the total search problems of interest. These classes are typically defined by simple existence principles that capture the totality of the problems within that class. These naturally give rise to total search problems. For example, PWPP is the class of all search problems whose totality is witnessed by the existence principle: any map from 2N to N must have a collision [49]. To make these problems non-trivial, the input is presented succinctly as a circuit C that on input i outputs the i-th bit of the search problem. For example, the existence principle for PWPP gives rise to the following (white-box) total search problem.

Weak-Pigeon.

Given P:{0,1}n{0,1}n1, a solution is xy such that P(x)=P(y).

PWPP is then the class of all total search problems which are efficiently reducible to Weak-Pigeon.

A major thrust of this line of work is to understand the relationships between these classes. However, a separation between classes would imply PNP. As a proxy, and as natural objects in their own right, researchers have studied total search problems in the black-box model. In this setting, the input C is given as a black box which can be queried, but we no longer have access to the description of C.

In this setting a (query) search problem is a sequence of relations Rn{0,1}n×𝒪n, for each n. It is total if for every x{0,1}n there is o𝒪n such that (x,o)Rn. We think of the input x{0,1}n as being accessed by querying the individual bits, and we will measure the complexity of solving Rn as the number of bits that must be queried to determine some suitable o𝒪n. An efficient algorithm is one that makes at most poly(logn)-many queries555As the input is succinctly encoded, this corresponds to looking at a polynomial part of the entire input.; these problems belong to the class FPdt, where dt indicates that it is the black-box version of the class. Similarly, RTFNPdt if for every n and each o𝒪n there exists a poly(logn)-depth decision tree To:{0,1}n{0,1} such that To(x)=1 iff (x,o)Rn. While search problems are formally defined as a sequence R=(Rn)n, we will often want to speak about individual members of this sequence. For readability, we will abuse notation and refer to elements Rn in the sequence as total search problems. Furthermore, we will often drop the subscript n and rely on context to differentiate.

We compare the complexity of total search problems by reductions between them; the following is the black-box (decision tree) analogue of a deterministic polynomial-time reduction between search problems.

Definition 13.

For total search problems R{0,1}n×𝒪n and S{0,1}m×𝒪m, there is an S-formulation of R if for every i[m] and o𝒪m there are functions fi:{0,1}n{0,1} and go:{0,1}n𝒪n such that

(f(x),o)S)(x,go(x))R, (2)

where f(x):=(f1(x),,fm(x)). The depth of the S-formulation is

d:=max({𝖽𝖾𝗉𝗍𝗁(fi):i[m]}{𝖽𝖾𝗉𝗍𝗁(go):o𝒪m}),

where 𝖽𝖾𝗉𝗍𝗁(f) denotes the minimum depth of any decision tree which computes f. The size of the S-formulation is m, the number of input bits to S. The complexity of an S-formulation is logm+d and the complexity of reducing R to S is the minimum complexity of any S-formulation of R.

This definition extends to sequences naturally. If S=(Sn) is a sequence and Rn is a single search problem, then the complexity of reducing Rn to S is the minimum over m of the complexity of reducing Rn to Sm. For two sequences S=(Sn) and R=(Rn), the complexity of reducing R to S is the complexity of reducing Rn to S for each n. We say that a reduction from R to S is efficient if its complexity is poly(log(n)) and denote this by RdtS.

2.2 TFZPP

In this work, we will be particularly interested in the total search problems which are solvable in randomized polynomial time. Formally, R{0,1}×{0,1}TFZPP if there is a distribution 𝒟 over polynomial-time Turing Machines A with range {0,1,}, such that PrA𝒟[A(x)=]1/3 and

TFZPP is defined semantically and it is unlikely to have complete problems. However, we show that it is exactly the TFNP problems in the 𝖳𝖥Σ2P class 𝖠𝖯𝖤𝖯𝖯, where 𝖠𝖯𝖤𝖯𝖯 is the class of total search problems that are reducible to Avoid, as defined in [51].

Theorem 3. [Restated, see original statement.]

TFZPP=TFNPAPEPP.

Proof.

Let LTFNP𝖠𝖯𝖤𝖯𝖯. This means that given an instance x of L, there are deterministic polynomial-time algorithms V, C, and R, such that:

  • V is a TFNP verifier for L. For every string z of length polynomial in |x|, V(x,z)=1 if and only if z is a valid solution for x.

  • (C,R) is a reduction from x to Avoid. The output of C(x) is a circuit Cx mapping input bits to +1 output bits, where poly(|x|); given any y{0,1}+1Range(Cx), R(x,y) outputs a valid solution for x.

Then we can solve L in TFZPP via the following procedure. Guess y{0,1}+1 uniformly at random and compute z:=R(x,y). If V(x,z) accepts, then we output z; otherwise, we output . By the correctness of V, if we did not output , then our output is a valid solution of x. On the other hand, since at least a 1/2 fraction of strings y{0,1}+1 are valid outputs of Avoid on the instance Cx, by the correctness of (R,C), we will output a valid solution w.p. at least 1/2.

Now we prove the converse direction. If LTFZPP then clearly LTFNP; hence we only need to show that there is a mapping reduction from L to Avoid. Let U(x,r) be the zero-error randomized algorithm for L, i.e., U(x,r) outputs a valid solution for x w.p. at least 1/2 over its randomness r, and it outputs whenever it fails to output a valid solution.

By standard results in derandomization [67, 44, 82], there exist absolute constants c,d1 and a deterministic polynomial-time algorithm 𝖯𝖱𝖦 such that the following holds. For every truth table f of length s10c, if the circuit complexity of f is at least sc, then 𝖯𝖱𝖦(f) outputs a list of sd strings that (1/s2)-fools every size-s2 circuit. That is, for every circuit C:{0,1}s{0,1} of size at most s2,

|Prx{0,1}s[C(x)=1]Prx𝖯𝖱𝖦(f)[C(x)=1]|1/s2.

Now, let spoly(|x|) be the circuit complexity of U. Consider the truth table generator 𝖳𝖳:{0,1}O(sclogs){0,1}s10c that takes the description of a size-sc circuit C:{0,1}10clogs{0,1} as input and outputs the length-s10c truth table of C. We treat 𝖳𝖳 as an instance for Avoid and reduce x to 𝖳𝖳.666An unusual aspect of this reduction is that 𝖳𝖳 does not depend on x!

It remains to show how to solve the instance x deterministically given a non-output of 𝖳𝖳. Note that if f{0,1}s10c is a non-output of 𝖳𝖳, then the circuit complexity of f is at least sc, hence 𝖯𝖱𝖦(f) outputs a list of poly(s) strings that (1/s2)-fools every size-s2 circuits. This implies that

Prr𝖯𝖱𝖦(f)[U(x,r)]Prr{0,1}s[U(x,r)]1/s21/21/s2>0,

and in particular, there exists at least one string r𝖯𝖱𝖦(f) such that U(x,r). We can solve the instance x by cycling through every r𝖯𝖱𝖦(f) and outputting U(x,r) whenever we encounter such a good r.

Note that this equivalence holds even in the black-box model, since its proof is relativizing.

Definition 14 (TFZPP).

A total NP search problem R{0,1}×{0,1} is in TFZPP if there is a distribution 𝒟 over polynomial-time algorithms A with output in {0,1,} such that:

  1. 1.

    For every x{0,1} and every A𝒟, if A(x) then (x,A(x))R,

  2. 2.

    For every x{0,1},

    PrA𝒟[A(x)=]1/3.

Similarly, RTFZPPdt if there is a family of distributions 𝒟={𝒟n}n over polylog(n)-depth decision trees with leaves labeled in {0,1,}, where on input x we sample a decision tree A𝒟|x|, and 𝒟 satisfies (1) and (2).

2.3 Lossy-Code

As mentioned in the introduction, Lossy-Code is the Herbrandization of Avoid. Let N<M be two parameters (think of NM), (the black-box version of) Lossy-Code is the following problem:

Lossy-CodeN→M.

Given query access to a pair of functions f:[N][M] and g:[M][N], find x[M] such that f(g(x))x.

We need the following basic fact about Lossy-Code that roughly states that the “stretch function” of Lossy-Code does not influence its complexity as long as it is in the “weak” regime. This fact and similar statements for other variants of the weak pigeonhole principle have been very useful in bounded arithmetic [69, 80, 58, 45, 47, 16], total search problems [53, 54, 61], and cryptography [33, 64].

Lemma 15.

Let ε>0 and M>(1+ε)N. There is a decision tree reduction of complexity O(ε1log(M/N)) from Lossy-CodeN(1+ε)N to Lossy-CodeNM.

By Lemma 15, Lossy-CodeN1.01N, Lossy-CodeN2N, and Lossy-CodeNN100 are equivalent up to decision tree reductions of polylog(N) depth. In this paper, unless otherwise stated, Lossy-Code always stands for Lossy-CodeN2N.

We denote LOSSY as the class of total search problems reducible to Lossy-Code.777Previous literature [60, 23] defined LOSSY as the class of decision problems reducible to Lossy-Code. In our context, it is more natural to define LOSSY as a class of total search problems. It follows from Lemma 15 that LOSSY is robust in the sense that it does not matter whether it is defined using Lossy-CodeN1.01N or Lossy-CodeNN100 as the complete problem. In fact, LOSSY is extremely robust: it is closed under Turing reductions (FPLOSSY=LOSSY [12, 60]) and it is self-low (LOSSYLOSSY=LOSSY [32]).

3 Randomized Proof Complexity and Explicit Separations

We begin by describing the connection between black-box TFZPP and proof complexity, and how this can be leveraged to obtain explicit separations from other natural classes. Proof complexity is concerned with the efficient provability of propositional theorems (unsatisfiable CNF formulas) in various proof systems – simply a verifier for the language UNSAT of unsatisfiable CNF formulas.

Definition 16.

A propositional proof system is a polynomial-time machine 𝒫 such that for every CNF formula F, FUNSAT iff there exists a proof Π{0,1} such that 𝒫(F,Π)=1. We say that Π is a 𝒫-proof of F and define the size of Π to be s(Π):=|Π|.

When studying connections between proof systems and TFNP classes, it is standard to also consider an associated notion of the width of a proof w(Π). This is typically specific to the proof system – for example, in resolution (defined next), it is the maximum number of literals in a clause in Π, while for algebraic systems such as Sum-of-Squares, the width is the degree of the polynomials occurring in the proof.

With a definition of width, the complexity of proving F in 𝒫 is

𝒫(F)min𝒫-proof Π of Fw(Π)+logs(Π).

A standard example is the resolution proof system. A resolution proof of an unsatisfiable CNF formula F consists of a sequence of clauses C1,,Ct= ending with the empty clause which contains no literals, such that each clause Ci either belongs to F or is derived from earlier clauses in the sequence according to the resolution rule.

Resolution rule.

From two clauses with complementary literals Ax and Bx¯, derive AB.

The size of a resolution proof is the number of clauses that it contains, while the width is the maximum number of literals within any clause in the proof. A resolution proof C1,,Ct is tree-like if each Ci is used at most once as a premise for the resolution proof. They are named as such because the implication graph of such proofs is a tree.

There is a long line of work connecting proof complexity and black-box TFNP [5, 35, 38, 11, 28, 42, 81, 25, 62, 27]. These connections show that a total search problem is contained within a class iff an associated proof system can prove the totality of that search problem. We can phrase the totality of any total search problem R{0,1}n×𝒪 as an unsatisfiable CNF formula in the following way: for each o𝒪 let Vo be a decision tree which checks whether o is a solution; that is, Vo(x)=1 iff (x,o)R. A root-to-leaf path in Vo is a 1-path if its leaf is labeled 1. We will associate with any path p the conjunction of literals that it follows. Then the totality of R is expressed as

FR¬(o𝒪1-path pVop).

If RTFNPdt then Vo can be assumed to have depth polylog(n), and hence the width of FR is also polylog(n).

Similarly, we can associate with any unsatisfiable CNF formula F=C1Cm a total search problem SearchF{0,1}n×[m] such that (x,o)SearchF iff Co(x)=0. Observe that whenever F has poly(log(n)) width then SearchFTFNPdt and furthermore that SearchFR is reducible to R by decision trees of depth at most the width of FR.

For a syntactic class 𝒞TFNPdt we will denote by 𝒞(R) the complexity of reducing R to S, where S is any complete problem for 𝒞. We say that a proof system 𝒫 is characterized by a class 𝒞TFNPdt if R𝒞 iff 𝒫(F)=poly(𝒞(R)). A standard example is that FPdt characterizes tree-like resolution. Said differently, decision trees are equivalent to tree-like resolution proofs.

We extend these characterizations to capture randomized reductions. We show that randomized reductions between total search problems give rise to proofs in randomized proof systems, a notion introduced by Buss, Kołodziejczyk, and Thapen [13].

Definition 17.

Let 𝒫 be any propositional proof system. A randomized 𝒫-proof, denoted r𝒫, of an unsatisfiable formula F is a distribution 𝒟 supported on pairs (Π,B), such that

  1. 1.

    Each B is a CNF formula over the variables of F,

  2. 2.

    Π is a 𝒫 proof of FB,

  3. 3.

    For any assignment x{0,1}n, Pr(Π,B)𝒟[B(x)=1]2/3.

The size s(𝒟), and width w(𝒟) of an r𝒫-proof 𝒟 is the maximum width and size of a proof Π in the support of 𝒟. The complexity of proving F in r𝒫 is

r𝒫(F):=minr𝒫-proof 𝒟 of Fw(𝒟)+logs(𝒟).

Note that a randomized proof system is not a Cook-Reckhow proof system in the sense of Definition 16 since its proofs typically cannot be polynomial-time verified [73].

The main theorem of this section, Theorem 19, shows that a proof system 𝒫 is characterized by class 𝒞 iff the totality of the total search problems R which are randomly reducible to any complete problem for 𝒞 is provable in r𝒫. The following definition is equivalent to the probabilistic reduction in [49].

Definition 18.

A randomized (ZPP) reduction from a search problem S[t]×𝒪 to R[n]×𝒬 is a distribution 𝒟 over deterministic reductions 𝒯=(T,{To}) such that each output decision tree is labeled either by some j𝒪 or by , and 𝒟 satisfies

  1. 1.

    For every x[t] and every 𝒯𝒟, if (T(x),o)R then either To(x)= or (x,To(x)) S.

  2. 2.

    For every x[t],

    Pr𝒯𝒟[o𝒪:(T(x),o)RTo(x)=]ε
Theorem 19.

If a proof system 𝒫 is characterized by the total search problems reducible to RTFNPdt, then r𝒫 is characterized by the total search problems that are randomized-reducible to R.

The intuition for this theorem is most clear in the case of randomized reductions to FPdt (which is TFZPPdt) and random tree-like resolution. This is also the case that we will use to derive consequences about TFZPPdt. We leave the proof of Theorem 19 to the full version.

We remark that Theorem 19 reduces the task of showing that a TFNP class is closed under randomized reduction to showing that the corresponding proof system is closed in the sense that 𝒫=r𝒫. We are not aware of any such proof system, and it would be interesting to exhibit one.

Lemma 20.

There is a quasi-polynomial size random tree-like resolution proof of F=C1Cm iff 𝖲𝖾𝖺𝗋𝖼𝗁FTFZPPdt.

Proof.

Suppose that SearchFTFZPPdt and let 𝒟 be a distribution over depth-d decision trees solving SearchF as in the definition of TFZPPdt. We construct a ε-error randomized tree resolution proof 𝒫, which is defined by the following sampling procedure:

  1. 1.

    Sample T𝒟.

  2. 2.

    Let B be the set of clauses obtained by taking the negation of each root-to-leaf path in T ending in ,

    B:={¬p:pT is a root-to- path},

    where we think of a path as the conjunctions of the literals that appear along it (a queried variable is a positive literal if p took the 1-branch, and a negative literal if p took the 0-branch).

  3. 3.

    To construct Π, we will use the equivalence between a decision tree solving the false-clause search problem and tree-resolution proofs. Let T be obtained from T by relabeling each path p ending in by the clause ¬pB. Let FB be the CNF formula whose clauses are the clauses of F and those in B, and observe that T is a depth-d decision tree solving SearchFB. Thus, there is a depth-d tree resolution proof Π of FB.

As B states that we do not follow any root-to- path in T, for any x{0,1}n, T(x) iff x satisfies all of B. Therefore, PrT𝒟[T(x)=]ε implies that Pr(B,Π)𝒫[B(x)=1]1ε for every x{0,1}n.

In the other direction, suppose that 𝒫 is a ε-random tree resolution proof of F with complexity c. We construct a distribution 𝒟 over decision trees solving SearchF by the following sampling procedure:

  1. 1.

    Sample (Π,B)𝒫.

  2. 2.

    Let T be the depth-d decision tree solving SearchFB obtained from Π obtained by the equivalence between tree resolution proofs and decision trees in the same manner as in point (3) above. It is well-known that the depth of a resolution proof is bounded by its width, and hence T has depth at most c.

  3. 3.

    Let T be the decision tree obtained from T by relabeling each leaf of Π that is labeled by a clause in B by .

As for any x{0,1}n, Pr(B,Π)𝒟[B(x)=1]1ε, we have that PrT𝒟[T(x)=]ε.

3.1 Separations

We now use Lemma 20 to show that TFZPPdt is not contained within any uniformly generated TFNPdt class unless NP is contained within quasi-polynomial time (𝖰𝖯).

Theorem 21.

TFZPPdt𝒞 for any uniformly generated class 𝒞TFNPdt unless NP𝖰𝖯.

This theorem follows by combining the characterization of uniform TFNPdt classes by proof systems of Buss et al. [11], Lemma 20, and the following result of Pudlák and Thapen [73].

This separation relies on a theorem of Pudlák and Thapen [73] who showed that random resolution cannot be simulated by any propositional proof system unless PNP. A straightforward examination of their theorem reveals that it also holds for tree-like resolution and can be stated in the following form.

Theorem 22 (Proposition 10 in [73]).

There is a family of unsatisfiable 3-CNFs such that:

  1. 1.

    There are O(log(n))-complexity random tree-like resolution proofs of .

  2. 2.

    If there is a propositional proof system which has polylog(n)-complexity proofs of then NP𝖰𝖯.

Indeed, to prove their theorem Pudlák and Thapen observe that random (treelike) resolution has small proofs of any highly unsatisfiable formula (one for which any assignment falsifies many clauses), and that the PCP theorem can be used to put any 3-CNF formula into this form (the family ). Using this, they show that if there existed a propositional proof system which could (quasi-polynomially) simulate random tree-like resolution, then one could use its variability to decide SAT.

Combining this theorem with Lemma 20 and the characterization of TFNPdt classes by propositional proof systems due to Buss et al. [11] proves Theorem 21. Say that R={Rn}TFNPdt is uniformly generated if there is a Turing Machine which on input 1n outputs Rn, and say that a class is uniformly generated if it has a uniformly generated complete problem. Note that all major TFNPdt subclasses are uniformly generated.

Proof of Theorem 21.

Let 𝒞 be a uniformly generated class such that TFZPPdt𝒞. Buss et al. [11] showed that every uniformly generated TFNPdt subclass is characterized by a proof system; let 𝒫 be the system for 𝒞. Consider the family of formulas from Theorem 22. As TFZPPdt𝒞, there are polylog(n)-complexity 𝒫-proofs of . Hence, Theorem 22 implies that NP𝖰𝖯.

3.2 Explicit Separations

The separating examples in Theorem 21 rely on an unproven hypothesis. We end this section by proving explicit separations between TFZPPdt and every major TFNPdt subclass which do not rely on any unproven assumptions. A separation of PPAdt from TFZPPdt was implicitly shown by Beame et. al. [5], who proved Nullstellensatz lower bounds for Lossy-Code.888More specifically, Beame et. al. [5, Theorem 12] proved the Nullstellensatz degree lower bounds for Weak-Pigeon. They also showed that any Nullstellensatz degree lower bounds for Weak-Pigeon implies the same Nullstellensatz degree lower bounds for Lossy-Code in [5, Lemma 10] (see also Definition 3.1, 3.2 in [5] for the definition of Lossy-Code and Weak-Pigeon). The remaining major TFNPdt classes are contained within PLSdt and PPPdt. We show the following.

Theorem 23.

There exist explicit total search problems in TFZPPdt which are not contained in PLSdt nor PPPdt.

It is known that if a total search problem SearchF is in PLSdt or PPPdt, then F has a small low-degree Sum-of-Squares (SoS) proof; see [29] for an exposition on this proof system. Our hard instance is based on the recent work of Hopkins and Lin [41], who exhibited the first explicit hard 3-XOR instance for SoS.

Theorem 24 ([41]).

There exist constants μ1,μ2(0,1) and a polynomial time algorithm which, given 1n as input, outputs a 3-XOR formula F=C1Cm on n variables such that:

  • For every x{0,1}n, Pri[m][Ci(x)=1]1μ1.

  • Any Sum-of-Squares refutation of F requires degree at least μ2n.

Proof of Theorem 23.

Let F be as in Theorem 24. We show that RSearchF satisfies the desired properties. We first prove RTFZPPdt. Consider the following simple algorithm: sample i[m] uniformly at random, and make three queries to check if Ci(x)=0. If so, output i; otherwise, output . By the first item of Theorem 24, the algorithm succeeds with probability at least μ1. Repeating the procedure O(1/μ1) times boosts the success probability to at least 2/3.

To separate R from PPPdt and PLSdt, we only need to show there is no efficient black-box reduction from R to a complete problem for one of these classes. If there was, then there would be an efficient SoS proof of F, contradicting the second item of Theorem 24.

4 Nephew

Recall the Nephew problem, which is our main candidate for a total search problem in TFZPP but not reducible to Lossy-Code:

Nephew.

Given a set V of vertices and two functions f:VV and g:VV. Think of f(v) as the father of v and g(v) as the nephew of v. A solution is one of the following.

s1.

vV such that f(f(g(v)))f(v) (your nephew’s grandparent is not your parent)

s2.

vV such that f(g(v))=v (you are your nephew’s parent)

The main result of this section is the inclusion theorem:

Theorem 7. [Restated, see original statement.]

NephewTFZPPPWPP.

The intuition behind Theorem 7 is as follows:

  • We can treat certain vertices in a Nephew instance like the root of a directed binary tree, where the leaves of the tree correspond to solutions of the Nephew instance.

  • Finding a leaf of a rooted binary tree is easy for both PWPP and TFZPP computations.

Once we have proven the above, we are nearly done. First, choose an arbitrary vertex. Perhaps that vertex is one of the many that we can treat as the root of a binary tree, and so from it we can find a solution. Otherwise, we use a procedure to find such a root vertex. In fact, we will be able to find two vertices, one of which must be a good root vertex. To show inclusion in TFZPP, we just need to pick one of these two randomly. For PWPP, we will have to consider both. We make an adjustment to the argument for a single root vertex so that it works for two potential root vertices.

To be more concrete, we will reduce Nephew to the following promise search problem.

Leaf-of-Rooted-Tree.

An instance consists of a set V of vertices, a special vertex v, and two functions L:VV{} and R:VV{}. We define a subset VV recursively: vV and, for every vV, we add L(v) to V if L(v) and R(v) to V if R(v). We promise that the induced subgraph on V is a (directed) tree rooted at v and that for all vV either:

  • L(v)=R(v)=, or

  • L(v) and R(v) and L(v)R(v).

A solution is a (log|V|+1)-length path, represented by a string p{L,R}log|V|+1, where starting at v and descending by the functions specified by the characters of p in order will at some point reach a vertex v where L(v)=R(v)=.

Note that instead of simply asking for a leaf, we require a root-to-leaf path for a solution. This is to confirm that the leaf is in the binary tree rooted at v.

Finding a root-to-leaf path in a rooted binary tree

It is easy for a PWPP or TFZPP computation to find a solution to Leaf-of-Rooted-Tree. This follows from the simple observation that there are (many) more paths of length (log|V|+1) than vertices in the tree, so most paths must be solutions.

Lemma 25.

A solution to Leaf-of-Rooted-Tree can be found with high probability using a randomized algorithm.

Proof.

The algorithm guesses a random path of length log|V|+1, which will be a solution with probability at least 5/6. This is because if a path p is not a solution, then following p reaches a vertex vp with two children. The path p is the only non-solution path of length log|V|+1 to reach vp; furthermore, because V is an induced tree, no other path ends at a vertex with L(vp) or R(vp) as children. This means that the set {vp,L(vp),R(vp)} are uniquely reached by p out of all of the non-solution paths. Therefore, there are at least 3 times as many vertices in V as there are non-solution paths. Let the fraction of non-solution paths be α. Then

|V|3α|{L,R}log|V|+1|3α2|V|,

and so α1/6.

To show inclusion in PWPP, we reduce to the PWPP-complete problem Weak-Pigeon. (Note that this is not a reduction between TFNP problems, as Leaf-of-Rooted-Tree is a promise problem.) We recall its definition here:

Weak-Pigeon.

Given h:{0,1}n{0,1}n1 a solution is xy such that h(x)=h(y).

Lemma 26.

Leaf-of-Rooted-Tree reduces to Weak-Pigeon.

Proof.

Let n=log|V|+1. Let vp be the vertex reached by p if p is a non-solution path. Then we define h as a map from (log|V|+1)-length paths to vertices:

h(p)={vpif p is not a solution;votherwise.

(Recall that since v is the root, we have that vpv for any path p.) Thus, paths (x,y) are a collision if and only if x and y are both solutions to Leaf-of-Rooted-Tree, and so from any collision we can find a solution to Leaf-of-Rooted-Tree by arbitrarily choosing one from the pair.

The structure of Nephew instances and finding a rooted binary tree

For any Nephew instance (V,f,g), let Gf be the directed graph with vertex set V and where (u,v) is an edge if and only if v=f(u). Then Gf is a directed graph with out-degree one, and therefore the connected components of Gf have a simple structure: they are composed of a cycle (perhaps a self-loop) and trees (with edges oriented leaf-to-root) that are rooted at vertices in the cycle. For any vertex vV, define its level (denoted (v)) as the distance from v to any vertex on the cycle of its connected component. So, any vertex on the cycle has level 0, any non-cycle vertex pointing to a cycle vertex has level 1, and so on. See Figure 2 for an illustration.

We give a reduction from Nephew to Leaf-of-Rooted-Tree under the assumption that we can find a vertex v with (v)2. After proving this, the hard part will be finding such a vertex. The main component of this reduction is the procedure Find-Children (Algorithm 1), which is based on the functions f and g from an Nephew instance. Define CheckSol(u) to be the procedure that returns True iff u is a solution to the Nephew instance, that is, iff f(f(g(u)))f(u) or f(g(u))=u.

Algorithm 1 Procedure Find-Childrenf,g(v).

See Figure 3(a) for an illustration. The idea is to construct the tree by defining the left and right children of v to be two nodes reachable from v, unless the procedure finds a nearby solution, in which case we can make v into a leaf, as it is easy to compute a solution given v.

Although the intuition behind the reduction is straightforward, some work needs to be done to show that Find-Children gives a valid binary tree. The following properties will help.

Lemma 27.

Let Find-Childrenf,g(v)=(a,b). If (a,b)(,), then

  1. (i)

    ab,

  2. (ii)

    f(a)f(b),

  3. (iii)

    f(f(a))=f(f(b))=f(v), and

  4. (iv)

    if (v)2, then (a)=(b)=(v)+1.

Proof.
  1. (i)

    Since (a,b)(,), we have that f(g(v))=f(a) is not an Nephew solution, and in particular f(g(f(a)))f(a). This means that ab, as b=g(f(a)).

  2. (ii)

    As f(g(v)) is not a solution, applying f(g(u))u to u=f(g(v)) gives f(b)=f(g(f(g(v))))f(g(v))=f(a).

  3. (iii)

    As v is not a solution, f(f(a))=f(f(g(v)))=f(v). As f(g(v)) is not a solution, applying f(f(g(u)))=f(u) for u=f(g(v)) gives f(f(b))=f(f(g(f(g(v)))))=f(f(g(v)))=f(v).

  4. (iv)

    Under the assumption that (v)2, we know that (f(v))=(v)1>0, which implies that any vertex with f(f(u))=f(v) has (u)=(f(v))+2=(v)+1. By Item iii, this applies to a and b.

 Remark 28.

Lemma 27 iv does not hold without the hypothesis (v)2, as if (f(v))=0 there is no guarantee that vertices with edges pointing to f(v) have level greater than 0. This is why it is necessary to assume (v)2 for the simple reduction given here.

Lemma 29.

Let (V,f,g) be an instance of Nephew. Then define L and R by

(L(v),R(v))Find-Childrenf,g(v).

Given vV with (v)2, (V,v,L,R) is a valid instance to Leaf-of-Rooted-Tree.

Proof.

All we need to show is that v is the root of an induced tree. Recall that V denotes the set of vertices reachable from v via L and R (i.e., using edges of the form (uL(u)) and (uR(u))). By Lemma 27 iv, the induced subgraph on V can be assigned levels such that edges are directed only from lower levels to higher levels, i.e. it is a DAG. By Lemma 27 i, the DAG has outdegree 2. To prove this induced DAG is indeed a tree, it suffices to show that no two vertices share the same child. Indeed, if there are vertices u,uV such that {L(u),R(u)}{L(u),R(u)}, then by Lemma 27 iii, we have f(u)=f(u). Hence, it suffices to prove the following claim:

Claim 30.

If u,uV are two different vertices, then f(u)f(u).

Proof of Claim 30.

We may assume that (u)=(u), as otherwise f(u)f(u) by the observation about levels in V. The proof is by induction on levels. At level (v), there is only one vertex, hence the base case is trivially true. Assume that at level k (where k(v)), no two vertices in V map via f to the same vertex. We will prove that it also holds for level k+1.

For two different vertices u,u with (u)=(u)=k+1, we may assume u and u are the children of two different vertices in V: if they are children of the same vertex, by Lemma 27 ii {u,u}={L(w),R(w)} implies f(u)f(u) and we are done. So, let ww be such that u{L(w),R(w)} and u{L(w),R(w)}. Then (w)=(w)=k and by the inductive hypothesis f(w)f(w).

Assume that f(u)=f(u). We will prove shortly that all vertices in V that map to u or u by L or R map to some common vertex z by f, a contradiction with f(w)f(w). Indeed, set z=f(f(u))=f(f(u)). If a vertex xV has L(x)=g(x)=u (the same arguments will apply to x that maps to u), then

z=f(f(u))=f(f(g(x)))=f(x).

If a vertex xV has R(x)=g(f(g(x)))=u, then

z=f(f(u))=f(f(g(f(g(x)))))=f(f(g(x)))=f(x),

where we have applied f(f(g(y)))=f(y) to y=f(g(x)) which we know is not a Nephew solution because it was checked by Find-Children.

See Figure 3(b) for an illustration of the argument behind Claim 30.

Completing the argument

It remains to find a vertex v with (v)2. We do not know how to achieve this deterministically; instead, we will find a pair of vertices (v,v) such that either (v)2 or (v)2.

Lemma 31.

Let uV be an arbitrary starting vertex. Let v=g(u) and v=g(f(u)). Then either

  • at least one of {u,v,v} is a solution, or

  • at least one of {v,v} is at level at least 2.

Proof.

Assume that there are no solutions among {u,v,v}. There are three cases:

  • Case I: Suppose that (u)=0. We claim that (v)=2 in this case. Indeed, u is the only vertex at level 0 that maps via f to f(u). Since f(f(v))=f(f(g(u)))=f(u) but f(v)=f(g(u))u, f(v) has to be at level 1, hence v is at level 2.

  • Case II: Suppose that (u)=1. In this case, we have (f(u))=0, hence the same argument as above applies to show that v=g(f(u)) is at level 2.

  • Case III: Suppose that (u)2. In this case, f(u) is at level at least 1. Since f(f(v))=f(f(g(u)))=f(u), we have that v is at level at least 2.

Figure 2: An example Gf with levels marked.
(a) Find-Childrenf,g(v). Shaded vertices are checked by CheckSol. Vertices g(v) and h(v) will be returned as the children of v.
(b) The argument behind Claim 30. The shaded vertices are possible locations of the wV such that either L(w)=u or R(w)=u.
(c) Case I of Lemma 31. If (u)=0, then (g(u))=2.
(d) Case II of Lemma 31. If (u)=1, then (g(f(u)))=2.
Figure 3: Illustrations of Nephew substructures from proofs in this section. Solid arrows represent f and dashed arrows represent g. Zigzag f arrows are between vertices of level 0. Parentheses are omitted in labels.

See Figures 3(c) and 3(d) for an illustration of cases I and II of Lemma 31.

Now we are ready to prove our main results. Recall the statement of Theorem 7: See 7

Proof.

The proof of NephewTFZPP is easy. Pick an arbitrary vertex u and define v=g(u) and v=g(f(u)). By Lemma 31, if none of u,v,v are solutions of Nephew, then at least one of v and v is at level at least 2. Therefore, we can randomly select one vertex in {v,v} as the root v and run the randomized algorithm for Leaf-of-Rooted-Tree on the instance (V,v,L,R). The correctness is guaranteed by Lemma 29.

Now we prove that NephewPWPP. Intuitively, we will run the reduction of Lemma 29 in parallel on (the direct product of) two graphs where the root is v in one graph and is v in the other. Because we can only guarantee that our starting point is at level 2 in only one of these graphs, we only know that one of the graphs has an induced tree rooted at its respective starting point by the reduction process. Fortunately, the overall graph on V×V will have a rooted induced tree.

By Lemma 31, starting with an arbitrary vertex, we can obtain either a Nephew solution (in which case we are done) or a pair (v,v) where at least one is at level at least 2. We will reduce to Leaf-of-Rooted-Tree on the vertex set V×V. Set

(A(u,u),B(u,u))Find-Childrenf,g(u),
(C(u,u),D(u,u))Find-Childrenf,g(u).

Set L(u,u)=(A(u,u),C(u,u)), unless one of A(u,u),C(u,u) is , in which case L(u,u)=. Set R(u,u)=(B(u,u),D(u,u)), again unless one side of the pair is in which case R(u,u)=. By the construction of Find-Children, if one of L,R is set to the other one will as well. The special vertex in V×V will be (v,v).

Then VV×V induces a binary tree (recall that V is the set of vertices reachable from (v,v) by L,R). Say that (v)2; the other case follows by symmetry. Starting at (v,v), L and R will yield a tree structure on the first member of the pair by the same argument as Lemma 29. Let p,p be two distinct paths of any length which start at (v,v) and traverse with L and R. The vertex reached by p will have a different first pair element than the vertex reached by p (by the tree structure on the first element of the pair), and thus p and p arrive at distinct vertices. This means that there are no (undirected) cycles in the induced graph on V, and so it is a tree.

5 Open Problems

We end with some future directions. The main problem left open by this work is to exhibit a natural problem in TFZPPdt which is not reducible to Lossy-Code; we conjecture that Nephew is such a problem. Some additional open questions are the following:

  • Find a TFZPP upper bound for Bertrand-Chebyshev, i.e., a natural problem in TFZPP to which Bertrand-Chebyshev reduces. The best upper bound we are aware of is only LOSSYFactoring [69, 54], which is in LOSSYPPA and LOSSYPWPP under the generalized Riemann Hypothesis [49]. Unfortunately, none of these upper bound classes are in TFZPP.

  • Find a TFZPP upper bound for the following problem, which we call Razborov-Smolensky [76, 79]: The input is an AC0[2] circuit C of depth d and size at most 2n1/10d that attempts to compute MAJ (the Majority function), and the goal is to output an instance x{0,1}n such that C(x)MAJ(x). Since MAJ is average-case hard against such circuits, this problem sits in TFZPP. This problem is trivially solvable in deterministic quasi-polynomial time (note that the naïve algorithm runs in 2O(n) time while the input size is 2nΩ(1)), hence we are interested in the regime where only polynomial-time reductions are allowed. We are not aware of any syntactic subclass of TFZPP that contains this problem.

See the full paper for open questions related to problems studied there which have been omitted in this shorter version.

References

  • [1] Aryan Agarwala and Ian Mertz. Bipartite matching is in catalytic logspace. In FOCS, 2025. To appear. doi:10.48550/arXiv.2504.09991.
  • [2] Manindra Agrawal, Neeraj Kayal, and Nitin Saxena. PRIMES is in P. Annals of Mathematics, 160(2):781–793, 2004. doi:10.4007/annals.2004.160.781.
  • [3] Albert Atserias and Víctor Dalmau. A combinatorial characterization of resolution width. J. Comput. Syst. Sci., 74(3):323–334, 2008. doi:10.1016/J.JCSS.2007.06.025.
  • [4] Albert Atserias and Neil Thapen. The ordering principle in a fragment of approximate counting. ACM Trans. Comput. Log., 15(4):29:1–29:11, 2014. doi:10.1145/2629555.
  • [5] Paul Beame, Stephen A. Cook, Jeff Edmonds, Russell Impagliazzo, and Toniann Pitassi. The relative complexity of NP search problems. J. Comput. Syst. Sci., 57(1):3–19, 1998. doi:10.1006/JCSS.1998.1575.
  • [6] Huck Bennett, Surendra Ghentiyala, and Noah Stephens-Davidowitz. The more the merrier! On total coding and lattice problems and the complexity of finding multicollisions. In 16th Innovations in Theoretical Computer Science Conference, volume 325 of LIPIcs. Leibniz Int. Proc. Inform., pages Art. No. 14, 22. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2025. doi:10.4230/lipics.itcs.2025.14.
  • [7] Maria Luisa Bonet and Nicola Galesi. Optimality of size-width tradeoffs for resolution. Comput. Complex., 10(4):261–276, 2001. doi:10.1007/S000370100000.
  • [8] Egon Börger, Erich Grädel, and Yuri Gurevich. The classical decision problem. Springer Science & Business Media, 2001.
  • [9] Harry Buhrman, Richard Cleve, Michal Koucký, Bruno Loff, and Florian Speelman. Computing with a full memory: catalytic space. In STOC, pages 857–866. ACM, 2014. doi:10.1145/2591796.2591874.
  • [10] Joshua Buresh-Oppenheim. On the TFNP complexity of factoring. Unpublished, 2006.
  • [11] Sam Buss, Noah Fleming, and Russell Impagliazzo. TFNP characterizations of proof systems and monotone circuits. In Yael Tauman Kalai, editor, 14th Innovations in Theoretical Computer Science Conference, ITCS 2023, January 10-13, 2023, MIT, Cambridge, Massachusetts, USA, volume 251 of LIPIcs, pages 30:1–30:40. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2023. doi:10.4230/LIPIcs.ITCS.2023.30.
  • [12] Samuel R Buss and Alan S Johnson. Propositional proofs and reductions between np search problems. Annals of Pure and Applied Logic, 163(9):1163–1182, 2012. doi:10.1016/J.APAL.2012.01.015.
  • [13] Samuel R. Buss, Leszek Aleksander Kołodziejczyk, and Neil Thapen. Fragments of approximate counting. J. Symb. Log., 79(2):496–525, 2014. doi:10.1017/JSL.2013.37.
  • [14] Lijie Chen, Shuichi Hirahara, and Hanlin Ren. Symmetric exponential time requires near-maximum circuit size. In STOC’24—Proceedings of the 56th Annual ACM Symposium on Theory of Computing, pages 1990–1999. ACM, New York, [2024] ©2024. doi:10.1145/3618260.3649624.
  • [15] Lijie Chen, Ce Jin, Rahul Santhanam, and Ryan Williams. Constructive separations and their consequences. TheoretiCS, 3, 2024. doi:10.46298/THEORETICS.24.3.
  • [16] Lijie Chen, Jiatu Li, and Igor C. Oliveira. Reverse mathematics of complexity lower bounds. In FOCS, pages 505–527. IEEE, 2024. doi:10.1109/FOCS61266.2024.00040.
  • [17] Lijie Chen, Xin Lyu, and R. Ryan Williams. Almost-everywhere circuit lower bounds from non-trivial derandomization. In FOCS, pages 1–12. IEEE, 2020. doi:10.1109/FOCS46700.2020.00009.
  • [18] Lijie Chen, Roei Tell, and Ryan Williams. Derandomization vs refutation: A unified framework for characterizing derandomization. In FOCS, pages 1008–1047. IEEE, 2023. doi:10.1109/FOCS57990.2023.00062.
  • [19] Xi Chen and Xiaotie Deng. Settling the complexity of two-player Nash equilibrium. In FOCS, pages 261–272. IEEE Computer Society, 2006. doi:10.1109/FOCS.2006.69.
  • [20] Yeyuan Chen, Yizhi Huang, Jiatu Li, and Hanlin Ren. Range avoidance, remote point, and hard partial truth table via satisfying-pairs algorithms. In STOC, pages 1058–1066. ACM, 2023. doi:10.1145/3564246.3585147.
  • [21] Mario Chiari and Jan Krajícek. Witnessing functions in bounded arithmetic and search problems. J. Symb. Log., 63(3):1095–1115, 1998. doi:10.2307/2586729.
  • [22] Jonas Conneryd, Susanna F. de Rezende, Jakob Nordström, Shuo Pang, and Kilian Risse. Graph colouring is hard on average for Polynomial Calculus and Nullstellensatz. In FOCS, pages 1–11. IEEE, 2023. doi:10.1109/FOCS57990.2023.00007.
  • [23] James Cook, Jiatu Li, Ian Mertz, and Edward Pyne. The structure of catalytic space: Capturing randomness and time via compression. In STOC, pages 554–564. ACM, 2025. doi:10.1145/3717823.3718112.
  • [24] Constantinos Daskalakis, Paul W. Goldberg, and Christos H. Papadimitriou. The complexity of computing a Nash equilibrium. SIAM J. Comput., 39(1):195–259, 2009. doi:10.1137/070699652.
  • [25] Ben Davis and Robert Robere. Colourful TFNP and propositional proofs. In Amnon Ta-Shma, editor, 38th Computational Complexity Conference, CCC 2023, July 17-20, 2023, Warwick, UK, volume 264 of LIPIcs, pages 36:1–36:21. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2023. doi:10.4230/LIPIcs.CCC.2023.36.
  • [26] John Fearnley, Paul Goldberg, Alexandros Hollender, and Rahul Savani. The complexity of gradient descent: CLS=PPADPLS. J. ACM, 70(1):7:1–7:74, 2023. doi:10.1145/3568163.
  • [27] Noah Fleming, Stefan Grosser, Toniann Pitassi, and Robert Robere. Black-box PPP is not Turing-closed. In Bojan Mohar, Igor Shinkar, and Ryan O’Donnell, editors, Proceedings of the 56th Annual ACM Symposium on Theory of Computing, STOC 2024, Vancouver, BC, Canada, June 24-28, 2024, pages 1405–1414. ACM, 2024. doi:10.1145/3618260.3649769.
  • [28] Noah Fleming, Deniz Imrek, and Christophe Marciot. Provably total functions in the polynomial hierarchy. In Srikanth Srinivasan, editor, 40th Computational Complexity Conference, CCC 2025, August 5-8, 2025, Toronto, Canada, volume 339 of LIPIcs, pages 28:1–28:40. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2025. doi:10.4230/LIPIcs.CCC.2025.28.
  • [29] Noah Fleming, Pravesh Kothari, and Toniann Pitassi. Semialgebraic proofs and efficient algorithm design. Found. Trends Theor. Comput. Sci., 14(1-2):1–221, 2019. doi:10.1561/0400000086.
  • [30] Lukáš Folwarczný, Mika Göös, Pavel Hubáček, Gilbert Maystre, and Weiqiang Yuan. One-way functions vs. TFNP: simpler and improved. In 15th Innovations in Theoretical Computer Science Conference, volume 287 of LIPIcs. Leibniz Int. Proc. Inform., pages Art. No. 50, 14. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2024. doi:10.4230/lipics.itcs.2024.50.
  • [31] Karthik Gajulapalli, Alexander Golovnev, Satyajeet Nagargoje, and Sidhant Saraogi. Range avoidance for constant depth circuits: Hardness and algorithms. In APPROX/RANDOM, volume 275 of LIPIcs, pages 65:1–65:18. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2023. doi:10.4230/LIPIcs.APPROX/RANDOM.2023.65.
  • [32] Surendra Ghentiyala and Zeyong Li. Hierarchies within TFNP: building blocks and collapses. CoRR, 2025. doi:10.48550/arXiv.2507.21550.
  • [33] Oded Goldreich, Shafi Goldwasser, and Silvio Micali. How to construct random functions. J. ACM, 33(4):792–807, 1986. doi:10.1145/6490.6503.
  • [34] Mika Göös, Alexandros Hollender, Siddhartha Jain, Gilbert Maystre, William Pires, Robert Robere, and Ran Tao. Further collapses in TFNP. SIAM J. Comput., 53(3):573–587, 2024. doi:10.1137/22M1498346.
  • [35] Mika Göös, Pritish Kamath, Robert Robere, and Dmitry Sokolov. Adventures in monotone complexity and TFNP. In Avrim Blum, editor, 10th Innovations in Theoretical Computer Science Conference, ITCS 2019, January 10-12, 2019, San Diego, California, USA, volume 124 of LIPIcs, pages 38:1–38:19. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019. doi:10.4230/LIPIcs.ITCS.2019.38.
  • [36] Svyatoslav Gryaznov. Notes on resolution over linear equations. In CSR, volume 11532 of Lecture Notes in Computer Science, pages 168–179. Springer, 2019. doi:10.1007/978-3-030-19955-5_15.
  • [37] Venkatesan Guruswami, Xin Lyu, and Xiuhan Wang. Range avoidance for low-depth circuits and connections to pseudorandomness. ACM Trans. Comput. Theory, 17(2):14:1–14:23, 2025. doi:10.1145/3718745.
  • [38] Mika Göös, Alexandros Hollender, Siddhartha Jain, Gilbert Maystre, William Pires, Robert Robere, and Ran Tao. Separations in proof complexity and TFNP. In 2022 IEEE 63rd Annual Symposium on Foundations of Computer Science (FOCS), pages 1150–1161, 2022. doi:10.1109/FOCS54457.2022.00111.
  • [39] Jiří Hanika. Search Problems and Bounded Arithmetic. PhD thesis, Charles University, Prague, 2004.
  • [40] Edward A. Hirsch and Ilya Volkovich. Upper and lower bounds for the linear ordering principle. CoRR, 2025. doi:10.48550/arXiv.2503.19188.
  • [41] Max Hopkins and Ting-Chun Lin. Explicit lower bounds against Ω(n)-rounds of sum-of-squares. In 63rd IEEE Annual Symposium on Foundations of Computer Science, FOCS 2022, Denver, CO, USA, October 31 - November 3, 2022, pages 662–673. IEEE, 2022. doi:10.1109/FOCS54457.2022.00069.
  • [42] Pavel Hubácek, Erfan Khaniki, and Neil Thapen. TFNP intersections through the lens of feasible disjunction. In Venkatesan Guruswami, editor, 15th Innovations in Theoretical Computer Science Conference, ITCS 2024, January 30 to February 2, 2024, Berkeley, CA, USA, volume 287 of LIPIcs, pages 63:1–63:24. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2024. doi:10.4230/LIPIcs.ITCS.2024.63.
  • [43] Pavel Hubáček, Chethan Kamath, Karel Král, and Veronika Slívová. On average-case hardness in TFNP from one-way functions. In Theory of cryptography. Part III, volume 12552 of Lecture Notes in Comput. Sci., pages 614–638. Springer, Cham, [2020] ©2020. doi:10.1007/978-3-030-64381-2_22.
  • [44] Russell Impagliazzo and Avi Wigderson. P=BPP if E requires exponential circuits: Derandomizing the XOR lemma. In STOC, pages 220–229. ACM, 1997. doi:10.1145/258533.258590.
  • [45] Emil Jeřábek. Dual weak pigeonhole principle, Boolean complexity, and derandomization. Ann. Pure Appl. Log., 129(1-3):1–37, 2004. doi:10.1016/j.apal.2003.12.003.
  • [46] Emil Jeřábek. Weak pigeonhole principle and randomized computation. PhD thesis, Charles University in Prague, 2005.
  • [47] Emil Jeřábek. Approximate counting in bounded arithmetic. J. Symb. Log., 72(3):959–993, 2007. doi:10.2178/JSL/1191333850.
  • [48] Emil Jeřábek. On independence of variants of the weak pigeonhole principle. J. Log. Comput., 17(3):587–604, 2007. doi:10.1093/LOGCOM/EXM017.
  • [49] Emil Jeřábek. Integer factoring and modular square roots. J. Comput. Syst. Sci., 82(2):380–394, 2016. doi:10.1016/J.JCSS.2015.08.001.
  • [50] David S. Johnson, Christos H. Papadimitriou, and Mihalis Yannakakis. How easy is local search? J. Comput. Syst. Sci., 37(1):79–100, 1988. doi:10.1016/0022-0000(88)90046-3.
  • [51] Robert Kleinberg, Oliver Korten, Daniel Mitropolsky, and Christos H. Papadimitriou. Total functions in the polynomial hierarchy. In ITCS, volume 185 of LIPIcs, pages 44:1–44:18. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2021. doi:10.4230/LIPIcs.ITCS.2021.44.
  • [52] Leszek Aleksander Kolodziejczyk and Neil Thapen. Approximate counting and NP search problems. J. Math. Log., 22(3):2250012:1–2250012:31, 2022. doi:10.1142/S021906132250012X.
  • [53] Oliver Korten. The hardest explicit construction. In 2021 IEEE 62nd Annual Symposium on Foundations of Computer Science—FOCS 2021, pages 433–444. IEEE Computer Soc., Los Alamitos, CA, 2021. doi:10.1109/FOCS52979.2021.00051.
  • [54] Oliver Korten. Derandomization from time-space tradeoffs. In CCC, volume 234 of LIPIcs, pages 37:1–37:26. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPIcs.CCC.2022.37.
  • [55] Oliver Korten. Range avoidance and the complexity of explicit constructions. Bull. EATCS, 145, 2025. URL: http://eatcs.org/beatcs/index.php/beatcs/article/view/825.
  • [56] Oliver Korten and Toniann Pitassi. Strong vs. weak range avoidance and the linear ordering principle. In FOCS, pages 1388–1407. IEEE, 2024. doi:10.1109/FOCS61266.2024.00089.
  • [57] Michal Koucký, Ian Mertz, Edward Pyne, and Sasha Sami. Collapsing catalytic classes. In FOCS, 2025. To appear. doi:10.48550/arXiv.2504.08444.
  • [58] Jan Krajíček. Dual weak pigeonhole principle, pseudo-surjective functions, and provability of circuit lower bounds. J. Symb. Log., 69(1):265–286, 2004. doi:10.2178/jsl/1080938841.
  • [59] Balakrishnan Krishnamurthy. Short proofs for tricky formulas. Acta Informatica, 22(3):253–275, 1985. doi:10.1007/BF00265682.
  • [60] Jiatu Li, Edward Pyne, and Roei Tell. Distinguishing, predicting, and certifying: On the long reach of partial notions of pseudorandomness. In FOCS, pages 1–13. IEEE, 2024. doi:10.1109/FOCS61266.2024.00095.
  • [61] Jiawei Li, Yuhao Li, and Hanlin Ren. Metamathematics of resolution lower bounds: A TFNP perspective. CoRR, abs/2411.15515, 2024. doi:10.48550/arXiv.2411.15515.
  • [62] Yuhao Li, William Pires, and Robert Robere. Intersection classes in TFNP and proof complexity. In Venkatesan Guruswami, editor, 15th Innovations in Theoretical Computer Science Conference, ITCS 2024, January 30 to February 2, 2024, Berkeley, CA, USA, volume 287 of LIPIcs, pages 74:1–74:22. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2024. doi:10.4230/LIPIcs.ITCS.2024.74.
  • [63] Zeyong Li. Symmetric exponential time requires near-maximum circuit size: Simplified, truly uniform. In STOC, pages 2000–2007. ACM, 2024. doi:10.1145/3618260.3649615.
  • [64] Ralph C. Merkle. A digital signature based on a conventional encryption function. In CRYPTO, volume 293 of Lecture Notes in Computer Science, pages 369–378. Springer, 1987. doi:10.1007/3-540-48184-2_32.
  • [65] Ian Mertz. Reusing space: Techniques and open problems. Bull. EATCS, 141, 2023. URL: http://eatcs.org/beatcs/index.php/beatcs/article/view/780.
  • [66] Moritz Müller. Typical forcings, NP search problems and an extension of a theorem of riis. Ann. Pure Appl. Log., 172(4):102930, 2021. doi:10.1016/J.APAL.2020.102930.
  • [67] Noam Nisan and Avi Wigderson. Hardness vs randomness. J. Comput. Syst. Sci., 49(2):149–167, 1994. doi:10.1016/S0022-0000(05)80043-1.
  • [68] Christos H. Papadimitriou. On the complexity of the parity argument and other inefficient proofs of existence. J. Comput. Syst. Sci., 48(3):498–532, 1994. doi:10.1016/S0022-0000(05)80063-7.
  • [69] Jeff B. Paris, A. J. Wilkie, and Alan R. Woods. Provability of the pigeonhole principle and the existence of infinitely many primes. J. Symb. Log., 53(4):1235–1244, 1988. doi:10.1017/S0022481200028061.
  • [70] Amol Pasarkar, Christos H. Papadimitriou, and Mihalis Yannakakis. Extremal combinatorics, iterated pigeonhole arguments and generalizations of PPP. In Yael Tauman Kalai, editor, 14th Innovations in Theoretical Computer Science Conference, ITCS 2023, January 10-13, 2023, MIT, Cambridge, Massachusetts, USA, volume 251 of LIPIcs, pages 88:1–88:20. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2023. doi:10.4230/LIPIcs.ITCS.2023.88.
  • [71] Aaron Potechin. Sum of squares bounds for the ordering principle. In Shubhangi Saraf, editor, 35th Computational Complexity Conference, CCC 2020, July 28-31, 2020, Saarbrücken, Germany (Virtual Conference), volume 169 of LIPIcs, pages 38:1–38:37. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2020. doi:10.4230/LIPIcs.CCC.2020.38.
  • [72] Pavel Pudlák. On the complexity of finding falsifying assignments for Herbrand disjunctions. Arch. Math. Log., 54(7-8):769–783, 2015. doi:10.1007/S00153-015-0439-6.
  • [73] Pavel Pudlák and Neil Thapen. Random resolution refutations. Comput. Complex., 28(2):185–239, 2019. doi:10.1007/S00037-019-00182-7.
  • [74] Edward Pyne. Derandomizing logspace with a small shared hard drive. In CCC, volume 300 of LIPIcs, pages 4:1–4:20. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2024. doi:10.4230/LIPIcs.CCC.2024.4.
  • [75] Edward Pyne, Ran Raz, and Wei Zhan. Certified hardness vs. randomness for log-space. In FOCS, pages 989–1007. IEEE, 2023. doi:10.1109/FOCS57990.2023.00061.
  • [76] Alexander A Razborov. Lower bounds on the size of bounded depth circuits over a complete basis with logical addition. Mathematical Notes of the Academy of Sciences of the USSR, 41(4):333–338, 1987.
  • [77] Hanlin Ren, Rahul Santhanam, and Zhikun Wang. On the range avoidance problem for circuits. In FOCS, pages 640–650. IEEE, 2022. doi:10.1109/FOCS54457.2022.00067.
  • [78] Søren Riis. A complexity gap for tree resolution. Comput. Complex., 10(3):179–209, 2001. doi:10.1007/S00037-001-8194-Y.
  • [79] Roman Smolensky. Algebraic methods in the theory of lower bounds for boolean circuit complexity. In STOC, pages 77–82. ACM, 1987. doi:10.1145/28395.28404.
  • [80] Neil Thapen. The weak pigeonhole principle in models of bounded arithmetic. PhD thesis, University of Oxford, 2002.
  • [81] Neil Thapen. How to fit large complexity classes into TFNP. CoRR, abs/2412.09984, 2024. doi:10.48550/arXiv.2412.09984.
  • [82] Christopher Umans. Pseudo-random generators for all hardnesses. J. Comput. Syst. Sci., 67(2):419–440, 2003. doi:10.1016/S0022-0000(03)00046-1.