One-Way Functions and Boundary Hardness of Randomized Time-Bounded Kolmogorov Complexity
Abstract
We revisit the question of whether worst-case hardness of the time-bounded Kolmogorov complexity problem, – that is, determining whether a string is “structured” (i.e., ) or “random” (i.e., ) – suffices to imply the existence of one-way functions (OWF). Liu-Pass (CRYPTO’25) recently showed that worst-case hardness of a boundary version of – where, roughly speaking, the goal is to decide whether given an instance , (a) is -random (i.e., ), or just close to -random (i.e., but ) – characterizes OWF, but with either of the following caveats (1) considering a non-standard notion of probabilistic , as opposed to the standard notion of , or (2) assuming somewhat strong, and non-standard, derandomization assumptions.
In this paper, we present an alternative method for establishing their result which enables significantly weakening the caveats. First, we show that boundary hardness of the more standard randomized problem suffices (where randomized is defined just like except that the program generating the string may be randomized).
As a consequence of this result, we can provide a characterization also in terms of just “plain” under the most standard derandomization assumption (used to derandomize just into ) – namely .
Our proof relies on language compression schemes of Goldberg-Sipser (STOC’85); using the same technique, we also present the the first worst-case to average-case reduction for the exact problem (under the same standard derandomization assumption), improving upon Hirahara’s celebrated results (STOC’18, STOC’21) that only applied to a gap version of the problem, referred to as , where the goal is to decide whether or and under the same derandomization assumption.
Keywords and phrases:
One-way functions, Time-Bounded Kolmogorov Complexity, Worst-case to Average-case ReductionsFunding:
Yanyi Liu: Supported in part by NSF Award CNS 2149305.Copyright and License:
2012 ACM Subject Classification:
Theory of computation Computational complexity and cryptographyEditor:
Shubhangi SarafSeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
A one-way function [5] (OWF) is a function that can be efficiently computed (in polynomial time), yet no probabilistic polynomial-time () algorithm can invert with inverse polynomial probability for infinitely many input lengths . Whether one-way functions exist is unequivocally the most important open problem in Cryptography (and arguably the most important open problem in the theory of computation, see e.g., [32]): OWFs are both necessary [27] and sufficient for many of the most central cryptographic primitives and protocols (e.g., pseudorandom generators [3, 13], pseudorandom functions [9], private-key encryption [10], digital signatures [49], commitment schemes [44], identification protocols [6], coin-flipping protocols [2], and more). These primitives and protocols are often referred to as private-key primitives, or “Minicrypt” primitives [26] as they exclude the notable task of public-key encryption [5, 48]. Additionally, as observed by Impagliazzo [11, 26], the existence of a OWF is equivalent to the existence of polynomial-time method for sampling hard solved instances for an language (i.e., hard instances together with their witnesses).
Worst-case Characterizations of OWFs.
We here focus on the question of whether there exists a natural complexity-theoretic problem whose worst-case hardness characterizes OWFs. Recently, [33] presented an average-case characterization of OWFs through a natural computational problem – the time-bounded Kolmogorov complexity problem [31, 50, 30, 12]: Let the Kolmogorov complexity of a string , denoted , be defined as the length of the shortest program that outputs , and the -bounded version, , be defined as the length of the shortest program that outputs the string within time . While determining (or deciding for some particular threshold ) the Kolmogorov complexity of a string is uncomputable, as surveyed by Trakhtenbrot [51], the problem of efficiently computing/deciding -complexity (when is a polynomial) predates the theory of -completeness and was studied in the Soviet Union since the 60s as a candidate for a problem that requires “brute-force search”. Indeed, so far no non-trivial attacks are known in the uniform setting (and only very recently a non-trivial attack of circuit size roughly was presented in the non-uniform setting [43, 18]).
The work of [33] showed that mild average-case hardness of computing (or simply deciding whether it is large or small) w.r.t. the uniform distribution over instances, characterizes the existence of OWF. (Subsequently, several other average-case characterizations of OWF were obtained, e.g., [34, 47, 1, 35, 36].)
Recently, also worst-case characterizations of OWFs were obtained [37, 20], considering some variants of the time-bounded Kolmogorov complexity problem. In particular:
-
[37] characterize OWFs through the problem of determining whether is large or small, but restricting attention to instances with very large unbounded Kolmogorov complexity (i.e., ), or alternatively, to strings whose so-called “computational depth”, , is small (whereby “restricting attention” means that we consider worst-case hardness of a promise problem that only considers those instances; that is, any efficient algorithm must fail on one of those instances in the promise).
-
[20] provide a worst-case characterization of a variant of OWFs, referred to as infinitely-often OWFs, through the problem of “estimating the probability that a random time-bounded program outputs a certain string” (which can be shown to be related to the notion of probabilistic ), while restricting attention to instances satisfying an analog of small computational depth (with respect to this complexity notion).111An alternative type of worst-case characterization of OWFs was also obtained in [20], where it is shown that OWFs exists iff and a certain “distributional”- problem is -complete w.r.t. a certain type of (restricted) reductions. Note that simply worst-case hardness of the distributional problem alone is not sufficient to get OWFs. Rather, the characterization is in terms of both a hardness assumption () combined with an feasibility assumption (the existence of a certain type of -completeness reduction).
The unappealing aspect of the above characterizations is that once we add the “conditioning”, it becomes less clear what the intuitive interpretation of the problems is. On a technical level, the property that we condition on (i.e., computational depth being small, or Kolmogorov complexity being large) is not decidable.
Boundary-Hardness of Time-bounded Kolmogorov Complexity.
Very recently, [39] presented an approach towards a more natural problems whose worst-case hardness may characterize OWFs. Ideally, we would like to show that the classic problem – that is, the problem of simply determining whether a string is “structured” (i.e., ) or “random” (i.e., ) – suffices to imply the existence of one-way functions (OWF). [39] recently showed that worst-case hardness of a boundary version of – where, roughly speaking, the goal is to decide whether given an instance , deciding whether (a) is -random (i.e., ), or just close to -random (i.e., but ) – characterizes OWF, but with either of the following caveats:
-
Considering a non-standard notion of probabilistic [8], as opposed to the standard notion of .222Roughly speaking, for some constant (think of ), is defined as the smallest such at with probability over the choice of a random string , there exists a program of length at most that generates if being provided . One can think of this notion as time-bounded Kolmogorov complexity in the presence of a “Common Random String” (CRS), but note that program is allowed to depend on the CRS.
Our Results in a Nutshell.
Our main results will improve upon the above result in two ways:
-
Instead of showing an unconditional characterization in terms of probabilistic , we will show that the same result holds with respect to the notion of randomized [41, 46, 40]. Roughly speaking, randomized – denoted – is defined as the length of the shortest randomized program that generates with some probability (think of as .) Note that despite the similarity in the names between probabilistic- and randomized-, there is a major conceptual difference. In , the same program is supposed to generate with high probability over the randomness string (i.e., generates with high probability over ) – this is arguably the most natural way of generalizing to consider randomized programs; in contrast, for , we simply require that with high probability over the string , there exists some “short” program such that generates .
1.1 Our Results
To state our results more formally, let us first recall the standard problem; the randomized version of the problem, , is defined analogously (see Section 4). For any polynomials , let denote the promise problem consisting of:
-
: , .
-
: , ,
In essence, the goal is to distinguish between so-called time-bounded Kolmogorov-random strings (i.e., s.t. ) and those that are not. We say that if for all polynomials , , .333Recall that denotes the class of promise problems that admit a probabilistic polynomial time algorithm on infinitely many input lengths.
The boundary version (denoted by ) is defined as the promise problem consisting of:
-
: , and .
-
: , .
and is analogously defined. In other words, the problem requires:
Deciding whether a string is (a) time-bounded Kolmogorov random, or (b) just “near” time-bounded Kolmogorov random (i.e., having “intermediate” time-bounded Kolmogorov complexity.)
We are now ready to state our main theorems:
Theorem 1.1.
Assume there exists some constant such that . Then, OWFs exist if and only if .
Theorem 1.2.
OWFs exist if and only if .
Comparing our Derandomization Assumption to the one used in [39].
As mentioned, we are relying on the most classic derandomization assumption in the literature; that is, the existence of a function computable in time for some constant , that cannot be computed in by circuits of size for some constant . This is the assumption used by [28] to demonstrate that .
This assumption was first strengthened by [29] to require hardness against non-deterministic circuits. Following [4], [39], considered an even stronger version, requiring the existence of such that for every (i.e., we require a hierarchy of hard functions), there exists a function running in time that is secure against non-deterministic attackers with running time but still only advice. Note that compared to this assumption, ours is weaker in two respects: (a) we no longer require non-deterministic hardness, and most importantly (b) we no longer require an infinite hierarchy of hard functions over a fixed input length.
Does Worst-case Hardness of just Suffice?
The above results still require worst-case of the boundary version of . As mentioned, ideally, we would like to base OWFs on worst-case hardness of just the problem. Beyond being interesting in its own right, this question is motivated by the fact that in recent years, there has been great progress (see e.g. [21, 25, 22] [23, 36, 17, 24]) towards showing that problem is -complete. Notably, if OWFs can be based on the worst-case hardness of this problem, and the problem can be shown to be -complete, then we would base OWFs on just the assumption that (a problem left open since the work by Diffie and Hellman [5], and referred to as the “holy-grail”).
Intriguingly, a result by Hirahara [14] shows that if we were to consider a “gap” version of the problem, referred to as , where the goal is to distinguish between such that and , for all , then worst-case hardness of this problem implies what is referred to as errorless average-case hardness (w.r.t. deterministic algorithms). Roughly speaking, errorless average-case hardness refers to average-case hardness w.r.t. algorithms that either provide the right answer or (and should only be output with small probability). In other words, hardness holds w.r.t. algorithms that “know” when they fail. This is in contrast to the notion of two-sided error hardness where we allow the attacker to fail (without knowing it) on some small set of inputs. (Unfortunately, to apply the result of [33], we require the more standard notion of two-sided error average-case hardness, so this result is not helpful in terms of getting OWFs, at least given current machinery.) Hirahara [16], subsequently showed that under the assumption that , the same result can be established when the “gap” is just , as opposed to (i.e., for ).
As our final result, and using the same technique as the proof of our main theorem, we show how to strengthen Hirahara’s result to directly apply to (as opposed to the gap version of this problem).
Theorem 1.3.
Assuming for some , if and only if (which in turn implies that ).
As mentioned, Hirahara [14, 15] shows (under the same derandomization assumption) a worst-case to average-case reduction for a gap version of the problem, where there is a gap of between the complexity of YES and NO instances (on top of the gap between the running times). Theorem 1.3 improves the results of [14, 15, 8] by removing the gap in the large threshold regime (i.e, when deciding whether is larger or smaller than ): As far as we know, Theorem 1.3 is the first worst-case to average-case reduction for simply the problem (and not the problem).
We mention, however, that while our proof technique is very different from the one in [14, 15], and is also significantly simpler, it has the disadvantage that it only works in the large threshold regime (i.e., ) , whereas Hirhara’s technique also works for smaller thresholds (but again, only for the gap version).
Additionally, if focusing on (as opposed to ) and only considering average-case hardness against deterministic polynomial-time algorithms (as in [14]), then we can show unconditionally obtain a worst-case to average-case reduction.
Theorem 1.4.
if and only if (which in turn implies that ).
Taken together, our results provide the appealing characterization of the worst-case hardness of the time-bounded Kolmogorov complexity problem (under standard derandomization assumption) provided in Figure 1.
1.2 Proof Overview
We proceed to a proof outline of our results. As mentioned, [39] obtained a characterization of OWFs through the boundary hardness of time-bounded Kolmogorov complexity, but considering a non-standard notion of probabilistic , . We focus on explaining how to prove that the same result holds w.r.t. the notion of randomized , . Note that the notion of was crucial in obtaining a OWF from the boundary hardness. So we will primarily discuss how to prove the “if” direction of Theorem 1.2. The converse direction will follow mostly from [39], using the techniques from [37], which passed through the notion of an “entropy-preserving PRG” constructed in [33] (and later improved in [37]). Our weakening of the derandomization assumption in Theorem 1.1 follows from the fact that we are able to deal with (as opposed to ).
Towards this, it is instructive to briefly review how [39] established the existence of OWFs from the boundary hardness of .
Why is Needed in [39].
Roughly speaking, the results of [39] first rely on the result of [33, 38] showing that average-case hardness of implies the existence of OWFs. In more detail, [33, 38] present a reduction for solving on average given a OWF inverter (for their specific OWF) (referred to as the LP20 reduction). [39] showed that the LP20 reduction, in fact, also correctly decides on all instances that “are along boundary”.
This was proved in the following two steps: (1) [39] showed that elements on which the reduction fails can be sampled with probability for any , by an algorithm running in time ; and (2) relying on a so-called “Coding Theorem” [42], [39] proved the complexity of an element is, roughly, at most the logarithm of the inverse of the probability by which it can be sampled. Taken together, the reduction only fails on the elements of small complexity (roughly at most ), and thus will succeed on elements of either “intermediate” or “large” complexity – instances that are along the boundary.
We plan to rely on the same outline: take the LP20 reduction for and prove that it is correct on all boundary instances. However, to make the above proof also work w.r.t. , we would need a “Coding Theorem” for without assuming any derandomization assumptions, which is still open. A step towards such a Coding Theorem was recently taken by Hirahara et al [19], which shows that an “average-case coding theorem for ” can be achieved assuming access to a OWF inverter. While in our setting, it is allowed to assume a OWF inverter (since we aim to establish the existence of OWFs), their notion unfortunately will not suffice for us – we require a coding theorem (that achieves non-trivial compression) for all strings in the support of the sampler.
Language Compression and .
While it does not apply directly, it is instructive here to recall the techniques used in [19] to achieve -type compression. [19] relied on the result of Goldberg and Sipser [7], in which they showed that all sparse languages in admit a so-called language compression scheme, where a language compression scheme allows to compress all instances in a language by logarithmically many bits. [19] observed that admitting a language compression also implies that every instance in the language has small complexity (at most ).
Thus, it would be sufficient for us to prove that the language of all the elements on which the LP20 reduction fails, denoted by the language , admits a language compression scheme. is indeed a sparse language (since, as mentioned, [39] showed that any instance in can be sampled with probability – there can be at most of them). However, it is unclear that is in : to check whether the reduction fails, we would need a “-witness”, which cannot be efficiently found.
Our key observation is that instead of considering , the set of bad instances, we can consider the set of -witnesses of bad instances, denoted as . We first remark that as a corollary of the [33] reduction, we have that not only is sparse, but also . In addition, is in since one can produce the bad instance from a bad witness (by running the witness as a program), run the reduction on the instance, and check whether it fails using the witness. Finally, notice that if all -witness of bad instances have small complexity, the instances themselves will also have small -complexity, since, again, the instances can be obtained from running the witnesses. The above simple approach suffices to show that worst-case hardness of deciding whether is large or small conditioned on . To obtain the full result from just boundary hardness of we need to adjust the OWF construction from [33] to consider also randomized programs, which requires additional care – we refer the reader to the formal proof for the details.
Dealing with just using Derandomization.
We highlight that the only reason the above approach requires using as opposed to is that the decoding (“decompression”) algorithm of [7] is randomized. However under standard derandomization assumption, we can simply derandomize this algorithm (by enumerating all seeds to a complexity-theoretic PRG and picking the most likely output) and obtain a deterministic decoding. This suffices to show the same result but starting with just boundary hardness of .
Let us briefly mention how this contrasts with the approach in [39] (and in particular, how we are able to weaken the derandomization assumption). There, the authors established an unconditional characterization of OWFs in terms of boundary of and next relied on a strong derandomization assumption to obtain a similar bound w.r.t. ; the fact that we can get the characterization in terms of is what enables us to use a much weaker derandomization assumptions.444We highlight that it is an open problem that obtain a tight bound on the distance of and based on standard derandomization assumptions.
A New Worst-case to Errorless Average-case Reduction.
We finally outline the idea behind our new worst-case to average-case reduction for under derandomization assumptions. For simplicity, we here argue that worst-case hardness implies errorless hardness against deterministic algorithms (but the argument also readily extends to the case of randomized heuristic with some care). The key point is that instances on which an errorless reduction fails are sparse and explicitly identifiable (since the heuristic needs to output ). Thus, we can again use the language compression algorithm of [7] to compress them, and thus by the above argument, their -complexity must be bounded by (under derandomization assumptions, and unconditionally if considering ). So any errorless heuristic can be turned in to a worst-case decider for (resp. ) by simply outputting whenever the heuristic outputs !
2 Preliminaries
2.1 One-Way Functions
We recall the definition of one-way functions [5]. Roughly speaking, a function is one-way if it is polynomial-time computable, but hard to invert for attackers.
Definition 2.1.
Let be a polynomial-time computable function. is said to be a one-way function (OWF) if for every algorithm , there exists a negligible function such that for all ,
We may also consider a weaker notion of a weak one-way function [52], where we only require all attackers to fail with probability noticeably bounded away from 1:
Definition 2.2.
Let be a polynomial-time computable function. is said to be an -weak one-way function (-weak OWF) if for every algorithm , for all sufficiently large ,
We say that is simply a weak one-way function (weak OWF) if there exists some polynomial such that is a -weak OWF.
Yao’s hardness amplification theorem [52] shows that any weak OWF can be turned into a (strong) OWF.
Theorem 2.3 ([52]).
Assume there exists a weak one-way function. Then there exists a one-way function.
2.2 Language Compression Schemes for
We introduce the notion of language compression we rely on. We highlight here that we do not consider the complexity of the compressing algorithm (and the compression can be just existential).
We generalize the notion of language compression to promise problems; additionally, we want the definition to apply for all input lengths where the language is sparse (i.e, the compression works input-length by input-length). We say that a promise problem is -sparse over input of length if .
Definition 2.4.
For any promise problem , we say that admits a -language-compression scheme if there exists a decompressing such that the following holds. For all sufficiently large , if is -sparse over input of length , then for any , there exists a string , such that
We recall the language compression scheme of Goldberg and Sipser [7]. (We note that while the language compression scheme considers languages in , as pointed out in [7], their scheme also works when the language is a promise problem; and we observe that the compression works in an input-length by input-length manner.)
Theorem 2.5 ([7]).
For any promise problem , any integer , admits a -language-compression scheme.
2.3 Errorless Average-case Complexity
We finally recall the definition of errorless average-case complexity with respect to the uniform distribution on instances.
Definition 2.6 ().
For a promise problem , we say that if for all polynomial , there exists a probabilistic polynomial-time heuristic , such that for all sufficiently large , for every (or ),555We remark that the constant can be made arbitrarily small – any constants bounded away from works as we can amplify it using a standard Chernoff-type argument.
(where if and if ) and
We will refer to problems in as problems that admit errorless heuristics. To better understand the class , it may be useful to compare it to the class (problems solvable by deterministic errorless heuristics): if for every polynomial , there exists some deterministic polynomial-time heuristic such that (a) for every input , outputs either the correct answer or , and (b) the probability over uniform -bit inputs that outputs is bounded by . In other words, the only way an errorless heuristic may make a “mistake” is by saying (“I don’t know”). is simply the natural “-analog” of where the heuristic is allowed to be randomized.
3 Time-Bounded Kolmogorov Complexity Problems
Before presenting the main results in this work, let us formally define the computational problems needed in our results.
We first recall the notion of time-bounded Kolmogorov complexity [31, 50, 30, 12], and its randomized extension [41, 46, 40].
Time-bounded Kolmogorov Complexity [31, 50, 30, 12].
Roughly speaking, let the -bounded Kolmogorov complexity of a string , denoted , be defined as the length of the shortest program such that outputs the string within time . Formally, fix to be a universal Turing machine. For any string , define
Randomized Time-bounded Kolmogorov complexity [41, 46, 40].
Roughly speaking, the randomized -time-bounded Kolmogorov complexity, , of a string is the length of the shortest randomized program such that (where denotes the program with its random tape fixed to be ) outputs in steps for at least a fraction of its random tapes .
Formally, fix to be a universal Turing machine. For any string , define
where for any (randomized) program , let denote the deterministic program where the random tape is fixed to be . We will also consider a generalized version where the probability can be replaced by : For any , we define as the same quantity but with replaced by .
The Problem and Its Boundary Version.
For any polynomials , , define the time-bounded Kolmogorov complexity problem (denoted by ) as the promise problem consisting of
-
: strings , .
-
: strings , .
In addition, we can define its boundary version (denoted by ) as
-
: strings , and .
-
: strings , .
Problem and Its Boundary Version.
For any polynomials , , and a parameter , define the time-bounded randomized Kolmogorov complexity problem (denoted by ) as the promise problem consisting of
-
: strings , .
-
: strings , .
We remark that the threshold (compared with the threshold in ) is shifted from to to ensure that the NO instance set is not empty.
Analogously, we can also define the boundary version (denoted by ) as
-
: strings , and .
-
: strings , .
We will consider and simply denote (resp ) by (resp ) where .
Finally, for (resp ), we say that (resp ) if for all polynomials , , (resp ).
4 New Characterizations of OWF Through Boundary Hardness
4.1 Main Theorems
We are ready to describe the main results in this work. Our first result shows that the existence of OWFs is equivalent to the worst-case hardness of the boundary version of .
Theorem 4.1.
OWFs exist if and only if .
Proof.
The theorem follows from Theorem 4.3 (which together with Yao’s hardness amplification Theorem, Theorem 2.3), proves the “if” direction; the “only-if” direction is proved in Theorem 4.7.
Our next result extends this theorem to obtain a characterization of OWFs based on the worst-case hardness of the boundary version of ; this time, however, we will rely on a standard derandomization assumption.
Theorem 4.2.
Assume that there exists a constant such that . Then, OWFs exist if and only if .
Proof.
4.2 OWFs from Boundary Hardness of
We first show a construction of OWF from the worst-case hardness of . Since can be derandomized to under standard derandomization assumption, this result will directly also extend to under the same derandomization assumption.
Theorem 4.3.
Assume that . Then, weak one-way functions exist.
Proof.
Consider any polynomial , and assume that for all , . We consider the function , which takes an input where , and for each , computes for each
where is the -bit prefix of (and the bit-string is interpreted as an integer ), and denotes the program with random tape . If there exists a string such that the majority among equals , it outputs
Otherwise (i.e., there is no majority among ) it outputs .
This function is only defined over some input lengths, but by an easy padding trick, it can be transformed into a function defined over all input lengths, such that if is weakly one-way (over the restricted input lengths), then will be weakly one-way (over all input lengths): simply truncates its input (as little as possible) so that the (truncated) input now becomes of length for some and outputs . This will decrease the input length by a polynomial factor (since is a polynomial) so the padding trick can be applied here.
We show that is a weak OWF (over the restricted input length). Let . We assume for contradiction that is not -weak one-way. (In the proof below, although the input length of we consider is for some , we will view as the “security parameter”, computing the running time and the inverting success probability in terms of the security parameter , but analyzing the one-wayness of on input length . Since and are polynomially related, we can still conclude that is weak one-way.) Then, there exists a attacker that inverts with probability at least for infinitely many .
We turn to constructing a algorithm deciding whether an input is of -complexity . Our algorithm , on input , samples random strings . Then, it runs for every where is represented as a -bit string. In addition, checks whether successfully finds a valid pre-image of on . Finally, outputs YES if and only if the shortest index on which succeeds in inverting is . (Otherwise, output NO.) Since runs in polynomial time, the algorithm will also terminate in polynomial time.
We will show that the algorithm decides the promise problem for some polynomial (which will be fixed later). Toward this, we consider a promise problem defined as follows: For any , we view as where and . Let denote the probability that fails to invert on where are sampled at random. We define if (and if ). Since both and are efficient, we have that .
Pick . By Theorem 2.5, there exists a -language-compression scheme for (where denotes the input length for ). Let denote the decoding (or decompressing) algorithm for the scheme.
Select to be the runtime bound for the following algorithm : Run the decoding algorithm (on some valid compression of ) to get an instance , . Since can be viewed in the form of , let and let be the -bit prefix of the string . Run over a random random-tape for of steps and output the output of .
Now we are ready to show that decides for infinitely many input lengths . Fix some sufficiently large input length where inverts the function with probability on . We first show that if is a NO instance of , outputs NO with probability .
Claim 4.4.
If , then outputs NO with probability .
Proof.
We show that outputs YES with probability . Recall that will output YES if there exists an index and a program of length such that outputs for a half fraction of random-tapes , . We refer to such as being “bad”.
We turn to showing that among all possible choices of random tapes, there are at most a fraction of them being bad. Consider any program of length . Since , by a standard Chernoff bound, there is no more than fraction of such that outputs for a majority of . Taking a Union bound over all possible , we conclude that the probability that is bad is at most
when is sufficiently large.
Finally, we show that if is a YES instance of , will output YES with probability . Since , it follows that there exists a program of length such that will output within steps over at least fraction of random-tape . Let denote . If finds a valid pre-image of on , will outputs YES. We show that this will happen with high probability.
Claim 4.5.
will return a valid pre-image of with probability over random .
Proof.
Since is a -witness of of length , by the Chernoff bound, there exists a valid pre-image of on for at least a fraction of . Assume for contradiction that fails to find a valid pre-image on with probability . We will show that
which is a contradiction since is a YES instance of .
Consider any string of length such that its -bit prefix is . By taking a Union bound, it follows that fails to invert on with probability . Thus, the concatenation will be an YES-instance of (where is the promise problem we defined before). We next show that is -sparse on input of length . Suppose not. Then, it follows that the inverter fails to invert with probability
which contradicts to the fact that is a good inverter on . Recall that is the decoding algorithm for the -language-compress scheme for . Thus, there exists a string (i.e., the compression of ), , such that outputs with probability . It follows (from our definition of algorithm ) that outputs with probability . Note that can be written as a program of length and it outputs with probability within steps (due to our choice of ), and therefore
which completes the proof.
We proceed to extend this result to work also for under standard derandomization assumptions.
Theorem 4.6.
Assume that there exists a constant such that . Then, implies that weak one-way functions exist.
Proof (sketch).
The proof will proceed largely as the proof of Theorem 4.3, but with minor adjustments. In particular, we will go back to the original OWF construction of [33] (which is exactly the same as the one in Theorem 4.3 except that we no longer consider randomized programs , so no longer including the random strings ). Specifically, we define
where notations are as in Theorem 4.3.
With the new OWF construction, we will be needing to adapt the decider to deal with . The new algorithm follows roughly , except that (1) no longer needs to sample random strings ; and (2) it replaces the threshold with (as required by the Theorem).
Finally, we need to show that Claim 4.4 and Claim 4.5 hold w.r.t. , the new construction , threshold , and algorithm . Claim 1 trivially holds since outputs YES only when it finds a -witness of length . Note that , so there will never be such a witness when . Claim 2 is where we require some additional work. Note that to conclude Claim 2, we need to show that if fails, then (as opposed to ). Recall that in Claim 2, it is shown that the string can be generated w.p by a randomized decoding algorithm on input a string of length . So to demonstrate the above claim, we just need to show how to derandomize , which can be done using standard techniques. In more detail, let denote the circuit that receives a random-tape for as input, and outputs 1 if the random-tape leads to output . By [28], under the assumption of the theorem, we have that there exists a (complexity-theoretic) PRG with seed length that fools . Thus, can be derandomized by emulating on all (pseudo) random-tapes generated by running on all possible seeds, and outputting the string that appears the most often.
4.3 Boundary Hardness of from OWFs
We here prove the worst-case hardness of from the existence of OWFs.
Theorem 4.7.
Assume that OWFs exist. Then, .
Proof.
The proof for this theorem closely follows from the proofs in [39], in which they show that OWFs imply the boundary hardness of the same problem but with respect to another complexity measure (). Their proof proceeds in two steps: First, they show that OWFs implies the existence of a so-called conditionally-secure entropy-preserving pseudorandom generator [33] (Cond-EP PRG); second, they show that a Cond-EP PRG (with certain parameters) implies the hardness of the boundary problem (in [39, Lemma 6.3]).
In order to prove the above theorem, the first step follows from the same proof. The following two claims are needed in the proof for the second step: (1) the output of Cond-EP PRG has “small” -complexity (which follows from the fact that the output of Cond-EP PRG can be generated using a small program (without using the random tape)) and (2) for any string and runtime bound , (c.f., [8, Proposition 19]). Thus, the second step follows from the proof of [39, Lemma 6.3] with replaced by .
5 New Worst-case to (Errorless) Average-case Reductions
We move on to proving that the worst-case hardness of is equivalent to its errorless average-case hardness under standard derandomization assumptions; additionally, we show the same result unconditionally holds for (i.e., without derandomization assumptions).
We start by showing the result for as its proof is simpler; the proof for is very similar but requires some additional work to deal with the derandomization.
Theorem 5.1.
If for all polynomials , , it holds that
then for all polynomials , , it holds that
Moreover, the converse is also true.
Proof.
To see that the backward direction is true, simply consider any and notice that if , the worst-case algorithm for is trivially an (average-case) errorless heuristic for the same problem, and thus .
We turn to proving the forward direction. Consider any polynomial , and let be a polynomial. Suppose for contradiction that . Therefore, there exists a deterministic errorless heuristic for such that for all sufficiently large . Let be the set of all ’s such that ; note that this language is in (by the definition of ) and is also sparse (by the above). Thus, by Theorem 2.5, admits a -language-compression scheme for . Let be the decompressing algorithm in the language compression scheme, and let be its running time. Note that for any , , there exists some string of length (i.e., the compression of ) such that with probability ; thus
We are now ready to present an efficient algorithm for , which will be a contradiction. On input , our algorithm simply runs ; if outputs , outputs 1, and otherwise it simply outputs whatever outputs. Note that when does not output , its answer needs to be correct. And when outputs , as argued above, we have that , and thus will also provide the right answer.
We move on to show the result for (under standard derandomization assumptions).
Theorem 5.2.
Assume that there exists a constant such that . If for all polynomials , , it holds that
then for all polynomials , , it holds that
Moreover, the converse is also true.
Proof.
Let (and recall that for any , we have that and for any , we have that ).
To see that the backward direction is true, simply consider any and notice that if , the worst-case algorithm for is trivially an (average-case) errorless heuristic for the same problem, and thus .
We turn to proving the forward direction. Consider any polynomial , and let be a polynomial. Suppose for contradiction that . Therefore, there exists an errorless heuristic for such that for all sufficiently large . Note that for any , , we have that (and also for any such that , ).
We will be needing the following promise problem , where YES instances of consist of all instances on which (and NO instances are those on which ). Since we assume that there exists a constant such that , [28]. Let be the language that derandomizes to. Since is a good errorless heuristic, on all sufficiently large , it holds that
Thus, by Theorem 2.5, admits a -language-compression scheme for . Let be the decompressing algorithm in the language compression scheme. We are going to show that there exists a polynomial such that for any sufficiently long , ,
Consider any such , and let be the string of length (i.e., the compression of ) guaranteed to exist by Theorem 2.5. Notice that will output with probability . It remains to derandomize using the derandomization assumption. Let denote the circuit that receives a random-tape for as input, and outputs 1 if the random-tape leads to output on input . By [28], there exists a (complexity-theoretic) PRG with seed length that fools for any such . Thus, can be derandomized by emulating on all (pseudo) random-tapes generated by running on all possible seeds, and outputting the string that appears the most often. Denote this algorithm by , and notice that runs in polynomial time. Let be a polynomial that upper bounds the running time of . Since the machine with and hardcoded will generate in time and it has description length , we conclude that
We are now ready to present an efficient algorithm for , which will be a contradiction. On input , our algorithm first checks whether . If so, simply outputs 1. Otherwise, outputs if (otherwise it output 0). Observe that runs in polynomial time (since and is a algorithm).
We turn to analysis the correctness of . For any such that (namely that is a YES instance), if , it follows that outputs 1. If , it follows that outputs with probability at most . Since outputs either or 1 with probability at least , by a Union bound, output 1 with probability at least . For any such that (namely that is a NO instance), it follows that . In addition, , and thus outputs either or with probability , and therefore output 0 with probability .
References
- [1] Eric Allender, Mahdi Cheraghchi, Dimitrios Myrisiotis, Harsha Tirumala, and Ilya Volkovich. One-way functions and a conditional variant of MKTP. Electron. Colloquium Comput. Complex., 28:9, 2021.
- [2] Manuel Blum. Coin flipping by telephone - A protocol for solving impossible problems. In COMPCON’82, Digest of Papers, Twenty-Fourth IEEE Computer Society International Conference, San Francisco, California, USA, February 22-25, 1982, pages 133–137. IEEE Computer Society, 1982.
- [3] Manuel Blum and Silvio Micali. How to generate cryptographically strong sequences of pseudo-random bits. SIAM Journal on Computing, 13(4):850–864, 1984. doi:10.1137/0213053.
- [4] Lijie Chen and Roei Tell. Simple and fast derandomization from very hard functions: eliminating randomness at almost no cost. In Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing, pages 283–291, 2021. doi:10.1145/3406325.3451059.
- [5] Whitfield Diffie and Martin Hellman. New directions in cryptography. IEEE Transactions on Information Theory, 22(6):644–654, 1976.
- [6] Uriel Feige and Adi Shamir. Witness indistinguishable and witness hiding protocols. In STOC ’90, pages 416–426, 1990. doi:10.1145/100216.100272.
- [7] Andrew Goldberg and Michael Sipser. Compression and ranking. In Proceedings of the seventeenth annual ACM symposium on Theory of computing, pages 440–448, 1985. doi:10.1145/22145.22194.
- [8] Halley Goldberg, Valentine Kabanets, Zhenjian Lu, and Igor C Oliveira. Probabilistic kolmogorov complexity with applications to average-case complexity. In Shachar Lovett, editor, 37th Computational Complexity Conference (CCC 2022), volume 234 of Leibniz International Proceedings in Informatics (LIPIcs), pages 16:1–16:60, Dagstuhl, Germany, 2022. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.CCC.2022.16.
- [9] Oded Goldreich, Shafi Goldwasser, and Silvio Micali. How to construct random functions. In FOCS, 1984.
- [10] Shafi Goldwasser and Silvio Micali. Probabilistic encryption. J. Comput. Syst. Sci., 28(2):270–299, 1984. doi:10.1016/0022-0000(84)90070-9.
- [11] Yuri Gurevich. The challenger-solver game: variations on the theme of p=np. In Logic in Computer Science Column, The Bulletin of EATCS. 1989.
- [12] J. Hartmanis. Generalized kolmogorov complexity and the structure of feasible computations. In 24th Annual Symposium on Foundations of Computer Science (sfcs 1983), pages 439–445, November 1983. doi:10.1109/SFCS.1983.21.
- [13] Johan Håstad, Russell Impagliazzo, Leonid A. Levin, and Michael Luby. A pseudorandom generator from any one-way function. SIAM J. Comput., 28(4):1364–1396, 1999. doi:10.1137/S0097539793244708.
- [14] Shuichi Hirahara. Non-black-box worst-case to average-case reductions within NP. In 59th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2018, pages 247–258, 2018. doi:10.1109/FOCS.2018.00032.
- [15] Shuichi Hirahara. Non-disjoint promise problems from meta-computational view of pseudorandom generator constructions. In Shubhangi Saraf, editor, 35th Computational Complexity Conference (CCC 2020), volume 169 of Leibniz International Proceedings in Informatics (LIPIcs), pages 20:1–20:47, Dagstuhl, Germany, 2020. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.CCC.2020.20.
- [16] Shuichi Hirahara. Symmetry of information in heuristica. Manuscript, 2021.
- [17] Shuichi Hirahara. Np-hardness of learning programs and partial mcsp. In 2022 IEEE 63rd Annual Symposium on Foundations of Computer Science (FOCS), pages 968–979. IEEE, 2022. doi:10.1109/FOCS54457.2022.00095.
- [18] Shuichi Hirahara, Rahul Ilango, and Ryan Williams. Beating brute force for compression problems. Electronic Colloquium on Computational Complexity, 2024. URL: https://eccc.weizmann.ac.il/report/2023/171/.
- [19] Shuichi Hirahara, Zhenjian Lu, and Mikito Nanashima. Optimal coding for randomized kolmogorov complexity and its applications. arXiv preprint arXiv:2409.12744, 2024. doi:10.48550/arXiv.2409.12744.
- [20] Shuichi Hirahara and Mikito Nanashima. Learning in pessiland via inductive inference. In 64nd Annual Symposium on Foundations of Computer Science (FOCS). IEEE, 2023. doi:10.1109/FOCS57990.2023.00033.
- [21] Rahul Ilango. Approaching MCSP from above and below: Hardness for a conditional variant and ACˆ0[p]. In 11th Innovations in Theoretical Computer Science Conference, ITCS 2020, pages 34:1–34:26, 2020. doi:10.4230/LIPIcs.ITCS.2020.34.
- [22] Rahul Ilango. The minimum formula size problem is (eth) hard. In 2021 IEEE 62nd Annual Symposium on Foundations of Computer Science (FOCS), pages 427–432. IEEE, 2021. doi:10.1109/FOCS52979.2021.00050.
- [23] Rahul Ilango. Constant depth formula and partial function versions of mcsp are hard. SIAM Journal on Computing, (0):FOCS20–317, 2022.
- [24] Rahul Ilango. Sat reduces to the minimum circuit size problem with a random oracle. In 2023 IEEE 64th Annual Symposium on Foundations of Computer Science (FOCS), pages 733–742, 2023. doi:10.1109/FOCS57990.2023.00048.
- [25] Rahul Ilango, Bruno Loff, and Igor Carboni Oliveira. NP-hardness of circuit minimization for multi-output functions. In 35th Computational Complexity Conference, CCC 2020, pages 22:1–22:36, 2020.
- [26] Russell Impagliazzo. A personal view of average-case complexity. In Structure in Complexity Theory ’95, pages 134–147, 1995. doi:10.1109/SCT.1995.514853.
- [27] Russell Impagliazzo and Michael Luby. One-way functions are essential for complexity based cryptography (extended abstract). In 30th Annual Symposium on Foundations of Computer Science, Research Triangle Park, North Carolina, USA, 30 October - 1 November 1989, pages 230–235, 1989. doi:10.1109/SFCS.1989.63483.
- [28] Russell Impagliazzo and Avi Wigderson. P = BPP if e requires exponential circuits: Derandomizing the xor lemma. In STOC ’97, pages 220–229, 1997. doi:10.1145/258533.258590.
- [29] Adam R Klivans and Dieter van Melkebeek. Graph nonisomorphism has subexponential size proofs unless the polynomial-time hierarchy collapses. SIAM Journal on Computing, 31(5):1501–1526, 2002. doi:10.1137/S0097539700389652.
- [30] Ker-I Ko. On the notion of infinite pseudorandom sequences. Theor. Comput. Sci., 48(3):9–33, 1986. doi:10.1016/0304-3975(86)90081-2.
- [31] A. N. Kolmogorov. Three approaches to the quantitative definition of information. International Journal of Computer Mathematics, 2(1-4):157–168, 1968.
- [32] L. A. Levin. The tale of one-way functions. Problems of Information Transmission, 39(1):92–103, 2003. doi:10.1023/A:1023634616182.
- [33] Yanyi Liu and Rafael Pass. On one-way functions and Kolmogorov complexity. In 61st IEEE Annual Symposium on Foundations of Computer Science, FOCS 2020, Durham, NC, USA, November 16-19, 2020, pages 1243–1254. IEEE, 2020. doi:10.1109/FOCS46700.2020.00118.
- [34] Yanyi Liu and Rafael Pass. Cryptography from sublinear time hardness of time-bounded kolmogorov complexity. In STOC, 2021.
- [35] Yanyi Liu and Rafael Pass. On the possibility of basing cryptography on EXP BPP. In CRYPTO, 2021.
- [36] Yanyi Liu and Rafael Pass. On one-way functions from np-complete problems. In Proceedings of the 37th Computational Complexity Conference, pages 1–24, 2022. doi:10.4230/LIPIcs.CCC.2022.36.
- [37] Yanyi Liu and Rafael Pass. On one-way functions and the worst-case hardness of time-bounded kolmogorov complexity. Cryptology ePrint Archive, page 1086, 2023. URL: https://eprint.iacr.org/2023/1086.
- [38] Yanyi Liu and Rafael Pass. One-way functions and the hardness of (probabilistic) time-bounded kolmogorov complexity w.r.t. samplable distributions. In CRYPTO’23, 2023.
- [39] Yanyi Liu and Rafael Pass. Hardness along the boundary: Towards one-way functions from the worst-case hardness of time-bounded kolmogorov complexity. In CRYPTO, 2025.
- [40] Zhenjian Lu and Igor C Oliveira. An efficient coding theorem via probabilistic representations and its applications. In Nikhil Bansal, Emanuela Merelli, and James Worrell, editors, 48th International Colloquium on Automata, Languages, and Programming (ICALP 2021), volume 198 of Leibniz International Proceedings in Informatics (LIPIcs), pages 94:1–94:20, Dagstuhl, Germany, 2021. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.ICALP.2021.94.
- [41] Zhenjian Lu, Igor C Oliveira, and Rahul Santhanam. Pseudodeterministic algorithms and the structure of probabilistic time. In Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing, pages 303–316, 2021. doi:10.1145/3406325.3451085.
- [42] Zhenjian Lu, Igor C Oliveira, and Marius Zimand. Optimal coding theorems in time-bounded kolmogorov complexity. In MikoLargeaj Bojańczyk, Emanuela Merelli, and David P. Woodruff, editors, 49th International Colloquium on Automata, Languages, and Programming (ICALP 2022), volume 229 of Leibniz International Proceedings in Informatics (LIPIcs), pages 92:1–92:14, Dagstuhl, Germany, 2022. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.ICALP.2022.92.
- [43] Noam Mazor and Rafael Pass. The non-uniform perebor conjecture for time-bounded kolmogorov complexity is false. Cryptology ePrint Archive, 2023.
- [44] Moni Naor. Bit commitment using pseudorandomness. J. Cryptology, 4(2):151–158, 1991. doi:10.1007/BF00196774.
- [45] Noam Nisan and Avi Wigderson. Hardness vs randomness. J. Comput. Syst. Sci., 49(2):149–167, 1994. doi:10.1016/S0022-0000(05)80043-1.
- [46] Igor Carboni Oliveira. Randomness and intractability in kolmogorov complexity. In Christel Baier, Ioannis Chatzigiannakis, Paola Flocchini, and Stefano Leonardi, editors, 46th International Colloquium on Automata, Languages, and Programming (ICALP 2019), volume 132 of Leibniz International Proceedings in Informatics (LIPIcs), pages 32:1–32:14, Dagstuhl, Germany, 2019. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.ICALP.2019.32.
- [47] Hanlin Ren and Rahul Santhanam. Hardness of KT characterizes parallel cryptography. Electron. Colloquium Comput. Complex., 28:57, 2021. URL: https://eccc.weizmann.ac.il/report/2021/057.
- [48] Ronald L. Rivest, Adi Shamir, and Leonard M. Adleman. A method for obtaining digital signatures and public-key cryptosystems (reprint). Commun. ACM, 26(1):96–99, 1983. doi:10.1145/357980.358017.
- [49] John Rompel. One-way functions are necessary and sufficient for secure signatures. In STOC, pages 387–394, 1990. doi:10.1145/100216.100269.
- [50] Michael Sipser. A complexity theoretic approach to randomness. In Proceedings of the 15th Annual ACM Symposium on Theory of Computing, 25-27 April, 1983, Boston, Massachusetts, USA, pages 330–335. ACM, 1983. doi:10.1145/800061.808762.
- [51] Boris A Trakhtenbrot. A survey of Russian approaches to perebor (brute-force searches) algorithms. Annals of the History of Computing, 6(4):384–400, 1984. doi:10.1109/MAHC.1984.10036.
- [52] Andrew Chi-Chih Yao. Theory and applications of trapdoor functions (extended abstract). In 23rd Annual Symposium on Foundations of Computer Science, Chicago, Illinois, USA, 3-5 November 1982, pages 80–91, 1982. doi:10.1109/SFCS.1982.45.