New Bounds for the Ideal Proof System in Positive Characteristic
Abstract
In this work, we prove upper and lower bounds over fields of positive characteristics for several fragments of the Ideal Proof System (IPS), an algebraic proof system introduced by Grochow and Pitassi (J. ACM 2018). Our results extend the works of Forbes, Shpilka, Tzameret, and Wigderson (Theory of Computing 2021) and also of Govindasamy, Hakoniemi, and Tzameret (FOCS 2022). These works primarily focused on proof systems over fields of characteristic 0, and we are able to extend these results to positive characteristic.
The question of proving general IPS lower bounds over positive characteristic is motivated by the important question of proving -Frege lower bounds. This connection was observed by Grochow and Pitassi (J. ACM 2018). Additional motivation comes from recent developments in algebraic complexity theory due to Forbes (CCC 2024) who showed how to extend previous lower bounds over characteristic to positive characteristic.
In our work, we adapt the functional lower bound method of Forbes et al. (Theory of Computing 2021) to prove exponential-size lower bounds for various subsystems of IPS. In order to establish these size lower bounds, we first prove a tight degree lower bound for a variant of Subset Sum over positive characteristic. This forms the core of all our lower bounds.
Additionally, we derive upper bounds for the instances presented above. We show that they have efficient constant-depth IPS refutations. This demonstrates that constant-depth IPS refutations are stronger than the proof systems considered above even in positive characteristic. We also show that constant-depth IPS can efficiently refute a general class of instances, namely all symmetric instances, thereby further uncovering the strength of these algebraic proofs in positive characteristic.
Keywords and phrases:
Ideal Proof Systems, Algebraic Complexity, Positive CharacteristicCategory:
Track A: Algorithms, Complexity and GamesFunding:
Amik Raj Behera: Supported by Srikanth Srinivasan’s start-up grant from the University of Copenhagen.Copyright and License:
![[Uncaptioned image]](x1.png)
2012 ACM Subject Classification:
Theory of computation Algebraic complexity theoryAcknowledgements:
We would like to thank Iddo Tzameret for his feedback and for sharing information about the independent work of Elbaz, Govindasamy, Lu, and Tzameret. NL thanks Tuomas Hakoniemi and Iddo Tzameret for several discussions on IPS proof systems.Editors:
Keren Censor-Hillel, Fabrizio Grandoni, Joël Ouaknine, and Gabriele PuppisSeries and Publisher:

1 Introduction
Propositional Proof Systems
A proof system consists of a set of axioms and inference rules. The goal is to start with the given set of axioms and apply the inference rules repeatedly to prove theorems (tautologies) within the proof system. A proof system is sound if it proves only true statements and it is complete if it proves all true statements. The area of Propositional Proof Complexity aims to understand the strength of different proof systems in the propositional setting. In a foundational work, Cook and Reckhow [10] showed that if we could prove that there exist tautologies such that they require exponential proof size (i.e., vaguely the number of times different inference rules are applied in the proof) in any proof system, then it would resolve the famous NP vs. coNP question in computational complexity theory.
Apart from the connection to this central question in complexity theory, understanding the power of different proof systems is also fundamental to mathematical reasoning. This has motivated a lot of research in the area for the last five decades. (See for instance these reference texts for more context [22, 9, 21].) There are many different kinds of propositional proof systems based on the set of axioms they start with and the kind of inference rules they are allowed to use. In this work, we will focus on algebraic proof systems. In algebraic proof systems, propositional tautologies are expressed as an unsatisfiable set of polynomial equations and the inference rules are algebraic, i.e. they involve reasoning based on polynomial arithmetic.
The study of algebraic proof systems originates from the work of Beame, Impagliazzo, Krajíček, Pitassi, and Pudlák [4] who introduced the Nullstellensatz proof system (based on Hilbert’s Nullstellensatz). Their work was followed by the work of Clegg, Edmonds, and Impagliazzo [8] who introduced Polynomial Calculus as a dynamic variant of the Nullstellensatz proof system. Over the years, substantial work on these proof systems has helped us get a good understanding of their power in terms of complexity measures such as sparsity and degree [4, 7, 29, 15, 20, 6, 1].
However, as noted in [13], sparsity and degree only roughly capture the complexity of algebraic proofs. More recently, Grochow and Pitassi [17] proposed the Ideal Proof System (IPS) as a natural generalization of these well-studied algebraic proof systems such as Polynomial Calculus and Nullstellensatz proof systems. In the last decade, several papers studied this proof system. (See for instance [17, 26, 13, 14, 19].) This has allowed us to understand many other aspects of algebraic proofs, such as proof size and proof depth.
In this paper, we extend this line of work. Specifically, we revisit some of the known upper and lower bounds for Ideal Proof Systems over characteristic and show similar bounds over fields of any characteristic111In all the results mentioned here, when we say that a result holds over characteristic , it in fact holds over large enough characteristic as well..
1.1 Ideal Proof Systems
We start by describing the general setup for an algebraic (static222In the literature, the following type of proof system is often referred to as a static proof system. There are other algebraic proof systems, where the proof is presented line-by-line and those are known as dynamic proof systems. Here, we will only discuss static proof systems.) proof system. Let denote the set of variables . We are given a set of polynomial axioms and the goal is to show that there is no - assignment to the variables such that it simultaneously satisfies over . To force a common Boolean solution, the set of axioms is appended with additional axioms, for . These are called the Boolean axioms.
Based on Hilbert’s Nullstellensatz, we know that if are simultaneously not satisfiable, then such a refutation333The words “proofs” and “refutations” are treated interchangeably in this paper. What we will be “proving” is a statement that “refutes” the existence of a common solution to a system of equations. can be given by polynomials and such that
(1) |
The complexity of such a proof can be defined using complexity parameters of the polynomials and . In the case of the Ideal Proof System, Grochow, and Pitassi proposed that we assume that are computed by algebraic circuits. (See Section 1.3 for the formal definition.) Based on this, they defined complexity measures such as circuit size and circuit depth of IPS.
This proof system in its full generality is known to be quite strong. Specifically, it can polynomially simulate Extended Frege [17], which is one of the most powerful among well-studied propositional proof systems. Additionally, the same work also showed that proving lower bounds for this proof system would also imply strong algebraic circuit lower bounds, which is also a very challenging problem.
In light of this (and other reasons explained below), many restricted variants of the IPS have been studied. Let be a class of polynomials. Then, a -IPS refutation is an IPS-refutation wherein and belong to the class . Forbes, Shpilka, Tzameret, and Wigderson [13], as well as Govindasamy, Hakoniemi, and Tzameret [14], considered different classes of polynomials, for example, the class of polynomials computed by read-once oblivious algebraic branching programs (roABPs), by multilinear formulas, or by constant-depth algebraic formulas. They proved upper and lower bounds on the size of (some variants of) -IPS refutations over characteristic .
1.2 Motivation
We extend these works and prove similar bounds in arbitrary characteristic. Our work is motivated by the following important strands of research in proof complexity.
IPS-refutations and -Frege
A long-standing open question in proof complexity, open for almost three decades [23], is to prove superpolynomial lower bounds against -Frege proof systems, i.e., a proof system in which the lines of the proof are constant-depth Boolean circuits that use modular gates. In the late 80s, Razborov [28] and Smolensky [33, 34] resolved the Boolean circuit lower bound question for , but the corresponding proof complexity question has proved to be elusive.
Over the years, several attempts have been made to resolve this question. The most relevant to our work is the result by Grochow and Pitassi [17, Theorem 3.5] which showed that constant-depth-IPS over characteristic can efficiently simulate -Frege proofs. This means that proving superpolynomial lower bounds against constant-depth-IPS refutations will give superpolynomial lower bounds against -Frege. This gives a strong motivation to prove IPS lower bounds over small characteristics.
Functional lower bounds over any characteristic
Building on the work of [17], [13] further explored the power of IPS refutations. They proposed a concrete approach towards proving size lower bounds for IPS refutations via functional lower bounds (further explained in Section 1.4). Their method was inspired by the notion of functional lower bounds in in Boolean circuit complexity [16, 12]. They demonstrated the promise of their method by proving several lower bounds for different fragments of IPS.
For example, the strong algebraic complexity lower bounds known for s [25] and multilinear formulas [27] follow from understanding the evaluation dimension complexity measure in these models. Since this measure is essentially functional in nature, [13] used it to successfully prove lower bounds for -IPS when is a class of read-once branching programs or multilinear formulas. Their bounds are over characteristic .
This approach of [13] was further adapted by Govindasamy, Hakoniemi, and Tzameret [14] to prove superpolynomial lower bounds against (multilinear) constant-depth-IPS refutations. Their proof builds on some of the key components of the superpolynomial lower bound against constant-depth algebraic circuits by Limaye, Srinivasan, and Tavenas. The latter lower bound of [24] only worked over characteristic ; for this and other reasons, the result of [14] was also limited to characteristic . In a recent paper, however, Forbes [11] improved the circuit lower bound result of [24] and proved the same444Some parameters in the lower bound by [24] were subsequently improved by [5] and [11] achieves those improved parameters. lower bound over any characteristic.
In light of these results, the next obvious step is to prove the lower bounds of [13, 14] over any characteristic. We achieve that in this work.555The subset-sum instances from [13, 14] are not always unsatisfiable over fields of positive characteristic; this requires that we tweak their instances to ensure unsatisfiability. Barring these changes, we qualitatively match their lower bounds over fields of positive characteristic.
1.3 Our results
To describe our results, we start with the formal definitions of IPS refutations and its variants.
Definition 1 ( proof systems [17, 13]).
Let be a system of unsatisfiable polynomials over the Boolean cube . In other words, there is no Boolean assignment to the variables so that for all
Given a class of algebraic circuits , a - refutation of the system of equations defined by is an algebraic circuit in variables such that
-
1.
, and
-
2.
The size of the refutation is the size of the circuit
Further, if the circuit has individual degree at most in the variables and , then we say that is a - refutation. If the circuit has individual degree at most in the variables (but not necessarily in ), then is said to be a - refutation.
Finally, we say that a circuit is a multilinear - refutation if additionally is a multilinear polynomial in the variables .
Remark 2.
We mostly employ the above definition in the case that , i.e. the case when we have a single polynomial equation that is unsatisfiable over the Boolean cube. Further, while our upper bound results are proved in the more restrictive - proof system, our lower bounds results hold in the setting of the stronger - proof systems.
We also recall some standard notions about polynomials and algebraic models of computation, which will be useful below.
Multilinear and symmetric polynomials
A polynomial is a multilinear if the individual degree is at most . For a polynomial , the multilinearization operator, denoted by , changes for each variable and any , every occurrence of in to .
A polynomial is said to be a symmetric polynomial if the polynomial remains invariant under any permutation of the input variables. For a degree parameter , the elementary symmetric polynomial is defined to be the following multilinear polynomial . Whenever is clear from the context, we will denote the elementary symmetric polynomial by .
Algebraic models of computation
We recall definitions of some of the standard models of computation relevant to our results.
Algebraic circuits and formulas. An algebraic circuit is a directed acyclic graph in which each node either computes a sum (or a linear combination) of its inputs, or a product of its inputs. The leaf nodes are either variables or constants. The size of an algebraic circuit is the number of edges in the circuit, and the depth of an algebraic circuit is the longest path from the output node (a sink) to a leaf node (a source). An algebraic formula is an algebraic circuit where the output of each node feeds into at most another node; in other words, the underlying graph of an algebraic formula is a tree. An algebraic formula is a multilinear formula if every gate of the formula computes a multilinear formula.
Sparse polynomials and constant-depth circuits. The class consists of depth-2 formulas with an addition gate in the top layer and multiplication gates in the bottom (second) layer. All the gates have unbounded fan-in. formulas essentially compute polynomials in the sparse representation i.e. as a sum of monomials. In general, a constant-depth algebraic circuit has alternating layers of additional and multiplication gates.
Read-Once Oblivious Algebraic Branching Programs. A read-once oblivious algebraic branching program in the variable-order 666 denotes the set of all permutation of . is a directed acyclic graph whose vertices are partitioned into layers . For each , there are edges directed from layer to that are labelled by univariate polynomials in the variable . For each -to- path , the polynomial computed by is defined to be product of the edge labels on . The polynomial computed by the is defined as the sum of polynomials computed by all -to- paths. The width of an is i.e. the size of the largest layer of vertices.
1.3.1 Lower bounds over positive characteristic
We start by stating our lower bound results.
Theorem 3 (Lower bounds for sparse- in positive characteristic).
The following holds for any large enough . Let be any prime number. Let such that . There exist and such that
-
1.
The polynomial has no Boolean satisfying assignment.
-
2.
Any sparse- refutation777Note that sparse- (a weaker system than sparse-) is equivalent to the Nullstellensatz proof system of [4]. of must have size at least
Note that the hard instance above is a sparse polynomial. We show that it has no small sparse refutation over positive characteristic.
Theorem 4 (Lower bounds for fixed-order in positive characteristic).
The following holds for any large enough . Let be any prime number. Let such that . There exist and such that
-
1.
The polynomial has no Boolean satisfying assignment.
-
2.
Any - refutation of in any order of variables where variables come before variables, must have width .
To obtain lower bounds against more powerful models such as - with respect to any order, or multilinear formulas, [13] used a slightly modified hard instance. We also use an instance the same as theirs up to the choice of coefficients.
Theorem 5 (Lower bounds for any order roABP- and multilinear-formula-).
The following holds for any large enough . Let be any prime number. Let such that . There exist and such that
-
1.
The polynomial has no Boolean satisfying assignment.
-
2.
Any - refutation of must have size at least .
-
3.
Moreover, any multilinear-formula- refutation of must have size at least and for , any product-depth888The product-depth of a circuit is the maximum number of product gates appearing in any leaf-to-root path.- multilinear-formula- refutation requires size .
Again notice that, is a sparse polynomial and hence has a polynomial size . It is also efficiently computable by a multilinear formula.
In general, in Boolean proof complexity, it is typical that the hard-to-refute instances are themselves easy to compute. In algebraic proof complexity, there are some lower bound results that do not have this property. That is, the instances that are hard to refute are also hard to compute. For example, the set of results obtained by the approach of multiples in [13, Theorem 1.18, Theorem 1.19, Theorem 1.20] and in a paper by Andrews and Forbes [3]. Additionally, in a recent work, Hakoniemi, Limaye, and Tzameret [19] presented instances that were hard to refute for - and for multilinear-formula- over any characteristics, i.e., similar to what we prove here. However, unfortunately, their instances were hard to compute and specifically, they could not be computed by or by multilinear formulas. Hence, our result here have the best of both the worlds; the lower bounds hold over any characteristic and the hard instances are easy to compute.
Theorem 6 (Lower bounds for multilinear constant-depth- in positive characteristic).
The following holds for any large enough . Let be any prime and let be large enough so that . There exist and such that
-
1.
The polynomial has no Boolean satisfying assignment.
-
2.
Any multilinear constant-depth- refutation of must have size .
The characteristic (or large characteristic) version of the above theorem was presented in [14]. Their lower bound is a step towards constant-depth- lower bounds. Our result above can thus be thought of as another step forward in the right direction. Moreover, our input instance is the same as the input instance in Theorem 1 [14] up to the choice of coefficients, and it is easy to compute (while being hard to refute). More specifically, it is computable by polynomial-sized constant-depth multilinear formulas.
Remark 7.
In all our results, the field characteristic is arbitrary, but the field size is quite large, i.e., is either exponential or superpolynomial. This setting is non-trivial because the field elements have polynomial bit complexity. Other results in the area, such as the work of Alekseev, Grigoriev, Hirsch, and Tzameret [2] similarly use polynomial constraints with coefficients from exponentially large domains. Specifically [2] study a variant of the subset sum instance, called the Binary Value Principle, in the context of IPS proof systems in fields of characteristic zero.
It is an interesting open question to prove similar lower bounds over finite fields of small size. Unfortunately, as we show below, this forces the polynomial instances to become more complicated. See Section 1.5 for recent independent work that makes progress in this direction.
1.3.2 Upper bounds over positive characteristic
A natural question for hard instances above is: what is the weakest proof system in which they are efficiently refutable? In personal communication, Tzameret observed that the above instances were refutable by constant-depth- hence showing that these proof systems can be exponentially more succinct than their multilinear counterpart. The theorem below shows that the above polynomials have efficient constant-depth- refutations, even in the setting of positive characteristic.
Theorem 8 (Upper bounds for (non-multilinear) constant-depth-).
Fix a prime number . The following holds for any natural numbers and
Let be any polynomial with sparsity and degree with coefficients from the field and let be any element of where is a field extension of
Then,
-
1.
The polynomial has no satisfying assignment over the Boolean cube
-
2.
There is a constant-depth- refutation of degree and size
Note that since , the polynomial does not have a zero over (in fact it does not have a solution over ). So the first item of above follows immediately.
Remark 9.
Suppose the characteristic is a fixed prime independent of the number of variables .
-
1.
Theorem 8 shows that the exponential field size in Theorem 3, Theorem 4 and Theorem 5 is not an artifact of the proofs.999Suppose the field is not large enough, say, . Then there is a refutation of degree , which is when and are constants. In particular, the sparsity of the refutation is at most , which is when .. For fields of subexponential size, the polynomials in these theorems have refutations of degree and in particular have - refutations of size 101010When the characteristic is a growing function of , this argument breaks down. It might be possible to get rid of the exponential field size.
- 2.
Our final result shows a constant-depth upper bound for multilinear and symmetric systems of polynomials, i.e. systems defined by polynomials of the form where denotes the elementary symmetric polynomial of degree in variables Such polynomial systems have been employed in [13] to prove lower bounds against restricted systems of constant-depth- Our results imply that general constant-depth circuit refutations can be exponentially more succinct than these restricted families, even for positive characteristic.
Theorem 10 (Upper bounds for multilinear symmetric systems).
Fix a field . Let be a family of multilinear and symmetric polynomials with no common Boolean solution i.e. there does not exist a such that each . This system has a constant-depth- refutation of size and depth .
1.4 Proof techniques
Lower bounds
Our proof uses the functional lower bound method introduced by [13], which can be described as follows. We know that a - refutation for consists of , such that
where belong to . As is unsatisfiable over the Boolean hypercube, this implies that over the Boolean hypercube, is a well-defined reciprocal of . Hence, to show that cannot belong to , it is enough to show that any polynomial that agrees with cannot be computed by . That is, the problem of proving a lower bound on the size of - is reduced to proving a functional lower bound for .
At the heart of such a functional lower bound lies a degree lower bound, i.e., a lower bound on the degree of , where and are related. In fact, is a lifted version of . Once we have such a degree lower bound for , we can apply proof ideas from algebraic complexity theory such as the rank-based lower bound methods. These methods allow for the degree lower bounds for to be lifted to size lower bounds for .
For their machinery to work over positive characteristic, we prove a positive characteristic version of the degree lower bound (see Lemma 17 for the formal statement). In the case of the lower bound argument in [13], it was important to obtain a tight degree lower bound of exactly . They needed it for the next step, i.e., lifting, to work. In our case, we show that such a degree lower bound holds with high probability (over the choice of coefficients of the hard instance). Once we have the degree lower bound, the rest of the lower bound proof works similar to the proof by [13]. Please refer to the full version for all the proofs.
Constant-depth upper bounds
We start by describing the main ideas in the proof of Theorem 8. Here, we proceed in two steps. First, we observe that for any sparse polynomial of degree , we can flatten it to a linear polynomial by renaming the monomials by fresh variables. Our hard instance is indeed sparse, hence the observation can be used to rewrite the polynomial as a linear polynomial over a fresh set of variables.
Now, consider a linear polynomial such that , where for some and prime and such that it is not satisfiable over - assignments.
To prove that the polynomial has a refutation over constant-depth circuits, we first prove that for every , can be expressed as a multiple of modulo the ideal , which is a shorthand for the ideal generated by .
We then observe that for , is a non-zero constant and use this observation to construct small depth circuits for the refutation of . Throughout, we use some standard but useful tricks available to positive characteristic fields. Please refer to the full version for all the proofs.
Upper bounds for symmetric polynomials
Now we discuss the proof outline for Theorem 10. For ease of exposition, we explain the ideas for the case of in Theorem 10, i.e. there is one multilinear symmetric polynomial that does not have a solution over the Boolean cube . Suppose has characteristic . Any symmetric polynomial is a polynomial of the elementary symmetric polynomials111111This follows from the Fundamental Theorem of Symmetric Polynomials. i.e. . However, if we restrict to the Boolean cube , then any symmetric polynomial is a polynomial of just elementary symmetric polynomials. Let denotes the tuple of those elementary symmetric polynomials.
Let be the variate polynomial such that agrees with on the Boolean cube . The Boolean cube is mapped to under the map because . The unsatisfiability of over the Boolean cube implies the unsatisfiability of over . Applying Hilbert’s Nullstellensatz Theorem (see Theorem 15) on the unsatisfiability121212To capture the restriction of , we add univariate polynomials, each of which vanishes on one coordinate of . of over , we get a low-variate Nullstellensatz certificate (it is a Nullstellensatz certificate in just variables)131313Loosely speaking, one can imagine this as a “dimension reduction” of our problem. The symmetric structure of led us to convert a problem in variables to a problem in just variables.. The coefficients of this low-variate Nullstellensatz certificate can be computed via -sized constant-depth circuits. This follows from the fact that we are working over constant characteristic. Refer to the diagram below for a schematic representation of what we discussed so far.
Next we “lift” the Nullstellensatz back to the variables . To do so, we plug-in in place of . Observe that this substitution by preserves the size and the depth of the coefficients of the low-variate Nullstellensatz certificate because of the Ben-Or’s construction (see Theorem 12).
It remains to prove via constant-depth circuits that agrees with on the Boolean cube, i.e. lie in the ideal . Here “to prove in constant-depth circuits” refers to giving a certificate for the ideal membership whose coefficients can be computed by constant-depth circuits. More precisely, we want to prove that there exists polynomials ’s which have -sized constant-depth circuits such that
This is the key step in our proof. To prove this, it suffices to prove the following special case.
Lemma 11.
Let and fix an arbitrary sequence where each . There exist polynomials ’s such that
and each polynomial can be computed by a -sized constant-depth circuit.
Please refer to the full version for all the proofs.
1.5 Related work
In an independent work, Elbaz, Govindasamy, Lu, and Tzameret (personal communication) consider related questions. Using the recent lower bound of Forbes [11], which proves the positive characteristic version of the constant-depth formula lower bound of [24], they obtain lower bounds for fragments of the IPS over finite fields of any size.
1.6 Preliminaries
In this subsection, we present a few more definitions and standard facts on polynomials that will be used in our proofs later on.
For a polynomial , the individual degree of is an integer such that for all , the degree of when viewed as a univariate polynomial in the variable is at most .
A classical and beautiful construction of Ben-Or shows that every elementary symmetric polynomial can be computed by -sized constant-depth circuits.
Theorem 12 (Ben-Or’s construction for elementary symmetric polynomials).
(See [31, Theorem 5.1]). Let be a field with . Then for every , the elementary symmetric polynomial has a circuit of size and depth (a circuit).
More particularly, for any choice of distinct elements and for every , there exists coefficients ’s such that
Theorem 13 (Polynomial Identity Lemma).
(See [18, Lemma 9.2.2]). Let be an arbitrary field. Let be a nonzero polynomial of degree at most and let . If we choose uniformly at random, then:
For a natural number and variables , we will use to denote the following ideal .
Next we recall the definition of an ideal and a variety, and then we state Hilbert’s Nullstellensatz.
Definition 14 (Ideal and Variety).
Fix any field and consider the commutative ring . For a set of polynomials , the ideal generated by ’s, denoted by is defined as:
For a set of polynomials , their variety, denoted by is a subset of the algebraic closure of , defined as:
Now we state Hilbert’s Nullstellensatz which essentially says that if a set of polynomials do not have a common zero, then there exists “witness” for this, i.e. one can express as a polynomial combination of ’s.
Theorem 15 (Hilbert’s Nullstellensatz).
Fix any field . Let be a set of multivariate polynomials such that they do not have any common zeros over the algebraic closure of . Then the constant lies in the ideal . In other words, there exists polynomials such that
Strictly speaking, Hilbert’s Nullstellensatz guarantees that the polynomials are in ( is the algebraic closure of ). However, the above statement also follows easily by observing that we can solve for ’s by solving a system of linear equations over . Throughout this article, we will refer to as a Nullstellensatz certificate141414There are infinitely many Nullstellensatz certificates for a system . To see this, suppose and let be a Nullstellensatz certificate. Then for any polynomial , is also a Nullstellensatz certificate. for the system . We will also refer to ’s as coefficients because if we take a polynomial combination of ’s with ’s being the coefficients, then we can generate .
2 Lower bounds in large fields of positive characteristic
In this section, we will prove size lower bounds for several fragments of IPS over positive characteristic. As explained in Section 1.3.1, we start by proving a tight degree lower bound (Lemma 17) over positive characteristic. Using our positive characteristic variant of the degree lower bound, we then recover the lower bound results from [13] and [14] over positive characteristic.
2.1 Degree lower bounds over large fields of arbitrary characteristic
For any , we use to denote its Hamming weight. For any and any subset of indices , we use to denote . All the statements in this section work over fields of arbitrary characteristic.
First, we state a standard fact about multilinear polynomials, which will be useful in the main lemma.
Fact 16.
Let be a multilinear polynomial on variables. Then,
The next lemma is our main degree lower bound which shows that a multilinear polynomial for the inverse of a random linear form will have maximal degree. While similar statements have been observed in the literature (e.g. [15, Proposition 2]), we give an explicit proof for the sake of completeness.
Lemma 17.
Let and be fields such that is a strict subfield of . Let be a natural number and let denote the tuple of variables . Fix any . For any , let be the unique multilinear polynomial that agrees with the function
on the Boolean cube . Let be any finite subset of the field. Then, for a uniformly random :
Proof.
By Fact 16, the coefficient of in is , or equivalently,
Based on the above expression, we define the rational function as follows.
We will use and to denote the numerator and denominator of . For any , we will use to denote . It follows that
Since , for any . If we prove that is a non-zero polynomial, then by the Polynomial Identity Lemma (Theorem 13), for any finite subset , , which implies that , and thus proves the theorem. Thus, it is enough to prove that some monomial in has non-zero coefficient.
For , has degree at most since will not increase the degree. The term syntactically contributes monomials of degree from , but is possible that these coefficients vanish if the field is of positive characteristic. We will show that there is a monomial of degree with coefficient 1, and thus this monomial will survive over any field.
Claim 18.
The coefficient of the monomial151515The same proof works for any monomial , where is an arbitrary permutation on . in is 1.
Proof sketch..
We would like to count the number of ways of collecting variables from each to construct the required monomial. We first observe (via a simple counting argument) that for every , the number of subsets such that , and , is . Moreover, for each , if is the collection of subsets with the above properties, then we observe that for all , , .
With these observations, it inductively follows that for each , conditioned on the degree of variables being correct (i.e. ), there is exactly one way of ensuring that the degree of is : for each that is one of the subsets satisfying the properties of the above observation, select the ’s from .
The next lemma proves a stronger version of the previous lemma: for a random linear form, the inverse of every restriction of the linear form (by setting some variables to 0) will have maximal degree. It follows from the previous lemma and an union bound.
Lemma 19.
Let and be fields such that is a strict subfield of . Let be a natural number and let denote the tuple of variables . Fix any . For any , let be the unique multilinear polynomial that agrees with the function
on the Boolean cube . Let be a finite subset of the field. Then, for an chosen uniformly at random:
In particular, with probability at least over the choice of , for every , the leading monomial of is for some .
2.2 Constant-depth multilinear lower bounds over large fields of arbitrary characteristic
In [14], Govindasamy, Hakoniemi, and Tzameret prove super polynomial lower bounds against constant-depth multilinear refutations of the subset sum variant
In particular, they prove the following theorem.
Theorem 20 (Constant-depth functional lower bounds [14]).
Let with and assume that . Let be the multilinear polynomial such that
over the Boolean cube. Then, any circuit of product-depth computing has size at least
We prove the same statement for large fields of arbitrary characteristic. Our proof exactly follows the structure of [14]. Their proof requires the condition for two reasons:
-
1.
They use the results of Limaye, Srinivasan, and Tavenas [24], which gave superpolynomial lower bounds against constant-depth circuits over any field with or greater than the degree of the hard polynomial. In particular, they use the result that over fields with or greater than , any low-degree set-multilinear polynomial computed by a constant-depth circuit can also be computed by a set-multilinear constant-depth circuit.161616They also use other ideas from [24] such as relative rank, word polynomial, etc., but those ideas do not require any restrictions on the characteristic of the underlying field.
-
2.
They use the degree lower bound for the multilinear representation of , proved by Forbes, Shpilka, Tzameret, and Wigderson [13].
To deal with the first requirement, we use the recent beautiful result of Forbes [11], which extends the results of [24] to arbitrary fields. In particular, we will use the following statement from [11], which says that the set-multilinear projection of a constant-depth circuit can be efficiently computed by a constant-depth circuit over arbitrary fields.
Theorem 21.
[11, Corollary 27]. Let be an arbitrary field. Let be a partition of the variables . Suppose can be computed by a size product-depth arithmetic circuit. Then the set-multilinear projection of (the restriction of to monomials that are set-multilinear with respect to the specified partition) can be computed by a size -size circuit of product-depth .
To deal with the second requirement, we use our degree lower bound from Lemma 19, which works for arbitrary fields of exponential size i.e. there is no restriction on the characteristic of the field.
Overview of [14]
-
1.
Using the word polynomials framework of [24], construct a knapsack polynomial (for a partition given by a word ) with the property that the set-multilinear projection of over the Boolean cube requires superpolynomially large set-multilinear constant-depth circuits.
-
2.
Consider a degree-4 subset-sum variant so that for the word that will be used to instantiate the previous point, there exists an assignment of some of the variables in , that maps to (upto a renaming of variables).
-
3.
If there is a multilinear polynomial computing over that has a small constant-depth circuit, then there is a multilinear polynomial computing over that has a small constant-depth circuit. Moreover by the set-multilinearization of [24], there is a small set-multilinear constant-depth circuit computing the set-multilinear projection of .
-
4.
Combining the first point with the contrapositive of the third point, conclude that any multilinear polynomial computing over requires superpolynomially large constant-depth circuits. The multilinear constant-depth lower bound follows.
In [14], the proof for the hardness of requires the underlying field to be of large characteristic essentially because it requires the degree lower bound from [13], which requires large characteristic. To make Theorem 20 work over fields of positive characteristic, we will employ our degree lower bound from Lemma 19 with a variant of the knapsack polynomial; the rest of the proof remains the same as that of Theorem 20. To provide the necessary details, we first describe the construction of the knapsack polynomial. Then, we state the particular claim from [14] that uses the degree lower bound from [13]. Finally, we show how our degree lower bound Lemma 19 fits into the rest of the proof.
Constructing the knapsack polynomial
We shall now recall the definitions required for defining the hard polynomial in [14] via the word polynomials template of [24].
Let be an arbitrary word. For any , let denote the subword of indexed by the set . Consider the sequence of sets of variables. Define the positive indices and negative indices of as:
Let any , the variables of will be of the form , where is a binary string indexed by the set:
We will call these sets positive indexing sets. The size of each is . The number of strings in is .
For , we similarly define the negative indexing sets that will be used to index the variables of for .
A word is balanced if:
-
1.
such that (i.e. is a witness that is balanced at )
-
2.
such that (i.e. is a witness that is balanced at )
For any , define:
(2) |
The product ranges over each that witnesses the fact that is balanced at . The sum ranges over each that is consistent with on . Now, we define the knapsack polynomial as
(3) |
where is any field element such that has no Boolean roots. To make the proof work over fields of positive characteristic, we define a variant of as:
(4) |
where , and will be chosen from an extension field so that has no Boolean roots.
For any word , denotes the matrix with rows indexed by all monomials that are set-multilinear over , and columns indexed by all monomials that are set-multilinear over . For each such pair of monomials , the corresponding entry in carries the coefficient of in . To show that the set-multilinear projection of any multilinear polynomial computing over requires superpolynomially large set-multilinear constant-depth circuits, [14] shows that is full-rank.
Lemma 22 (Rank lower bound lemma (Lemma 6 [14])).
Let be a balanced word, and let be the multilinear polynomial such that over . Then, is full-rank.
The use of degree lower bound in [14]
Suppose , where the sum runs over all multilinear monomials in the variables, and is some multilinear polynomial in the variables. They show that for any which is set-multilinear on , the leading monomial of is the set-multilinear monomial on positive variables such that is consistent with ([14] describes this formally). For each monomial that is set-multilinear on , the leading monomial of turns out to be a different set-multilinear monomial on the positive variables, and together, these leading monomials span the space of all set-multilinear monomials on the positive variables. This makes full-rank. To get a handle on (for being a monomial on , consisting only of variables), [14] sets all the variables in to 1 and all the variables outside to 0. They call this transformation . For the proof of Lemma 22, an important requirement is that:
For every and for every set-multilinear monomial on , the leading monomial of is , which is the product of all the variables that show up in the denominator of where , and for each , is the unique indexing string that agrees with on , the positive indexing set.
This requirement is satisfied due to the degree lower bound from [13], which requires the field to be of characteristic 0. The proof in [14] includes helpful figures and the reader is encouraged to refer to the paper.
Let us recall our variant of :
(5) |
where . To prove Theorem 20 in positive characteristic, we use the following lemma that follows by a union bound over all and all set-multilinear monomials on , on top of Lemma 17.
Lemma 23.
Let be a natural number and be a balanced word. Let . For any , and any that is a set-multilinear monomial on , let be the unique multilinear polynomial that agrees with the function
on the Boolean cube, where is chosen so that has no Boolean roots, and . Let be a finite subset of the field. Let . Then, for an chosen uniformly at random:
In particular, with probability at least over the choice of , for every choice of and set-multilinear monomial over , the leading monomial of is for some .
Proof.
The number of is . The number of set-multilinear monomials on for any is , which is at most . For any fixed and that is a set-multilinear monomial on , Lemma 17 implies that for an chosen uniformly at random:
Applying a union bound over all and implies that for an chosen uniformly at random:
With this lemma, the rest of the proof of [14] works out verbatim. We state the final theorem, which is a version of Theorem 20 for finite fields of positive characteristic.
Theorem 24 ([14] over positive characteristic).
Let with . Let be any prime. Let be a field of characteristic and size , where is the smallest integer that satisfies for an absolute constant 171717This is a fixed constant that depends on the exact choice of parameters in the proof of [14]. Let be an arbitrary element in , where denotes the subfield of size . For any , Let be the multilinear polynomial such that
over the Boolean cube. Then, there exists an such that any circuit of product-depth computing has size at least
The reason for in Theorem 24.
When we instantiate Lemma 23 inside the proof of Theorem 24, the parameter , which is the number of variable sets, will be , and the word will also be chosen so that for each , . Thus, , and the union bound in Lemma 23 will require the field to be larger than .
References
- [1] M. Alekhnovich and A.A. Razborov. Lower bounds for polynomial calculus: non-binomial case. In Proceedings 42nd IEEE Symposium on Foundations of Computer Science, pages 190–199, 2001. doi:10.1109/SFCS.2001.959893.
- [2] Yaroslav Alekseev, Dima Grigoriev, Edward A. Hirsch, and Iddo Tzameret. Semi-algebraic proofs, ips lower bounds, and the -conjecture: can a natural number be negative? In Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing, STOC 2020, pages 54–67, New York, NY, USA, 2020. Association for Computing Machinery. doi:10.1145/3357713.3384245.
- [3] Robert Andrews and Michael A. Forbes. Ideals, determinants, and straightening: proving and using lower bounds for polynomial ideals. In Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2022, pages 389–402, New York, NY, USA, 2022. Association for Computing Machinery. doi:10.1145/3519935.3520025.
- [4] Paul Beame, Russell Impagliazzo, Jan Krajíček, Toniann Pitassi, and Pavel Pudlák. Lower bounds on hilbert’s nullstellensatz and propositional proofs. Proceedings of the London Mathematical Society, s3-73(1):1–26, 1996. doi:10.1112/plms/s3-73.1.1.
- [5] C.S. Bhargav, Sagnik Dutta, and Nitin Saxena. Improved lower bound, and proof barrier, for constant depth algebraic circuits. ACM Trans. Comput. Theory, 16(4), November 2024. doi:10.1145/3689957.
- [6] Sam Buss, Dima Grigoriev, Russell Impagliazzo, and Toniann Pitassi. Linear gaps between degrees for the polynomial calculus modulo distinct primes. Journal of Computer and System Sciences, 62(2):267–289, 2001. doi:10.1006/jcss.2000.1726.
- [7] Samuel R. Buss, Russell Impagliazzo, Jan Krajícek, Pavel Pudlák, Alexander A. Razborov, and Jirí Sgall. Proof complexity in algebraic systems and bounded depth frege systems with modular counting. Comput. Complex., 6(3):256–298, 1997. doi:10.1007/BF01294258.
- [8] Matthew Clegg, Jeffery Edmonds, and Russell Impagliazzo. Using the groebner basis algorithm to find proofs of unsatisfiability. In Proceedings of the Twenty-Eighth Annual ACM Symposium on Theory of Computing, STOC ’96, pages 174–183, New York, NY, USA, 1996. Association for Computing Machinery. doi:10.1145/237814.237860.
- [9] Peter Clote and Evangelos Kranakis. Boolean Functions and Computation Models. Texts in Theoretical Computer Science. An EATCS Series. Springer, 2002. doi:10.1007/978-3-662-04943-3.
- [10] Stephen A. Cook and Robert A. Reckhow. The relative efficiency of propositional proof systems. Journal of Symbolic Logic, 44(1):36–50, 1979. doi:10.2307/2273702.
- [11] Michael A. Forbes. Low-Depth Algebraic Circuit Lower Bounds over Any Field. In Rahul Santhanam, editor, 39th Computational Complexity Conference (CCC 2024), volume 300 of Leibniz International Proceedings in Informatics (LIPIcs), pages 31:1–31:16, Dagstuhl, Germany, 2024. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.CCC.2024.31.
- [12] Michael A. Forbes, Mrinal Kumar, and Ramprasad Saptharishi. Functional Lower Bounds for Arithmetic Circuits and Connections to Boolean Circuit Complexity. In Ran Raz, editor, 31st Conference on Computational Complexity (CCC 2016), volume 50 of Leibniz International Proceedings in Informatics (LIPIcs), pages 33:1–33:19, Dagstuhl, Germany, 2016. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.CCC.2016.33.
- [13] Michael A. Forbes, Amir Shpilka, Iddo Tzameret, and Avi Wigderson. Proof complexity lower bounds from algebraic circuit complexity. Theory Comput., 17:1–88, 2021. doi:10.4086/TOC.2021.V017A010.
- [14] Nashlen Govindasamy, Tuomas Hakoniemi, and Iddo Tzameret. Simple hard instances for low-depth algebraic proofs. In 2022 IEEE 63rd Annual Symposium on Foundations of Computer Science (FOCS), pages 188–199, 2022. doi:10.1109/FOCS54457.2022.00025.
- [15] D. Grigoriev. Tseitin’s tautologies and lower bounds for nullstellensatz proofs. In Proceedings 39th Annual Symposium on Foundations of Computer Science (Cat. No.98CB36280), pages 648–652, 1998. doi:10.1109/SFCS.1998.743515.
- [16] Dima Grigoriev and Alexander A. Razborov. Exponential lower bounds for depth 3 arithmetic circuits in algebras of functions over finite fields. Applicable Algebra in Engineering, Communication and Computing, 10:465–487, 2000. doi:10.1007/S002009900021.
- [17] Joshua A. Grochow and Toniann Pitassi. Circuit complexity, proof complexity, and polynomial identity testing: The ideal proof system. J. ACM, 65(6), November 2018. doi:10.1145/3230742.
- [18] Venkatesan Guruswami, Atri Rudra, and Madhu Sudan. Essential Coding Theory. 2023. Available online: https://cse.buffalo.edu/faculty/atri/courses/coding-theory/book/web-coding-book.pdf. URL: https://cse.buffalo.edu/faculty/atri/courses/coding-theory/book/web-coding-book.pdf.
- [19] Tuomas Hakoniemi, Nutan Limaye, and Iddo Tzameret. Functional lower bounds in algebraic proofs: Symmetry, lifting, and barriers. In Proceedings of the 56th Annual ACM Symposium on Theory of Computing, STOC 2024, pages 1396–1404, New York, NY, USA, 2024. Association for Computing Machinery. doi:10.1145/3618260.3649616.
- [20] Russell Impagliazzo, Pavel Pudlák, and Jirí Sgall. Lower bounds for the polynomial calculus and the gröbner basis algorithm. Comput. Complex., 8(2):127–144, 1999. doi:10.1007/s000370050024.
- [21] J. Krajíček. Proof Complexity. Encyclopedia of Mathematics and its Applications. Cambridge University Press, 2019. URL: https://books.google.dk/books?id=uOyKuQEACAAJ.
- [22] Jan Krajícek. Bounded arithmetic, propositional logic, and complexity theory, volume 60 of Encyclopedia of mathematics and its applications. Cambridge University Press, 1995. doi:10.1017/CBO9780511529948.
- [23] Jan Krajíček. A reduction of proof complexity to computational complexity for frege systems. Proceedings of the American Mathematical Society, 143(11):4951–4965, 2015. URL: http://www.jstor.org/stable/24507779.
- [24] Nutan Limaye, Srikanth Srinivasan, and Sébastien Tavenas. Superpolynomial lower bounds against low-depth algebraic circuits. In 2021 IEEE 62nd Annual Symposium on Foundations of Computer Science (FOCS), pages 804–814, 2021. doi:10.1109/FOCS52979.2021.00083.
- [25] Noam Nisan. Lower bounds for non-commutative computation. In Proceedings of the Twenty-Third Annual ACM Symposium on Theory of Computing, STOC ’91, pages 410–418, New York, NY, USA, 1991. Association for Computing Machinery. doi:10.1145/103418.103462.
- [26] Toniann Pitassi and Iddo Tzameret. Algebraic proof complexity: progress, frontiers and challenges. ACM SIGLOG News, 3(3):21–43, August 2016. doi:10.1145/2984450.2984455.
- [27] Ran Raz. Multi-linear formulas for permanent and determinant are of super-polynomial size. J. ACM, 56(2), April 2009. doi:10.1145/1502793.1502797.
- [28] Alexander A. Razborov. Lower bounds on the size of bounded depth circuits over a complete basis with logical addition. Mathematical notes of the Academy of Sciences of the USSR, 41:333–338, 1987. URL: https://api.semanticscholar.org/CorpusID:121744639.
- [29] Alexander A. Razborov. Lower bounds for the polynomial calculus. Comput. Complex., 7(4):291–324, 1998. doi:10.1007/s000370050013.
- [30] Ramprasad Saptharishi. A survey of lower bounds in arithmetic circuit complexity. Github survey, 2021. URL: https://github.com/dasarpmar/lowerbounds-survey/releases/tag/v9.0.3.
- [31] Amir Shpilka and Avi Wigderson. Depth-3 arithmetic circuits over fields of characteristic zero. Comput. Complex., 10(1):1–27, 2001. doi:10.1007/PL00001609.
- [32] Amir Shpilka and Amir Yehudayoff. Arithmetic circuits: A survey of recent results and open questions. Foundations and Trends® in Theoretical Computer Science, 5(3–4):207–388, 2010. doi:10.1561/0400000039.
- [33] Roman Smolensky. Algebraic methods in the theory of lower bounds for boolean circuit complexity. Proceedings of the nineteenth annual ACM symposium on Theory of computing, 1987. URL: https://api.semanticscholar.org/CorpusID:2214101.
- [34] Roman Smolensky. On representations by low-degree polynomials. Proceedings of 1993 IEEE 34th Annual Foundations of Computer Science, pages 130–138, 1993. doi:10.1109/SFCS.1993.366874.