Abstract 1 Introduction 2 Conclusion References

Uniform Bounds on Product Sylvester-Gallai Configurations

Abhibhav Garg ORCID University of Waterloo, Canada Rafael Oliveira ORCID University of Waterloo, Canada Akash Kumar Sengupta ORCID University of Waterloo, Canada
Abstract

In this work, we explore a non-linear extension of the classical Sylvester-Gallai configuration. Let 𝕂 be an algebraically closed field of characteristic zero, and let ={F1,,Fm}𝕂[x1,,xN] denote a collection of irreducible homogeneous polynomials of degree at most d, where each Fi is not a scalar multiple of any other Fj for ij. We define to be a product Sylvester-Gallai configuration if, for any two distinct polynomials Fi,Fj, the following condition is satisfied:

ki,jFkrad(Fi,Fj).

We prove that product Sylvester-Gallai configurations are inherently low dimensional. Specifically, we show that there exists a function λ:, independent of 𝕂, N, and m, such that any product Sylvester-Gallai configuration must satisfy:

dim(span𝕂())λ(d).

This result generalizes the main theorems from (Shpilka 2019, Peleg and Shpilka 2020, Oliveira and Sengupta 2023), and gets us one step closer to a full derandomization of the polynomial identity testing problem for the class of depth 4 circuits with bounded top and bottom fan-in.

Keywords and phrases:
Sylvester-Gallai theorem, arrangements of hypersurfaces, algebraic complexity, polynomial identity testing, algebraic geometry, commutative algebra
Copyright and License:
[Uncaptioned image] © Abhibhav Garg, Rafael Oliveira, and Akash Kumar Sengupta; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Algebraic complexity theory
; Theory of computation Computational geometry
Related Version:
Full Version: https://eccc.weizmann.ac.il/report/2025/037
Acknowledgements:
The authors would like to thank Amir Shpilka and Shir Peleg for several helpful conversations in the course of this work.
Editors:
Oswin Aichholzer and Haitao Wang

1 Introduction

In 1893, Sylvester posed the following basic problem in extremal combinatorial geometry.

Problem 1 ([49]).

Suppose a finite set of real points are arranged such that any line through two points of the set contains a third point in the set. Prove that the set of points lies on a straight line.

The first complete affirmative answer to Problem 1 was given by Melchior [35]. The same problem was independently posed in 1943 by Erdős [17], and was answered by Gallai [20]. The above statement is now known as the Sylvester-Gallai theorem.

It is important to note that the answer to Sylvester’s problem depends on the base field. For instance, it was known (quite possibly even to Sylvester) that the answer to Problem 1 is negative when the base field is , as the nine inflection points of any non-singular planar cubic form a 2-dimensional complex configuration satisfying Sylvester’s conditions. In 1966, Serre asked in [46] whether any complex configurations satisfying Sylvester’s conditions must be 2-dimensional – that is, whether the configurations arising from non-singular planar cubics are extremal. An affirmative answer to Serre’s question was given in 1983 by Hirzebruch [29], using highly non-elementary algebro-geometric tools.

Ever since the solution of Sylvester’s problem, several generalizations and variations have been studied in combinatorial geometry, theoretical computer science (TCS) and coding theory [15, 27, 42, 16, 7, 13, 39, 40] – see [9, 12] for brief surveys on these generalizations. The usual setup of such generalizations is as follows: given a finite collection of geometric objects (points, in the case of Sylvester’s problem) satisfying enough local conditions (collinearity of certain triples of points, in the case of Problem 1), one wants to know if such collection of geometric objects must be “low dimensional” (all points must be in one line, in the case of Problem 1). As is usual in the literature, any configuration satisfying the proposed local conditions are called Sylvester-Gallai configurations, and the result stating that such configurations are low dimensional is referred to as a Sylvester-Gallai type theorem.

One line of generalizations of Problem 1 arises from projective duality, which we now discuss. By projective duality, any point P=(p1,,pN) gives rise to a dual hyperplane, defined by the zero set of the linear form P:=p1x1++pNxN, which we denote by V(P). Given three points P,Q,R, the condition that R is in the line defined by P,Q is equivalent to V(P,Q)V(R) in the dual space. Lastly, given points P1,,Pm, the condition that they are collinear is equivalent to dimspan{P1,,Pm}=2. Thus, we can recast Sylvester’s problem as follows: given a finite set of distinct hyperplanes defined by the set of linear forms :={1,,m}[x1,,xN] such that for any ij[m], there is ki,j such that V(i,j)V(k), then it must be the case that dimspan{}=2. For simplicity, if one applies projective duality in two dimensions, the foregoing statement becomes quite natural: given a finite set of distinct lines (the set ) such that for any two lines i,j, there must be a third line in which passes through the intersection of 1,2 (i.e., V(1,2)), then it must be the case that the set of lines forms a pencil (i.e., all lines intersect at a common point). In fact, this dual formulation was precisely the setting treated in [29].

Motivated by questions in algebraic complexity theory, Gupta [24] proposed non-linear, algebro-geometric generalizations to the above (dual) formulation of Sylvester’s problem (and their set variants, such as [15]). In this work, we make progress on Gupta’s program and fully resolve one such generalization of Sylvester’s problem. Before we describe our main result, in the following subsection we describe the connection between the Sylvester-Gallai configurations that we study and the Polynomial Identity Testing (PIT) problem, a fundamental problem in algebraic complexity theory.

1.1 Polynomial Identity Testing (PIT) and Sylvester-Gallai configurations

The Polynomial Identity Testing (PIT) problem is the task of determining whether a given algebraic circuit computes the zero polynomial. While there are simple and efficient randomized algorithms for the PIT problem, developing an efficient deterministic solution remains a significant open problem in theoretical computer science. The PIT problem is closely tied to fundamental topics such as lower bounds for algebraic circuits [28, 2, 30] and derandomizing key problems in mathematics and computer science [3, 19, 33, 36, 26, 18]. For a deeper exploration of PIT and its applications, see [43, 48, 44].

The recent works [1, 25, 50] have shown that the PIT problem for general circuits can be reduced to solving it for low-depth circuits, such as unrestricted depth-3 or homogeneous depth-4 circuits, which has made these circuit classes a focus of recent study. Notably, progress has been made towards deterministic, polynomial-time PIT for depth-3 circuits of constant top fanin via the connection between this problem and linear Sylvester-Gallai configurations arising from depth-3 identities [14, 32, 45].

Let us now explain how Sylvester-Gallai configurations arise in the context of depth-3 and depth-4 PIT. Let 𝐱:=(x1,,xN) be a tuple of N variables and S:=[𝐱] be the polynomial ring in N variables. Suppose we are given a depth three circuit with top fan-in three (denoted Σ3ΠΣ) that computes a polynomial P. Such a polynomial has the form

P=i=1mi(𝐱)+j=1mgj(𝐱)+k=1mhk(𝐱), (1)

where i,gj,hk are linear forms. When P0, that is, when we have an identity, and moreover this representation of the identity is “efficient,”111The formal definition of efficient is simple and minimal, as defined in [32]. we may wonder whether the linear forms in the identity must be “low dimensional” (i.e., all such linear forms must “depend on few variables”). To capture the dimensionality of the circuit given by Equation 1, [14] defined the rank of the circuit as being the linear span of the forms that appear in the circuit. In case of Equation 1, we have that the rank is dimspan{i,gj,hk}i,j,k[m].

Consider a linear form i from the first gate and a second linear form gj from the second gate. Since P0, we have that for any 𝐚N such that i(𝐚)=gj(𝐚)=0, it must be the case that hk(𝐚)=0. In other words, we have that V(i,gj)V(hk)=kV(hk). Since the algebraic set V(i,gj) is irreducible, we must have V(i,gj)V(ha) for some a[m]. Note that this last condition (combined with the symmetry among the gates) is exactly the local constraint arising from the dual formulation of Sylvester’s problem. In [14], the authors asked whether sets {i}i[m],{gj}j[m],{hk}k[m] arising from Equation 1 which satisfy the above conditions must be low-dimensional – more precisely, whether dimspan{i,gj,hk}i,j,k=O(1).

As it turns out, the Sylvester-Gallai like condition in the previous paragraph can be seen as a set-version of Sylvester’s problem, which was studied by [15]. This work showed that such configurations must have constant rank. Thus, we deduce that span{i,gj,hk}=O(1). This shows that any Σ3ΠΣ identity essentially depends on constantly many variables. Combined with the results of [31], this gives a black box deterministic PIT algorithm for such circuits. While significantly extra complexity and subtleties arise when one works with ΣkΠΣ circuits for k>3, the linear forms in “efficient” identities arising from such circuits satisfy enough local relationships to deduce that they have low rank. This approach was carried out by [32, 45], who generalized the above approach and showed an intrinsic and elegant connection between ΣkΠΣ identities and Sylvester-Gallai configurations (see [45, Theorem 1.4]). As a result of this connection, Sylvester-Gallai theorems imply deterministic, polynomial-time PIT algorithms for ΣkΠΣ circuits, when k=O(1).

Motivated by these results, Gupta [24] studied depth four circuits with constant top and bottom fan-ins. This circuit family is denoted ΣkΠΣΠd, and it computes polynomials which can be written as a sum of k products of polynomials of degree at most d.

Consider a polynomial Q computed by a Σ3ΠΣΠd circuit. It has the form

Q=i=1m1Ai(𝐱)+j=1m2Bj(𝐱)+k=1m3Ck(𝐱),

where Ai,Bj,Ck are polynomials of degree at most d. If Q0 and the representation is efficient, as in the previous case, we have V(Ai,Bj)V(Ck).222By symmetry among the gates, we also have V(Ai,Ck)V(Bj) and V(Bj,Ck)V(Ai). However, as the forms are not necessarily linear, we have that V(Ai,Bj) is not necessarily irreducible, and thus we no longer have the same Sylvester-Gallai type configuration, as we cannot guarantee the existence of k[m] such that V(Ai,Bj)V(Ck). Nevertheless, as there are many constraints of the form V(Ai,Bj)V(Ck), [24] asked whether such Sylvester-Gallai type conditions would be enough to prove that dimspan{Ai,Bj,Ck}i,j,k is small enough.

Given any “efficient” Σ3ΠΣΠd circuit (not necessarily computing the zero polynomial), [24] called it a Sylvester-Gallai circuit if the above constraints on the zerosets (i.e. V(Ai,Bj)V(Ck)) hold. The above discussion shows that all such circuits that compute 0 are Sylvester-Gallai circuits, but the converse is not true: for instance, multiplying a single form by a constant can change whether or not the circuit computes 0 without changing the above condition.

Non Sylvester-Gallai circuits are relatively simple compared to Sylvester-Gallai circuits. Non Sylvester-Gallai circuits do not have too many cancellations (the above discussion shows that they can never compute 0), and the circuit structure can be preserved by certain linear variable reduction maps, which is the main tool used in the works [24, 23] to handle these circuits. This allows these works to certify the non-zeroness of non-Sylvester-Gallai circuits.

The cancellations in Sylvester-Gallai circuits on the other hand are a lot more subtle, and it is not clear if they are preserved by the linear maps of [24, 23]. Similar to how linear Sylvester-Gallai configurations have constant rank, Gupta conjectured that sets of higher degree forms satisfying the local condition V(Ai,Bj)V(kCk) must “depend on constantly many variables.” If this is true, then we can efficiently test if Sylvester-Gallai circuits are nonzero using the methods of [8]. A simpler form of Gupta’s main conjecture is the following generalization of the configurations studied in [15]:

Conjecture 2 (Gupta’s main conjecture - simple form).

Let {Ai}i[m1]{Bj}j[m2]{Ck}k[m3] be a set of irreducible polynomials of degree d such that for any i[m1],j[m2], we have that V(Ai,Bj)V(kCk) (and the same relations hold when exchanging the roles of the polynomials A,B,C). Then there exists a function λ: such that

dimspan{Ai,Bj,Ck}i,j,kλ(d).

A first step towards establishing the above conjecture is to establish the Sylvester-Gallai version of the above conjecture. This motivates us to consider the Sylvester-Gallai configurations we study in this article, which we now define. By way of the algebraic-geometric dictionary, we will henceforth replace the geometric condition V(A,B)V(C) by the equivalent algebraic condition rad(C)rad(A,B).

Definition 3 (Product Sylvester–Gallai configurations).

A finite set [x1,,xN] of irreducible forms of degree at most d is a d-product-SG-configuration if the following hold.

  • Fi(Fj) for any Fi,Fj. (each form encodes a different hypersurface)

  • For each Fi,Fj, we have (Sylvester-Gallai condition)

    F{Fi,Fj}Frad(Fi,Fj).

As stated above, the reader may find it strange that the product constraint no longer seems local, since the condition for each Fi,Fj involves every other form in the configuration. However, by a standard Bézout type argument it follows that the condition F{Fi,Fj}Frad(Fi,Fj) is equivalent to the (local) condition that there are indices k1,,kd2 different from i,j such that

a=1d2Fkarad(Fi,Fj).

Since d is a constant, each (local) condition now only depends on constantly many forms in the configuration as expected. As this equivalent condition is more cumbersome to state, we prefer to work with the above definition.

In a similar fashion to [24], we can also conjecture that such “product” configurations are “low dimensional.” This is the content of the following conjecture:

Conjecture 4 (Product Sylvester-Gallai Conjecture).

There is a function λ: such that if is a d-product-SG-configuration then dimspan{}=λ(d).

The above conjecture was first considered by [39], where the authors proved Conjecture 4 for d=2. Our main theorem, which we now state, generalizes their main result to forms of every degree, thus fully settling Conjecture 4.

Theorem 5 (Product Sylvester-Gallai Theorem).

There is a function λ: such that if is a d-product-SG-configuration then dimspan{}=λ(d).

Our proof techniques build upon the techniques introduced in [47, 38] to study radical Sylvester-Gallai configurations, which we define in Section 1.3. In Section 1.3 we also discuss related and previous works on non-linear Sylvester-Gallai configurations and the PIT problem for depth-4 circuits. In Section 1.4, we give a high level outline of our proof, highlighting the technical difficulties that need to be overcome, as well as the new conceptual contributions of this work. We will now discuss the main contributions of this work, where we state our main technical result, which is of independent interest. Formal statements and proofs of all claims can be found in the full version of this paper.

1.2 Contributions of this paper

This paper has two main contributions. The first is the proof of Theorem 5, which states that product Sylvester-Gallai configurations of degree bounded by d have dimension uniformly upper bounded by a function which depends only on d. This result simultaneously generalizes [47, 39, 38], putting us one step closer towards a proof of Conjecture 2, and also to the main conjecture of Gupta [24]. To achieve this, we adapt the inductive approach from [38] to work with product configurations. This adaptation, as we discuss in Section 1.4, requires us to deal with several subtleties, one of them being to establish a useful property of the general quotients of [47, 38]: we prove that such general quotients behave well with respect to (a generalized notion of) absolute irreducibility. More precisely, we prove that that these quotients send absolutely reducible forms to reducible elements, and that forms which are absolutely irreducible over the vector space being quotiented out will remain absolutely irreducible after a general quotient.

The second contribution is our main technical result which relates absolute irreducibility of a polynomial P with an effective and uniform bound on the non-primality of certain ideals containing P. In a simplified form, it can be stated as:

Theorem 6 (Prime bound - informal).

Let S:=[x1,,xN,y1,,yn] and PS(x1,,xN) be a form of degree d that is irreducible over (x1,,xN)¯[y1,,yn]. The number of non-associate irreducible forms Q[x1,,xN] such that (P,Q) is not prime is bounded above by a function of d.

We believe that the above theorem, and its more general version (which can be found in the full version), together with its proof, are of independent interest. We now briefly discuss the ingredients needed in the proof of the above theorem. If one only combines results from Gröbner basis theory and elimination theory with techniques from algebraic geometry, one can obtain an effective bound for the above theorem which depends on N,n and d. Since the goal of a Sylvester-Gallai theorem is to obtain bounds independent of the number of variables, such a bound will not be good enough for our purposes. By combining the Stillman’s uniformity bounds of [5] with results from elimination theory and algebraic geometry, we are able to obtain an effective bound (depending only on n,d) on the number of non-associate irreducible forms Q such that (P,Q) is not prime.

Once the dependence on N has been removed and a finite bound on the number of (non-associate) bad forms Q has been established, we can then use Bertini’s theorem and the Ananyan-Hochster principle to further drop the dependency on n, finally obtaining a bound which simply depends on the degree d. To achieve this, we give very effective bounds on the degrees of generators of Gröbner bases for ideals generated by a small number of variables of low degree, and also for certain elimination ideals of such ideals. These bounds could be of independent interest.

An important point to note is that [38] builds on the Ananyan-Hochster principle to show their transfer theorems, which we also use. However, to prove the above theorem, we also need to apply the Ananyan-Hochster principle in a different manner: combined with results from Gröbner basis theory, we use it to find low degree polynomials in elimination ideals of ideals generated by a small number of low degree forms.

1.3 Related Work

1.3.1 Sylvester-Gallai configurations

In [24], as a first step of his plan, Gupta proposed the study of a direct generalization of the linear Sylvester-Gallai configurations, which are captured by the following definition.

Definition 7 (Radical Sylvester–Gallai configuration).

Let [x1,,xN] be a finite set of irreducible forms of degree at most d. We say that is a d-radical Sylvester-Gallai configuration if the following hold.

  • Fi(Fj) for any Fi,Fj.

  • For every Fi,Fj, we have Fkrad(Fi,Fj) for some Fk{Fi,Fj}.

The following conjecture is a strengthening of [24, Conjecture 2].

Conjecture 8 (Radical Sylvester-Gallai conjecture).

There is a function λ: such that if is a d-radical Sylvester-Gallai configuration then dimspan{}=λ(d).

Conjecture 8 was proved by [47] in the special case when d=2. Subsequently, [37] proved Conjecture 8 for d=3, and [38] fully resolved the above conjecture in the affirmative. Note that the condition Fkrad(Fi,Fj) for forms of degree d is a natural generalization of the condition k(i,j): by the Nullstellensatz, these conditions are equivalent to V(Fk)V(Fi,Fj) and V(k)V(i,j) respectively. Therefore Conjecture 8 is a direct generalisation of the usual Sylvester-Gallai theorem for forms of higher degrees.

The first work to consider product Sylvester-Gallai configurations was the work of [39], where the authors settled Conjecture 4 for quadratic forms. In follow up work [40], the same authors settled Conjecture 2 for the case when d=2, therefore proving that Gupta’s PIT algorithm runs in deterministic, poly-time for Σ3ΠΣΠ2 circuits. Note that the above path in the proofs of Sylvester-Gallai type results is a natural path towards resolving the PIT problem for Σ3ΠΣΠd circuits, since Conjecture 8 follows from Conjecture 4, and it can be shown that Conjecture 2 would follow from a robust version of Conjecture 4.

Robust and higher dimensional generalizations of the radical Sylvester-Gallai theorems have also been studied by the works [41, 22, 21]. Aside from settling such interesting questions in extremal combinatorial geometry, such variations are also motivated by the PIT problem for depth-4 circuits, in the hope that these more general versions may be useful towards obtaining a (potentially simpler) proof of Gupta’s conjectures.

1.3.2 PIT for depth four circuits

Depth-4 circuits with bounded top fan-in are among the “easiest” classes for which we do not have deterministic, poly-time PIT algorithms. There has thus been some work on deterministic PIT for these circuits using methods other than the Sylvester-Gallai based methods discussed above.

In [11], the authors give a quasipolynomial time PIT algorithm for depth 4 circuits of bounded top and bottom fanins, via the Jacobian method of [4]. By using the logarithmic derivative and its power series, they are able to modify the top sum gate in the circuit to a powering gate.

The breakthrough work of [34] on lower bounds for bounded depth circuits gives another approach to PIT for this model. Hardness-randomness tradeoffs have been well studied in algebraic complexity [2, 30], and the work of [10] showed that these tradeoffs also hold in the bounded depth setting. Combining these results gives a subexponential time PIT algorithm for depth four circuits. Based on the lower bounds of [34], a hitting set generator for bounded depth circuits was constructed by [6], which gives another subexponential time PIT algorithm for depth four circuits, with improved parameters.

It is important to note that neither of the above methods seem to be able to give a truly polynomial time algorithm for the PIT problem for ΣkΠΣΠd circuits. The only currently known method that might do so is the approach of [24].

1.4 High-level proof overview

In this subsection we provide an overview of the proof of our main result, Theorem 5. We begin by outlining the high-level approach used in previous works to bound the dimension of the span of radical Sylvester-Gallai configurations and discuss the challenges of applying this method in our context. Next, we describe the tools that we employ to address these challenges. Finally, we provide a concise summary of how these components are combined to establish our main theorem.

To illustrate the ideas in this overview, we use a special case of product Sylvester-Gallai configurations with a simplifying assumption as an example. In the final part of this subsection, we explain how this assumption can be eliminated to extend the argument to the general case.

1.4.1 High-level approach

At a high level, our proof uses a similar approach as the previous works on higher degree Sylvester-Gallai configurations [47, 39, 40, 41, 21, 38], which we now summarize.

Let R:=[x1,,xN], and S:=R[y1,,yn] be two polynomial rings and S be a radical (or product) Sylvester-Gallai configuration, where the forms in have degree at most d. Thus, we can write =1d, where e are the forms in of degree e. We start with the assumption that each form only depends on constantly many variables in S. A number of key ideas can already be highlighted in this easier setting.

We want to control the highest degree forms d in our configuration, with the goal of reducing to the case when the highest degree is d1, where we can proceed inductively.

If many ideals of the form (Fi,Fj), where Fi,Fjd, are radical (or prime), then the set d is essentially a (robust) linear Sylvester-Gallai configuration. In this case, the linear Sylvester-Gallai theorems imply that constantly many forms F1,,Fa are a basis for span{d}, and if x1,,xr is the union of the set of variables of F1,,Fa, then d[x1,,xr]. This is sufficient to control the forms of d and apply our inductive step. The interesting case is thus when there are many ideals generated by forms in d that are not radical (or prime). In this case, the goal is to show that the forms in d must share many variables in common. The steps listed below show how previous works have dealt with this case.

  1. 1.

    First, one devises a structure theorem on ideals that are not radical (or prime). In principle, such structure theorems show that if an ideal generated by two polynomials is not radical (or not prime), then the variables appearing in the polynomials must have some special “dependencies”.

  2. 2.

    Next, combine the structure theorem with the local conditions to show the existence of constantly many “good variables” x1,,xb such that d(x1,,xb).

    In this step, our approach differs from previous works, as in a product configuration, it may not be possible to directly achieve this high level of control on the forms in d. We discuss this in more detail in Step 2 below.

  3. 3.

    Finally, we “randomly project” (i.e., apply a general quotient) the special variables x1,,xb to a new variable z in order to reduce the degree of the forms in d.

    Also, in this step our approach will differ from previous works. Instead of reducing degree directly, we will make the forms in factor more.”

We now elaborate on this approach, pointing out the new difficulties we face in this work, and sketch how we deal with these issues.

Step 1 – Structure theorems

Earlier works ([47, 37]) gave fairly strong classification of ideals that are not radical in the first step. In particular, they consider the possible minimal primes that a non-radical ideal of the form (P,Q) can have, and possible multiplicities that these minimal primes can have, giving a structural result for each of these cases. However this strategy is hard to generalise beyond the case of ideals generated by two cubics, since these ideals have very high degree and their primary decomposition becomes much more complex as the degree grows.

Subsequent work by [38] used a significantly weaker (albeit more general) structure theorem, and showed that this suffices to carry out the strategy above. The structure theorem in [38] is the following. If PS is an irreducible form of degree d that is not in the ideal (x1,,xN), then there are at most 3d3 square-free forms QiR such that (P,Qi) is not radical. Informally, this statement says that if a form P depends non-trivially on some variable(s) yi, then (P,Q) must be radical for almost every polynomial Q that only depends on x1,,xN.

For product Sylvester-Gallai configurations, the ideals of interest are those that are not prime. One could hope to generalize the structural result from [38] to this setting. However, a statement of the above form is no longer true here. Take the form P=y14x1x2y22. This form depends on the variables y1,y2. However, for every irreducible linear form QR, the ideal (P,x1x2Q2) is not prime, even though P and x1x2Q2 are irreducible. This is our first technical difficulty.

We overcome this by showing that the above only happens because P is reducible as a form in (x1,,xN)¯[y1,y2], since it factors as P=(y12x1x2y2)(y12+x1x2y2). This is where our first key conceptual and technical contribution comes in. We show that for any degree d form in R[y1,,ym] that is irreducible in (x1,,xN)¯[y1,,yn] (such forms are called absolutely irreducible over R[y1,,ym]), there are only finitely many QiR such that (P,Qi) is not prime.

A fairly subtle issue remains in the above statement. In order to solve the product Sylvester-Gallai problem, we need quantitative bounds on the number of “bad forms” Qi. Moreover, we need a bound which is solely a function of deg(P). If we use standard techniques from commutative algebra and algebraic geometry, we obtain bounds which depend on the number of variables (that is: N,n) and on deg(P). This is not sufficient for us, as we want to show that dimspan{} does not depend on the number of variables in the polynomial ring. To prove that the number of bad forms is in fact independent of N and n, we need to work a bit harder, and we need to make use of two additional tools: Bertini-type theorems (to remove the dependence on n), and the Ananyan-Hochster construction of small strong subalgebras (this will remove the dependence on N). This yields, in its basic form, Theorem 6, which we discussed in Section 1.2.

Step 2 – Finding a small set of “good common variables”

Suppose is a radical Sylvester-Gallai configuration. By the above, we can assume that d is not a robust linear Sylvester-Gallai configuration. Our assumption implies that for many pairs Fi,Fj, the ideal (Fi,Fj) is not radical. In particular, for a small 0<ε<1, we can find 3d3 forms F1,,F3d3d such that (i,j) is not radical for i3d3 and j(1ε)|d|. The radical bound mentioned in Step 1 shows that there is aO(1) and an ideal generated by linear forms (1,,a) that contains every such Fj. Repeating this argument, we can find an ideal (1,,b) with b constant such that d(1,,b).

When is a product Sylvester-Gallai configuration a number of issues arise: first, our structure theorem deals with prime ideals (instead of radical) and our assumptions are that the forms are absolutely irreducible. Hence, we will only be able to draw the following weaker conclusion: there are constantly many variables x1,,xb such that every Fd is either in (x1,,xb), or absolutely reducible in the ring S:=(x1,,xb)[xb+1,,xN,y1,,yn]. Therefore, when we apply the general quotient (i.e. a random projection), we can no longer guarantee that the degree of our forms reduce. We deal with this issue by changing the way we “decompose” the configuration =1d. This is a key difference between our approach and previous approaches, since this issue does not arise when studying radical Sylvester-Gallai configurations. This new decomposition also allows us to solve an issue that arises when we apply the general quotient. We now state this issue, and discuss the solution.

Step 3 – General quotients & degree reduction/lowering degree

The image of an irreducible form under a projection map can be reducible. In order to apply induction, the definition of product Sylvester-Gallai configurations has to be generalised to allow the forms in to be reducible. Hence, we update our definition of product Sylvester-Gallai configurations to the following.

Definition 9 (Product Sylvester–Gallai configurations).

A finite set [x1,,xN] of forms of degree at most d is a d-product-SG-configuration if the following hold.

  • gcd(Fi,Fj)=1 for all ij.

  • For every Fi,Fj, we have F{Fi,Fj}Frad(Fi,Fj).

This generalisation already appears in [38], where reducibility is not an issue when dealing with radical ideals, since the radical bound also applies to reducible forms. The prime bound however can no longer apply, irrespective of any assumption: if P is reducible then (P,Q) can never be prime. In particular, if every form in d is reducible, then we can draw no conclusion about d using the above ideas.

We tackle both these issues by turning this reducibility into an advantage: instead of trying to reduce the degree as in the inductive approaches of [47, 38, 39], we will try and make the forms in our configuration “factor more”.

In order to formalise what we mean by “factor more”, we introduce the notion of factor sets of a product configuration. Given a (generalized) product Sylvester-Gallai configuration , we define the factor set of to be the set of irreducible factors of all forms in . As before, we write =1c, where jSj.

If every form in factors a lot, then all the forms in will have low degree. We will induct on the highest integer c such that c is nonzero. Now, the set of “common variables” we look for will not be variables common to d, but variables common to c. The absolute irreducibility condition remains, and we end up with variables x1,,xb such that every form in c is either in the ideal (x1,,xb) or absolutely reducible. We postpone the discussion of how we do this step to the next subsection. We now show how these common variables are used in the rest of the induction step.

Step 2 lets us find the “common variables” x1,,xb. Define a projection map φ that maps each xi for 1ib to a random multiple of a new variable z, and acts as identity on the other variables. If F(x1,,xb) then φ(F)=zF, where deg(F)<deg(F). We define φ() to be the image of under φ, after factoring out powers of z. Such a projection map preserves radical and product Sylvester-Gallai structure, and also allows bounds on the dimension of φ() to be lifted back to bounds on the dimension of .

If we have d(x1,,xb) (as in the case of radical Sylvester-Gallai configurations) then φ() is a Sylvester-Gallai configuration with forms of degree at most d1, and induction can be applied. If on the other hand is a product Sylvester-Gallai configuration, then we have the weaker condition that every Fc is either in (x1,,xb) or absolutely reducible in S. We show that the image of any F that is absolutely reducible in S under the map φ is reducible. Thus, in either case we conclude that c=, where is a factor set of φ(), and we can apply induction.

In the next subsection we show how we find these “common variables” in our setting. Note that we are still under the assumption that every form in (and thus in ) depends on only constantly many variables. We will also show how this assumption is removed.

1.4.2 Putting everything together

Let be a product Sylvester-Gallai configuration of forms of degree at most d. We now allow forms in to be reducible, but still require that they are square-free and are pairwise relatively prime. Let :=1d be the set of irreducible factors of forms in . We will induct on the largest integer c such that c. We first go over the base case in our induction.

Our base case is when every form Fi is a product of linear forms, say Fi=aia. In this case, =1={ia}i,a. Now suppose we pick ia and jb with ij. The ideal (ia,jb) is a minimal prime of (Fi,Fj). Therefore, the product Sylvester-Gallai condition implies that Fk(ia,jb) for some k, which in turn implies that ke(ia,jb) for some choice of index e. This is a linear Sylvester-Gallai relationship among the elements of 1. The choice of ia,jb was arbitrary, except the condition that ij. Therefore, the set 1 is a robust linear Sylvester-Gallai configuration, and therefore has bounded rank. This in turn implies that the forms in have bounded rank, since they are spanned by monomials of degree at most d in a basis for =1.

We now sketch the induction step. Suppose c is the largest integer such that c. Our induction step is further split into two steps, as discussed in the previous subsection.

  1. 1.

    First we show that there are variables x1,,xb such that every form in c is either in the ideal (x1,,xb) or absolutely reducible in S:=(x1,,xb)[xb+1,,xN,y1,,yn]. This is a relaxed version of step 2 in the previous approach, applied to c instead of d.

  2. 2.

    We then show that projecting x1,,xb to a new variable z results in a product Sylvester-Gallai configuration with e= for any ec.

We elaborate on these in the special case when c=2, since this already captures all the main ideas. As before, if 2 is a robust linear Sylvester-Gallai configuration, then step 2 follows easily by picking a basis for 2. Consider A,B2, and suppose A|Fi and B|Fj for some Fi,Fj. Suppose (A,B) is prime. Then (A,B) is a minimal prime of (Fi,Fj). The product Sylvester-Gallai configuration implies Fk(A,B) for some k, and therefore C(A,B) for some C|Fk. Further, by homogeneity, it must be that C2. This gives us a linear Sylvester-Gallai relationship between the elements A,B,C2. The assumption that 2 is not a robust linear Sylvester-Gallai configuration will therefore imply that (A,B) is not prime for many pairs A,B. Since the forms in 2 are irreducible (even though the forms in might not be), by a combinatorial argument and Theorem 6, we deduce the existence of variables x1,,xb, completing step 1 above.

We now move on to second step. Suppose φ is a projection map that sends x1,,xb to random multiples of a new variable z. The map φ sends elements in (x1,,xb) to multiples of z. We also prove that φ sends elements of 2(x1,,xb) that are absolutely reducible in S to reducible forms. Since c=2, the degree of each of these forms has to be 1. Therefore, if we apply φ to some F, every factor of F will be linear, even though it is possible that deg(F)=deg(φ(F)). This reduces the problem to the base case, and the properties of φ allow us to lift the resulting bound on the dimension of φ() back to a bound on the dimension of .

Removing the initial assumption

We started with the assumption that every form in depended only on constantly many variables. In general, it is of course possible that the forms depend on many or even all the variables. This issue was overcome in [38], building on the seminal work of [5]. They show that forms H1,,Ha of high enough strength behave essentially like variables, in the sense that [H1,,Ha] is isomorphic to a polynomial ring. Further, the extension [H1,,Ha]S has many of the properties that the extension [x1,,xa]S has, the most useful of them being that this extension preserves arbitrary intersection of ideals.

Motivated by this, [38] define the notion of strong vector spaces and algebras, which are vector spaces of small dimension spanned by forms of high rank, and the algebras they generate. They show that in the general case of the radical Sylvester-Gallai problem (when the forms depend on more than constantly many variables), while there might not be variables x1,,xb such that d(x1,,xb), we can find a strong vector space V such that d(V).

The above change complicates the projection step, since random restriction of higher degree forms is no longer well defined. To fix this, [38] defined generalised projection maps, and again proved that such maps have all the properties that the linear projection maps have. In particular, they preserve the radical Sylvester-Gallai structure, and they allow bounds on the dimension of the image of to be lifted back to bounds on . With these tools, they set up an inductive framework to show radical Sylvester-Gallai configurations have bounded dimension, with linear Sylvester-Gallai configurations acting as the base case.

Extending our results to the setting of strong algebras in order to apply this framework requires two technical results. First we define the notion of absolute reducibility with respect to strong vector spaces. This allows us to extend the prime bound to strong algebras. We then show that general projection maps send forms that are absolutely reducible with respect to strong vector spaces to reducible forms. These two extensions to the above framework allow us to prove our main theorem in the general case.

2 Conclusion

In this work, we prove that product Sylvester-Gallai configurations of forms have bounded dimension, generalising the result of [38], and getting one step closer towards a PIT algorithm for ΣkΠΣΠd circuits. To achieve this, we give a novel, and very effective, sufficient condition for the primality of ideals of the form (P,Q), where P “depends on more variables” than Q, via the transfer principle from [38]. This turns out to be far more subtle, and to require more tools from algebraic geometry, than giving a sufficient condition for the ideal to be radical.

Our work leaves two important questions, in order to bridge the gap between our results and a complete analysis for a polynomial-time, deterministic PIT algorithm for ΣkΠΣΠd circuits. The first question is to extend this work and prove a higher codimension product Sylvester-Gallai theorem for all degrees, simultaneously generalising the main theorems of [21] and this work. The second question is to provide a tight connection between such Sylvester-Gallai problems and rank bounds for identities in ΣkΠΣΠd, thereby generalizing the beautiful result in [45, Theorem 1.4] to circuits of depth four.

References

  • [1] M. Agrawal and Manindra. Vinay. Arithmetic circuits: A chasm at depth four. In 49th Annual IEEE Symposium on Foundations of Computer Science, FOCS 2008, October 25-28, 2008.
  • [2] Manindra Agrawal. Proving lower bounds via pseudo-random generators. In FSTTCS 2005: Foundations of Software Technology and Theoretical Computer Science: 25th International Conference, Hyderabad, India, December 15-18, 2005. Proceedings 25, pages 92–105. Springer, 2005. doi:10.1007/11590156_6.
  • [3] Manindra Agrawal, Neeraj Kayal, and Nitin Saxena. Primes is in p. Annals of mathematics, pages 781–793, 2004.
  • [4] Manindra Agrawal, Chandan Saha, Ramprasad Saptharishi, and Nitin Saxena. Jacobian hits circuits: Hitting sets, lower bounds for depth-d occur-k formulas and depth-3 transcendence degree-k circuits. SIAM Journal on Computing, 45(4):1533–1562, 2016. doi:10.1137/130910725.
  • [5] Tigran Ananyan and Melvin Hochster. Small subalgebras of polynomial rings and stillman’s conjecture. Journal of the American Mathematical Society, 33(1):291–309, 2020.
  • [6] Robert Andrews and Michael A. Forbes. Ideals, determinants, and straightening: proving and using lower bounds for polynomial ideals. In 54th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2022, pages 389–402, 2022. doi:10.1145/3519935.3520025.
  • [7] Boaz Barak, Zeev Dvir, Amir Yehudayoff, and Avi Wigderson. Rank bounds for design matrices with applications to combinatorial geometry and locally correctable codes. In Proceedings of the Forty-Third Annual ACM Symposium on Theory of Computing, STOC ’11, pages 519–528, 2011. doi:10.1145/1993636.1993705.
  • [8] M. Beecken, J. Mittmann, and N. Saxena. Algebraic independence and blackbox identity testing. Information and Computation, 222:2–19, 2013. 38th International Colloquium on Automata, Languages and Programming (ICALP 2011). doi:10.1016/j.ic.2012.10.004.
  • [9] Peter Borwein and William OJ Moser. A survey of sylvester’s problem and its generalizations. Aequationes Mathematicae, 40(1):111–135, 1990.
  • [10] Chi-Ning Chou, Mrinal Kumar, and Noam Solomon. Closure results for polynomial factorization. Theory of Computing, 15(1):1–34, 2019. doi:10.4086/TOC.2019.V015A013.
  • [11] Pranjal Dutta, Prateek Dwivedi, and Nitin Saxena. Deterministic identity testing paradigms for bounded top-fanin depth-4 circuits. In Proceedings of the 36th Computational Complexity Conference, CCC ’21, 2021. doi:10.4230/LIPIcs.CCC.2021.11.
  • [12] Zeev Dvir. Incidence theorems and their applications. arXiv preprint arXiv:1208.5073, 2012. arXiv:1208.5073.
  • [13] Zeev Dvir, Shubhangi Saraf, and Avi Wigderson. Improved rank bounds for design matrices and a new proof of kelly’s theorem. In Forum of Mathematics, Sigma, volume 2. Cambridge University Press, 2014.
  • [14] Zeev Dvir and Amir Shpilka. Locally decodable codes with two queries and polynomial identity testing for depth 3 circuits. SIAM Journal on Computing, 36(5):1404–1434, 2007. doi:10.1137/05063605X.
  • [15] Michael Edelstein and Leroy M Kelly. Bisecants of finite collections of sets in linear spaces. Canadian Journal of Mathematics, 18:375–380, 1966.
  • [16] N. Elkies, L. Pretorius, and K. Swanepoel. Sylvester–gallai theorems for complex numbers and quaternions. Discrete Comput Geom, 35:361–373, 2006. doi:10.1007/S00454-005-1226-7.
  • [17] Paul Erdos, Richard Bellman, Hubert S Wall, James Singer, and Victor Thébault. Problems for solution: 4065-4069. The American Mathematical Monthly, 50(1):65–66, 1943.
  • [18] Stephen Fenner, Rohit Gurjar, and Thomas Thierauf. A deterministic parallel algorithm for bipartite perfect matching. Communications of the ACM, 62(3):109–115, 2019. doi:10.1145/3306208.
  • [19] Michael A Forbes and Amir Shpilka. Explicit noether normalization for simultaneous conjugation via polynomial identity testing. In International Workshop on Approximation Algorithms for Combinatorial Optimization, pages 527–542. Springer, 2013. doi:10.1007/978-3-642-40328-6_37.
  • [20] Tibor Gallai. Solution of problem 4065. American Mathematical Monthly, 51:169–171, 1944.
  • [21] Abhibhav Garg, Rafael Oliveira, Shir Peleg, and Akash Kumar Sengupta. Radical Sylvester-Gallai Theorem for Tuples of Quadratics. In 38th Computational Complexity Conference (CCC 2023), pages 20:1–20:30, 2023. doi:10.4230/LIPIcs.CCC.2023.20.
  • [22] Abhibhav Garg, Rafael Oliveira, and Akash Kumar Sengupta. Robust Radical Sylvester-Gallai Theorem for Quadratics. In 38th International Symposium on Computational Geometry (SoCG 2022), pages 42:1–42:13, 2022. doi:10.4230/LIPIcs.SoCG.2022.42.
  • [23] Zeyu Guo. Variety Evasive Subspace Families. In Valentine Kabanets, editor, 36th Computational Complexity Conference (CCC 2021), volume 200 of Leibniz International Proceedings in Informatics (LIPIcs), pages 20:1–20:33, 2021. doi:10.4230/LIPIcs.CCC.2021.20.
  • [24] Ankit Gupta. Algebraic geometric techniques for depth-4 pit & sylvester-gallai conjectures for varieties. In Electron. Colloquium Comput. Complex., volume 21, page 130, 2014.
  • [25] Ankit Gupta, Pritish Kamath, Neeraj Kayal, and Ramprasad Saptharishi. Arithmetic circuits: A chasm at depth 3. SIAM Journal on Computing, 45(3):1064–1079, 2016. doi:10.1137/140957123.
  • [26] Rohit Gurjar and Thomas Thierauf. Linear matroid intersection is in quasi-nc. In Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing, pages 821–830, 2017. doi:10.1145/3055399.3055440.
  • [27] Sten Hansen. A generalization of a theorem of sylvester on the lines determined by a finite point set. Mathematica Scandinavica, 16(2):175–180, 1965.
  • [28] Joos Heintz and Claus-Peter Schnorr. Testing polynomials which are easy to compute. In Proceedings of the twelfth annual ACM Symposium on Theory of Computing, pages 262–272, 1980.
  • [29] Friedrich Hirzebruch. Arrangements of lines and algebraic surfaces. In Arithmetic and geometry, pages 113–140. Springer, 1983.
  • [30] Valentine Kabanets and Russell Impagliazzo. Derandomizing polynomial identity tests means proving circuit lower bounds. computational complexity, 13:1–46, 2004. doi:10.1007/S00037-004-0182-6.
  • [31] Zohar S. Karnin and Amir Shpilka. Black box polynomial identity testing of generalized depth-3 arithmetic circuits with bounded top fan-in. In 2008 23rd Annual IEEE Conference on Computational Complexity, pages 280–291, 2008. doi:10.1109/CCC.2008.15.
  • [32] Neeraj Kayal and Shubhangi Saraf. Blackbox polynomial identity testing for depth 3 circuits. In 2009 50th Annual IEEE Symposium on Foundations of Computer Science, pages 198–207. IEEE, 2009. doi:10.1109/FOCS.2009.67.
  • [33] Swastik Kopparty, Shubhangi Saraf, and Amir Shpilka. Equivalence of polynomial identity testing and polynomial factorization. computational complexity, 24:295–331, 2015. doi:10.1007/S00037-015-0102-Y.
  • [34] Nutan Limaye, Srikanth Srinivasan, and Sébastien Tavenas. Superpolynomial lower bounds against low-depth algebraic circuits. In 2021 IEEE 62nd Annual Symposium on Foundations of Computer Science (FOCS), pages 804–814. IEEE, 2022.
  • [35] Eberhard Melchior. Uber vielseite der projektiven ebene. Deutsche Math, 5:461–475, 1940.
  • [36] Ketan Mulmuley. Geometric complexity theory v: Efficient algorithms for noether normalization. Journal of the American Mathematical Society, 30(1):225–309, 2017.
  • [37] Rafael Oliveira and Akash Sengupta. Radical sylvester-gallai theorem for cubics. FOCS, 2022.
  • [38] Rafael Oliveira and Akash Kumar Sengupta. Strong algebras and radical sylvester-gallai configurations. In Proceedings of the 56th Annual ACM Symposium on Theory of Computing, pages 95–105, 2024. doi:10.1145/3618260.3649617.
  • [39] Shir Peleg and Amir Shpilka. A Generalized Sylvester-Gallai Type Theorem for Quadratic Polynomials. In 35th Computational Complexity Conference (CCC 2020), pages 8:1–8:33, 2020. doi:10.4230/LIPIcs.CCC.2020.8.
  • [40] Shir Peleg and Amir Shpilka. Polynomial time deterministic identity testing algorithm for Σ[3]ΠΣΠ[2] circuits via edelstein-kelly type theorem for quadratic polynomials. In STOC ’21: 53rd Annual ACM SIGACT Symposium on Theory of Computing, Virtual Event, Italy, June 21-25, 2021, pages 259–271. ACM, 2021. doi:10.1145/3406325.3451013.
  • [41] Shir Peleg and Amir Shpilka. Robust sylvester-gallai type theorem for quadratic polynomials. In 38th International Symposium on Computational Geometry, SoCG 2022, June 7-10, 2022, Berlin, Germany, volume 224, pages 43:1–43:15, 2022. doi:10.4230/LIPIcs.SoCG.2022.43.
  • [42] L. M. Pretorius and K. J. Swanepoel. The sylvester-gallai theorem, colourings and algebra. Discret. Math., 2009.
  • [43] Nitin Saxena. Progress on polynomial identity testing. Bull. EATCS, 99:49–79, 2009.
  • [44] Nitin Saxena. Progress on polynomial identity testing-ii. Perspectives in Computational Complexity: The Somenath Biswas Anniversary Volume, pages 131–146, 2014.
  • [45] Nitin Saxena and Comandur Seshadhri. From sylvester-gallai configurations to rank bounds: Improved blackbox identity test for depth-3 circuits. Journal of the ACM (JACM), 60(5):1–33, 2013. doi:10.1145/2528403.
  • [46] Jean-Pierre Serre. Advanced problem 5359. Amer. Math. Monthly, 73(1):89, 1966.
  • [47] Amir Shpilka. Sylvester-gallai type theorems for quadratic polynomials. Discrete Analysis, page 14492, 2020.
  • [48] Amir Shpilka and Amir Yehudayoff. Arithmetic circuits: A survey of recent results and open questions. Foundations and Trends® in Theoretical Computer Science, 5(3–4):207–388, 2010. doi:10.1561/0400000039.
  • [49] James Joseph Sylvester. Mathematical question 11851. Educational Times, 59(98):256, 1893.
  • [50] Sébastien Tavenas. Improved bounds for reduction to depth 4 and depth 3. Information and Computation, 240:2–11, 2015. doi:10.1016/J.IC.2014.09.004.