Abstract 1 Introduction 2 Our Results 3 Proof Idea and Notations 4 Lower Bound 5 Separation between homogeneous and low syntactic degree circuits 6 Discussion References

Lower Bounds for Noncommutative Circuits with Low Syntactic Degree

Pratik Shastri ORCID The Institute of Mathematical Sciences (a CI of Homi Bhabha National Institute), Chennai, India
Abstract

Proving lower bounds on the size of noncommutative arithmetic circuits is an important problem in arithmetic circuit complexity. For explicit n variate polynomials of degree Θ(n), the best known general bound is Ω(nlogn) [7, 1]. Recent work of Chatterjee and Hrubeš [4] has provided stronger (Ω(n2)) bounds for the restricted class of homogeneous circuits.

The present paper extends these results to a broader class of circuits by using syntactic degree as a complexity measure. The syntactic degree of a circuit is a well known parameter which measures the extent to which high degree computation is used in the circuit. A homogeneous circuit computing a degree d polynomial can be assumed, without loss of generality, to have syntactic degree exactly equal to d [5]. We generalize this by considering circuits that are not necessarily homogeneous but have low syntactic degree. Specifically, for an explicit n variate, degree n polynomial f we show that any circuit with syntactic degree O(n) computing f must have size Ω(n1+c) for some constant c>0. We also show that any circuit with syntactic degree o(nlogn) computing the same f must have size ω(nlogn). We further analyze the circuit size required to compute f based on the number of distinct syntactic degrees appearing in the circuit. Our analysis yields an ω(nlogn) size lower bound for all but a narrow parameter regime where an improved bound is not obtained. Finally, we observe that low syntactic degree circuits are more powerful than homogeneous circuits in a fine grained sense: there exists an n variate, degree Θ(n) polynomial that has a circuit of size O(nlog2n) and syntactic degree O(n) but any homogeneous circuit computing it requires size Ω(n2).

Keywords and phrases:
Noncommutative Circuits, Lower Bounds, Circuit Complexity, Algebraic Complexity
Copyright and License:
[Uncaptioned image] © Pratik Shastri; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Algebraic complexity theory
Editor:
Shubhangi Saraf

1 Introduction

Let 𝔽 be a field and X={x1,,xn} be a set of variables. A noncommutative polynomial f(X) over 𝔽 in the variables X is a finite 𝔽-linear combination of noncommutative monomials in X (equivalently, one may think of these as words in {x1,,xn}). The set of all such polynomials forms a ring under addition and noncommutative multiplication, this is the noncommutative polynomial ring denoted by 𝔽X. In noncommutative arithmetic circuit complexity, the central object of study is the noncommutative arithmetic circuit. A noncommutative arithmetic circuit (we will often simply say circuit for short), is a directed acyclic graph whose leaves are labeled by variables (say from X) and constants from 𝔽, and whose internal nodes, called gates, are labeled by either + or ×. A circuit is said to have fan-in 2 if each gate has in-degree 2. Each gate in the circuit computes a polynomial in the natural way: a + gate computes the sum of the polynomials computed at the children and a × gate computes the product. Importantly, since the product is noncommutative, each product gate comes with an ordering on the children. Since we will deal with fan-in 2 circuits, each such gate has a designated left child and a designated right child. We will say that a circuit has size s if the underlying graph has s vertices. We will say that a gate g in a circuit has depth Δ if the longest leaf to g path in the circuit has length Δ. The depth of a circuit is then defined as the maximum depth of any gate in it. We designate a particular gate of the circuit to be the output gate, and the polynomial computed by the circuit is defined to be the polynomial computed at this distinguished gate.

A central goal in this area is to prove exponential lower bounds on the size of circuits computing explicit111We will say that a polynomial f𝔽X is explicit if there is a polynomial time algorithm that computes, given a monomial mX, the coefficient of m in f. polynomials. The most influential work in this area is that of Nisan [6]. He showed exponential lower bounds on the size of formulas computing explicit polynomials. Formulas are circuits whose underlying graph is a tree. In order to show this, he introduced a complexity measure, now called Nisan Rank. Unfortunately, his methods do not provide such strong bounds for circuits. Indeed, the best lower bounds on the circuit size of an explicit, n variate degree Θ(n) polynomial are of the form Ω(nlogn) [7, 1] and improving upon this has been an outstanding open problem for more than 40 years. The work of Carmosino et al. [3] provides strong motivation to improve upon the nlogn bound: they show, for instance, that barely superlinear lower bounds (of the form Ω(nω/2+ϵ), where ω is the exponent of matrix multiplication) for noncommutative circuits computing constant degree polynomials can be amplified to obtain exponential lower bounds.

In a recent work [4], Chatterjee and Hrubeš improve upon the Ω(nlogn) bound, assuming that the circuit is homogeneous. A circuit is said to be homogeneous if every gate in it computes a homogeneous polynomial. In their main result ([4], Theorem 14), they construct an n variate, degree Θ(n) polynomial f such that any homogeneous circuit computing f requires size Ω(n2).

Note that noncommutative circuits can be homogenized, but this procedure incurs a multiplicative blowup of d2 (where d is the degree of the polynomial) in size and therefore the results in the paper discussed above fall short of giving lower bounds for general (not necessarily homogeneous) circuits.

2 Our Results

The work of Chatterjee and Hrubeš [4] leaves open (and tantalizingly so) the question of going beyond homogeneous circuits and proving ω(nlogn) lower bounds for general circuits. In this work, we take a step in this direction. We remove the homogeneity restriction and replace it with a bound on the syntactic degree. Let Ψ be a circuit. For each gate g in Ψ, we define the syntactic degree d(g) inductively as follows: for a leaf g labeled by a field constant, we define d(g)=0. For a leaf labeled by a variable, we define d(g)=1. For a sum gate g=g1+g2 we define d(g)=max{d(g1),d(g2)}. If g=g1×g2 is a product gate, we define d(g)=d(g1)+d(g2). Note that for each gate g, the degree of the polynomial computed at g is at most d(g). The syntactic degree of Ψ is then defined as the maximum over all gates g of Ψ of d(g).

Notice that if we have a homogeneous circuit Ψ for a degree d polynomial, we can assume without loss of generality that the syntactic degree of Ψ is exactly d. This is because in such a circuit, each gate g either computes a polynomial of degree equal to d(g), or it computes the 0 polynomial. This is well known [5] and easily proved, for example by induction. Therefore, homogeneity forbids (useful) computation of polynomials of degree larger than d. This simple observation suggests a natural relaxation of homogeneity: allowing the circuit to have syntactic degree larger than the degree of the output, but not much more. Such relaxations of homogeneity have been examined in the literature. For instance, in a recent result, the authors of [5] provided a quasi-homogenization result for commutative formulas. A formula is said to be quasi-homogeneous if the syntactic degree of the formula is a polynomially bounded (from above) by the degree of the output.

Quantitatively, we recall that Chatterjee and Hrubeš have proved an Ω(n2) lower bound on the size of any homogeneous noncommutative circuit computing some explicit polynomial on n variables of degree Θ(n). In contrast, for general circuits, the strongest known lower bound remains Ω(nlogn).

One can now ask the following intermediate question: can we prove better-than-Ω(nlogn) lower bounds on the size of circuits computing an explicit n variate, degree Θ(n) polynomial where the circuit is not necessarily homogeneous, but has syntactic degree, say, O(n)? Such circuits may compute intermediate inhomogeneous polynomials of degree higher than the degree of the output, but not much higher. We answer this question in the affirmative.

 Remark 1.

Circuits with bounded syntactic degree are provably more powerful than their homogeneous counterparts: there exists an n variate, degree Θ(n) polynomial which can be computed by a circuit of size O(nlog2n) and syntactic degree O(n) but any homogeneous circuit computing it must have size Ω(n2). This follows from inspecting a construction presented in the work of Chatterjee and Hrubeš, see Section 5.

Let X={x1,,xn} and Y={y1,,yn}. Our results apply to the palindrome polynomial Paln,n(X,Y)𝔽X,Y where Paln,d(X,Y)𝔽X,Y is defined as follows:

Paln,d(X,Y)=(i1,id)[n]d(j=1dxijj=1dyid+1j)

A version of this polynomial was already introduced by Nisan in [6] to separate circuits from formulas. Indeed, there exists a homogeneous circuit of size O(n2) that computes Paln,n(X,Y). We obtain this from the recursive expression Paln,d=i=1nxiPaln,d1yi. On the other hand, it follows from Nisan’s work [6] that any formula computing Paln,n(X,Y) requires size nΩ(n). Since a fan-in two circuit with depth Δ can be converted into a formula of size 2Δ, it follows that any circuit computing Paln,n(X,Y) must have depth, and therefore size, Ω(nlogn). In this paper we improve upon this lower bound for circuits with low syntactic degree. As alluded to before, our main results are the following.

Theorem 2 (Stated below as Corollary 8).

Let Ψ be a circuit computing Paln,n(X,Y) with syntactic degree O(n). Then there exists a constant c>0 such that Ψ has size Ω(n1+c).

Theorem 2 offers a qualitative strengthening of the main theorem in the work of Chatterjee and Hrubeš ([4], Theorem 14), since their result applies to homogeneous circuits while ours applies to a provably stronger class of circuits (Remark 1).

Next, we analyze the circuit size required to compute Paln,n based on the number of different syntactic degrees that appear in the circuit. We show that for a circuit if this number is o(nlogn), then it must have size ω(nlogn).

Theorem 3 (Stated below as Corollary 10).

Let Ψ be a circuit computing Paln,n(X,Y). Let d=|{d(g)g is a gate in Ψ}|. If d=o(nlogn), Ψ has size ω(nlogn).

Note that Theorem 3 applies to circuits with arbitrarily high syntactic degree. We only require that the number of distinct d(g)’s appearing in Ψ satisfies the constraints mentioned above. Also, note that if d=ω(nlogn), we trivially get an ω(nlogn) bound: if a circuit has ω(nlogn) types of gates then it must have ω(nlogn) gates. So, the only case where we do not obtain a lower bound asymptotically better than Ω(nlogn) is when d and the size of Ψ are both Θ(nlogn). We do not know if such circuits exist for Paln,n.

Theorem 3 has the following immediate corollary:

Theorem 4 (Stated below as Corollary 11).

Let Ψ be a circuit computing Paln,n(X,Y) with syntactic degree o(nlogn). Then Ψ has size ω(nlogn).

Finally, in Section 5 we analyze a construction provided in the work of Chatterjee and Hrubeš, and show a fine grained separation between homogeneous circuits and circuits with low syntactic degree.

3 Proof Idea and Notations

We use the methods of Nisan [6] to obtain our lower bounds. Specifically, we bound the dimension of the coefficient space of an X,Y separated polynomial computed by a circuit of low syntactic degree: Let f𝔽X,Y be a polynomial. We say f is X,Y separated if in each nonzero monomial of f, every X variable appears before every Y variable. Note that the Paln,d(X,Y) polynomials are X,Y separated. Now suppose f is X,Y separated. Thinking of f as a polynomial in the Y variables with coefficients that are elements of 𝔽X, we write

f=mYcmm

where cm𝔽X. Define 𝖼𝗈𝖾𝖿𝖿𝗆(f)cm, and 𝖢𝗈𝖾𝖿𝖿X(f){𝖼𝗈𝖾𝖿𝖿𝗆(f)mY}. Recalling that the ring 𝔽X forms an 𝔽-vector space with X as a basis, we consider the dimension of the 𝔽-span of the set 𝖢𝗈𝖾𝖿𝖿X(f). For f=Paln,n(X,Y), this quantity is nn. On the other hand, Nisan, in [6], showed that if f is computed by a formula of size s, then this dimension is bounded above by sO(1). Since a circuit of size s can be converted into a formula of size 2s, his method applied directly gives a lower bound of Ω(nlogn) on the size of any circuit computing Paln,n(X,Y). We refine this analysis and show that if an X,Y separated polynomial is computed by a circuit with restrictions on the syntactic degrees appearing in it, we may obtain a better upper bound on the dimension of the span of the coefficients, by explicitly constructing a spanning set of polynomials. This gives the lower bounds. For future convenience, we set up some more notation. Let Ψ be a noncommutative circuit. Recall that for a gate g in Ψ, d(g) denotes its syntactic degree. Define DΨ{d(g)g is a gate in Ψ}. We say that a circuit Ψ is in normal form if no product gate has a child of syntactic degree 0. Since gates with syntactic degree 0 can only compute constants, we may assume without loss of generality, by pushing constants all the way down to the leaves, that a given circuit is in normal form. We allow leaves labeled by αx where α𝔽 and x is a variable. Such leaves also have syntactic degree 1.

 Remark 5.

It is unclear whether the technique of Chatterjee and Hrubeš [4] for proving lower bounds against homogeneous circuits can be extended to circuits of low syntactic degree. One indication that their approach does not directly generalize is that, as noted in Remark 1 and further discussed in Section 5, the polynomial for which they prove an Ω(n2) homogeneous circuit lower bound in fact admits a small circuit of low syntactic degree.

4 Lower Bound

In this section, we prove our main result. Theorems 2, 4 and 3 follow as corollaries of Theorem 6, which we prove below.

Theorem 6.

Let Ψ be a size s, fan-in 2 circuit computing Paln,d(X,Y). Let |DΨ|=d. Then, s(d1)n(d2)/(d1)2d+2.

Proof.

For simplicity, we assume without loss of generality that Ψ is in normal form. For each gate gΨ, we let g^ denote the polynomial computed at g in Ψ. For each gate g, we write

g^=g^1+g^X+g^Y+g^XY+g^other

where g^1 denotes the constant term of g^, g^X denotes the sum (with coefficient) of non-constant monomials of g^ which contain only X variables, g^Y denotes the sum (with coefficient) of non-constant monomials of g^ which contain only Y variables, g^XY denotes the sum (with coefficient) of monomials of g^ in which both X and Y variables occur and all X variables appear before all the Y variables, and g^other denotes the sum (with coefficient) of all the other monomials.

Let DΨ={l1,ld} with l1<<ld. For each k[d], define 𝒢k{gg is a gate in Ψ with d(g)=lk}. For each k[d], we will build a (reasonably small) set k𝔽X of spanning polynomials such that for each g𝒢k, 𝖢𝗈𝖾𝖿𝖿X(g^1), 𝖢𝗈𝖾𝖿𝖿X(g^X), 𝖢𝗈𝖾𝖿𝖿X(g^Y), 𝖢𝗈𝖾𝖿𝖿X(g^XY)𝗌𝗉𝖺𝗇{k} and for each t<k,tk. We build these sets iteratively. Our base case will be k=1 with 1={1,x1,,xn} and |1|=n+1. Now suppose we have already constructed l for each l<k. First, set kk1. Let 𝒫k𝒢k be the set of product gates g in Ψ with d(g)=lk. For each such gate g=g×g′′ with d(g)=lt,d(g′′)=lp, we note that t,p<k since Ψ is normal. Observe the following:

g^1 =g^1×g′′^1
g^X =g^X×g′′^X+g^X×g′′^1+g^1×g′′^X
g^Y =g^Y×g′′^Y+g^Y×g′′^1+g^1×g′′^Y
g^XY =g^X×g′′^Y+g^XY×g′′^Y+g^X×g′′^XY+g^XY×g′′^1+g^1×g′′^XY

For each such g𝒫k (with g=g×g′′), we update k as kk{g^X}{g^X×hhk1}.

Claim 7.

For k[d] and each gate g𝒢k, we have that 𝖢𝗈𝖾𝖿𝖿X(g^1), 𝖢𝗈𝖾𝖿𝖿X(g^X), 𝖢𝗈𝖾𝖿𝖿X(g^Y), 𝖢𝗈𝖾𝖿𝖿X(g^XY)𝗌𝗉𝖺𝗇{k}

Proof.

We show this by inducting on the depth of the gate g.

  • If g is a leaf, it satisfies the claim by construction of 1.

  • g is a product gate: Let g=g×g′′ with d(g)=lk. As mentioned earlier, if d(g)=lt,d(g′′)=lp then t,p<k since Ψ is normal.

    1. 1.

      𝖢𝗈𝖾𝖿𝖿X(g^1)𝗌𝗉𝖺𝗇{k}: This is true since 𝖢𝗈𝖾𝖿𝖿X(g^1) just contains a constant, 1k and 1 contains the constant 1.

    2. 2.

      𝖢𝗈𝖾𝖿𝖿X(g^X)𝗌𝗉𝖺𝗇{k}: This is true since 𝖢𝗈𝖾𝖿𝖿X(g^X)={g^X} and g^X is in k

    3. 3.

      𝖢𝗈𝖾𝖿𝖿X(g^Y)𝗌𝗉𝖺𝗇{k}: Again, this is true since 𝖢𝗈𝖾𝖿𝖿X(g^Y) is just a set of field constants, 1k and 1 contains the constant 1.

    4. 4.

      𝖢𝗈𝖾𝖿𝖿X(g^XY)𝗌𝗉𝖺𝗇{k}: Consider a monomial m𝔽Y with coefficient 𝖼𝗈𝖾𝖿𝖿m(g^XY) (which is an element of 𝔽X). We write 𝖼𝗈𝖾𝖿𝖿m(g^XY)=𝖼𝗈𝖾𝖿𝖿m(g^X×g′′^Y)+𝖼𝗈𝖾𝖿𝖿m(g^XY×g′′^Y)+𝖼𝗈𝖾𝖿𝖿m(g^X×g′′^XY)+𝖼𝗈𝖾𝖿𝖿m(g^XY×g′′^1)+𝖼𝗈𝖾𝖿𝖿m(g^1×g′′^XY). It suffices to show that each term in this expansion is individually in 𝗌𝗉𝖺𝗇{k}.

      • Observe that 𝖼𝗈𝖾𝖿𝖿m(g^X×g′′^Y) is a scalar multiple of g^X which is in k (1k1 and so g^X×1k).

      • Next, 𝖼𝗈𝖾𝖿𝖿m(g^XY×g′′^Y) is a linear combination of coefficients of the form 𝖼𝗈𝖾𝖿𝖿m(g^XY) where m is a prefix of m. By induction, for each such m we have 𝖼𝗈𝖾𝖿𝖿m(g^XY)𝗌𝗉𝖺𝗇{t}𝗌𝗉𝖺𝗇{k} and so 𝖼𝗈𝖾𝖿𝖿m(g^XY×g′′^Y)𝗌𝗉𝖺𝗇{k}.

      • Further, 𝖼𝗈𝖾𝖿𝖿m(g^X×g′′^XY) is just g^X×𝖼𝗈𝖾𝖿𝖿m(g′′^XY). Since we have 𝖼𝗈𝖾𝖿𝖿m(g′′^XY)𝗌𝗉𝖺𝗇{p}𝗌𝗉𝖺𝗇{k1} by induction, 𝖼𝗈𝖾𝖿𝖿m(g^X×g′′^XY)𝗌𝗉𝖺𝗇{k}, by construction, as desired.

      • Finally, by induction, 𝖼𝗈𝖾𝖿𝖿m(g^XY) and 𝖼𝗈𝖾𝖿𝖿m(g′′^XY) are in span{t}span{k} and span{p}span{k} respectively, and therefore so are 𝖼𝗈𝖾𝖿𝖿m(g^XY×g′′^1) and 𝖼𝗈𝖾𝖿𝖿m(g^1×g′′^XY).

  • g is a sum gate: Let g=g+g′′ with d(g)=lk, d(g)=lk and d(g′′)=lt with tk. We assume the claim for g and g′′, by induction on depth. That is, we assume that 𝖢𝗈𝖾𝖿𝖿X(g^1),𝖢𝗈𝖾𝖿𝖿X(g^X),𝖢𝗈𝖾𝖿𝖿X(g^Y),𝖢𝗈𝖾𝖿𝖿X(g^XY)𝗌𝗉𝖺𝗇{k} and 𝖢𝗈𝖾𝖿𝖿X(g′′^1), 𝖢𝗈𝖾𝖿𝖿X(g′′^X), 𝖢𝗈𝖾𝖿𝖿X(g′′^Y), 𝖢𝗈𝖾𝖿𝖿X(g′′^XY)𝗌𝗉𝖺𝗇{t}𝗌𝗉𝖺𝗇{k}. The claim then follows from the fact that g^1=g^1+g′′^1, g^X=g^X+g′′^X, g^Y=g^Y+g′′^Y and g^XY=g^XY+g′′^XY.

This finishes the proof Claim 7

Let us now bound |d|. For each k[d], define sk|𝒫k|. Note that k=1dsk is at most s, the size of Ψ. Observe that for each k2, we have by construction that |k|sk(|k1|+1)+|k1|(sk+1)(|k1|+1). Define, for each 2kd, bk|k|+1. This fixes b1=n+2. Applying the inequality above, we get that for each 1kd, bk(sk+1)bk1+1(sk+2)bk1. Therefore, we see that bd=|d|+1(n+2)(k=2d(sk+2)).

Applying the AM-GM inequality, |d|(n+2)(s+2d2d1)d1. Since Ψ computes Paln,d, we must also have that |d|nd. This is because 𝖢𝗈𝖾𝖿𝖿X(Paln,d) is the entire set Xd of degree d monomials in X, and distinct monomials are independent in the vector space 𝔽X. Therefore,

nd/(d1) (n+2)1/(d1)(s+2d2d1)
s (d1)n(d/(d1))(log(n+2)/((d1)logn))2d+2
(d1)n(d2)/(d1)2d+2

This finishes the proof of Theorem 6.

Corollary 8 (Restatement of Theorem 2).

Let Ψ be a circuit computing the polynomial Paln,n(X,Y) with syntactic degree O(n). Then there exists a constant c>0 such that Ψ has size Ω(n1+c).

Proof.

Let d=|DΨ|. Suppose the syntactic degree of Ψ is αn for some constant α. Note that α2 since the degree of Paln,n is 2n. Also, note that by definition, dαn because d is the number of distinct syntactic degrees appearing in Ψ and also that dlog(2n)>1, because the degree of Paln,n(X,Y) is 2n and one may find a root to leaf path in the circuit on which the syntactic degree drops by a factor of at most half at each step. We split into two cases.

  • Case 1). dn/2: In this case, we apply Theorem 6 and find that s(d1)n22d+2=Ω(n2logn).

  • Case 2). n/2<dαn: In this case, we apply Theorem 6 and find that s(n/2)n(n2)/(αn1)2d+2=Ω(n1+1/α).

Since α2, we have s=Ω(n1+1/α) in both cases.

 Remark 9.

Homogeneous circuits computing Paln,n(X,Y) can be assumed to have syntactic degree 2n, and so Corollary 8 gives us a Ω(n3/2) lower bound for such circuits. Note that this is worse than the Ω(n2) bound proved by Chatterjee and Hrubeš (for a different polynomial). On the other hand, our technique works for a broader class of circuits.

Let us now give a more precise analysis based on the number of distinct syntactic degrees appearing in the circuit, i.e., on the size of |DΨ|.

Corollary 10 (Restatement of Theorem 3).

Let Ψ be a circuit computing the polynomial Paln,n(X,Y). If |DΨ|=o(nlogn), then Ψ has size ω(nlogn).

Proof.

Let the size of Ψ be s. Suppose |DΨ|=d=d(n)=o(nlogn). Define g(n)nlogn/d(n). Note that g(n)=ω(1). Applying Theorem 6, we find that

s (d1)×n(n2)/(d1)O(d)
(d/2)×nn/2dO(d) (for large enough n)
nlogn2g(n)×2g(n)/2o(nlogn)
ω(nlogn)

This finishes the proof of Corollary 10.

When the syntactic degree is slightly higher, we have the following corollary.

Corollary 11 (Restatement of Theorem 4).

Let Ψ be a circuit computing the polynomial Paln,n(X,Y) with syntactic degree o(nlogn). Then Ψ has size ω(nlogn).

Proof.

If the syntactic degree of Ψ is o(nlogn), then |DΨ|=o(nlogn) and the conclusion follows from Corollary 10

5 Separation between homogeneous and low syntactic degree circuits

In this section, we observe that low syntactic degree circuits are provably more powerful than homogeneous circuits in a fine grained sense. We will consider the ordered symmetric polynomials OSnk defined as follows:

OSnk(x1,,xn)=1i1<<iknj[k]xij

On the one hand, Chatterjee and Hrubeš showed ([4], Theorem 14) the following lower bound:

Theorem 12 ([4], Theorem 14).

Any homogeneous circuit computing OSnn/2 must have size Ω(n2).

On the other hand, they observe ([4], Section 5) that a well known construction ([2], Chapters 2.1-2.3) of circuits of size O(nlog2n) computing the commutative elementary symmetric polynomials (this construction uses the Fast Fourier Transform) continues to hold noncommutatively. Here, we observe that the syntactic degree of this (noncommutative) circuit, which simultaneously computes {OSnk}k=0n, is n.

For completeness, we present the construction here and analyze the syntactic degree of the resulting circuit. For simplicity, we work over the complex numbers, and we assume that n is a power of 2. The circuit Cn is constructed such that it simultaneously computes the set {OSnk}k=0n of polynomials. Cn will have size O(nlog2n) and syntactic degree n.

The basic observation is that for each k, OSnk is the coefficient of tnk in i=1n(xi+t). Here t commutes with the xi’s.

By induction, we assume that we have circuits Cn/21 and Cn/22 that compute the coefficients of 1,t,,tn/2 in the polynomials A(t)=i=1n/2(xi+t) and B(t)=i=n/2+1n(xi+t) respectively. We are now interested in computing the coefficients of 1,t,,tn in the polynomial A(t)B(t). In order to do this, we implement the well known FFT based algorithm for univariate polynomial multiplication. Let ω be a primitive 2n-th root of unity.

  • Evaluation: Using the Fast Fourier Transform (FFT) and the circuits Cn/21 and Cn/22 which represent coefficient vectors of A(t) and B(t) respectively, we construct two circuits Φ1 and Φ2 for computing A(1),A(ω),,A(ω2n1) and B(1),B(ω),,B(ω2n1) respectively. Importantly, the syntactic degree of Φ1 and Φ2 is exactly the same as that of Cn/21 and Cn/22, which by induction is n/2. This is because the FFT implements a linear transformation which never multiplies the entries of the coefficient vectors.

  • Pointwise Product: Next, we use Φ1 and Φ2 to construct a circuit Φ which simultaneously computes the polynomials A(1)B(1),A(ω)B(ω),, A(ω2n1)B(ω2n1). The syntactic degree of Φ is n.

  • Interpolation: Finally, we again use the (inverse) FFT to construct the circuit Cn which computes the coefficients {OSnk}k=0n of the power of t in A(t)B(t). This step does not increase the syntactic degree because it’s linear. Therefore, the syntactic degree of Cn is n.

Cn in particular computes the polynomial OSnn/2, has size O(nlog2n) ([4], [2]) and syntactic degree n. It heavily uses inhomogeneity but has low (in terms of n/2) syntactic degree. This construction therefore demonstrates the power of low syntactic degree circuits vis-à-vis homogeneous circuits.

6 Discussion

In this paper, we proved ω(nlogn) lower bounds for noncommutative circuits of low syntactic degree. The question of proving such lower bounds for general circuits remains open. One possible avenue of attack for this problem is inventing a size preserving procedure for reducing a circuit’s syntactic degree. As a concrete question, suppose we are given a circuit of size s and syntactic degree d=ω(n) that computes Paln,n. Can we construct another circuit computing Paln,n with syntactic degree O(n) and size s×no(1)? An affirmative answer to this question, combined with our work, would immediately yield strong lower bounds for general circuits. Reducing syntactic degree is no harder (and perhaps easier) than homogenization, so one may expect a more efficient way to do it than simple homogenization. Another question that stems from this work is whether we can handle separately the case when the number of distinct syntactic degrees appearing in a circuit computing Paln,n is Θ(nlogn).

References

  • [1] Walter Baur and Volker Strassen. The complexity of partial derivatives. Theoretical Computer Science, 22(3):317–330, 1983. doi:10.1016/0304-3975(83)90110-X.
  • [2] Peter Bürgisser, Michael Clausen, and Mohammad Amin Shokrollahi. Algebraic Complexity Theory. Grundlehren der mathematischen Wissenschaften. Springer Berlin Heidelberg, 1 edition, 1997. doi:10.1007/978-3-662-03338-8.
  • [3] Marco L. Carmosino, Russell Impagliazzo, Shachar Lovett, and Ivan Mihajlin. Hardness Amplification for Non-Commutative Arithmetic Circuits. In Rocco A. Servedio, editor, 33rd Computational Complexity Conference (CCC 2018), volume 102 of Leibniz International Proceedings in Informatics (LIPIcs), pages 12:1–12:16, Dagstuhl, Germany, 2018. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.CCC.2018.12.
  • [4] Prerona Chatterjee and Pavel Hrubeš. New Lower Bounds Against Homogeneous Non-Commutative Circuits. In Amnon Ta-Shma, editor, 38th Computational Complexity Conference (CCC 2023), volume 264 of Leibniz International Proceedings in Informatics (LIPIcs), pages 13:1–13:10, Dagstuhl, Germany, 2023. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.CCC.2023.13.
  • [5] Hervé Fournier, Nutan Limaye, Srikanth Srinivasan, and Sébastien Tavenas. On the power of homogeneous algebraic formulas. In Proceedings of the 56th Annual ACM Symposium on Theory of Computing, STOC 2024, pages 141–151, New York, NY, USA, 2024. Association for Computing Machinery. doi:10.1145/3618260.3649760.
  • [6] Noam Nisan. Lower bounds for non-commutative computation. In Shafi Goldwasser, editor, Proceedings of the Twenty-Third Annual ACM Symposium on Theory of Computing, May 6-8, 1991, New Orleans, Louisiana, USA, pages 410–418. ACM, 1991. doi:10.1145/103418.103463.
  • [7] Volker Strassen. Die berechnungskomplexität von elementarsymmetrischen funktionen und von interpolationskoeffizienten. Numerische Mathematik, 20(3):238–251, June 1973. doi:10.1007/BF01436566.