Algebraic and Analytic Methods in Computational Complexity
Abstract
Computational Complexity is concerned with the resources that are required for algorithms to detect properties of combinatorial objects and structures. It has often proven true that the best way to argue about these combinatorial objects is by establishing a connection (perhaps approximate) to a more well-behaved algebraic setting. Indeed, many of the deepest and most powerful results in Computational Complexity rely on algebraic proof techniques.
The algebraic theme continues to play a central role in some of the most exciting recent progress in computational complexity in areas like circuit complexity, polynomial identity testing, pseudorandomness and derandomization, or error correcting codes.
Beside algebraic methods, analytic methods like Fourier analysis have been used for quite some time in theoretical computer science. These methods have been used recently in some breakthrough results in complexity theory in areas like hardness of approximation, quantum computation, and scaling algorithms.
A new theme that has gained importance in the last years is the area of meta-complexity, that is, the complexity of computational problems that are themselves problems about the complexity of computation. Meta-complexity provides a link between a variety of important areas like circuit complexity, proof complexity, cryptography, and learning theory.
These new directions were in the focus of the Dagstuhl Seminar and reflect the developments in the field since the previous Dagstuhl Seminar 22371. Taking the recent exciting developments outlined above into account, we also included analytic methods this time.
This Dagstuhl Seminar aimed to capitalize on recent progress and brought together researchers who are using a diverse array of algebraic and analytic methods in a variety of settings. Researchers in these areas are relying on ever more sophisticated and specialized mathematics and we hope that this seminar has contributed in educating a diverse community about the latest new techniques.
Keywords and phrases:
Algebraic complexity theory, circuit complexity, pseudorandomnes and derandomization, error correcting codesSeminar:
September 15–20, 2024 – https://www.dagstuhl.de/243812012 ACM Subject Classification:
Theory of computation Computational complexity and cryptography ; Theory of computation Algebraic complexity theory ; Theory of computation Circuit complexity ; Theory of computation Communication complexity ; Theory of computation Randomness, geometry and discrete structures ; Theory of computation Pseudorandomness and derandomization ; Theory of computation Error-correcting codesCopyright and License:
1 Executive Summary
Markus Bläser, Shubhangi Saraf, Ronen Shaltiel, and Jacobo Torán
License:
Creative Commons BY 4.0 International license © Markus Bläser, Shubhangi Saraf, Ronen Shaltiel, and Jacobo Torán
The seminar brought together more than 50 researchers covering
a wide spectrum of complexity theory.
The focus on algebraic methods showed the great importance of such
techniques for theoretical computer science. We had 27
talks, most of them lasting about 35 minutes, leaving ample room for
discussions. We also scheduled several longer talks of a more introductory nature.
One of the days, we also had a much appreciated open problem session.
Computational complexity is a fundamental and active area of research that has produced some of the most well known results in theoretical computer science in recent years. Here we discuss a few broad themes that represented the focus areas of our seminar, mentioning some of the given talks in these areas.
Circuit complexity
Boolean circuits are one of the most fundamental models of computation. Due to their combinatorial nature, they seem more amenable to formal analysis than the uniform models such as Turing machines. The classical lower bound techniques of Razborov and Smolensky are algebraic: they work by first approximating circuits (constant-depth circuits with AND, OR, NOT, and counting modulo prime gates) by low-degree polynomials, and then proving that certain functions (like Majority) are not well correlated with such polynomials. The Fourier expansion of a Boolean function and its representation as a real multilinear polynomial as well as other analytic tools have been added in the last years to the bag of tools used for the analysis of Boolean circuits. Several seminar talks discussed recent results on circuit complexity.
Igor Carboni Oliveira established that efficient indistinguishability obfuscation for multi-output circuits necessarily incurs an additive size overhead of under reasonable complexity assumptions.
In his talk, William Hoza studied hardness amplification in the context of two well-known “moderate” average-case hardness results for circuits.
Algebraic complexity
A class of circuits especially suitable for the use of algebraic techniques is that of arithmetic or algebraic circuits. These are circuit models that compute polynomial functions using gates performing arithmetic operations (additions, subtractions, multiplications, and divisions). Two fundamental complexity measures for arithmetic circuits are size and depth.
Robert Andrews talked about constant-depth arithmetic circuits for linear algebra problems. He explained a new algorithm for the GCD that uses a combination of polynomial interpolation and Newton’s identities for symmetric polynomials.
Vishwas Bhargava talked about read-once branching programs, a special class of arithmetic circuits. His focus was the order in which variables can appear in such branching programs.
Exponential lower bounds against sums of ordered set-multilinear branching programs were shown by Prerona Chatterjee. Her bound stays superpolynomial for degree , complementing recent results by Bhargav, Dwivedi, and Saxena.
Chris Umans talked about the group-theoretic approach to fast matrix multiplication and how to extend the group-theoretic framework to the setting of infinite groups.
In his presentation, Michael Forbes explained how to extend the recent breakthrough of Limaye, Srinivasan, and Tavenas, who gave the first super-polynomial lower bounds against low-depth algebraic circuits, to fields of small characteristic. This answers an open question by Limaye, Srinivasan, and Tavenas.
In his talk, Rafael Mendes de Oliveira surveyed the connections between Sylvester-Gallai configurations and complexity theory as well as identity testing.
Peter Bürgisser studied the computational complexity of robust generalizations of orbit problems, which amount to approximating the distance of orbits in up to a factor . These algorithms run in polynomial time if and only if a version of the famous number-theoretic abc-conjecture holds.
In his talk, Nitin Saxena showed that the border of bounded-top-fanin depth-3 circuits can be computed by polynomial-size algebraic branching programs, which has applications especially in identity testing and lower bounds.
Amir Shpilka proved that for polynomials such that divides , the bit complexity of the quotient polynomial is bounded by the sparsity of g and the sparse representation size of .
Pseudorandomness and derandomization
The theory of pseudorandomness studies explicit constructions and applications of “random-like” objects of combinatorial or algebraic type. The common feature of such objects is that they are easy to construct by random sampling, but it is a very important problem to get efficient deterministic constructions.
Arkadev Chattopadhyay discussed the power of randomness for computing total Boolean functions in two models of computation, query algorithms and -party communication protocols.
In his talk, Gil Cohen used analytic techniques to explore a more refined analysis of the zig-zag graph product, that leverages the entire spectrum of the graph.
Dean Doron studied almost Chor–Goldreich (CG) random sources, and constructed deterministic condensers with constant entropy gap for them. As a consequence, one can simulate any randomized algorithm with small failure probability using almost CG sources with no multiplicative slowdown.
Seedless condenser were studied by Eshan Chattopadhyay, proving several new results for seedless condensers in the context of three related classes of sources.
Rohit Gurjar gave a characterization of principal minor equivalence and constructed a deterministic polynomial time algorithm to check if two given matrices are principal minor equivalent, which can be viewed as some sort of identity testing.
Error correcting codes
Error-correcting codes, particularly those constructed from polynomials, lie at the heart of many of the most significant results in computational complexity, like interactive proofs, PCPs, hardness amplification, or explicit constructions.
In his talk, Mrinal Kumar showed that univariate multiplicity codes can be list-decoded up to capacity in nearly linear time.
In her talk, Noga Ron-Zewi surveyed the theory behind the recent result that Reed-Solomon codes with random evaluation points are list-decodable up to capacity with optimal list size, and then discussed how it can be extended to the whole class of polynomial ideal codes.
Jad Sidblak studied codes against channels that are computationally bounded. He constructed codes with optimal rate that are explicit (assuming circuit lower bounds).
Srikanth Srinivasan presented low-degree local correction algorithms for constant-degree polynomials from to any field.
Roei Tell talked about simulations of interactive proof systems by non-interactive proof systems. He explained how to leverage approaches from recent works in derandomization to address this problem.
Further applications of algebraic methods
In addition to the focus areas mentioned above, we had a number of talks that considered applications of algebraic methods in other areas. In particular, we had a significant number of contributions that dealt with learning problems.
The talk of Neeraj Kayal explored machine learning tasks from the perspective of algebraic complexity, particularly using techniques based on arithmetic circuit lower bounds.
Pascal Koiran gave a new constructive uniqueness theorem for tensor decomposition. As a result, he obtained the first efficient algorithm for the so-called overcomplete decomposition of generic tensors of order .
Michal Koucký presented an efficient randomized algorithm for computing hierarchical Hamming sketches, which has applications in the construction of sketches for the edit distance.
The topic of the talk by Meena Mahajan was lower bounds for the polynomial calculus over non-Boolean domains. She showed how to improve a recent work by Sokolov by using an asymmetric gadget, obtaining a stronger lower bound that works over any field.
Kilian Rothmund presented an algorithm for testing whether a graph is minimally rigid when the input graph is -free.
Rocco Servedio showed that the supremum of any centered Gaussian process can be approximated to any arbitrary accuracy by a finite dimensional Gaussian process, where the dimension of the approximator just depends on the target error.
Conclusion
The talks of the seminar ranged over a broad assortment of subjects with the underlying theme of using algebraic and analytic techniques. It was a very fruitful meeting and has, we hope, initiated new directions in research. Several participants specifically mentioned that they appreciated the particular focus on a common class of techniques (rather than end results) as a unifying theme of the seminar. We look forward to our next seminar.
2 Table of Contents
3 Overview of Talks
3.1 Constant-Depth Arithmetic Circuits for Linear Algebra Problems
Robert Andrews (University of Waterloo, CA)
License:
Creative Commons BY 4.0 International license © Robert Andrews
Joint work of: Robert Andrews, Avi Wigderson
Main reference: Robert Andrews, Avi Wigderson: “Constant-Depth Arithmetic Circuits for Linear Algebra Problems”, CoRR, Vol. abs/2404.10839, 2024.
How does one compute the greatest common divisor (GCD) of two univariate polynomials? The Euclidean algorithm provides a polynomial-time solution, and fast variants of the Euclidean algorithm can solve this problem in nearly-linear time. The GCD can also be expressed in a linear-algebraic form. Basic tasks in linear algebra like computing determinants and solving linear systems can be performed in parallel time, and this can be used to compute the GCD in parallel time. This algorithm does not take advantage of any structure present in the resulting linear systems, so in principle one could compute the GCD in parallel even faster.
In this talk, I will describe a new algorithm that computes the GCD in parallel time by using a combination of polynomial interpolation and Newton’s identities for symmetric polynomials. In fact, this algorithm can essentially be implemented as an arithmetic circuit of constant depth. Similar ideas yield constant-depth circuits to compute the resultant, Bézout coefficients, and squarefree decomposition.
3.2 Fast list decoding of univariate multiplicity codes
Mrinal Kumar (TIFR – Mumbai, IN)
License:
Creative Commons BY 4.0 International license © Mrinal Kumar
Joint work of: Ashutosh Shankar, Mrinal Kumar, Prahladh Harsha, Rohan Goyal
Univariate multiplicity codes are a family of algebraic error correcting codes that are obtained by evaluating low degree univariate polynomials and all their derivatives up to a certain order at a set of distinct input points in an underlying field. These codes are a well studied generalisation of the more well known Reed-Solomon codes and are now known to have amazing list decodable properties; specifically, they are known to be efficiently list decodable up to the so-called list decoding capacity with constant list size.
In this talk, I will discuss a recent joint work with Rohan Goyal, Prahladh Harsha and Ashutosh Shankar, where we show that these codes can be list decoded up to capacity in nearly linear time. On the way, we will talk about lattices over the univariate polynomial ring, and will see a nearly linear time algorithm for solving linear differential equations of high order.
3.3 ROABPs: To order, or not to order.
Vishwas Bhargava (University of Waterloo, CA)
License:
Creative Commons BY 4.0 International license © Vishwas Bhargava
Joint work of: Vishwas Bhargava, Anamay Tengse
ROABPs are a natural model of computation, beloved for their analogies to Boolean ROBPs, their ability to subsume many interesting subclasses, and most notably for Nisan’s criterion, which can pinpoint the ROABP-complexity for any fixed order.
However, much less is understood about finding the order, and the subclasses of ROABPs that are order-oblivious, which will be the focus of this talk.
First, we will demonstrate that determining the order of an ROABP is an -hard problem. Following that, we will explore the computational power of commutative ROABPs, particularly their ability to compute polynomials with small partial derivative dimension.
3.4 Lower bounds on the overhead of indistinguishability obfuscation
Igor Carboni Oliveira (University of Warwick – Coventry, GB)
License:
Creative Commons BY 4.0 International license © Igor Carboni Oliveira
Joint work of: Igor Carboni Oliveira, Zhenjian Lu, Noam Mazor, Rafael Pass
Main reference: Zhenjian Lu, Noam Mazor, Igor C. Oliveira, Rafael Pass: “Lower Bounds on the Overhead of Indistinguishability Obfuscation”, 2024.
We consider indistinguishability obfuscation (iO) for multi-output circuits of size , where is the number of AND/OR/NOT gates in . Under the worst-case assumption that is not contained in , we establish that there is no efficient indistinguishability obfuscation scheme that outputs circuits of size . In other words, to be secure, an efficient iO scheme must incur an additive overhead in the size of the obfuscated circuit. The hardness assumption under which this negative result holds is minimal since an optimal iO scheme with no circuit size overhead exists if is contained in .
Expanding on this result, we also rule out iO for single-output database-aided circuits with an arbitrary polynomial overhead in circuit size. This strengthens an impossibility result by Goldwasser and Rothblum (2007), which considered circuits with access to an exponential-length database that the obfuscator has oracle access to; in contrast, our impossibility result holds even w.r.t. polynomial-size databases and even w.r.t. obfuscators that may run in time polynomial in the size of the database (and thus may read the whole database).
The proof of our main result builds on a connection between obfuscation and meta-complexity put forward by Mazor and Pass (2024), and on the -hardness of circuit minimization for multi-output circuits established by Loff, Ilango, and Oliveira (2020), together with other techniques from cryptography and complexity theory.
3.5 Lower Bounds for Sums of Ordered Set-Multilinear ABPs
Prerona Chatterjee (NISER – Bhubaneswar, IN)
License:
Creative Commons BY 4.0 International license © Prerona Chatterjee
Joint work of: Prerona Chatterjee, Deepanshu Kush, Shubhangi Saraf, Amir Shpilka
Algebraic Branching Programs (ABPs) are a natural model for computing polynomials formally, and proving super-polynomial lower bounds against this model is a central question in the field of algebraic complexity theory.
In a recent work, Bhargav, Dwivedi, and Saxena (2023), showed that super-polynomial lower bounds against a sum of ordered set-multilinear branching programs – for a polynomial over variables of degree – would imply super-polynomial lower bounds against general ABPs. Following up on their work, in a joint work with Kush, Saraf and Shpilka, we prove an exponential lower bound against this model but for a polynomial over variables of degree . In fact, even though our lower bound does not remain super-polynomial when the polynomial has degree , it does remain so for a polynomial with degree as low as . Moreover, the hard polynomial is computable by an ABP, thereby suggesting that our approach would not work if one wanted to prove lower bounds against general ABPs.
In this talk, we will discuss our results in the context of proving general ABP lower bounds and also give a brief overview of the proofs. The talk is based on joint work with Deepanshu Kush, Shubhangi Saraf and Amir Shpilka.
3.6 Randomness in Query and Communication Complexity
Arkadev Chattopadhyay (TIFR – Mumbai, IN)
License:
Creative Commons BY 4.0 International license © Arkadev Chattopadhyay
Understanding the power of randomness for general algorithms is one of the central themes of the theory of computation but is still seemingly widely out of reach. We consider the power of randomness for computing total Boolean functions in two simple, very well studied models of computation: query algorithms and -party communication protocols. In the former, it is known that randomness does not give super-polynomial advantage, whereas in the latter, it does, thanks to the Equality function. While these facts are folklore by now, we argue that this doesn’t mean that we understand the power of randomness satisfactorily in these simple models.
For instance, Goos-Pitassi-Watson (2016) asked if protocols can simulate every protocol. It was not even known if contained . We (Chattopadhyay-Lovett-Vinyals, 2019) give a strong negative answer to that. In the query world, motivated by the recent work of Knop-Lovett-McGuire-Yuan (2021), we consider models that could be thought of as intermediates between query and communication protocols, by allowing the queries to be more powerful. For example, when the set of available query functions are AND (OR) functions or both AND and OR functions. In this case, we (Chattopadhyay-Dahiya-Mande-Radhakrishnan-Sanyal, 2023) show that randomness does not give significant advantage for computing functions. On the other hand, it is known that randomness provides exponential advantage in the model of parity decision trees (PDT).
We conjecture that the set of boolean functions that can be computed by poly-log depth randomized PDTs are exactly those that can be computed in poly-log depth by decision trees whose query functions are testing membership in affine spaces.
3.7 Finite matrix multiplication algorithms from infinite groups
Chris Umans (California Institute of Technology – Pasadena, US)
License:
Creative Commons BY 4.0 International license © Chris Umans
Joint work of: Chris Umans, Henry Cohn, Jonah Blasiak, Josh Grochow, Kevin Pratt
The Cohn–Umans (FOCS ’03) group-theoretic framework for matrix multiplication produces fast matrix multiplication algorithms from three subsets of a finite group satisfying a simple combinatorial condition (the Triple Product Property). The complexity of such an algorithm then depends on the representation theory of . In this paper we extend the group-theoretic framework to the setting of infinite groups. In particular, this allows us to obtain constructions in Lie groups, with favorable parameters, that are provably impossible in finite groups of Lie type (Blasiak, Cohn, Grochow, Pratt, and Umans, ITCS ’23). Previously the Lie group setting was investigated purely as an <analogue of the finite group case; a key contribution in this paper is a fully developed framework for obtaining bona fide matrix multiplication algorithms directly from Lie group constructions.
As part of this framework, we introduce “separating functions” as a necessary new design component, and show that when the underlying group is , these functions are polynomials with their degree being the key parameter. In particular, we show that a construction with “half-dimensional” subgroups and optimal degree would imply . We then build up machinery that reduces the problem of constructing optimal-degree separating polynomials to the problem of constructing a single polynomial (and a corresponding set of group elements) in a ring of invariant polynomials determined by two out of the three subgroups that satisfy the Triple Product Property. This machinery combines border rank with the Lie algebras associated with the Lie subgroups in a critical way.
We give several constructions illustrating the main components of the new framework, culminating in a construction in a special unitary group that achieves separating polynomials of optimal degree, meeting one of the key challenges. The subgroups in this construction have dimension approaching half the ambient dimension, but (just barely) too slowly. We argue that features of the classical Lie groups make it unlikely that constructions in these particular groups could produce nontrivial bounds on unless they prove . One way to get via our new framework would be to lift our existing construction from the special unitary group to , and improve the dimension of the subgroups from to .
3.8 Analytic Insights into the Zig-Zag Product and Its Friends
Gil Cohen (Tel Aviv University, IL)
License:
Creative Commons BY 4.0 International license © Gil Cohen
Joint work of: Itay Cohen, Gal Maor
Main reference: Gil Cohen, Itay Cohen, Gal Maor: “Tight Bounds for the Zig-Zag Product”, Electron. Colloquium Comput. Complex., Vol. TR24-089, 2024.
The Zig-Zag product and related graph operators, such as derandomized squaring, are inherently combinatorial, and the classical bounds on their behavior typically require a blend of combinatorics and linear algebra. However, these bounds often fail to align with experimental results.
In this talk, we will explore a more refined analysis that leverages the entire spectrum of the graph (rather than just its spectral expansion), yielding results that both match experimental observations and, in some sense, are tight. The technique is analytic, deviating from the classical techniques: for the upper bound, we utilize finite free probability, while for the lower bound, we draw on results from analytic combinatorics.
No prior knowledge is assumed. This talk is based on joint works with Itay Cohen and Gal Maor.
3.9 Condensing Unpredictability Sources via Random Walks
Dean Doron (Ben Gurion University – Beer Sheva, IL)
License:
Creative Commons BY 4.0 International license © Dean Doron
Joint work of: Dean Doron, Dana Moshkovitz, Justin Oh, David Zuckerman
We consider the following adversarial, non-Markovian, random walk on “good enough” expanders: Starting from some fixed vertex, walk according to the instructions , where each is only somewhat close to having only little entropy, conditioned on any, or a typical, prefix. The -s are not independent, meaning that the distribution of the next step depends not only on the walk’s current node, but also on the path it took to get there. We show that such walks (or certain variants of them) accumulate most of the entropy in .
More specifically, we study -s that are “almost Chor–Goldreich (CG) sources”, and for them our result gives deterministic condensers (even online ones) with constant entropy gap, which were not known to exist even for standard CG sources. As a consequence, we can simulate any randomized algorithm with small failure probability using almost CG sources with no multiplicative slowdown. For a more general notion of “unpredictability sources”, we show that a random walk with a random stopping time also results in a distribution with constant entropy gap, and that such a random stopping time is necessary.
3.10 On the Existence of Seedless Condensers: Exploring the Terrain
Eshan Chattopadhyay (Cornell University – Ithaca, US)
License:
Creative Commons BY 4.0 International license © Eshan Chattopadhyay
Joint work of: Eshan Chattopadhyay, Mohit Gurumukhani, Noam Ringach
While the existence of randomness extractors, both seeded and seedless, has been thoroughly studied for many sources of randomness, currently, very little is known regarding the existence of seedless condensers in many settings. Here, we prove several new results for seedless condensers in the context of three related classes of sources: Non-Oblivious Symbol Fixing (NOSF) sources, SHELA sources as defined by Aggarwal, Obremski, Ribeiro, Siniscalchi, and Visconti [AORSV, EUROCRYPT’20], and almost Chor-Goldreich (CG) sources as defined by Doron, Moshkovitz, Oh, and Zuckerman [DMOZ, STOC’23]. Here, we will think of these sources as a sequence of random variables on symbols where at least out of these symbols are “good” (i.e., uniformly random), denoted as a -source, and the remaining “bad” symbols may adversarially depend on these good blocks. The difference between each of these sources is realized by restrictions on the power of the adversary, with the adversary in NOSF sources having no restrictions.
Prior to our work, the only known seedless condenser upper or lower bound in these settings is due to [DMOZ, STOC’23] which explicitly constructs a seedless condenser for a restricted subset of -almost CG sources.
The following are our main results concerning seedless condensers for each of these three sources.
-
1.
When , we prove for all three classes of sources that condensing with error 0.99 above rate is impossible.
-
2.
Next, we investigate the setting of , and in particular focus on and . We show that condensing with constant error above rate is impossible for uniform NOSF sources.
-
3.
Quite surprisingly, we show the existence of excellent condensers for uniform -SHELA and uniform almost CG sources, thus proving a separation from NOSF sources. Further, we explicitly construct a condenser that outputs bits and condenses any uniform -SHELA source to entropy (with error ). Our construction is based on a new type of seeded extractor that we call output-light, which could be of independent interest. In contrast, we show that it is impossible to extract from uniform -SHELA sources.
These results extend seedless extractor lower bounds on NOSF sources (Bourgain, Kahn, Kalai, Katznelson, and Linial [BKKKL, Israel J. Math’92]) and make progress on several open question from [DMOZ, STOC’23], [AORSV, EUROCRYPT’20], and Kopparty and N [KN, RANDOM’23].
3.11 Low-Depth Algebraic Circuit Lower Bounds over Any Field
Michael A. Forbes (University of Illinois – Urbana-Champaign, US)
License:
Creative Commons BY 4.0 International license © Michael A. Forbes
The recent breakthrough of Limaye, Srinivasan and Tavenas [Limaye et al., 2022] (LST) gave the first super-polynomial lower bounds against low-depth algebraic circuits, for any field of zero (or sufficiently large) characteristic. It was an open question to extend this result to small-characteristic ([Limaye et al., 2022; Govindasamy et al., 2022; Fournier et al., 2023]), which in particular is relevant for an approach to prove superpolynomial -Frege lower bounds ([Govindasamy et al., 2022]). In this work, we prove super-polynomial algebraic circuit lower bounds against low-depth algebraic circuits over any field, with the same parameters as LST (or even matching the improved parameters of Bhargav, Dutta, and Saxena [Bhargav et al., 2022]). We give two proofs. The first is logical, showing that even though the proof of LST naively fails in small characteristic, the proof is sufficiently algebraic that generic transfer results imply the result over characteristic zero implies the result over all fields. Motivated by this indirect proof, we then proceed to give a second constructive proof, replacing the field-dependent set-multilinearization result of LST with a set-multilinearization that works over any field, by using the Binet-Minc identity [Minc, 1979].
3.12 Characterizing and Testing Principal Minor Equivalence
Rohit Gurjar (Indian Institute of Technology Bombay – Mumbai, IN)
License:
Creative Commons BY 4.0 International license © Rohit Gurjar
Joint work of: Abhranil Chatterjee, Sumanta Ghosh, Rohit Gurjar, Roshan Raj
Two matrices are said to be principal minor equivalent if they have equal corresponding principal minors of all orders. We give a characterization of principal minor equivalence and a deterministic polynomial time algorithm to check if two given matrices are principal minor equivalent.
Earlier such results were known for certain special cases like symmetric matrices, skew-symmetric matrices with 0, 1, -1-entries, and matrices with no cuts (i.e., for any non-trivial partition of the indices, the top right block or the bottom left block must have rank more than 1).
As an application, we give a deterministic polynomial-time test to check equality of two multivariate polynomials, each computed by a symbolic determinant with a rank 1 constraint on coefficient matrices.
3.13 A Technique for Hardness Amplification Against
William Hoza (University of Chicago, US)
License:
Creative Commons BY 4.0 International license © William Hoza
We study hardness amplification in the context of two well-known “moderate” average-case hardness results for circuits. First, we investigate the extent to which circuits of depth can approximate circuits of some larger depth . The case is resolved by Håstad, Rossman, Servedio, and Tan’s celebrated average-case depth hierarchy theorem [1]. Our contribution is a significantly stronger correlation bound when . Specifically, we show that there exists a linear-size circuit such that for every circuit , either has size , or else agrees with on at most a -fraction of inputs where . For comparison, Håstad, Rossman, Servedio, and Tan’s result has . Second, we consider the majority function. It is well known that the majority function is moderately hard for circuits (and stronger classes). Our contribution is a stronger correlation bound for the XOR of copies of the -bit majority function, denoted . We show that if is an circuit of size , then agrees with on at most a -fraction of inputs, where .
To prove these results, we develop a hardness amplification technique that is tailored to a specific type of circuit lower bound proof. In particular, one way to show that a function is moderately hard for circuits is to (a) design some distribution over random restrictions or random projections, (b) show that circuits simplify to shallow decision trees under these restrictions/projections, and finally (c) show that after applying the restriction/projection, is moderately hard for shallow decision trees with respect to an appropriate distribution. We show that (roughly speaking) if can be proven to be moderately hard by a proof with that structure, then XORing multiple copies of amplifies its hardness. Our analysis involves a new kind of XOR lemma for decision trees, which might be of independent interest.
References
- [1] Johan Håstad, Benjamin Rossman, Rocco A. Servedio, and Li-Yang Tan: “An average-case depth hierarchy theorem for Boolean circuits”, Journal of the ACM 64.5 (2017), Art. 35, 27. https://doi.org/10.1145/3095799
3.14 Towards a practical theory of Unsupervised Learning
Neeraj Kayal (Microsoft Research India – Bangalore, IN)
License:
Creative Commons BY 4.0 International license © Neeraj Kayal
Joint work of: Pritam Chandra, Ankit Garg, Neeraj Kayal, Kunal Mittal, Tanmay Sinha
Unsupervised learning seeks to build machines that learn/understand the world primarily from unlabelled data. In our case our data is a set of points S in n-dimensional space .
In this talk, we first consider the setting wherein S is a sample from an unknown parametric distribution and the task is to learn the parameters of a distribution on points in . The distribution is that which is induced on the n outputs of a circuit with () inputs. We will see that this formulation captures some well-studied problems such as subspace clustering and learning mixtures of Gaussians. Our algorithmic techniques based on arithmetic circuit lower bounds combined with some recent powerful work on singular values of structured random matrices yield *smoothed* polynomial-time algorithms for these problems.
In the next part of this talk, I will describe a new approach towards unsupervised learning on real-world data. Our aim is to formulate an approach which will have many capabilities including out-of-distribution generalization, continual learning, construction of concept hierarchies, shareability of understanding and robustness to adversarial perturbation. It is based on try to learn the structure underlying the datapoints.
We hypothesize that the structure underlying the data is some sort of iterated combination of a few basic “structural patterns” after suitable augmentations of the data points. Such structural patterns include subspaces which almost contain a large number of points, clusters (under some appropriate distance function) and periodicity of occurrence of data points. Unsupervised learning then entails discovering the augmentations needed and discovering the parameters of the underlying structural patterns. We will illustrate this with the example of unsupervised learning for optical character recognition.
3.15 An efficient uniqueness theorem for overcomplete tensor decomposition
Pascal Koiran (ENS – Lyon, FR)
License:
Creative Commons BY 4.0 International license © Pascal Koiran
Main reference: Pascal Koiran: “An efficient uniqueness theorem for overcomplete tensor decomposition”, CoRR, Vol. abs/2404.07801, 2024.
We give a new, constructive uniqueness theorem for tensor decomposition. It applies to order tensors of format and can prove uniqueness of decomposition for generic tensors up to rank as soon as . One major advantage over Kruskal’s uniqueness theorem is that our theorem has an algorithmic proof, and the resulting algorithm is efficient. Like the uniqueness theorem, it applies in the range . As a result, we obtain the first efficient algorithm for overcomplete decomposition of generic tensors of order . For instance, prior to this work it was not known how to efficiently decompose generic tensors of format n×n×n and rank (or rank , for some constant ). Efficient overcomplete decomposition of generic tensors of format remains an open problem. Our results are based on the method of commuting extensions pioneered by Strassen for the proof of his lower bound on tensor rank and border rank. In particular, we rely on an algorithm for the computation of commuting extensions recently proposed in a companion paper, and on the classical diagonalization-based “Jennrich algorithm” for undercomplete tensor decomposition.This is a dummy text.
3.16 Hierarchical Hamming Sketch
Michal Koucký (Charles University – Prague, CZ)
License:
Creative Commons BY 4.0 International license © Michal Koucký
Joint work of: Michal Koucký, Michael Saks
In sketching of Hamming distance we want to build a sketch and recovery algorithms such that given a parameter , for any string , the sketch algorithm computes sketch of . Given two sketches and we want to compute the exact Hamming distance of and provided it is less than . Moreover, we also want the recovery algorithm to output the content of the differing positions between and .
The hierarchical Hamming sketch is a special variant of this problem, where input strings are divided into blocks (or hierarchy of blocks) and from sketches of two strings we want to recover the differences in blocks that do not contain too many differences. Namely, the recovery algorithm should ignore blocks that differ a lot.
This has applications in a construction of sketches for edit distance.
In this talk I will present the concept and show a particular randomized implementation.
3.17 New lower bounds for Polynomial Calculus over non-Boolean bases
Meena Mahajan (The Institute of Mathematical Sciences – Chennai, IN)
License:
Creative Commons BY 4.0 International license © Meena Mahajan
Joint work of: Yogesh Dahiya, Meena Mahajan, Sasank Mouli
The Polynomial Calculus PC is an algebraic proof system based on Hilbert’s Nullstellensatz, in which the unsatisfiability of polynomial equations in general, and CNF formulas in particular, can be certified. While several size lower bounds have long been known for PC over 0-1 encodings of Boolean variables, no lower bounds were known over the +1/-1 encoding until a recent work by Sokolov in STOC 2020; he established an exponential lower bound over the reals using a lifting technique with a symmetric gadget. We show how to use an asymmetric gadget and obtain a stronger lower bound that works over any field. In a different setting, we also strengthen the lower bounds obtained recently by Impagliazzo, Mouli, Pitassi, in CCC 2023: over finite fields, when extension variables are allowed, we obtain size lower bounds in terms of the number and arity of the extension axioms. In both these results, the notion of quadratic degree introduced by Sokolov plays a crucial role. The talk gives an overview of the setting, the results, and the techniques.
3.18 Sylvester-Gallai Configurations & Algebraic Complexity
Rafael Mendes de Oliveira (University of Waterloo, CA)
License:
Creative Commons BY 4.0 International license © Rafael Mendes de Oliveira
Joint work of: Rafael Oliveira, Abhibhav Garg, Akash Sengupta, Amir Shpilka, Shir Peleg
In this talk we survey the connections between Sylvester-Gallai (SG) configurations and complexity theory, starting from the connection between SG configurations and the Polynomial Identity Testing (PIT) problem given in [2] and the robust versions of linear SG configurations and its connections to coding theory [1], to the non-linear generalizations of the SG problem conjectured by [4].
References
- [1] Barak, Boaz and Dvir, Zeev and Yehudayoff, Amir and Wigderson, Avi: “Rank Bounds for Design Matrices with Applications to Combinatorial Geometry and Locally Correctable Codes”, Forty-Third Annual ACM Symposium on Theory of Computing, 2011
- [2] Dvir, Zeev and Shpilka, Amir: “Locally decodable codes with two queries and polynomial identity testing for depth 3 circuits”, SIAM Journal on Computing, 2007.
- [3] Garg, Abhibhav and Oliveira, Rafael and Sengupta, Akash K: “Uniform Bounds on Product Sylvester-Gallai Configurations”, Manuscript, 2024.
- [4] Gupta, Ankit: “Algebraic Geometric Techniques for Depth-4 PIT & Sylvester-Gallai Conjectures for Varieties”, Electron. Colloquium Comput. Complex., 2014.
- [5] Oliveira, Rafael and Sengupta, Akash K: “Radical Sylvester-Gallai theorem for cubics”, FOCS 2022.
- [6] Oliveira, Rafael and Sengupta, Akash K: “Strong Algebras & Radical Sylvester-Gallai Configurations”, STOC 2024.
- [7] Peleg, Shir and Shpilka, Amir: “Polynomial time deterministic identity testing algorithm for circuits via Edelstein-Kelly type theorem for quadratic polynomials”, 53rd Annual ACM SIGACT Symposium on Theory of Computing, 2020.
- [8] Peleg, Shir and Shpilka, Amir: “Robust Sylvester-Gallai Type Theorem for Quadratic Polynomials”, SoCG 2022.
- [9] Peleg, Shir and Shpilka, Amir: “A generalized Sylvester–Gallai-type theorem for quadratic polynomials”, Forum of Mathematics Sigma, 2022.
- [10] Saxena, Nitin and Seshadhri, Comandur: “From Sylvester-Gallai configurations to rank bounds: Improved blackbox identity test for depth-3 circuits”, Journal of the ACM (JACM).
- [11] Shpilka, Amir: “Sylvester-Gallai type theorems for quadratic polynomials”, Discrete Analysis, 2020.
3.19 Efficient List-decoding of Polynoimal Codes with Optimal List Size
Noga Ron-Zewi (University of Haifa, IL)
License:
Creative Commons BY 4.0 International license © Noga Ron-Zewi
Joint work of: Noga Ron-Zewi, S. Venkitesh, and Mary Wootters
Main reference: : “Efficient List-decoding of Polynomial Ideal Codes with Optimal List Size”, CoRR, Vol. abs/2401.14517, 2024.
In a recent breakthrough, Brakensiek, Gopi, and Makam showed that Reed-Solomon codes with random evaluation points are list-decodable up to capacity with optimal list size with high probability. This result followed by developing a new beautiful theory of higher-order MDS codes, which extends the classical notion of MDS codes to the list-decoding setting.
We extend this result to the family of polynomial ideal codes, a large class of error-correcting codes which includes several well-studied families of codes such as Reed-Solomon, folded Reed-Solomon, and multiplicity codes. As a corollary, this gives the first family of error-correcting codes that is efficiently list-decodable with optimal list size.
In the talk, I will survey the theory behind the result for Reed-Solomon codes, and discuss how it can be extended to the whole class of polynomial ideal codes.
3.20 Complexity of Robust Orbit Problems for Torus Actions and the abc-Conjecture
Peter Bürgisser (TU Berlin, DE)
License:
Creative Commons BY 4.0 International license © Peter Bürgisser
Joint work of: Peter Bürgisser, Mahmut Levent Doğan, Visu Makam, Michael Walter, Avi Wigderson
When a group acts on a set, it naturally partitions it into orbits, giving rise to orbit problems. These are natural algorithmic problems, as symmetries are central in numerous questions and structures in physics, mathematics, computer science, optimization, and more.
In 2021 we gave the first polynomial-time algorithms for orbit problems of torus actions, that is, actions of connected commutative continuous groups on Euclidean space. In this work, we study the computational complexity of robust generalizations of these orbit problems, which amount to approximating the distance of orbits in up to a factor .
On the one hand, we prove the NP-hardness of this problem for by reducing the closest vector problem for lattices to it. On the other hand, we describe algorithms for solving this problem for an approximation factor .
Our algorithms combine tools from invariant theory and algorithmic lattice theory, and they also provide group elements witnessing the proximity of the given orbits (in contrast to the algebraic algorithms of prior work). We prove that they run in polynomial time if and only if a version of the famous number-theoretic abc-conjecture holds – establishing a new and surprising connection between computational complexity and number theory.
3.21 Minimal graph rigidity is in NC for -free and -free graphs
Kilian Rothmund (Hochschule Aalen, DE)
License:
Creative Commons BY 4.0 International license © Kilian Rothmund
Joint work of: Rohit Gurjar, Kilian Rothmund, Thomas Thierauf
A graph is said to be minimal rigid in if it admits a so-called first order rigid embedding in the plane with the minimum number of edges. Graph rigidity can be formulated as a polynomial identity test [1]. We consider two problems: First, we want to decide whether a graph is minimally rigid. Second, we want to compute first order rigid embeddings for minimal rigid graphs. We present algorithms for the two problems when the input graph is -free. We also present an algorithm for the first problem when the input graph is -free. We start with the observation that both problems are in for planar graphs. This follows from a geometric characterization of planar minimally rigid graphs that has been discovered by [2] and [3].
References
- [1] Orit E. Raz, Avi Wigderson: “Subspace arrangements, graph rigidity and derandomization through submodular optimization”, https://arxiv.org/abs/1901.09423.
- [2] Ileana Streinu: “Pseudo-triangulations, rigidity and motion planning”, Discrete Computational Geometry, 34(4):587–635, September 2005.
- [3] Ruth Haas, David Orden, Günter Rote, Francisco Santos, Brigitte Servatius, Hermann Servatius, Diane Souvaine, Ileana Streinu, and Walter Whiteley: “Planar minimally rigid graphs and pseudo-triangulations”, Computational Geometry, 2005.
3.22 Surveying the border and its demystification
Nitin Saxena (Indian Institute of Technology Kanpur, IN)
License:
Creative Commons BY 4.0 International license © Nitin Saxena
Joint work of: Nitin Saxena, C.S. Bhargav, Pranjal Dutta, Prateek Dwivedi
Border (or approximative) complexity of polynomials plays an integral role in algebraic algorithms and the GCT approach to . This raises an important open question: can a border circuit be *efficiently* debordered (i.e. convert from approximative to exact)? Or, could the approximation involve exponential-precision which may not be efficiently simulable?
Circuits of depth 3 or 4, are a good testing ground for this question; although, it is unclear how such circuits can be presented *in practice* if at all. We would discuss our new presentable-border definition and its application in circuit factorization problems that were open for quite some time.
In shallow depths, Kumar (ToCT’20) proved the universal power of the border of top-fanin-2 depth-3 circuits. We recently solved some of the related open questions. In this talk we will outline our result: border of bounded-top-fanin depth-3 circuits is relatively easy– it can be computed by a polynomial-size algebraic branching program (ABP). Our de-bordering paradigm has many applications, especially in identity testing and lower bounds.
3.23 Sparsifying suprema of Gaussian processes
Rocco Servedio (Columbia University – New York, US)
License:
Creative Commons BY 4.0 International license © Rocco Servedio
Joint work of: Anindya De, Shivam Nadimpalli, Ryan O’Donnell, Rocco Servedio
We show that the supremum of any centered Gaussian process can be approximated to any arbitrary accuracy by a finite dimensional Gaussian process, where the dimension of the approximator is just dependent on the target error. As a corollary, we show that for any norm defined over and target error , there is a norm such that (i) is only dependent on dimensions, and (ii) with probability when is sampled from the Gaussian space. We prove a similar-in-spirit result for sparsifying high-dimensional polytopes in Gaussian space, and present applications to computational learning and property testing. The proof relies on Talagrand’s majorizing measures theorem.
3.24 New Bounds on Quotient Polynomials with Applications to Exact Divisibility of Sparse Polynomials
Amir Shpilka (Tel Aviv University, IL)
License:
Creative Commons BY 4.0 International license © Amir Shpilka
Joint work of: Ido Nahshon, Amir Shpilka
We prove that for polynomials in such that divides , the bit complexity of the quotient polynomial is bounded by the sparsity of g and the sparse representation size of . This result improves upon previous bounds (Gel’fond’49, Mignotte’74), which were linear in and known to be tight in the general case.
Our result implies that the trivial long division algorithm runs in quasi-linear time relative to the input size and the number of terms of the quotient polynomial , thereby solving a long-standing problem on the exact divisibility of sparse polynomials.
3.25 Explicit Codes for Poly-Size Circuits and Functions that are Hard to Sample on Low Entropy Distributions
Jad Silbak (Northeastern University – Boston, US)
License:
Creative Commons BY 4.0 International license © Jad Silbak
Joint work of: Ronen Shaltiel, Jad Silbak
Guruswami and Smith (J. ACM 2016) considered codes for channels that are computationally bounded which modify at most a p-fraction of the bits of the codeword. This class of channels is significantly stronger than Shannon’s binary symmetric channel which flips each bit independently with probability p, but weaker than Hamming’s channels which may flip any p-fraction of bits and are computationally unbounded.
Recently, there has been a growing body of work that aims to construct codes against channels that are computationally bounded (e.g., bounded memory channels, channels that are poly-size circuits). In this work, we consider bounded size channels and construct codes that:
-
Achieve an optimal rate of (matching the rate of binary symmetric channels, and beating the rate of Hamming channels).
-
Are explicit, assuming E does not have a size nondeterministic circuits.
Achieving these codes implies circuit lower bounds (and therefore explicit constructions need to be based on hardness assumptions). This result builds on the recent result by Shaltiel and Silbak (FOCS 2022) that gave a randomized Monte-Carlo construction, rather than explicit codes.
A key component in our codes (that we believe to be of independent interest) is a new complexity theoretic notion of hard to sample functions (HTS). Loosely speaking, a function f is HTS for circuits of size , if for every randomized circuit A of size that samples a distribution such that has sufficiently large min-entropy, it holds that is small. This notion is inspired by a related notion introduced by Viola (SICOMP 2012) in which is the uniform distribution and is similar in flavor to the definition of multi-collision-resistant hash functions. Building on classical works on “hardness amplification” (and using many additional tools and ideas from pseudorandomness) we construct HTS functions under hardness assumptions.
3.26 Low Degree Local Correction Over the Boolean Cube
Srikanth Srinivasan (University of Copenhagen, DK)
License:
Creative Commons BY 4.0 International license © Srikanth Srinivasan
Joint work of: Prashanth Amireddy, Amik Raj Behera, Manaswi Paraashar, Srikanth Srinivasan, Madhu Sudan
We present low-degree local correction algorithms for constant-degree polynomials from to any field (or more generally any Abelian group). The main new contributions are:
-
1.
A local-correction algorithm making many queries correcting codewords within the unique decoding radius.
-
2.
A combinatorial list-decoding bound, showing that there are not too many codewords in a ball of radius less than the distance of the code (best possible),
-
3.
A local list-correction algorithm that outputs all the codewords in a ball of radius less than the distance of the code.
3.27 Fiat-Shamir in the Plain Model from Derandomization (Or: Do Efficient Algorithms Believe that NP = PSPACE?)
Roei Tell (University of Toronto, CA)
License:
Creative Commons BY 4.0 International license © Roei Tell
Joint work of: Lijie Chen, Ron Rothblum, Roei Tell
A classical challenge in complexity theory and cryptography is to simulate interactive proof systems by non-interactive proof systems. In this work we leverage approaches from recent works in derandomization to address this challenge, focusing on non-interactive simulations that are sound against uniform adversarial algorithms.
Our results concern fundamental questions in complexity theory, such as the vs question, and also in cryptography, such as the question of constructing non-interactive zero-knowledge arguments for from unstructured assumptions. Relying on strong complexity-theoretic hardness assumptions (that will be described below):
-
1.
Complexity theory. We prove that is contained in the “computationally sound” version of . Specifically, for every , membership in can be verified by an -type (deterministic, polynomial-time) verifier with the following guarantee: The verifier accepts every when given a proof from an honest prover that runs in fixed exponential time ; and every uniform adversary running in probabilistic time cannot find and such that , except with negligible probability in . As a corollary in the area of bounded arithmetic, under the same assumptions, we deduce that is not provable in the theory . This is a strong theory, which captures many of the major results in complexity.
-
2.
Cryptography. We construct new cryptographic protocols, including succinct non-interactive arguments (s) for in the plain model, as well as non-interactive zero-knowledge and witness indistinguishable ( and ) proof systems for , all with computational soundness against uniform adversaries. The relies solely on the aforementioned complexity-theoretic assumption, whereas the and require also a sub-exponentially secure one-way function (which should be injective in the case of the ). These are the first constructions of the above protocols that do not rely on highly structured cryptographic primitives.
Roughly speaking, following Chen and Tell (FOCS 2021, STOC 2023), the complexity-theoretic hardness assumptions throughout our paper assert the existence of functions that are computable in polynomial time and hard for bounded-space machines (say, linear space) in a strong average-case sense: No efficient algorithm can find an input on which the bounded-space machine computes , except with negligible probability.
4 Participants
-
Robert Andrews – University of Waterloo, CA
-
Vikraman Arvind – The Institute of Mathematical Sciences – Chennai, IN
-
Vishwas Bhargava – University of Waterloo, CA
-
Markus Bläser – Universität des Saarlandes – Saarbrücken, DE
-
Andrej Bogdanov – University of Ottawa, CA
-
Peter Bürgisser – TU Berlin, DE
-
Harry Buhrman – University of Amsterdam, NL
-
Igor Carboni Oliveira – University of Warwick – Coventry, GB
-
Prerona Chatterjee – NISER – Bhubaneswar, IN
-
Arkadev Chattopadhyay – TIFR – Mumbai, IN
-
Eshan Chattopadhyay – Cornell University – Ithaca, US
-
Gil Cohen – Tel Aviv University, IL
-
Radu Curticapean – Universität Regensburg, DE
-
Dean Doron – Ben Gurion University – Beer Sheva, IL
-
Klim Efremenko – Ben Gurion University – Beer Sheva, IL
-
Michael A. Forbes – University of Illinois – Urbana-Champaign, US
-
Anna Gál – University of Texas – Austin, US
-
Rohit Gurjar – Indian Institute of Technology Bombay – Mumbai, IN
-
Shuichi Hirahara – National Institute of Informatics – Tokyo, JP
-
William Hoza – University of Chicago, US
-
Christian Ikenmeyer – University of Warwick – Coventry, GB
-
Valentine Kabanets – Simon Fraser University – Burnaby, CA
-
Neeraj Kayal – Microsoft Research India – Bangalore, IN
-
Pascal Koiran – ENS – Lyon, FR
-
Antonina Kolokolova – Memorial University of Newfoundland – St. John’s, CA
-
Michal Koucký – Charles University – Prague, CZ
-
Donald Kougang Yombi – AIMS Rwanda – Kigali, RW
-
Mrinal Kumar – TIFR Mumbai, IN
-
Nutan Limaye – IT University of Copenhagen, DK
-
Meena Mahajan – The Institute of Mathematical Sciences – Chennai, IN
-
Or Meir – University of Haifa, IL
-
Rafael Mendes de Oliveira – University of Waterloo, CA
-
Noga Ron-Zewi – University of Haifa, IL
-
Kilian Rothmund – Hochschule Aalen, DE
-
Rahul Santhanam – University of Oxford, GB
-
Ramprasad Saptharishi – TIFR Mumbai, IN
-
Shubhangi Saraf – University of Toronto, CA
-
Nitin Saxena – Indian Institute of Technology Kanpur, IN
-
Rocco Servedio – Columbia University – New York, US
-
Ronen Shaltiel – University of Haifa, IL
-
Amir Shpilka – Tel Aviv University, IL
-
Jad Silbak – Northeastern University – Boston, US
-
Srikanth Srinivasan – University of Copenhagen, DK
-
Sathyawageeswar Subramanian University of Cambridge, GB
-
Amnon Ta-Shma – Tel Aviv University, IL
-
Sébastien Tavenas – University Savoie Mont Blanc – Le Bourget-du-Lac, FR
-
Roei Tell – University of Toronto, CA
-
Thomas Thierauf – Hochschule Aalen, DE
-
Jacobo Torán – Universität Ulm, DE
-
Christopher Umans – California Institute of Technology – Pasadena, US
-
Jeroen Zuiddam – University of Amsterdam, NL