Multiparty Communication Complexity of Collision-Finding and Cutting Planes Proofs of Concise Pigeonhole Principles

Beame, Paul; Whitmeyer, Michael

doi:10.4230/LIPIcs.ICALP.2025.21

Multiparty Communication Complexity of Collision-Finding and Cutting Planes Proofs of Concise Pigeonhole Principles

Paul Beame

University of Washington, Seattle, WA, USA Michael Whitmeyer

University of Washington, Seattle, WA, USA

Abstract

We prove several results concerning the communication complexity of a collision-finding problem, each of which has applications to the complexity of cutting-plane proofs, which make inferences based on integer linear inequalities.

In particular, we prove an $\Omega(n^{1-1/k}\log k\ /2^{k})$ lower bound on the $k$ -party number-in-hand communication complexity of collision-finding. This implies a $2^{n^{1-o(1)}}$ lower bound on the size of tree-like cutting-planes refutations of the bit pigeonhole principle CNFs, which are compact and natural propositional encodings of the negation of the pigeonhole principle, improving on the best previous lower bound of $2^{\Omega(\sqrt{n})}$ . Using the method of density-restoring partitions, we also extend that previous lower bound to the full range of pigeonhole parameters.

Finally, using a refinement of a bottleneck-counting framework of Haken and Cook and Sokolov for DAG-like communication protocols, we give a $2^{\Omega(n^{1/4})}$ lower bound on the size of fully general (not necessarily tree-like) cutting planes refutations of the same bit pigeonhole principle formulas, improving on the best previous lower bound of $2^{\Omega(n^{1/8})}$ .

Keywords and phrases:

Proof Complexity, Communication Complexity

Category:

Track A: Algorithms, Complexity and Games

Copyright and License:

2012 ACM Subject Classification:

Mathematics of computing

Related Version:

Full Version: https://eccc.weizmann.ac.il/report/2025/057/ [3]

Funding:

Research supported by NSF grants CCF-2006359 and CCF-2422205.

DOI:

10.4230/LIPIcs.ICALP.2025.21

Event:

52nd International Colloquium on Automata, Languages, and Programming (ICALP 2025)

Editors:

Keren Censor-Hillel, Fabrizio Grandoni, Joël Ouaknine, and Gabriele Puppis

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

The pigeonhole principle, which asserts that there is no injective function $f:[m]\to[n]$ for $m>n$ , is a cornerstone problem in the study of proof complexity. It is typically encoded as unsatisfiable conjunctive normal form formula (CNF), henceforth denoted $\mathrm{PHP}^{m}_{n}$ , on the variables $y_{i,j}$ , each of which is an indicator that “pigeon” $i$ is mapped to “hole” $j$ .

It is well known that any refutation of $\mathrm{PHP}^{n+1}_{n}$ using resolution proofs requires size $2^{\Omega(n)}$ [12] and the same asymptotic bound holds for all $m$ that are $O(n)$ [6]. On the other hand, if we allow our proof system to reason about linear inequalities (for example using cutting-planes proofs), then it is easy to see that refuting $\mathrm{PHP}^{n+1}_{n}$ becomes easy – indeed, there exist polynomial size refutations of $\mathrm{PHP}^{n+1}_{n}$ .

Despite the pigeonhole principle having short cutting-planes refutations, the related clique-coloring formulas, which state that a graph cannot have both $k$ -cliques and $k-1$ -colorings, requires exponential-size cutting-planes refutation [24].¹¹1Lower bounds for restricted cutting-planes refutations of these formulas were earlier shown in [17, 4] The clique-coloring formula can be viewed as a kind of indirect pigeonhole principle: The $k$ nodes of the clique correspond to the pigeons and $k-1$ colors correspond to the holes, but the representation of possible mappings is quite indirect.

It is natural to wonder about the extent to which indirection is required for the pigeonhole principle to be hard for cutting-planes reasoning. As part of studying techniques for cutting-planes proofs, Hrubeš and Pudlák [16] considered a very natural compact and direct way of expressing the pigeonhole principle, known as the bit or binary pigeonhole principle²²2This encoding of the pigeonhole principle was introduced in [1].. The bit pigeonhole principle analog of $\mathrm{PHP}^{m}_{n}$ (henceforth denoted $\mathrm{BPHP}^{m}_{n}$ ) has $m\log n$ variables $x_{i,j}$ for $i\in[m],j\in[\log n]$ and the principle asserts that, when we organize these variables as an $m\times[\log n]$ matrix, the rows of the matrix all have distinct values. $\mathrm{BPHP}^{m}_{n}$ is the following CNF formula: for each $i\neq j\in[m]$ , include the clauses of a CNF encoding that $x_{i}\neq x_{j}$ . One can achieve this by including a clause for each $\alpha\in\{0,1\}^{\log n}$ expressing that $x_{i}\neq\alpha\vee x_{j}\neq\alpha$ . The end result is a CNF with $\binom{m}{2}n$ clauses of size $2\log n$ .

Using techniques related to those of [24], Hrubeš and Pudlák [16] showed that $\mathrm{BPHP}^{m}_{n}$ requires cutting-planes refutations of size $2^{\Omega(n^{1/8})}$ for any $m>n$ , proving that even a very direct representation of the pigeonhole principle is hard for cutting-planes proofs. Their arguments, like those of Pudlák, also apply to any proof system that has proof lines consisting of integer linear inequalities with two antecedents per inference that are sound with respect to $01$ -valued variables; such proofs are known alternatively as semantic cutting-planes proofs or ${\mathsf{Th}}(1)$ proofs [2].

Recently, Dantchev, Galesi, Ghani, and Martin [8] exhibited a $2^{\Omega(n/\log n)}$ lower bound on the size of any general resolution refutation of $\mathrm{BPHP}^{m}_{n}$ for all $m>n$ . In fact, they showed that $\mathrm{BPHP}^{m}_{n}$ requires proofs of size $2^{\Omega(n^{1-\varepsilon})}$ for a more powerful class of proof systems that extend resolution by operating on $k$ -DNFs (known as $\mathrm{Res}(k)$ proofs) for $k\leq\log^{1/2-\varepsilon^{\prime}}n$ . (Note that any sound proof system operating on DNFs requires size at least $2^{n^{\Omega(1)}}$ to refute $\mathrm{PHP}^{n+1}_{n}$ [23, 20, 14].) In addition, [8] showed that $\mathrm{BPHP}^{m}_{n}$ has no refutations in the Sherali-Adams proof system [27] of size smaller than $2^{\Omega(n/\log^{2}n)}$ . Finally, just as $\mathrm{PHP}^{m}_{n}$ has polynomial-size Sum-of-Squares refutations [11], Dantchev et al. showed that $\mathrm{BPHP}^{m}_{n}$ has polynomial-sized Sum-of-Squares refutations.

Given the large lower bounds for resolution, $\mathrm{Res}(k)$ , and Sherali-Adams refutations of $\mathrm{BPHP}^{m}_{n}$ , it is natural to ask the extent to which the sub-exponential lower bounds can be improved for cutting-planes proofs; how close to a $2^{\Omega(n)}$ lower bound is possible?

1.1 Tree-like Proofs and Multiparty Communication

In prior work there has been progress towards this question for the restricted class of tree-like refutations. Tree-like proofs require that any time an inequality is used, it must be re-derived (i.e., the underlying graph of deductions is a tree); the polynomial-size cutting-planes refutations of $\mathrm{PHP}^{n+1}_{n}$ can be made tree-like. In contrast, Itsykson and Riazanov [18] showed that $\mathrm{BPHP}^{m}_{n}$ requires tree-like cutting-planes refutations of size $2^{\Omega(\sqrt{n})}$ when $m\leq n+\sqrt{n}$ .

Our first result pushes this bound almost to its limit. Specifically, we prove that any tree-like semantic cutting-planes refutation of $\mathrm{BPHP}^{m}_{n}$ requires size $2^{n^{1-o(1)}}$ whenever $m\leq n+2^{2\sqrt{\log n}-2}$ .

In order to show this, we utilize a well-known connection between tree-like refutations and communication complexity. While the results of [18] for cutting planes rely on two-party communication complexity (and number-on-forehead multiparty communication for other results that we mention below), our stronger results are based on multiparty number-in-hand communication. In particular they are based on a similar natural collision-finding communication problem ${\mathsf{Coll}}^{k}_{m,\ell}$ , in which each player $p\in[k]$ in the number-in-hand model receives an input in $x^{(p)}\in[\ell]^{m}$ , and their goal is to communicate and find a pair $i\neq j\in[m]$ such that $x^{(p)}_{i}=x^{(p)}_{j}$ for all players $p\in[k]$ . Such a communication problem is well-defined (in the sense that such a pair $i, j$ always exists) when $m>\ell^{k}$ .

This collision-finding problem is intimately related to the unsatisfiable $\mathrm{BPHP}^{m}_{n}$ formula via the following natural search problem associated with any unsatisfiable CNF formula: Given unsatisfiable CNF $\varphi$ , the associated search problem ${\mathsf{Search}}_{\varphi}$ takes as input a truth assignment $\alpha$ to the variables of $\varphi$ and requires the output of the index of a clause of $\varphi$ that is falsified by $\alpha$ . In particular the connection follows by considering a natural $k$ -party number-in-hand communication game that we denote by ${\mathsf{Search}}_{\varphi}^{k}$ wherein the assignment $\alpha$ to the variables of $\varphi$ is evenly distributed among the $k$ players. and the players must communicate to find an answer for ${\mathsf{Search}}_{\varphi}(\alpha)$ . It is not hard to see that if we have a communication protocol solving ${\mathsf{Search}}_{\mathrm{BPHP}^{m}_{n}}^{k}(\alpha)$ then such a protocol also solves ${\mathsf{Coll}}^{k}_{m,n^{1/k}}$ on input $\alpha$ .

Our first result is a lower bound on ${\mathsf{Coll}}^{k}_{m,n^{1/k}}$ that holds even when we allow randomized protocols.

Theorem 1.1.

The randomized number-in-hand communication complexity of ${\mathsf{Coll}}^{k}_{m,n^{1/k}}$ is $\Omega(n^{1-1/k}\log k\ /2^{k})$ whenever $n+1\leq m\leq n+2^{k-2}n^{1/k}$ .

We pause here to note that this bound is nearly tight. There is a deterministic protocol wherein the first player sends a subset of coordinates of size $\lceil m/n^{1/k}\rceil$ in which their inputs are all equal. This requires $\log\binom{m}{m/n^{1/k}}\lesssim(m/n^{1/k})\log(m/n^{1/k})$ bits; when $m\approx n$ , this is $O(n^{1-1/k}\log(n^{1/k}))=O(n^{1-1/k}\log n\ /k)$ bits of communication. Player two then announces a subset of these coordinates on which they are equal of size $\lceil m/n^{2/k}\rceil$ . The players can continue in this manner until they have found a collision (which is guaranteed by the pigeonhole principle). Note that the amount of communication is bounded by a geometric series, and is dominated by the first term, which results in communication $O(n^{1-1/k}\log n\ /k)$ . This shows that up to logarithmic factors and a factor of $2^{k}$ , Theorem 1.1 is tight.

We state here a simplified corollary³³3When $m$ is somewhat larger, we can obtain somewhat weaker lower bounds. of Theorem 1.1 which formalizes our lower bounds for cutting-planes refutations of $\mathrm{BPHP}^{m}_{n}$ .

Theorem 1.2.

Any tree-like semantic cutting-planes refutation of $\mathrm{BPHP}^{m}_{n}$ requires size $2^{n^{1-2/\sqrt{\log n}-o(1/\sqrt{\log n})}}$ when $m\leq n+2^{2\sqrt{\log n}-2}$ .

We remark that Itsykson and Riazanov [18] utilized the same connection between communication and proof complexity to achieve their results. They were also interested in a $k$ -party number-on-forehead version of ${\mathsf{Coll}}^{k}_{m,\ell}$ (in particular, in their version, the matrices are added rather than concatenated), which leads to weaker lower bounds in stronger proof systems ${\mathsf{Th}}(k-1)$ that manipulate degree $k-1$ polynomial inequalities.

Itsykson and Riazanov also left as an open problem whether their bounds for ${\mathsf{Search}}_{\mathrm{BPHP}^{m}_{n}}$ could be extended to the regime of the “weak” pigeonhole principle when $m=n+\Omega(n)$ . Göös and Jain [10] first answered this in the affirmative, giving an $\Omega(n^{1/12})$ lower bound on the randomized communication complexity of ${\mathsf{Coll}}^{2}_{2n,n^{1/2}}$ . Yang and Zhang [32] subsequently improved this to an $\Omega(n^{1/4})$ bound, which is tight for randomized computation. Because of the connection between communication protocols and tree-like communication, Yang and Zhang’s bound showed that any tree-like cutting planes refutation of $\mathrm{BPHP}^{m}_{n}$ requires size $2^{\Omega(n^{1/4})}$ . We extend the lower bounds of Yang and Zhang for all $k\geq 2$ as follows:

Theorem 1.3.

For all $m>n$ and $2\leq k\leq\log n\,/4$ , the randomized number-in-hand communication complexity of ${\mathsf{Coll}}^{k}_{m,n^{1/k}}$ is $\Omega(n^{1/2-1/(2k)}/k)$ .

This bound, whose proof combines techniques of Yang, Zhang, and Wang [31, 32], is much weaker than that of Theorem 1.1 when $m$ is close to $n$ but it is tight, up to an $O(\log n)$ factor, for randomized protocols when $m$ is $n+\Omega(n)$ . For $k=\Omega(\log n)$ , it also implies the following proof complexity bound which asymptotically matches the bound of [18], but applies for all $m>n$ .

Corollary 1.4.

For all $m>n$ , any tree-like semantic cutting-planes refutation of $\mathrm{BPHP}^{m}_{n}$ requires size $2^{\widetilde{\Omega}(\sqrt{n})}$ .

1.2 General Proofs and DAG-like Protocols

Our second result improves on the $2^{\Omega(n^{1/8})}$ lower bound on the size of cutting-planes refutations of $\mathrm{BPHP}^{m}_{n}$ (for all $m>n$ ) of Hrubeš and Pudlák [16], and subsumes the $2^{\Omega(n^{1/4})}$ lower bound for tree-like proofs of Yang and Zhang [32]. Hrubeš and Pudlák used the method of interpolation involving a version of the method of approximation due to Jukna [19]. On the other hand, we use a bottleneck-counting method inspired by recent work of Sokolov [29] that refines a method introduced by Haken and Cook [13].

Theorem 1.5.

Any semantic cutting planes refutation of $\mathrm{BPHP}^{m}_{n}$ requires size at least $2^{n^{1/4}/\sqrt{2}-2}$ , for any $m>n$ .

We prove Theorem 1.5 via a connection between cutting-planes refutations and certain types of DAG-like communication protocols [15, 28]. Specifically, we prove a lower bound on the size of any two party triangle-DAG protocol computing ${\mathsf{Search}}_{\mathrm{BPHP}^{m}_{n}}$ . We define such protocols formally in Section 2.3.

For now, we state the connection as the following proposition, originally due to Sokolov and Hrubeš and Pudlák [28, 15]. Its proof can be found in Section 2.3 for completeness.

Proposition 1.6 ([28, 15]).

Given a semantic cutting planes refutation for CNF formula $\varphi$ of size $s$ and any partition of the variables of $\varphi$ into two sets, there is a size $s$ triangle-DAG computing ${\mathsf{Search}}_{\varphi}$ .

In our setting, for $\mathrm{BPHP}^{m}_{n}$ , there is a natural partition of the variables such that each clause has exactly half of its variables coming from $V_{x}$ and half of its variables coming from $V_{y}$ . This can be achieved, for example, by letting $V_{x}$ be the set of variables corresponding to the first $\frac{1}{2}\log n$ columns of the matrix associated with $\mathrm{BPHP}^{m}_{n}$ . In light of Proposition 1.6, we derive Theorem 1.5 by proving the following.

Theorem 1.7.

For the natural partition of variables $V_{x}\sqcup V_{y}=[m\log n]$ , any triangle-DAG computing ${\mathsf{Search}}_{\mathrm{BPHP}^{m}_{n}}$ requires size $2^{n^{1/4}/\sqrt{2}-2}$ .

2 Preliminaries

2.1 Proof Complexity

Proof complexity studies how the size required for refutations of unsatisfiable formulas depends on the size of the formulas themselves. The size of a refutation in general depends on the allowable structure (lines, derivation rules, etc.) of a proof. In general, a proof system corresponds to a polynomial-time verifier that can check proofs of a certain format. It generally suffices to derive refutations of unsatisfiable CNF formulas; so it is usual to focus solely on inputs of this form.

For most proof systems, a sequence of deductions can be thought of as a directed graph, where two (or possibly more) lines (whether given or derived) are combined soundly to create a new line. The underlying graph then has edges pointing from the derived inequality to its antecedents.⁴⁴4The direction of arrows in this digraph may seem counterintuitive, but it is convenient when thinking of the graph as a search problem for a violated axiom. In this case, we can follow a path in the graph to find such a a violated axiom on one of the leaves/sources. We say that a proof is tree-like if every inequality is used as an antecedent at most once in the proof – that is, if we want to use an inequality twice, we must derive it twice.

For example, in the resolution proof system the lines are clauses, and we have the derivation rule $(A\vee x)\wedge(B\vee\neg x)\vdash A\vee B.$

A more powerful proof system is the cutting planes proof system⁵⁵5Cutting planes proofs can simulate resolution proofs, see e.g. [19, 25]., denoted ${\mathsf{CP}}$ , where lines are linear inequalities. Clauses can be trivially converted into linear inequalities⁶⁶6For example, $x\vee y\vee z$ could be converted to the inequality $x+y+z\geq 1$ . but, more generally, cutting planes can start with any system of inequalities $Ax\leq b$ where $A$ is a matrix with integer entries. We say that the system is unsatisfiable iff it is unsatisfiable for any $x\in\{0,1\}^{n}$ . The most basic form of the cutting planes proof system consists of three rules: addition of inequalities, multiplication of inequalities by positive integers, and most crucially, the rounded division rule, also known as the Gomory-Chvátal rule. The rounded division rule is the simple but powerful observation that if $a_{1},\ldots,a_{n}$ are all integers with a common factor $c$ , and we have the inequality $a^{T}x\leq b$ , then we can derive that $\frac{1}{c}a^{T}x\leq\lfloor\frac{b}{c}\rfloor$ .

In general, there are many more sound derivation rules for integer/linear inequalities than just rounded division (such as saturation [9] for example), and even more generally, one may allow any sound derivation rule for linear inequalities, which yields what is known as semantic cutting planes or ${\mathsf{Th}}(1)$ – we use the two names interchangeably.

There is a well-known connection between communication complexity and tree-like proofs, which we will now detail. Given any unsatisfiable formula $\varphi$ , an assignment $\alpha$ to the variables can be distributed among $k$ players, who must then communicate in order to find a a violated clause in $\varphi$ . This is a search problem, and is denoted ${\mathsf{Search}}_{\varphi}(\alpha)$ . Short tree-like proofs of the unsatisfiability of $\varphi$ can often be converted into short protocols for ${\mathsf{Search}}_{\varphi}$ using standard techniques.

For example, a short tree-like proof of unsatisfiability of $\varphi$ using the resolution rule naturally corresponds to a decision tree for finding a violated clause of $\varphi$ in the following way. For every derivation of the form $(A\vee x)\wedge(B\vee\neg x)\vdash(A\vee B)$ where $A\vee B$ is known to be false, we query $x$ in order see whether $A\vee x$ or $B\vee\neg x$ is necessarily false. We can continue in this manner, from the root of the tree-like refutation, until we hit an unsatisfied clause in the original formula.

On the other hand, tree-like semantic cutting planes refutations naturally correspond to threshold decision trees, which we now define.

Definition 2.1 (Threshold Decision Tree).

A threshold decision tree is a tree whose vertices are labeled with inequalities of the form $a_{1}x_{1}+\cdots a_{n}x_{n}\leq b$ where $a_{1},\ldots,a_{n},b$ are integers. Edges are labelled with 0 or 1, and leaves are axioms of a system of inequalities $Ax\leq b$ .

We traverse a threshold decision tree by computing the threshold function at the root, following the corresponding edge, and continuing in this manner until we hit a leaf. We say that a threshold decision tree computes the search problem for a formula $\varphi$ if this process leads to a leaf corresponding to a violated clause in $\varphi$ .

First, we have the following well-known lemma, which states that one can derive a low-depth threshold decision tree from a small ${\mathsf{Th}}(1)$ refutation.⁷⁷7As noted in [19], there is no meaningful converse to this statement, since if there are $m$ inequalities in our unsatisfiable system, there exists a trivial depth $m$ threshold decision tree finding one that is violated.

Proposition 2.2.

Given a size $S$ tree-like ${\mathsf{Th}}(1)$ refutation of an unsatisfiable system $Ax\leq b$ , there is a depth $O(\log S)$ threshold decision tree finding a violated axiom.

Proposition 2.2 goes back to the work of Impagliazzo, Pitassi, and Urquhart [17], and can also be found for instance in [19]. We omit the proof, but the idea is a common one: find a node in the tree with roughly half (between 1/3 and 2/3) of the leaves as its descendants, make that the root of the threshold decision tree, and recurse.

2.2 Communication Complexity

We mainly focus on $k$ -party number-in-hand communication, wherein each player $p$ receives an input $x^{(p)}$ , and the players’ goal is to communicate as little as possible in order to compute a known function or relation involving their collective inputs. In general, players may have access to shared randomness, and we allow incorrect answers with probability $1/3$ .

An important function for us is the number-in-hand disjointness problem with $k$ players and input size $n$ , henceforth denoted ${\mathsf{DISJ}}^{k}_{n}$ . This is the communication problem wherein each player and input in $\{0,1\}^{n}$ , and they must decide if there exists a coordinate $i$ for which they all have a 1 in that coordinate. Disjointness in general is an extremely well-studied problem [7, 25], and for the specific case of the NIH model, we have the following lower bound due to Braverman and Oshman.

Theorem 2.3 ([5]).

The randomized communication complexity of ${\mathsf{DISJ}}^{k}_{n}$ is $\Omega(n\log k)$ .

Our results rely on the following connection between threshold decision trees for finding violated clauses and $k$ -party NIH communication. It is closely related to previous work.

Lemma 2.4.

For $x\in\{0,1\}^{n}$ , if an unsatisfiable system $Ax\leq b$ on has a threshold decision tree of depth $d\leq n$ finding a violated axiom, then for any partition of the input variables into $k$ parts there is a randomized protocol for ${\mathsf{Search}}^{k}_{NIH}(Ax\leq b)$ using $O(dk\log k\log n)$ bits of communication.

Lemma 2.4 is similar for example to Lemma 5 in [17] or Lemma 19.11 in [19], but is slightly stronger so we give a brief proof. Using a well-known theorem of Muroga [21] we can assume, without loss of generality, that each of the threshold functions in the decision tree has integer coefficients of absolute value at most $2^{-n}(n+1)^{(n+1)/2}$ .

The players evaluate these threshold functions using the following efficient randomized protocol given by Viola [30] which tightens and refines a protocol first given by Nisan [22].

Proposition 2.5 ([30]).

Suppose that each player $p\in[k]$ receives an input $x^{(p)}\in[-2^{n},2^{n}]$ . Then there is a randomized number-in-hand protocol with error at most $\varepsilon$ that determines whether $\sum_{p}x^{(p)}>s$ and communicates $O(k\log k\log(n/\varepsilon))$ bits.

Corollary 2.6.

Suppose that each player $p\in[k]$ receives an input $x^{(p)}\in[2^{t}]$ . Then they can execute a randomized number-in-hand protocol to determine whether $\sum_{p}a_{p}x^{(p)}\leq b$ , where each $|a_{p}|\leq 2^{w}$ with error at most $\varepsilon$ using at most $O(k\log k\log((w+t)/\varepsilon))$ bits of communication.

Proof of Lemma 2.4.

Given a threshold decision tree of depth $d$ , which has coefficients of magnitude at most $2^{O(n\log n)}$ without loss of generality [21], we simply traverse it from the root, evaluating the threshold functions until a root is reached. By Corollary 2.6, the players can communicate $O(k\log((n\log n)/\varepsilon)\log k)$ bits to compute the threshold function with error probability $\varepsilon$ . Setting $\varepsilon=\Theta(1/d)$ and continuing in this manner, by a union bound the threshold functions are all computed correctly with constant probability. Using the assumption that $d\leq n$ , this yields a protocol communicating $O(dk\log k\log n)$ bits. $\hfill\blacktriangleleft$

Given a system of unsatisfiable inequalities $Ax\leq b$ , and a partition of an assignment $\alpha\in\{0,1\}^{n}$ between $k$ players, there is a natural (number-in-hand) communication game wherein players must communicate to a find an axiom violated by $\alpha$ . Lemma 2.4 implies the following result connecting communication complexity and proof complexity.

Lemma 2.7.

For any partition of $n$ variables into $k$ parts, if ${\mathsf{Search}}^{k}_{NIH}(Ax\leq b)$ requires $t$ bits of communication, then any tree-like ${\mathsf{Th}}(1)$ refutation of $Ax\leq b$ requires size $2^{\Omega(t/(k\log k\log n)})$ .

Proof.

By Proposition 2.2, given a size $S$ tree-like ${\mathsf{Th}}(1)$ proof, we get a depth $d=O(\log S)$ threshold decision tree finding a violated axiom. By Lemma 2.4, there exists a communication protocol finding a violated axiom using $O(\log S\ k\log k\ \log n)$ bits of communication. This implies that $\log(S)k\log k\log n\geq ct$ for constant $c$ , which in turn implies $S$ is $2^{\Omega(t/(k\log k\ \log n)}$ , as desired. $\hfill\blacktriangleleft$

2.3 Triangle-DAGs

Given a bipartite domain $X\times Y$ , a triangle $T\subseteq X\times Y$ is any set that can be written as $T=\{(x,y):a_{T}(x)<b_{T}(y)\}$ for some labelling $a_{T}:X\to\mathbb{R}$ and $b_{T}:Y\to\mathbb{R}$ .

A triangle-DAG computing a search problem ${\mathsf{Search}}\subseteq(X\times Y\times\mathcal{O})$ is a directed acyclic graph $D$ of fan-out at most 2 where each node $u\in D$ is associated with a triangle $T_{u}\subseteq X\times Y$ satisfying the following:

$\blacksquare$

there is a distinguished root node $r$ with fan-in zero, and $T_{r}=X\times Y$ , and
$\blacksquare$

for each non-sink node $u$ with children $v,v^{\prime}$ we have $T_{u}\subseteq T_{v}\cup T_{v^{\prime}}$ , and
$\blacksquare$

each sink node $u$ is labeled with an output $o$ such that $T_{u}\subseteq{\mathsf{Search}}^{-1}(o)$ .

Given these definitions, we restate Proposition 1.6 for convenience. Proofs may be found in [28, 15] or in our full paper [3].

See 1.6

3 Multiparty Communication Lower Bounds for $m\leq n+\sqrt{n}$

In this section we prove Theorem 1.1, which we now recall. See 1.1 The idea for the proof is to exhibit a random reduction from the decision problem ${\mathsf{DISJ}}^{k}_{m^{k-1}}$ to the collision problem. This is analogous to the approach of Itsykson and Riazanov [18] for number-on-forehead communication complexity which used lower bounds for disjointness in that model and a randomized decision-to-search reduction paradigm introduced by Raz and Wigderson [26] to prove lower bounds on the monotone depth complexity of matching. The details and parameters of our reduction are necessarily quite different.

In our setting, we embed $k$ players’ inputs to a disjointness problem into $k$ matrices such that, when these matrices are concatenated, the resulting matrix has distinct rows if and only if the players’ inputs were disjoint. We can then add a few “fake” rows to this matrix and run our algorithm for ${\mathsf{Coll}}^{k}_{m,n^{1/k}}$ , and see if the collisions it finds involve the fake rows or not. If so, we conclude that the inputs were disjoint and if not, we know that they were not disjoint.

The following key combinatorial lemma allows us to carry out the first step of this process.

Lemma 3.1.

For all integers $k\geq 1$ there exist matrices $M^{1}_{k}\in\{0,1\}^{2^{k}\times k}$ and $M^{0}_{k}\in\{0,1\}^{2^{k}\times k}$ such that

1.

$M^{1}_{k}$ has $2^{k-1}$ unique pairs of identical rows.
2.

For any string $b\in\{0,1\}^{k}$ , define the matrix $M_{k}(b_{1},\ldots b_{k})$ as the matrix formed by making its $i$ -th column equal to the $i$ -th column of $M^{0}_{k}$ if $b_{i}=0$ , and equal to the $i$ -th column of $M^{1}_{k}$ if $b_{i}=1$ . Then $M_{k}(b)$ has unique rows for all $b\neq\vec{1}$ .

We defer the proof of Lemma 3.1 to Section 3.1.

Proof of Theorem 1.1 using Lemma 3.1.

As alluded to, we will reduce the NIH disjointness problem to our bit-pigeonhole problem. Namely, we will reduce ${\mathsf{DISJ}}^{k}_{m^{k-1}}$ to a ${\mathsf{Coll}}^{k}_{\widetilde{m},\widetilde{\ell}}$ communication game, where $\widetilde{m}=m^{k}2^{k}+2^{k-1}m$ , and $\widetilde{\ell}=2m$ .

The players get $x^{(1)},\ldots,x^{(k)}\subseteq[m^{k-1}]$ (viewed as bit strings of length $m^{k-1}$ ), and need to determine whether $x^{(1)}\cap\cdots\cap x^{(k)}=\varnothing$ . First, we define $B_{m^{k}}(j):=\begin{bmatrix}\mathrm{bin}_{m^{k}}(j),&\cdots,&\mathrm{bin}_{m^% {k}}(j)\end{bmatrix}^{T}\in\{0,1\}^{2^{k}\times k\log m},$ where the same row is repeated $2^{k}$ times.

For each $i\in[m^{k-1}]$ , consider the matrix $M_{k}(x^{(1)}_{i},\ldots,x^{(k)}_{i})$ from Lemma 3.1. Note that each player $p$ knows the $p$ -th column of $M_{k}(x^{(1)}_{i},\ldots,x^{(k)}_{i})$ for all $i$ , but without communication the other players do not know this column. Then we can define

\displaystyle\widetilde{M}:=\begin{bmatrix}M_{k}(x^{(1)}_{1},\ldots,x^{(k)}_{1% })&B_{m^{k}}(0)\\ M_{k}(x^{(1)}_{1},\ldots,x^{(k)}_{1})&B_{m^{k}}(1)\\ \vdots&\vdots\\ M_{k}(x^{(1)}_{1},\ldots,x^{(k)}_{1})&B_{m^{k}}(m-1)\\ M_{k}(x^{(1)}_{2},\ldots,x^{(k)}_{2})&B_{m^{k}}(m)\\ \vdots&\vdots\\ M_{k}(x^{(1)}_{2},\ldots,x^{(k)}_{2})&B_{m^{k}}(2m-1)\\ \vdots&\vdots\\ M_{k}(x^{(1)}_{m^{k-1}},\ldots,x^{(k)}_{m^{k-1}})&B_{m^{k}}(m^{k}-1)\end{% bmatrix}\in\{0,1\}^{m^{k}\cdot 2^{k}\times(k\log m+k)}.

Observe that each player $p$ can construct their “part” of this matrix without communicating by constructing the $p$ -th column of every $M^{S_{i}}_{k}$ matrix (which only depends on $x^{(p)}$ ), and then taking the the $p$ -th part of each of the $B_{m^{k}}(j)$ matrices.

Lemma 3.1 lets us connect the distinctness property of this matrix with the disjointness property of the players’ inputs.

Claim 3.2.

If $(x^{(1)},\ldots,x^{(k)})$ are disjoint, then $\widetilde{M}$ has distinct rows.

Proof.

The only possible collisions happen in every group of $2^{k}$ rows, since $B_{m^{k}}(j)$ has every row different from $B_{m^{k}}(i)$ for all $i\neq j$ . Within these groups, by Lemma 3.1, if the inputs are disjoint then there are no collisions. $\hfill\vartriangleleft$

Claim 3.3.

If $X$ is not disjoint, then there are at least $2^{k-1}m$ pairs of colliding rows in $\widetilde{M}$ .

Proof.

Any coordinate $i$ for which $x^{(p)}_{i}=1$ for all $p$ generates $S_{i}=\varnothing$ , which by Lemma 3.1 generates $2^{k-1}m$ such pairs, since input $i$ was repeated $m$ times. $\hfill\vartriangleleft$

We cannot run any collision protocol for $\widetilde{M}$ yet, as there are not guaranteed collisions. To address this, the players use shared randomness to put an additional $2^{k-1}m$ rows at the bottom of $\widetilde{M}$ . These rows will be chosen randomly with the following two properties:

1.

Each fake row will be distinct.
2.

Each player’s “part” of the matrix (which consists of $\log m+1$ columns) when restricted to these rows will repeat the $2m$ unique possible bit strings an addition $2^{k-1}m/(2m)=2^{k-2}$ times.⁸⁸8This is important because if we bias and have a certain fake row appear more often in the input for player $p$ , then $\mathcal{A}$ could potentially detect and use this to its advantage.

Denote this new matrix $M$ . $M$ now has “fake” collisions which involve any of the last $2^{k-1}m$ rows.

Let $\mathcal{A}$ denote the randomized protocol solving the ${\mathsf{Coll}}^{k}_{(2m)^{k}+2^{k-1}m,2m}$ problem.

Observe that if the inputs are disjoint, then the only collisions in $M$ involve fake rows. The players would like to feed their parts of this matrix into $\mathcal{A}$ and conclude that their inputs are disjoint if the output involves a fake row, and conclude that they were not disjoint if the output involves two non-fake rows. However, this is problematic, as we have no guarantees over how $\mathcal{A}$ behaves, and it could always find a collision involving one of the last $2m$ rows (which it knows are fake), regardless of if there are other collisions. This necessitates the following random shuffling.

1.

Each player applies (the same) random permutation $\pi:[2^{k}m^{k}+2^{k-1}m]\to[2^{k}m^{k}+2^{k-1}m]$ which shuffles the rows of $M$ .
2.

Each player applies an individual random permutation $\pi^{(p)}:[2m]\to[2m]$ to each of their rows. Note that this preserves collisions/distinctness in the concatenation.

Denote $\vec{\pi}:=(\pi,\pi^{(1)},\ldots,\pi^{(k)})$ , and call this final matrix $M_{\vec{\pi}}$ .

Algorithm.

The algorithm for disjointness is as follows: the players use their inputs and shared randomness to compute (without communication) their respective parts of 5 independent copies of an $M_{\vec{\pi}}$ constructed in the above manner, and run $\mathcal{A}$ using these as inputs. The players then examine the outputs $(i_{1},j_{1}),\ldots(i_{5},j_{5})$ of the algorithm on these five inputs. They then exchange an additional $O(k\log m)$ bits to determine if each claimed collision actually was a collision. Finally, if any of the claimed collisions were actually collisions on rows that were not fake (under the appropriate permutation), then the players can conclude with certainty that their inputs were not disjoint. Otherwise, if $\mathcal{A}$ only ever finds colliding pairs that involve a row the players know is fake (or otherwise fail to find any collisions), then players guess that $(x^{(1)},\ldots,x^{(k)})$ were disjoint.

Analysis.

We analyze one iteration of the algorithm. Suppose $\mathcal{A}$ has error probability at most $1/3$ – that is, with probability at least $2/3$ , it outputs two rows $i, j$ that are equal.

Suppose $(x^{(1)},\ldots,x^{(k)})$ are disjoint. Then by assumption, $\mathcal{A}$ finds a collision with probability at least $2/3$ , and we know this collision will always involve a fake row by ˜3.2. Therefore the players will correctly output that their inputs were distinct with probability at least $2/3$ , and this is only improved by the five-fold repetition.

Otherwise, suppose that the players’ inputs were not disjoint. Suppose further that $\mathcal{A}$ successfully finds a collision – this happens with probability at least $2/3$ . Recall that by ˜3.3 $M_{\vec{\pi}}$ will have at least $2^{k-1}m$ distinct pairs of real collisions. Adding the $2^{k-1}m$ fake rows produced additional “fake” collisions. These fake rows could have created up to $2^{k-1}m$ additional unique pairs of fake collisions, or could have “joined” the real collisions, creating up to $2^{k-1}m$ groups of 3 equal rows in $M_{\vec{\pi}}$ .

If $\mathcal{A}$ outputs a collision from one of the groups of three, then because we applied random permutations to the rows, it is equally likely to have chosen any of the 3 possible pairs. Therefore, with probability at least $1/3$ , it outputs a real collision, and the players successfully discover that they are not disjoint. Otherwise, if $\mathcal{A}$ outputs one of the unique collision pairs, then (again because we have applied random permutations to the rows), any such unique collision is equally likely to be output. If $t$ of the fake rows formed a group of three with real collisions, then that leaves at most $2^{k-1}m-t$ fake rows to collide with a different unique row. It also leaves $2^{k-1}m-t$ untouched real collisions, so $\mathcal{A}$ outputs a real collision with probability at least $1/2$ . Either way, the probability that $\mathcal{A}$ outputs a real collision is at least $2/3\cdot 1/3=2/9$ .

Therefore, after repeating this 5 times independently, the probability of seeing at least one real collision is at least $1-(7/9)^{5}>2/3$ .

Let $n:=(2m)^{k}$ . We have shown that if ${\mathsf{Coll}}^{k}_{\widetilde{m},\widetilde{\ell}}$ with input size $\widetilde{m}=2^{k}m^{k}+2^{k-1}m=n+2^{k-2}n^{1/k}$ can be solved with $o(n^{1-1/k}\log k/2^{k})$ communication, then we can solve the decision disjointness problem with $o(n^{1-1/k}\log k/2^{k})+O(k\log m)$ which is at most $o(m^{k-1}\log k)$ , contradicting the $\Omega(m^{k-1}\log k)$ lower bound from Theorem 2.3. $\hfill\blacktriangleleft$

3.1 Proof of Lemma 3.1

We first recall Lemma 3.1.

See 3.1

Proof.

Let $\mathcal{E}_{k}\subseteq\{0,\ldots,k-1\}$ be the set of integers with an even number of 1s in their binary representation.

Define

\displaystyle M^{1}_{k}=\begin{bmatrix}\mathrm{bin}_{k}(y_{1})\\ \mathrm{bin}_{k}(y_{1})\\ \mathrm{bin}_{k}(y_{2})\\ \mathrm{bin}_{k}(y_{2})\\ \vdots\\ \mathrm{bin}_{k}(y_{2^{k-1}})\\ \mathrm{bin}_{k}(y_{2^{k-1}})\end{bmatrix}\in\{0,1\}^{2^{k}\times k},

where $y_{i}$ is the $i$ -th smallest number in $\mathcal{E}_{k}$ .

We pause here to note that each of the columns of $M^{1}_{k}$ are actually each the truth table of a linear function. Let $f_{1}:\{0,1\}^{k}\to\{0,1\}$ be the linear function $f_{1}(x)=\langle{x,e_{1}}\rangle$ , where $e_{1}$ is the first standard basis vector. Then we can describe the first column of $M^{1}_{k}$ as the truth table of $f_{1}$ . More generally, we have that for $i<k$ the $i$ -th column of $M^{1}_{k}$ is the truth table of $f_{i}(x)=\langle{x,e_{i}}\rangle$ , and the last column is the truth table of $f_{k}(x)=\langle{x,e_{1}+\cdots+e_{k-1}}\rangle$ .

If we define $F^{1}_{k}$ to be the matrix whose column $i$ is the vector whose inner product we are taking with $x$ in $f_{i}$ , and $B_{k}\in\mathbb{F}_{2}^{2^{k}\times k}$ whose rows are the binary strings written in order, then we have that

\displaystyle M^{1}_{k}=\begin{bmatrix}\mathrm{bin}_{k}(0)\\ \mathrm{bin}_{k}(1)\\ \vdots\\ \mathrm{bin}_{k}(2^{k}-1)\end{bmatrix}\begin{bmatrix}1&0&0&\cdots&1\\ 0&1&0&\cdots&1\\ 0&0&1&\ddots&1\\ \,&\vdots&\,&\ddots&\vdots\\ 0&0&0&\cdots&1\\ 0&0&0&\cdots&0\end{bmatrix}=:B_{k}F^{1}_{k}

where all operations are over $\mathbb{F}_{2}$ . $M^{1}_{k}$ has repeated rows precisely because $F^{1}_{k}$ has linearly dependent columns.

With this perspective in mind, if we can find a $k\times k$ matrix $F_{k}^{0}$ over $\mathbb{F}_{2}$ such that replacing any (nonzero) number of columns of $F_{k}^{1}$ with corresponding columns in $F^{0}_{k}$ produces a matrix with linearly independent columns, then we are done, as we can let $M^{0}_{k}:=B_{k}F^{0}_{k}$ .

We define $F^{0}_{k}$ to be the following lower triangular matrix:

\displaystyle F^{0}_{k}:=\begin{bmatrix}1&0&0&\cdots&0\\ 1&1&0&\cdots&0\\ 1&1&1&\ddots&0\\ \vdots&\vdots&\vdots&\ddots&\vdots\\ 1&1&1&\cdots&1\end{bmatrix},

Clearly $F^{0}_{k}$ is full rank.

We claim that replacing any set nonempty set $S$ of columns of $F^{1}_{k}$ with the corresponding columns in $F_{k}^{0}$ produces a matrix $F^{S}_{k}$ with linearly independent columns. Consider arbitrary $k$ , and arbitrary nonempty $S\subseteq[k]$ .

Case 1.

Suppose $k\in S$ , that is, the last column of $F_{k}^{S}$ is $e_{k}$ . Then our matrix is lower triangular with 1s on the diagonal, and so has linearly independent columns.

Case 2.

Suppose $k\not\in S$ , that is, the last column of $F_{k}^{S}$ is $e_{1}+\cdots+e_{k-1}$ . In this case, the first $k-1$ columns are lower triangular with 1s on the diagonal, and therefore their span is equal to ${\mathsf{span}}\{e_{1},\ldots,e_{k-1}\}$ . It suffices then to show that also $e_{k}$ is in the column span. Towards this goal, take the minimal $0<i<k$ such that $i\in S$ , and observe that summing the first $i$ columns of $F^{S}_{k}$ equals $\vec{1}$ , the all 1s vector. Adding this to the last column produces $e_{k}$ , so we are done. $\hfill\blacktriangleleft$

4 Tree-like Proof Complexity Lower Bounds

In this section, we prove the following more detailed version of Theorem 1.2.

Theorem 4.1.

When $m\leq n+2^{k-2}n^{1/k}$ , any tree-like semantic cuttings planes refutation of $\mathrm{BPHP}^{m}_{n}$ must have size at least $2^{\Omega(n^{1-1/k}2^{-k}/(k\log n))}$ .

Proof.

The theorem follows quite readily from Lemma 2.7 and the fact that ${\mathsf{Search}}^{k}_{\mathrm{BPHP}^{m}_{n}}$ reduces to ${\mathsf{Coll}}^{k}_{m,n^{1/k}}$ . If we translate $\mathrm{BPHP}^{m}_{n}$ to a system of inequalities, then there are $O(nm^{2})=O(n^{3})$ inequalities on $m\log n$ variables.

Lemma 2.7 then says that any tree-like semantic cutting planes proof of the unsatisfiability of $\mathrm{BPHP}^{m}_{n}$ must have size at least $2^{\Omega(t/(k\log k\ \log n)}.$ By Theorem 1.1, $t=\Omega(n^{1-1/k}2^{-k}\log k)$ . Plugging this in yields that the tree-like size must be at least $2^{\Omega(n^{1-1/k}2^{-k}/(k\log n)}$ . $\hfill\blacktriangleleft$

From Theorem 4.1 we achieve Theorem 1.2, which we restate now.

Corollary 4.2.

Any tree-like semantic cutting planes refutation of $\mathrm{BPHP}^{m}_{n}$ requires size $2^{n^{1-2/\sqrt{\log n}-o(1/\sqrt{\log n})}}$ when $m\leq n+2^{2\sqrt{\log n}-2}$ .

Proof.

Let $k=\sqrt{\log n}$ . Plugging this into the bound from Theorem 4.1, we get that $m\leq n+2^{\sqrt{\log n}-2}n^{1/\sqrt{\log n}}=2^{2\sqrt{\log n}-2}$ . In this regime, Theorem 4.1 gives the size lower bound

\displaystyle 2^{\Omega(n^{1-1/\sqrt{\log n}}n^{-1/\sqrt{\log n}}/(\log^{3/2}n% ))}=2^{cn^{1-2/\sqrt{\log n}-1.5\log\log n/\log n}},

for appropriate constant $c$ . Using the fact that $c=n^{\log c/\log n}$ , the bound becomes

2^{n^{1-2/\sqrt{\log n}-o(1/\sqrt{\log n})}}.\

$\hfill\blacktriangleleft$

5 Triangle-DAG Lower Bounds

In this section, we prove Theorem 1.7, which we restate here for convenience.

See 1.7

Before proving Theorem 1.7, we need the following notion of triangle slices. For a triangle $T\subseteq X\times Y$ and a given $x\in X$ , we define the slice $T^{x}$ to be $T\cap(\{x\}\times Y)$ , and define the slice $T^{y}$ analogously. Note that for any triangle $T$ defined by functions $a$ and $b$ there is a natural total pre-order on $X$ where $x\preceq x^{\prime}$ iff $a(x^{\prime})\leq a(x)$ and hence $T^{x}\subseteq T^{x^{\prime}}$ ; similarly, there is an total pre-order on $Y$ where $y\preceq y^{\prime}$ iff $b(y)\leq b(y^{\prime})$ and hence $T^{y}\subseteq T^{y^{\prime}}$ . This immediately implies the following.

Proposition 5.1.

Given a triangle $T$ , if $y\in Y$ is such that $|T^{y}|$ is maximized, then for all $(x,y^{\prime})\in T$ we have $(x,y)\in T$ .

We will have the following proposition.

Proposition 5.2.

For any triangle $T$ and rectangle $R$ , $T\cap R$ is a triangle.

Proof.

By definition, $T=\{(x,y):a(x)<b(y)\}$ for some $a, b$ . If $R=X^{\prime}\times Y^{\prime}$ , simply modify $a(x)=\infty$ for all $x\not\in X^{\prime}$ and $b(y)=-\infty$ for all $y\not\in Y^{\prime}$ . $\hfill\blacktriangleleft$

The proof of Theorem 1.7 uses the bottleneck counting method of Haken and Cook for cutting planes proofs [13] as adapted for direct use on triangle-DAGs by Sokolov [29].

Let $V(D)$ denote the nodes of a triangle-DAG $D$ computing ${\mathsf{Search}}_{\mathrm{BPHP}^{m}_{n}}$ under the natural partition of variables. At a high level, to prove Theorem 1.7, we will construct a partial function $\mu:X\cup Y\to V(D)$ such that

: (a) (Lemma 5.5) most $x\in X$ and $y\in Y$ are mapped by $\mu$ to a node of $D$ , and
: (b) (Lemma 5.6) there is no node of $D$ with too many $x$ or $y$ assigned to it by $\mu$ .

It is then easy to combine these two properties and conclude that $|V(D)|$ must be large⁹⁹9The name bottleneck counting came from its original version due to Haken [12] in which a class of full input assignments was mapped by a function $\mu$ to nodes in a proof DAG and each input assignment could be viewed as “flowing” from the root to a particular sink node (defined by the assignment). In that case, $\mu$ identified a node on such a path, a bottleneck node, that did not permit many assignments to flow through it. Since there were many assignments, only few of which could pass through any bottleneck node identified by $\mu$ , there must be many bottleneck nodes in the DAG. In the modified form for cutting planes/triangle-DAGs, while the full assignment is flowing through the triangle-DAG, the mapping $\mu$ is based on half of the input assignment, either the $x$ or $y$ portion, and the sink node is not unique; not all assignments are assigned in this way, but many are..

For any $i\neq j\in[m]$ , there is an associated rectangle $R_{i,j}\subseteq X\times Y$ of inputs that violate at least one of the $n$ clauses enforcing an inequality between coordinates $i$ and $j$ given by

\displaystyle R_{i,j}=\{(x,y)\in X\times Y\mid x_{i}=x_{j},\ y_{i}=y_{j}\}.

Observe that the set of all rectangles $\{R_{i,j}:i\neq j\in[m]\}$ covers the entire space $X\times Y$ as long as $m>n$ . (That is, $X\times Y=\bigcup_{i\neq j\in[m]}R_{i,j}$ .) Note that in order for a triangle-DAG to correctly compute ${\mathsf{Search}}_{\mathrm{BPHP}^{m}_{n}}$ , it must, at the very least, have each sink be contained in a single $R_{i,j}$ .

Definition 5.3.

For a node $u\in V(D)$ , let $T_{u}$ denote the triangle associated with that node. Define the width $w(T,z)$ of $z$ in triangle $T$ to be the minimal size of any subset $S\subseteq\{R_{i,j}:i\neq j\in[m]\}$ such that $T^{z}\subseteq\bigcup_{R\in S}R$ .

With this definition in hand, we can define our $\mu:X\cup Y\to V(D)$ using Algorithm 1 which depends on a parameter $k$ that we will eventually set to $\sqrt{n}/4$ .

Algorithm 1 Defining the partial map

\mu:X\cup Y\to V(D)

.

Intuitively, in using Algorithm 1 we are going backwards through our DAG $D$ , and whenever the protocol had too many options for what clause to output when the protocol was at a node $u$ for a fixed $z\in X\cup Y$ (which corresponds to large width), we assign $z$ to $u$ . Arguing that $|\mu^{-1}(u)|$ is not too large is thus arguing that a protocol cannot make huge strides in learning about which clause to output for too many $z\in X\cup Y$ suddenly in one single node $u$ .

We now make some basic observations about our definition of $\mu$ . First, note that no $z\in X\cup Y$ is assigned to any sink node of $D$ , since every sink node $s\in V(D)$ is wholly contained in an $R_{i,j}$ , and therefore we have that $w(T_{s},z)=1$ for all $z$ . The following observation will be of critical importance in our analysis.

Claim 5.4.

During the execution of Algorithm 1, for every $u$ in triangle-DAG $D$ and every $z\in X^{\prime}\cup Y^{\prime}$ , $w(\widetilde{T}_{u},z)\leq 2k$ .

Proof.

By definition in a triangle-DAG, if $v$ and $v^{\prime}$ are children of node $u$ , then we have that $T_{u}\subseteq T_{v}\cup T_{v^{\prime}}$ . In particular, we know that if $z\in X\cup Y$ was not assigned to nodes $v$ or $v^{\prime}$ , then $w(\widetilde{T}_{u},z)\leq 2k$ , since we can take the union of the coverings for $\widetilde{T}_{v}^{z}$ and $\widetilde{T}_{v^{\prime}}^{z}$ . $\hfill\vartriangleleft$

Finally, we note that at the end of Algorithm 1 at the root $r$ , for all $z\in X^{\prime}\cup Y^{\prime}$ , we have that $w(\widetilde{T}_{r},z)<k$ .

Lemma 5.5.

Let $k\leq\sqrt{n}/2$ . For $\mu$ constructed using Algorithm 1, at least $|X|/2$ elements of $X$ are assigned by $\mu$ or at least $|Y|/2$ elements of $Y$ are assigned by $\mu$ .

Proof.

At the end of Algorithm 1 after processing the root $r$ of $D$ , since $T_{r}=X\times Y$ , we have $\widetilde{T}_{r}=X^{\prime}\times Y^{\prime}$ which is the rectangle of unassigned inputs. If $|X^{\prime}|\leq|X|/2$ then we are done; otherwise, let $x\in X^{\prime}$ . Since $w(\widetilde{T}_{r},x)<k$ , there exists a set $S$ , $|S|<k$ , such that for all $y\in Y^{\prime}$ , $y_{i}=y_{j}$ for at least one $(i,j)\in S$ .

So, we obtain

\displaystyle\mathop{\bf Pr\/}_{y\sim Y}[y\in Y^{\prime}]

\displaystyle\leq\mathop{\bf Pr\/}[\exists(i,j)\in S\text{ s.t. }y_{i}=y_{j}]% \leq|S|\max_{(i,j)\in S}\mathop{\bf Pr\/}[y_{i}=y_{j}]<k/\sqrt{n}\leq 1/2.

Hence, $|Y^{\prime}|<|Y|/2$ , as desired. $\hfill\blacktriangleleft$

This shows that a large number of elements of $X\cup Y$ are assigned by $\mu$ . It remains to show that each node in the protocol $D$ only has a bounded number of elements of $X\cup Y$ assigned by $\mu$ .

Lemma 5.6.

Let $\mu$ be defined by Algorithm 1 with $k=\sqrt{n}/4$ . For all all $u\in V(D)$ , $\mu$ maps at most $|X|\cdot 2^{-n^{1/4}/\sqrt{2}+1}$ elements of $X$ to $u$ ; the analogous bound also holds for $Y$ .

Proof.

We fix any $u\in V(D)$ and focus on the number of elements $x\in X$ that $\mu$ maps to $u$ ; the bound for the number of such $y\in Y$ is identical. In particular, consider the values of $X^{\prime}$ and $Y^{\prime}$ and $\widetilde{T}_{u}$ in Algorithm 1 immediately before processing node $u$ ; any $x\in X$ that $\mu$ maps to $u$ must be in $X^{\prime}$ .

Our strategy will be to construct a set of potential coverings for the slices $\{\widetilde{T}_{u}^{x}:x\in X^{\prime}\}$ by monochromatic rectangles. We will then aim to show that most $x\in X^{\prime}$ have width $<k$ even when restricted to this set of potential coverings, thereby limiting how many $x\in X^{\prime}$ could be assigned by $\mu$ to $u$ .

Algorithm 2 Defining tree

\mathcal{T}

, which encodes a set of potential coverings of

\widetilde{T}^{x}

for all

x\in X

.

We now describe this potential covering in detail. Intuitively, our strategy will be to construct a tree $\mathcal{T}$ , each of whose nodes is labelled by a triangle of inputs that are not yet covered, where each edge of the tree is labelled with a pair $i\neq j\in[m]$ . We say that $x\in X^{\prime}$ is consistent with a node $t\in\mathcal{T}$ iff for all $(i,j)$ labels on the edges of the unique path to $t$ in $\mathcal{T}$ , we have that $x_{i}=x_{j}$ . Furthermore, $\mathcal{T}$ will be constructed so that if $x\in X^{\prime}$ is consistent with a leaf $t$ of $\mathcal{T}$ , then $\widetilde{T}_{u}^{x}$ is covered by $\{R_{i,j}:(i,j)\text{ is on the path to $t$}\}$ . We produce the tree $\mathcal{T}$ of potential coverings of of $\widetilde{T}_{u}^{x}$ for all $x\in X^{\prime}$ using Algorithm 2 applied with $\widetilde{T}=\widetilde{T}_{u}$ , $X=X^{\prime}$ , and $Y=Y^{\prime}$ which satisfy the preconditions by ˜5.4.

Refer to caption — Figure 1: A single iteration of Algorithm 2.

First, observe that any $x\in X^{\prime}$ such that $\widetilde{T}^{x}\neq\varnothing$ is consistent with at least one leaf in $\mathcal{T}$ , by the choice of $y$ in Algorithm 2 along with Proposition 5.1. We define the set of equality constraints of a leaf $t\in\mathcal{T}$ to be the set $E(t):=\{i\neq j\in[m]:\text{$(i,j)$ is on the unique path to $t$ in $\mathcal{% T}$}\}$ . Observe that we have the desired property that if $x$ is consistent with a leaf $t$ of $\mathcal{T}$ , then $\widetilde{T}_{u}^{x}$ is covered by $\{R_{i,j}:(i,j)\in E(t)\}$ , thereby ensuring that $w(\widetilde{T}_{u},x)\leq 0pt_{\mathcal{T}}(t)$ .

Therefore, if $\mu$ maps $x\in X^{\prime}$ to $u$ then $x$ must be consistent with a leaf of $\mathcal{T}$ of depth at least $k$ . Let $\mathcal{T}^{\prime}$ denote tree $\mathcal{T}$ truncated at depth $k$ . Thus, the total number of $x\in X^{\prime}$ that $\mu$ maps to $u$ is at most

\sum_{\begin{subarray}{c}t\in\mathsf{Leaves}(\mathcal{T}^{\prime})\\ 0pt_{\mathcal{T}^{\prime}}(t)=k\end{subarray}}|\{x\in X:\text{$x$ is % consistent with $t$}\}|.

(1)

$\blacktriangleright$ Remark 5.7.

Before explaining the details of our analysis, it is useful to discuss a naive bound on (1) that does not suffice to yield our bounds: Naively, there could be $(2k)^{k}$ leaves in $\mathcal{T}^{\prime}$ at depth $k$ , and each such leaf $t$ could have the property that $E(t)$ forms a clique on $\ell=O(\sqrt{k})$ coordinates when viewed as a graph. In this case, the fraction of $x\in X$ that could be consistent with the intersection $R_{i_{1},j_{1}}\cap\cdots\cap R_{i_{k},j_{k}}$ is $(1/\sqrt{n})^{\ell}$ . If we simply take a union bound over all $(2k)^{k}$ paths, the upper bound we would get on (1) is $(2k)^{k}\cdot(1/\sqrt{n})^{\ell}$ which, since $\ell$ is only $O(\sqrt{k})$ , is larger than 1 unless $k^{\sqrt{k}}$ is at most polynomial in $n$ , which requires that $k$ is $o(\log^{2}n)$ . $\lrcorner$

In light of Remark 5.7, we need to be more careful about how we bound (1). For $t\in V(\mathcal{T}^{\prime})$ , we define its equality graph $G(t)$ to be a graph with vertex set $[m]$ and edge set $E(t)$ . We use this name because if $x$ is consistent with $t$ , then $x_{i}=x_{j}$ for all $(i,j)\in E(t)$ . We say that an edge $(i,j)$ denoting $x_{i}=x_{j}$ is implied by equality graph $G(t)$ if adding it completes a cycle in $G(t)$ .

The key observation is the following.

Claim 5.8.

Let $t$ be a node in $\mathcal{T}^{\prime}$ , and $G(t)$ its associated equality graph. If $t$ has out-degree strictly larger than 1 in $\mathcal{T}^{\prime}$ , then none of the outgoing edges can be labelled by an $(i,j)$ that is implied by $G(t)$ .

Proof.

We prove the contrapositive. Consider any $x$ consistent with the path to $t$ in $\mathcal{T}^{\prime}$ (meaning $x_{i}=x_{j}$ for all $(i,j)\in E(t)$ ). Let $T$ be the triangle labelling node $t$ and consider the $y$ that we choose when processing $T$ in Algorithm 2. If there is some outgoing edge from $t$ labelled $(i,j)$ that is implied by $G(t)$ , then $R_{i,j}\cap T^{y}\neq\varnothing$ and hence $y_{i}=y_{j}$ . Since we know that $x_{i}=x_{j}$ is already implied in $T$ , the single rectangle $R_{i,j}$ covers all of $T^{y}$ , so $t$ will have out-degree 1 in $\mathcal{T}$ . $\hfill\vartriangleleft$

˜5.8 implies that every path in $\mathcal{T}^{\prime}$ goes through a node that branches at least $\sqrt{2k}$ times, since $k$ edges must imply equality constraints on at least $\sqrt{2k}$ coordinates (the worst case is a clique). We simplify $\mathcal{T}^{\prime}$ further by collapsing nodes of $\mathcal{T}^{\prime}$ with out-degree 1. Then we are left with $\mathcal{T}^{\prime\prime}$ , which is a tree with every path of length between $\sqrt{2k}$ and $k$ . Since the input distribution on $X$ is uniform and none of the out-degree 1 edges of $\mathcal{T}^{\prime}$ involve new constraints by ˜5.8, the probability that $x$ is consistent with any leaf $t\in\mathcal{T}^{\prime\prime}$ is precisely $(1/\sqrt{n})^{0pt_{\mathcal{T}^{\prime\prime}}(t)}$ . Then we can bound (1) as follows:

	$\displaystyle\sum_{\begin{subarray}{c}t\in\mathcal{T}^{\prime}\\ 0pt_{\mathcal{T}^{\prime}}(t)=k\end{subarray}}$	$\displaystyle\|\{x\in X:\text{$x$ is consistent with $t$}\}\|$
		$\displaystyle=\sum_{t\in\mathsf{Leaves}(\mathcal{T}^{\prime\prime})}\|\{x\in X:% \text{$x$ is consistent with $t$}\}\|$
		$\displaystyle=\sum_{i=\sqrt{2k}}^{k}\ \sum_{\begin{subarray}{c}t\in\mathsf{% Leaves}(\mathcal{T}^{\prime\prime})\\ 0pt_{\mathcal{T}^{\prime\prime}}(t)=i\end{subarray}}\|\{x\in X:\text{$x$ is % consistent with $t$}\}\|$
		$\displaystyle\leq\sum_{i=\sqrt{2k}}^{k}(2k)^{i}(1/\sqrt{n})^{i}\qquad\textrm{% since $\mathcal{T}$ and hence $\mathcal{T}^{\prime\prime}$ has out-degree at % most $2k$}$
		$\displaystyle\leq\|X\|\cdot\sum_{i=\sqrt{2k}}^{k}(1/2)^{i}\qquad\textrm{since $k% =\sqrt{n}/4$}$
		$\displaystyle\leq 2^{-\sqrt{2k}+1}\|X\|.$

This implies that the number of $x\in X$ that get assigned to $u$ by $\mu$ is at most $|X|\cdot 2^{-\sqrt{2k}+1}\leq|X|2^{-n^{1/4}/\sqrt{2}+1}$ as required. The analogous bound holds for the number of $y$ assigned to $u$ by $\mu$ . $\hfill\blacktriangleleft$ We now have all we need to prove our main theorem.

Proof of Theorem 1.5.

Let $k=\sqrt{n}/4$ . By Lemma 5.5, $\mu$ given by Algorithm 1 maps at least $1/2$ of the elements of $X$ or elements of $Y$ to vertices of $D$ . Choose the applicable $X$ or $Y$ . By Lemma 5.6, at most a $2^{-n^{1/4}/\sqrt{2}+1}$ fraction of these can map to a single vertex of $D$ . Therefore $D$ has at least $2^{n^{1/4}/\sqrt{2}-2}$ vertices. $\hfill\blacktriangleleft$

$\blacktriangleright$ Remark 5.9.

Hrubeš and Pudlák [16] proved their lower bound using the interpolation method combined with a modified form of a general monotone switching lemma of Jukna [19]. The construction of Algorithm 2 has some flavor of the monotone switching lemma arguments, but the overall argument here seems fairly different. It would not be surprising that if one could modify this method to yield a similar lower bound to the one we give here, but exactly how to do that is not clear. In any case, we find the form of the bottleneck counting argument appealing.

6 Multiparty Communication Lower Bounds for all $m>n$

See 1.3

We note that this is essentially optimal in that a simple randomized protocol nearly matches our lower bound.

Proposition 6.1.

For $m\geq n+n^{1-1/k}$ and $k\leq\log n\,/4$ , there is a randomized $k$ -party number-in-hand protocol for ${\mathsf{Coll}}^{k}_{m,n^{1/k}}$ with complexity $O(n^{1/2-1/(2k)}\log n)$ .

Details of both proofs are in our full paper [3]. We sketch the proof of Theorem 1.3, which simplifies and generalizes ideas of Yang, Zhang, and Wang [32, 31] to the $k>2$ -party setting for ${\mathsf{Coll}}^{k}_{m,n^{1/k}}$ . It is based on a method for decomposing the input space into structured rectangles on which a protocol’s output is constant.

Definition 6.2.

For a random variable $X$ supported on $\Omega^{J}$ , we say it is $(1-\delta)$ -dense if the min-entropy of all its projections is nearly as large as possible; that is, for all $I\subseteq J$ , $H_{\infty}(X(I))\geq(1-\delta)\;|I|\log|\Omega|$ , where $X(I)$ denotes $X$ marginalized to the coordinates in $I$ .

For ${\mathsf{Coll}}^{k}_{m,n^{1/k}}$ we have $\mathrm{supp}(X_{i})=\Omega^{m}$ for $\Omega=[n^{1/k}]$ and we will choose $\delta=\frac{1}{\log n}$ .

Definition 6.3 (Structured rectangle).

Given a rectangle $R=X_{1}\times\ldots\times X_{k}$ , where each $X_{i}\subseteq\Omega^{m}$ , we say that $R$ is structured iff there exist $J_{1},\ldots,J_{k}\subseteq[m]$ and $\tau_{1},\ldots,\tau_{k}$ where each $\tau_{i}:[m]\setminus J_{i}\rightarrow\Omega$ such that

$\blacksquare$

$X_{i}$ is $(1-\delta)$ -dense on the coordinates in $J_{i}$ and
$\blacksquare$

$X_{i}$ is fixed to $\tau_{i}$ on the coordinates in $\overline{J_{i}}:=[m]\setminus J_{i}$ .

Using ideas from [31] we obtain:

Lemma 6.4.

Let $\Pi$ be a deterministic $k$ -party number-in-hand protocol solving ${\mathsf{Coll}}^{k}_{m,n^{1/k}}$ . There is a partition of the inputs into structured rectangles, each of which yields a fixed output of $\Pi$ , such that the probability distribution $\mathcal{R}^{\mathrm{leaf}}$ on these structured rectangles induced by uniform inputs satisfies $\mathop{\bf E\/}_{R\sim\mathcal{R}^{\mathrm{leaf}}}\sum_{i}|\overline{J_{i}}|% \leq 2k\cdot CC(\Pi)$ where $J_{1},\ldots,J_{k}$ are the sets associated with the structured property of $R$ .

Our proof considers rectangles in the support of $\mathcal{R}^{\mathrm{leaf}}$ of three kinds: (1) Those where $\sum_{i}|\overline{J_{i}}|$ is larger than its expected value of $2k\cdot CC(\Pi)$ , (2) those where it is not much larger and every element of the rectangle contains some fixed collision, and (3) the rest. The fraction of inputs in rectangles of type (1) is small by Markov’s inequality and the fraction in rectangles of type (3) on which any fixed answer of $\Pi$ can be correct, is small because they are structured. Analyzing the total measure of inputs in rectangles of type (2) relies on a subtle charging scheme that allocates the total number of such inputs to rectangles of all types using a per-rectangle fraction proportional to $\min(\sum_{i}\binom{|\overline{J_{i}}|}{2},\ k^{2}\cdot CC^{2}(\Pi))/n^{1-1/k}$ .

7 Discussion

We end by discussing some related problems and directions for future study. In particular, we highlight three possible directions.

$\blacksquare$

Multiparty DAG-like communication lower bounds. In general, it would be interesting to see if the $2^{\Omega(n^{1/4})}$ we give can be further improved, or if any nontrivial (size $2^{o(n)}$ ) cutting planes refutation of $\mathrm{BPHP}^{n+1}_{n}$ exists. One possible way to improve the lower bound could be to study a multiparty ( $k>2$ ) version of DAG-like protocols. Concretely, could one can prove DAG-like communication lower bounds for the $k$ -player analog ${\mathsf{Search}}_{\mathrm{BPHP}^{m}_{n}}^{k}$ ? We have shown that generalizing to $k$ players helps in the tree-like case, and perhaps this holds true in the DAG-like setting as well.
$\blacksquare$

Stronger bounds for tree-like cutting-planes proof complexity of the weak bit pigeonhole principle When $m=n+\Omega(n)$ , we only obtain the weaker $2^{\Omega(\sqrt{n})}$ size lower bound for $\mathrm{BPHP}^{m}_{n}$ , rather than the $2^{n-o(1)}$ size lower bound we obtain for values of $m$ closer to $n$ . Can we extend the range of our strong lower bound to all values of $m>n$ ? To do so we must use a technique that does not use randomized communication complexity.
$\blacksquare$

Finally, we highlight that for $m$ close to $n$ , the loss of $2^{k}$ in the denominator of Theorem 1.1 could potentially be improved when $m\leq n+n^{1/k}$ . Indeed, we do not suspect that it should be present at all, and we conjecture that ${\mathsf{Search}}^{k}_{\mathrm{BPHP}^{m}_{n}}$ should remain hard for $k$ all the way up to $\log n$ . However, it seems unlikely that any reduction from $k$ -party disjointness would be able to achieve this since an additional input bit per player seems essential in implementing the reduction.

References

[1] Albert Atserias, Moritz Müller, and Sergi Oliva. Lower bounds for DNF-refutations of a relativized weak pigeonhole principle. J. Symb. Log., 80(2):450–476, 2015. doi:10.1017/JSL.2014.56.
[2] Paul Beame, Toniann Pitassi, and Nathan Segerlind. Lower bounds for Lovász–Schrijver systems and beyond follow from multiparty communication complexity. SIAM J. Comput., 37(3):845–869, 2007. doi:10.1137/060654645.
[3] Paul Beame and Michael Whitmeyer. Multiparty communication complexity of collision-finding and cutting planes proofs of concise pigeonhole principles. Electron. Colloquium Comput. Complex., TR25-057, 2025. URL: https://eccc.weizmann.ac.il/report/2025/057.
[4] Maria Luisa Bonet, Toniann Pitassi, and Ran Raz. Lower bounds for cutting planes proofs with small coefficients. In Proceedings of the Twenty-Seventh Annual ACM Symposium on Theory of Computing (STOC 1995), pages 575–584, Las Vegas, Nevada, 1995. ACM. doi:10.1145/225058.225275.
[5] Mark Braverman and Rotem Oshman. The communication complexity of number-in-hand set disjointness with no promise. Electron. Colloquium Comput. Complex., TR15-002, 2015. URL: https://eccc.weizmann.ac.il/report/2015/002.
[6] Samuel R. Buss and György Turán. Resolution proofs of generalized pigeonhole principles. Theor. Comput. Sci., 62(3):311–317, 1988. doi:10.1016/0304-3975(88)90072-2.
[7] Arkadev Chattopadhyay and Toniann Pitassi. The story of set disjointness. SIGACT News, 41(3):59–85, 2010. doi:10.1145/1855118.1855133.
[8] Stefan S. Dantchev, Nicola Galesi, Abdul Ghani, and Barnaby Martin. Proof complexity and the binary encoding of combinatorial principles. SIAM J. Comput., 53(3):764–802, 2024. doi:10.1137/20M134784X.
[9] Stephan Gocht, Jakob Nordström, and Amir Yehudayoff. On division versus saturation in pseudo-boolean solving. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI 2019, pages 1711–1718. ijcai.org, August 2019. doi:10.24963/IJCAI.2019/237.
[10] Mika Göös and Siddhartha Jain. Communication complexity of collision. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, APPROX/RANDOM 2022, volume 245 of LIPIcs, pages 19:1–19:9. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPICS.APPROX/RANDOM.2022.19.
[11] Dima Grigoriev, Edward A. Hirsch, and Dmitrii V. Pasechnik. Complexity of semi-algebraic proofs. In STACS 2002, 19th Annual Symposium on Theoretical Aspects of Computer Science, Proceedings, volume 2285 of Lecture Notes in Computer Science, pages 419–430. Springer, 2002. doi:10.1007/3-540-45841-7_34.
[12] Armin Haken. The intractability of resolution. Theor. Comput. Sci., 39:297–308, 1985. doi:10.1016/0304-3975(85)90144-6.
[13] Armin Haken and Stephen A. Cook. An exponential lower bound for the size of monotone real circuits. J. Comput. Syst. Sci., 58(2):326–335, 1999. doi:10.1006/JCSS.1998.1617.
[14] Johan Håstad. On small-depth Frege proofs for PHP. In 64th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2023, pages 37–49. IEEE, November 2023. doi:10.1109/FOCS57990.2023.00010.
[15] Pavel Hrubes and Pavel Pudlák. A note on monotone real circuits. Electron. Colloquium Comput. Complex., TR17-048, 2017. URL: https://eccc.weizmann.ac.il/report/2017/048.
[16] Pavel Hrubes and Pavel Pudlák. Random formulas, monotone circuits, and interpolation. Electron. Colloquium Comput. Complex., TR17-042, 2017. URL: https://eccc.weizmann.ac.il/report/2017/042.
[17] Russell Impagliazzo, Toniann Pitassi, and Alasdair Urquhart. Upper and lower bounds for tree-like cutting planes proofs. In Proceedings of the Ninth Annual Symposium on Logic in Computer Science (LICS ’94), pages 220–228, Paris, France, 1994. IEEE Computer Society. doi:10.1109/LICS.1994.316069.
[18] Dmitry Itsykson and Artur Riazanov. Proof complexity of natural formulas via communication arguments. In Proceeedings of the 36th Computational Complexity Conference, CCC 2021, volume 200 of LIPIcs, pages 3:1–3:34. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2021. doi:10.4230/LIPICS.CCC.2021.3.
[19] Stasys Jukna. Boolean Function Complexity - Advances and Frontiers, volume 27 of Algorithms and combinatorics. Springer, 2012. doi:10.1007/978-3-642-24508-4.
[20] Jan Krajícek, Pavel Pudlák, and Alan R. Woods. An exponential lower bound to the size of bounded depth Frege proofs of the pigeonhole principle. Random Struct. Algorithms, 7(1):15–40, 1995. doi:10.1002/RSA.3240070103.
[21] Saburo Muroga, Iwao Toda, and Satoru Takasu. Theory of majority decision elements. Journal of the Franklin Institute, 271(5):376–418, 1961.
[22] Noam Nisan. The communication complexity of threshold gates. Combinatorics, Paul Erdos is Eighty, 1(301-315):6, 1993.
[23] Toniann Pitassi, Paul Beame, and Russell Impagliazzo. Exponential lower bounds for the pigeonhole principle. Comput. Complex., 3:97–140, 1993. doi:10.1007/BF01200117.
[24] Pavel Pudlák. Lower bounds for resolution and cutting plane proofs and monotone computations. J. Symb. Log., 62(3):981–998, 1997. doi:10.2307/2275583.
[25] Anup Rao and Amir Yehudayoff. Communication Complexity: and Applications. Cambridge University Press, 2020. doi:10.1017/9781108671644.
[26] Ran Raz and Avi Wigderson. Monotone circuits for matching require linear depth. J. ACM, 39(3):736–744, 1992. doi:10.1145/146637.146684.
[27] Hanif D. Sherali and Warren P. Adams. A hierarchy of relaxations between the continuous and convex hull representations for zero-one programming problems. SIAM J. Discret. Math., 3(3):411–430, 1990. doi:10.1137/0403036.
[28] Dmitry Sokolov. Dag-like communication and its applications. In Computer Science - Theory and Applications - 12th International Computer Science Symposium in Russia, CSR 2017, Proceedings, volume 10304 of Lecture Notes in Computer Science, pages 294–307. Springer, 2017. doi:10.1007/978-3-319-58747-9_26.
[29] Dmitry Sokolov. Random (log n)-CNF are hard for cutting planes (again). In Proceedings of the 56th Annual ACM Symposium on Theory of Computing, STOC 2024, Vancouver, BC, Canada, June 24-28, 2024, pages 2008–2015. ACM, 2024. doi:10.1145/3618260.3649636.
[30] Emanuele Viola. The communication complexity of addition. Combinatorica, 35(6):703–747, 2015. doi:10.1007/S00493-014-3078-3.
[31] Shuo Wang, Guangxu Yang, and Jiapeng Zhang. Communication complexity of set-intersection problems and its applications. Electron. Colloquium Comput. Complex., TR23-164, 2023. URL: https://eccc.weizmann.ac.il/report/2023/164.
[32] Guangxu Yang and Jiapeng Zhang. Communication lower bounds for collision problems via density increment arguments. In Proceedings of the 56th Annual ACM Symposium on Theory of Computing, STOC 2024, pages 630–639. ACM, June 2024. doi:10.1145/3618260.3649607.

[bib.bib1] [1] Albert Atserias, Moritz Müller, and Sergi Oliva. Lower bounds for DNF-refutations of a relativized weak pigeonhole principle. J. Symb. Log., 80(2):450–476, 2015. doi:10.1017/JSL.2014.56.

[bib.bib2] [2] Paul Beame, Toniann Pitassi, and Nathan Segerlind. Lower bounds for Lovász–Schrijver systems and beyond follow from multiparty communication complexity. SIAM J. Comput., 37(3):845–869, 2007. doi:10.1137/060654645.

[bib.bib3] [3] Paul Beame and Michael Whitmeyer. Multiparty communication complexity of collision-finding and cutting planes proofs of concise pigeonhole principles. Electron. Colloquium Comput. Complex., TR25-057, 2025. URL: https://eccc.weizmann.ac.il/report/2025/057.

[bib.bib4] [4] Maria Luisa Bonet, Toniann Pitassi, and Ran Raz. Lower bounds for cutting planes proofs with small coefficients. In Proceedings of the Twenty-Seventh Annual ACM Symposium on Theory of Computing (STOC 1995), pages 575–584, Las Vegas, Nevada, 1995. ACM. doi:10.1145/225058.225275.

[bib.bib5] [5] Mark Braverman and Rotem Oshman. The communication complexity of number-in-hand set disjointness with no promise. Electron. Colloquium Comput. Complex., TR15-002, 2015. URL: https://eccc.weizmann.ac.il/report/2015/002.

[bib.bib6] [6] Samuel R. Buss and György Turán. Resolution proofs of generalized pigeonhole principles. Theor. Comput. Sci., 62(3):311–317, 1988. doi:10.1016/0304-3975(88)90072-2.

[bib.bib7] [7] Arkadev Chattopadhyay and Toniann Pitassi. The story of set disjointness. SIGACT News, 41(3):59–85, 2010. doi:10.1145/1855118.1855133.

[bib.bib8] [8] Stefan S. Dantchev, Nicola Galesi, Abdul Ghani, and Barnaby Martin. Proof complexity and the binary encoding of combinatorial principles. SIAM J. Comput., 53(3):764–802, 2024. doi:10.1137/20M134784X.

[bib.bib9] [9] Stephan Gocht, Jakob Nordström, and Amir Yehudayoff. On division versus saturation in pseudo-boolean solving. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI 2019, pages 1711–1718. ijcai.org, August 2019. doi:10.24963/IJCAI.2019/237.

[bib.bib10] [10] Mika Göös and Siddhartha Jain. Communication complexity of collision. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, APPROX/RANDOM 2022, volume 245 of LIPIcs, pages 19:1–19:9. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPICS.APPROX/RANDOM.2022.19.

[bib.bib11] [11] Dima Grigoriev, Edward A. Hirsch, and Dmitrii V. Pasechnik. Complexity of semi-algebraic proofs. In STACS 2002, 19th Annual Symposium on Theoretical Aspects of Computer Science, Proceedings, volume 2285 of Lecture Notes in Computer Science, pages 419–430. Springer, 2002. doi:10.1007/3-540-45841-7_34.

[bib.bib12] [12] Armin Haken. The intractability of resolution. Theor. Comput. Sci., 39:297–308, 1985. doi:10.1016/0304-3975(85)90144-6.

[bib.bib13] [13] Armin Haken and Stephen A. Cook. An exponential lower bound for the size of monotone real circuits. J. Comput. Syst. Sci., 58(2):326–335, 1999. doi:10.1006/JCSS.1998.1617.

[bib.bib14] [14] Johan Håstad. On small-depth Frege proofs for PHP. In 64th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2023, pages 37–49. IEEE, November 2023. doi:10.1109/FOCS57990.2023.00010.

[bib.bib15] [15] Pavel Hrubes and Pavel Pudlák. A note on monotone real circuits. Electron. Colloquium Comput. Complex., TR17-048, 2017. URL: https://eccc.weizmann.ac.il/report/2017/048.

[bib.bib16] [16] Pavel Hrubes and Pavel Pudlák. Random formulas, monotone circuits, and interpolation. Electron. Colloquium Comput. Complex., TR17-042, 2017. URL: https://eccc.weizmann.ac.il/report/2017/042.

[bib.bib17] [17] Russell Impagliazzo, Toniann Pitassi, and Alasdair Urquhart. Upper and lower bounds for tree-like cutting planes proofs. In Proceedings of the Ninth Annual Symposium on Logic in Computer Science (LICS ’94), pages 220–228, Paris, France, 1994. IEEE Computer Society. doi:10.1109/LICS.1994.316069.

[bib.bib18] [18] Dmitry Itsykson and Artur Riazanov. Proof complexity of natural formulas via communication arguments. In Proceeedings of the 36th Computational Complexity Conference, CCC 2021, volume 200 of LIPIcs, pages 3:1–3:34. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2021. doi:10.4230/LIPICS.CCC.2021.3.

[bib.bib19] [19] Stasys Jukna. Boolean Function Complexity - Advances and Frontiers, volume 27 of Algorithms and combinatorics. Springer, 2012. doi:10.1007/978-3-642-24508-4.

[bib.bib20] [20] Jan Krajícek, Pavel Pudlák, and Alan R. Woods. An exponential lower bound to the size of bounded depth Frege proofs of the pigeonhole principle. Random Struct. Algorithms, 7(1):15–40, 1995. doi:10.1002/RSA.3240070103.

[bib.bib21] [21] Saburo Muroga, Iwao Toda, and Satoru Takasu. Theory of majority decision elements. Journal of the Franklin Institute, 271(5):376–418, 1961.

[bib.bib22] [22] Noam Nisan. The communication complexity of threshold gates. Combinatorics, Paul Erdos is Eighty, 1(301-315):6, 1993.

[bib.bib23] [23] Toniann Pitassi, Paul Beame, and Russell Impagliazzo. Exponential lower bounds for the pigeonhole principle. Comput. Complex., 3:97–140, 1993. doi:10.1007/BF01200117.

[bib.bib24] [24] Pavel Pudlák. Lower bounds for resolution and cutting plane proofs and monotone computations. J. Symb. Log., 62(3):981–998, 1997. doi:10.2307/2275583.

[bib.bib25] [25] Anup Rao and Amir Yehudayoff. Communication Complexity: and Applications. Cambridge University Press, 2020. doi:10.1017/9781108671644.

[bib.bib26] [26] Ran Raz and Avi Wigderson. Monotone circuits for matching require linear depth. J. ACM, 39(3):736–744, 1992. doi:10.1145/146637.146684.

[bib.bib27] [27] Hanif D. Sherali and Warren P. Adams. A hierarchy of relaxations between the continuous and convex hull representations for zero-one programming problems. SIAM J. Discret. Math., 3(3):411–430, 1990. doi:10.1137/0403036.

[bib.bib28] [28] Dmitry Sokolov. Dag-like communication and its applications. In Computer Science - Theory and Applications - 12th International Computer Science Symposium in Russia, CSR 2017, Proceedings, volume 10304 of Lecture Notes in Computer Science, pages 294–307. Springer, 2017. doi:10.1007/978-3-319-58747-9_26.

[bib.bib29] [29] Dmitry Sokolov. Random (log n)-CNF are hard for cutting planes (again). In Proceedings of the 56th Annual ACM Symposium on Theory of Computing, STOC 2024, Vancouver, BC, Canada, June 24-28, 2024, pages 2008–2015. ACM, 2024. doi:10.1145/3618260.3649636.

[bib.bib30] [30] Emanuele Viola. The communication complexity of addition. Combinatorica, 35(6):703–747, 2015. doi:10.1007/S00493-014-3078-3.

[bib.bib31] [31] Shuo Wang, Guangxu Yang, and Jiapeng Zhang. Communication complexity of set-intersection problems and its applications. Electron. Colloquium Comput. Complex., TR23-164, 2023. URL: https://eccc.weizmann.ac.il/report/2023/164.

[bib.bib32] [32] Guangxu Yang and Jiapeng Zhang. Communication lower bounds for collision problems via density increment arguments. In Proceedings of the 56th Annual ACM Symposium on Theory of Computing, STOC 2024, pages 630–639. ACM, June 2024. doi:10.1145/3618260.3649607.

	$\displaystyle\sum_{\begin{subarray}{c}t\in\mathcal{T}^{\prime}\\ 0pt_{\mathcal{T}^{\prime}}(t)=k\end{subarray}}$	$\displaystyle\|\{x\in X:\text{$x$ is consistent with $t$}\}\|$
		$\displaystyle=\sum_{t\in\mathsf{Leaves}(\mathcal{T}^{\prime\prime})}\|\{x\in X:% \text{$x$ is consistent with $t$}\}\|$
		$\displaystyle=\sum_{i=\sqrt{2k}}^{k}\ \sum_{\begin{subarray}{c}t\in\mathsf{% Leaves}(\mathcal{T}^{\prime\prime})\\ 0pt_{\mathcal{T}^{\prime\prime}}(t)=i\end{subarray}}\|\{x\in X:\text{$x$ is % consistent with $t$}\}\|$
		$\displaystyle\leq\sum_{i=\sqrt{2k}}^{k}(2k)^{i}(1/\sqrt{n})^{i}\qquad\textrm{% since $\mathcal{T}$ and hence $\mathcal{T}^{\prime\prime}$ has out-degree at % most $2k$}$
		$\displaystyle\leq\|X\|\cdot\sum_{i=\sqrt{2k}}^{k}(1/2)^{i}\qquad\textrm{since $k% =\sqrt{n}/4$}$
		$\displaystyle\leq 2^{-\sqrt{2k}+1}\|X\|.$

Multiparty Communication Complexity of Collision-Finding and Cutting Planes Proofs of Concise Pigeonhole Principles

Abstract

Keywords and phrases:

Category:

Copyright and License:

2012 ACM Subject Classification:

Related Version:

Funding:

DOI:

Event:

Editors:

Series and Publisher:

1 Introduction

1.1 Tree-like Proofs and Multiparty Communication

Theorem 1.1.

Theorem 1.2.

Theorem 1.3.

Corollary 1.4.

1.2 General Proofs and DAG-like Protocols

Theorem 1.5.

Proposition 1.6 ([28, 15]).

Theorem 1.7.

2 Preliminaries

2.1 Proof Complexity

Definition 2.1 (Threshold Decision Tree).

Proposition 2.2.

2.2 Communication Complexity

Theorem 2.3 ([5]).

Lemma 2.4.

Proposition 2.5 ([30]).

Corollary 2.6.

Proof of Lemma 2.4.

Lemma 2.7.

Proof.

2.3 Triangle-DAGs

3 Multiparty Communication Lower Bounds for 𝒎≤𝒏+𝒏

Lemma 3.1.

Proof of Theorem 1.1 using Lemma 3.1.

Claim 3.2.

Proof.

Claim 3.3.

Proof.

Algorithm.

Analysis.

3.1 Proof of Lemma 3.1

Proof.

Case 1.

Case 2.

4 Tree-like Proof Complexity Lower Bounds

Theorem 4.1.

Proof.

Corollary 4.2.

Proof.

5 Triangle-DAG Lower Bounds

Proposition 5.1.

Proposition 5.2.

Proof.

Definition 5.3.

Claim 5.4.

Proof.

Lemma 5.5.

Proof.

Lemma 5.6.

Proof.

▶ Remark 5.7.

Claim 5.8.

Proof.

Proof of Theorem 1.5.

▶ Remark 5.9.

6 Multiparty Communication Lower Bounds for all 𝒎>𝒏

Proposition 6.1.

Definition 6.2.

Definition 6.3 (Structured rectangle).

Lemma 6.4.

7 Discussion

References

3 Multiparty Communication Lower Bounds for $m\leq n+\sqrt{n}$

$\blacktriangleright$ Remark 5.7.

$\blacktriangleright$ Remark 5.9.

6 Multiparty Communication Lower Bounds for all $m>n$