Abstract 1 Introduction 2 Preliminaries 3 Lower bounds on 𝗛𝗮𝗺𝗖𝘆𝗰𝗹𝗲(𝒏) 4 Randomized lower bound for PC(𝒏,𝟓) 5 Randomized lower bound for bPC(𝒏,𝟕) References Appendix A Information Theory Preliminaries Appendix B Statements and proofs of lemmas used Appendix C Missing proofs from Section 3 Appendix D Missing proofs from Section 4 Appendix E Missing proofs from Section 5 Appendix F Upper bounds

Optimal Two-Round Communication Lower Bound for Graph Connectivity via Pointer Chasing

Jaikumar Radhakrishnan ORCID ICTS-TIFR, Bengaluru, India Chaitanya Reddy ORCID Department of Computer Science and Engineering, IIT Hyderabad, India Rakesh Venkat ORCID Department of Computer Science and Engineering, IIT Hyderabad, India
Abstract

We consider the communication complexity of the graph connectivity problem, where the edges of an n-vertex undirected graph G are distributed between two parties Alice and Bob, who are then required to communicate to determine if G is connected. We show that in any randomized protocol with two-rounds of communication, Alice and Bob must exchange Ω(nlogn) bits; such a lower bound for one-round protocols was shown by Sun and Woodruff (APPROX/RANDOM 2015). A one-round deterministic protocol, where Alice sends O(nlogn) bits and Bob determines the answer, was observed by Hajnal, Maass and Turan (STOC 1988); they also showed a matching lower bound of Ω(nlogn) bits for deterministic protocols with unbounded rounds of communication. For randomized protocols, a reduction from the set disjointness problem due to Babai, Frankl and Simon (FOCS 1986) implies a randomized lower bound of Ω(n) even with unbounded rounds of communication. Whether this lower bound can be improved to Ω(nlogn) has been an outstanding open question, whose algorithmic implications were recently emphasized by Apers, Efron, Gawrychowski, Lee, Mukopadhyay and Nanongkai (FOCS 2022). Our lower bound for randomized two-round protocols is based on a reduction from a restricted version of the two-player pointer chasing problem originally studied by Papadimitriou and Sipser (JCSS 1984). Using this reduction, we show an ω(n) lower bounds on graph connectivity for any constant number of rounds by extending deterministic lower bounds shown by Ponzio, Radhakrishnan and Venkatesh (JCSS 2001) to the randomized setting.

Keywords and phrases:
Communication complexity
Funding:
Jaikumar Radhakrishnan: Department of Atomic Energy, Government of India, under project number RTI4001.
Copyright and License:
[Uncaptioned image] © Jaikumar Radhakrishnan, Chaitanya Reddy, and Rakesh Venkat; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Communication complexity
Editor:
Shubhangi Saraf

1 Introduction

Graph problems have been extensively studied in various computational settings – algorithms, decision trees, streaming algorithms, distributed algorithms and communication protocols. In this paper, we focus on the two-party communication setting: the edges of the n-vertex undirected graph G=(V,E) are distributed between two parties Alice and Bob (some of the edges may be given to both), who then need to exchange messages in order to compute some function of G; the goal is to minimize the number of bits exchanged. Early works [4, 7] investigated properties such as connectivity, st-connectivity, bipartiteness, proving both lower and upper bounds on the amount of communication needed in this model. Other properties such as triangle-freeness, existence of Eulerian tours, k-edge connectivity, and cycle freeness among others have been studied and new lower bounds have been obtained in subsequent works [8, 19]. More recently, graph problems have been extensively studied for trade-offs between space requirements and the number of passes for streaming algorithms for estimating quantities like matchings, cuts, etc., in various settings (see [2] and the references therein). Lower bounds in these settings are also typically shown by considering the communication complexity of some associated graph problem.

A fundamental problem in this domain is the connectivity problem 𝖢𝗈𝗇𝗇n: Alice and Bob need to determine if G is connected. The randomized complexity of this problem was studied by Babai, Frankl and Simon [4], who showed a reduction from the set disjointness problem; when combined with the Ω(n) lower bound for the set disjointness problem [18, 9], this implies an Ω(n) lower bound for the randomized communication complexity of 𝖢𝗈𝗇𝗇n. There exists an O(nlogn)-bit 1-round deterministic communication protocol: Alice sends a spanning forest of her edges to Bob, who then determines the answer. Hajnal, Maas and Turan [7] showed a deterministic lower bound of Ω(nlogn) for connectivity111In fact, they show that the bound holds in a stronger arbitrary partitioning model of the edges.. For randomized protocols with an unbounded number of rounds of communication, closing this gap between the upper and lower bounds has remained a major open question since the work of [4], and has been mentioned [1, 8] as one of the outstanding open questions in communication complexity. An optimal lower bound on 𝖢𝗈𝗇𝗇n also implies the optimality of some query algorithms [1]. A randomized lower bound of Ω(nlogn) was shown for one-round protocols by [19]; however, no lower bound better than Ω(n) was known for randomized protocols with more than one round of communication222Following convention, we only require one of the players to know the answer for 𝖢𝗈𝗇𝗇n at the end of the protocol..

We show an optimal randomized lower bound of Ω(nlogn) on 𝖢𝗈𝗇𝗇n against protocols with two rounds of communication, by relating it to the pointer chasing problem introduced by Papdimitrou and Sipser [14] to study tradeoffs between the number of rounds of communication, and number of bits communicated. In the two-party pointer chasing problem333Our definition differs from the standard definition, for we insist that the pointers given to Alice and Bob correspond to bijections – our reduction will make crucial use of this property. Note, however, that our lower bounds with this restriction on the inputs imply similar lower bounds for the problem without any restriction., the edges of a directed bipartite graph G=(𝒱A𝒱B,E) (where |𝒱A|,|𝒱B|=n) are distributed between Alice and Bob in the following way: Alice receives a one-to-one function fA:𝒱A𝒱B and Bob receives a one-to-one function fB:𝒱B𝒱A, which we view as (directed) matchings EA and EB respectively. A fixed starting vertex v0𝒱A and a parameter k are known to both players. We consider two versions of the problem: the full-version and the bit-version. In the full-version of the problem, PC(n,k), the goal is to find the identity of the unique vertex vk found by following the k-length path v0v1:=fA(v0)v2:=fB(v1)vk:=f(vk1), where f=fA, if k is odd, and f=fB, if k is even. In the bit-version of the problem, bPC(n,k), Alice and Bob need to determine just the last bit or the parity of vk. At the end, both players must know the answer.444The extra round that is needed for both players to know the answer in pointer chasing is often crucial.

There is a simple deterministic k-round protocol that exchanges O(klogn) bits to find vk, where Alice starts by sending v1, and in the i-th round the appropriate player sends vi since they already know vi1. A major focus of previous work has been to show a separation between the amount of communication needed in the above case versus if only k1 rounds of communication are allowed, or k rounds are allowed but Bob (the wrong player, since he does not know fA) starts the communication. With this restriction, for a general k, a sequence of works [13, 20, 11] have resulted in a Ω(n/k) lower bound against randomized protocols. For constant k, Ponzio, Radhakrishnan and Venkatesh [15] show a Ω(nlog(k1)n) lower bound for the full version. As for upper bounds, Nisan and Wigderson [13] gave a randomized protocol with communication O((k+n/k)logn) , and Damm, Jukna and Sgall [5] design a deterministic protocol with communication O(nlog(k1)n), for constant k.

Our reduction from to 𝖢𝗈𝗇𝗇n in fact passes through another related problem. In the Hamiltonian cycle problem, 𝖧𝖺𝗆𝖢𝗒𝖼𝗅𝖾(n), the inputs to Alice and Bob are the same as in PC(n,k), that is, matchings represented as one-to-one functions fA and fB. Alice and Bob need to determine if the union of fA and fB is a Hamiltonian Cycle, or it consists of more than one cycle. This particular problem was studied earlier by Raz and Spieker [17], who showed a lower bound of Ω(nloglogn) for non-deterministic protocols even with unbounded rounds of communication. It is easy to see that a protocol for 𝖢𝗈𝗇𝗇2n immediately yields a protocol with the same complexity for 𝖧𝖺𝗆𝖢𝗒𝖼𝗅𝖾(n).

1.1 Our Results

To state our results, we will need the following notation; detailed problem definitions and notation are presented in Section 2. Let CδB,r(f) be the δ-error, r-round randomized communication complexity of the function f when Bob sends the first message; let CδA,r(f) be the corresponding complexity when Alice sends the first message. We write Cδr(f), omitting the superscript, to denote the complexity when either party is allowed to start. log(i)n denotes the i-th iterated logarithm of n.

Our lower bound for Cδ2(𝖢𝗈𝗇𝗇n) is derived as follows. We show that the pointer chasing problem reduces to the Hamiltonian cycle problem; more precisely, we show (see Theorem 7 in Section 3) that for all δ[0,1), we have

Cδ2(𝖧𝖺𝗆𝖢𝗒𝖼𝗅𝖾(n))CδB,3(bPC(16n+2,7)).

We then show that

CδB,3(bPC(n,7))=Ω(nlogn). (Theorem 13 in Section 5.1)

Theorem 13 is the main technical contribution of this work. Combining these with our earlier observation that 𝖧𝖺𝗆𝖢𝗒𝖼𝗅𝖾(n) reduces to 𝖢𝗈𝗇𝗇2n, we obtain the following.

Theorem 1 (Main Result).

For all δ[0,1/2), we have

Cδ2(𝖢𝗈𝗇𝗇n),Cδ2(𝖧𝖺𝗆𝖢𝗒𝖼𝗅𝖾(n))=Ω(nlogn).

Typically, pointer chasing has been considered in the setting the number of rounds of communication is r=k1; we know of only a few prior works [15, 3] that have considered r<k1. However, the techniques used in these do not seem to extend to yield an optimal Ω(nlogn) lower bound.

Furthermore, our reduction can be used to obtain superlinear lower bounds for 𝖢𝗈𝗇𝗇n for randomized protocols with more than two rounds of communication. By extending the result [15, Theorem 6] to the randomized setting, we show that for all δ[0,1/2), constant k and rk/21

CδB,r(bPC(n,k))=Ω(nlog(k1)n);

our reduction then implies that for all constant r,

Cδr(𝖢𝗈𝗇𝗇n)=Ω(nrlog(2r+1)nr).

The proof details of the above statement are deferred to the extended version of the paper.

For our main result we used the lower bound for the bit version of the pointer chasing problem, where Alice and Bob need to determine the last bit for v7 with just three rounds of communication. We show that for the full version determining v5 is already equally hard.

Theorem 2.

For all constants δ[0,1/2), CδB,3(PC(n,5))=Ω(nlogn).

Complementing the above, we show that there are deterministic protocols that can chase a larger pointer depth with few rounds, by extending the protocols in [5, 15].

Theorem 3.

Let kk+53. Then CB,k(PC(n,k))=O(nkloglogn). For the bit-version: kk+64. Then CB,k(bPC(n,k))=O(nkloglogn).

1.2 Organization of the paper

In Section 3, we describe the reduction from PC(n,k) to 𝖢𝗈𝗇𝗇2n (and in particular, the 𝖧𝖺𝗆𝖢𝗒𝖼𝗅𝖾(n) problem).

Section 4 proves Theorem 2, a strong lower bound for the full-version of the pointer chasing problem. Section 5 builds upon the result in Section 4 to prove the main technical result: a strong lower bound for the bit-version Theorem 13. Proofs are detailed in Appendices D and E. Appendix F proves the upper bounds in Theorem 3.

Each technical result is preceded by a proof outline that emphasizes key steps and conceptual insights.

2 Preliminaries

2.1 Problem Definitions

Definition 4 (Pointer Chasing, full-version).

PC(n,k) is the 2-player pointer chasing problem defined as follows: The input instance is a (k+1)-layered, directed graph G. The vertex set is 𝒱0,𝒱1,,𝒱k, with |𝒱i|=n for all i{0,1,2,,k}. The edges of G are bijections from 𝒱i to 𝒱i+1 for each i{0,1,2,,k1}. The edges are distributed between Alice and Bob as follows:

Alice gets bijections FA,i:𝒱2i2𝒱2i1 Bob gets bijections FB,i:𝒱2i1𝒱2i

There is a fixed vertex v0𝒱0 known to both, and they need to find the unique vertex vk𝒱k at distance k from v0.

Definition 5 (Pointer Chasing, bit-version).

bPC(n,k) is the problem of finding the least significant bit of vk (as opposed to vk in PC(n,k)). We call the least significant bit of vk the parity of vk (having identified the vertex set with [n]). This is also called the bit-version of the pointer-chasing problem.

For pointer chasing problems, we require that any valid protocol has both players decide on the same answer and output it after the protocol concludes.

We note that Definition 4 differs slightly from the description given in the introduction: in our formal definition above, we allow a different function (bijection) at every step. It is not difficult to see that the two formulations are equivalent up to a factor-k blowup in the vertex-set size. In particular, any instance satisfying Definition 4 is a special case of the introduction’s definition with |𝒱A|=|𝒱B|=n(k+1)/2, obtained by taking 𝒱A=i:imod2=0𝒱i and 𝒱B=i:imod2=1𝒱i.

Definition 6.

The 𝖧𝖺𝗆𝖢𝗒𝖼𝗅𝖾(n) problem is defined as follows. Let VA and VB be two disjoint sets of n vertices each. Alice receives a bijection map σA:VAVB, and Bob receives a bijection σB:VBVA. The goal is to decide whether the composition σAσB (equivalently, union of the two perfect matchings) forms a Hamiltonian cycle covering all 2n vertices.

For the problems 𝖢𝗈𝗇𝗇n, 𝖧𝖺𝗆𝖢𝗒𝖼𝗅𝖾(n) under consideration, following convention, we require only that one of the players knows the answer after r rounds.

2.2 Notation

We use [n] to denote the set {1,2,,n}. Random variables will typically be denoted by capital letters: e.g. V5 denotes a random variable, and V5=v5 will be its instantiation to the value v5. Most random variables we deal with will be over a finite, discrete domain. The support of a random variable X is denoted by supp(X).
All logarithms are to the base 2, and log(i)n denotes the i-th iterated logarithm of n.
Our proofs make significant use of information theoretic concepts. We describe some preliminaries in Section A.
We assume throughout the proofs that when used, ε,δ are sufficiently small constants.

2.3 Communication Complexity preliminaries

We assume some familiarity with basic notions of two-party communication complexity, and refer the reader to standard books [10, 16] for the basics. Let f:𝒳×𝒴Z be a function.

Rectangles, Subrectangles.

A combinatorial rectangle is a subset of inputs of the form A×B, where A𝒳 and B𝒴. A deterministic protocol Π=(M1,M2,,Mk) splits the input space into a disjoint partition of rectangles, each rectangle corresponding to a certain transcript (M1=m1,M2=m2,,Mk=mk) of the protocol.

We call a rectangle R a subrectangle of a rectangle R if RR. R is a subrectangle of the protocol Π, if it is a subrectangle of some rectangle in the partition induced by the protocol Π. If R=A×B is a rectangle, then a subrectangle of the form A×B for AA is called a horizontal slice of R. Similarly, A×B, for BB is called a vertical slice.

Let R=A×B be a rectangle. Let Z be a random variable that depends on the inputs. We denote Z|R to be the random variable having distribution of Z conditioned on inputs restricted to the rectangle R.

3 Lower bounds on 𝗛𝗮𝗺𝗖𝘆𝗰𝗹𝗲(𝒏)

Theorem 7.

Let n be a multiple of 4. The bPC(n,k) problem reduces to the 𝖧𝖺𝗆𝖢𝗒𝖼𝗅𝖾(N) problem with N=2+2n(k+1) in the two-party communication setting.

Proof.

For notational simplicity, we describe the reduction with only two functions σA:[n][n], and σB:[n][n] in the pointer chasing instance. It will be clear that the reduction can be modified to accommodate the case of different bijections at every level. We specify the reduction for even k, a similar one works for odd k with minor changes.

Let G be the graph for the pointer chasing instance, with permutations σA:VAVB and σB:VBVA as edges, and a fixed starting vertex v0VA. We identify VA and VB with [n]. For notational reasons, we assume that v0 is the vertex 2 in VA. Let vk be the pointer at depth k. We show that Alice and Bob can produce a graph G without communication, such that G is a single cycle if and only if vkmod2=1.

Though the graph G described will be a directed graph, the underlying undirected graph of G (undirected version of G) will be a Hamiltonian cycle iff vkmod2=1

We first describe a reduction that produces a graph G~ satisfying the following: if vkmod2=1, then G~ is the union of two cycles. If vkmod2=0, then G~ is the union of four cycles. A slight modification to the edges in this instance will produce an appropriate YES/NO instance of 𝖧𝖺𝗆𝖢𝗒𝖼𝗅𝖾(N), concluding the reduction.

Vertex set for 𝑮~.

We define symmetric vertex sets VA and VB forming the bipartite graph G~=(VAVB,E~):

VA ={A0α,A0β}{At,α,At,β:t[k+1],[n]},
VB ={B0α,B0β}{Bt,α,Bt,β:t[k+1],[n]}

We have (k+1) blocks of vertices. Each vertex v in the original G has two copies in each block: an α-copy and a β-copy (denoted by superscripts). There are 4 additional special vertices labeled with block number 0 for technical reasons. At,α denotes the α-copy of the vertex VA in the t-th block.

We note again that v0 was the vertex labeled 2 in VA in G.

Hence, |VA|=|VB|=2+2n(k+1), and the total number of vertices in G~ (and G) is |VA|+|VB|=4+4n(k+1).

Alice’s Edges (based on 𝝈𝑨).
  • Fixed edges: A0αB0α,A0βB0β

  • Permutation (t[1,k+1],[n]):

    At,α{Bt,σA()α,if tk2,Bt,σA1()α,if t>k2+1,At,β{Bt,σA()β,if tk2,Bt,σA1()β,if t>k2+1
  • Critical (t=k2+1):     At,α{Bt,α,if mod2=0,Bt,β,if mod2=1,At,β{Bt,β,if mod2=0,Bt,α,if mod2=1

Bob’s Edges (based on 𝝈𝑩).
  • Fixed: B0αA1,v0α,B0βA1,v0β

  • Permutation (t[1,k],[n]):

    Bt,α{At+1,σB()α,if tk2,At+1,σB1()α,if t>k2,Bt,β{At+1,σB()β,if tk2,At+1,σB1()β,if t>k2
  • Block k+1 to Alice’s first block:  (v02)

    Bk+1,α{A1,+1α,if {v01,v0,n},A1,v0+1α,if =v01,A0α,if =v0,A1,1α,if =n,Bk+1,β{A1,+1β,if {v01,v0,n},A1,v0+1β,if =v01,A0β,if =v0,A1,1β,if =n
Claim 8.

If vkmod2=0, G~ is the union of 4 cycles.
If vkmod2=1, G~ is the union of 2 cycles.

Proof of Claim.

We defer the analysis of the cases to Appendix C.

Figure 1: An illustration of typical paths for k=2. The solid path is from v0, and the dashed path from an arbitrary v0. In the above case, both α and β copies of v0 will lie in the same cycle.
Modification.

The following slight modification that replaces two constant edges in G~ gives us the desired G:

(Old edges):

Bk+1,nαA1,1αandB0αA1,v0α, we now map

(New edges):

Bk+1,nαA1,v0αandB0αA1,1α.

These edges are present in different cycles in the Yes instances of G~. Splicing merges the two to a single Hamiltonian cycle (and keeps the NO instances disconnected, since at most two edges are modified).

Assuming our main technical result (Theorem 13), the proof of Theorem 1 follows immediately from the above reduction. Note that since 𝖧𝖺𝗆𝖢𝗒𝖼𝗅𝖾(N) is symmetric between Alice and Bob, we can assume without loss of generality that Bob initiates the communication in any protocol solving it.

4 Randomized lower bound for PC(𝒏,𝟓)

Theorem 9 (Theorem 2 restated).

For any constant δ[0,1/2) :

Cδ(B,3)(PC(n,5))=Ω(nlogn)
Proof outline.

For context, we first describe how pointer chasing lower bounds are typically shown via round elimination [13, 20, 15]. For odd k, consider a k-round protocol Π=(M1,,Mk) with low distributional error, where Bob sends M1. Fixing typical values of M1=m1 and V1=v1, one shows that FB(v1) remains highly variable if |Π| is small. Iterating this k1-times, alternating roles between Alice and Bob each time, we can fix M1,,Mk1 and V1,,Vk1 while preserving variability in FA(vk1). Yet Bob must now infer vk with high confidence, contradicting the protocol’s correctness.

The strategy of fixing vi (i.e., looking at a subrectangle with fixed vi) after round i that helps isolate entropy in Vi+1 fails when Ω(nlogn) bits of communication are allowed. For our lower bound for PC(n,5), we avoid fixing v2, instead allowing it to vary and analyzing a good assignment σ:𝒜2𝒱B, where 𝒜2supp(V2). Under high-entropy FBm1, we show that V4 remains variable over Ω(n) candidates, each with probability Ω(1/n). Since FAm2 also retains entropy (since |m2| is small), Bob cannot reliably infer the next pointer across these V4’s. Using an unfixed V2 is what allows us to infer that V4 is spread out. However, this requires a careful probabilistic analysis in choosing a good σ while accounting for correlations across layers. The proof relies on the permutation structure of the functions. We now present the proof.

Proof.

We show that no 3-round deterministic protocol Π=(M1,M2,M3), with Bob starting the communication and using at most εnlogn bits exchanged per message for a small constant ε>0, can correctly solve the problem on more than a (small) constant fraction of inputs drawn from the uniform distribution. The theorem then follows by an application of Yao’s lemma [10, Sec 3.4, Theorem 3.20], followed by standard error reduction techniques via repetition.

Recall notation from Section 2: Alice has functions fA,1 on 𝒱0𝒱1, fA,2 on 𝒱2𝒱3, and fA,3 on 𝒱4𝒱5. Bob has fB,1 and fB,2 for 𝒱1𝒱2 and 𝒱3𝒱4 respectively. We use A and B to denote the set of all possible inputs to Alice and Bob respectively.

Let Bob send M1, with |M1|=εBnlogn, and Alice send M2, with |M2|=εAnlogn. After M2, Bob should know the answer, which he conveys to Alice, using M3. We therefore consider just the first two messages, and show that Bob cannot know the correct answer for a constant fraction of the inputs.

Initially, with no messages passed, we have:

  • H[FA,1]=H[FA,2]=H[FA,3]=H[FB,1]=H[FB,2]=logn!=nlognO(n)

  • H[Vi]=logn, for all 1i5

We prove the theorem in a sequence of three claims, whose proofs we defer to Appendix D.

Claim 10 (Round 1).

With probability 1/2 over the choice of inputs, (M1,V1) satisfy the following:

  1. 1.

    H[FB,1(v1)|M1=m1,V1=v1](112εB)logn

  2. 2.

    H[FA,2]nlognO(n)

  3. 3.

    H[FB,2M1=m1](14εB)nlogn

Proof.

Deferred to Appendix D.

We note that fixing such a pair of (m1,v1) gives us a subrectangle of the protocol after m1. For any such subrectangle, we show the following:

Claim 11 (Main Claim).

For a subrectangle RA×B formed by fixing (m1,v1) satisfying the conditions in Claim 10, the following is true: There exists a set 𝒜4:={b1,b2,,br}𝒱4, for rn/6, and a constant c1 such that with probability 1nexp(c1n1O(εB))=1o(1) over choice of FA,2R:

i[r]:PrfBR[V4=bifA,2]120n (1)
Proof.

Deferred to Appendix D.

Let σFA,2 restricted to the domain supp(V2). As mentioned in the outline, the proof of Claim 11 goes via analyzing a random assignment σ:supp(V2)𝒱3 under FA,2. We proceed to analyze what happens in the subrectangle of a fixed σ that satisfies (1) from Claim 11, after Round 2 (i.e. after Alice sends M2). We note that once we fix a σ, it corresponds to a class of FA’s, and hence a subrectangle within the rectangle R in Claim 11.

Claim 12 (Error after Round 2).

Consider a subrectangle RRA×B caused by fixing a σ=σ satisfying (1) in Claim 11. In any such rectangle R, Bob makes an error in knowing the answer with probability at least 1/400, after receiving Alice’s message M2.

Proof.

Deferred to Appendix D.

We finally collate the errors across the subrectangles from Claims 10, 11 and 12. The overall error of the protocol is at least:

Pr(Error )12(1nexp(c1n1O(εB)))1400=Ω(1)

5 Randomized lower bound for bPC(𝒏,𝟕)

5.1 Proof of main result

Theorem 13.

For any constant δ[0,1/2),

Cδ(B,3)(bPC(n,7))=Ω(nlogn)
Proof Outline.

Before giving the proof, we give a rough outline of the steps involved.

We start with a sub-rectangle R of the protocol as described in Claim 12, where m1,v1,σ are fixed. We have Ω(n) possibilities for V4 at this point, called b1,,br. From Bob’s point of view, within the rectangle R, he exactly knows V4; suppose that V4=b1.

On receiving m2, Bob will know the shadow of b1: the set Δ(b1){fA,3(b1):fA,3R}, which typically has size n1εA. For the full-version, this was sufficient to prevent Bob from knowing V5, but it is not clear that the parity of V7 reachable from potential V4’s will vary within the shadow. Indeed, if the goal was to find parity of V5, Alice could explicitly send this information to Bob in O(n) bits with m2 (by sending parity(FA,3(i)) for every i[n]). The extra layers in the instance come into play here: when looking at the parity of V7, even if Alice sends the last bit (or more) of fA,(i) to Bob, due to the high entropy of intermediate layers, we show that the composed function parity(FA,4(fB,3(FA,3(bj))) is still unpredictable to Bob within the shadow Δ(bj).

Following [15], we view the parity of a vertex in the set 𝒱7 as a two-coloring χ7:𝒱7{0,1}. For a given set of inputs, fA,fB, this induces two-colorings on higher levels: χ6χ7fA,4, χ5χ6(fB,3). If the intermediate functions are viewed as random, the resulting coloring at each layer is also a random coloring.

In terms of two-colorings, it is sufficient to show that within R, Δ(b1)𝒱5 is two-colored by χ5 in a roughly balanced way while varying over the inputs consistent with the current subrectangle. We formally show this in the proof via a composition lemma. This is a major technical contribution of our work, and we elaborate on it in Appendix E.

Figure 2: Fixing (m1,v1,σ), Bob knows V4 exactly, say V4=bj. Δ(bj) will be two-colored by χ6FB,3|bj by most FB,3|bj (by composition lemma), preventing Bob from knowing the answer.
Proof.

We assume, as before, that all input functions come from the uniform distribution over respective domains and show that any protocol communicating εnlogn bits will err on a (small) constant fraction of the inputs (for small enough ε).

We follow the proof of Theorem 9 and arrive at a rectangle R as in Claim 11. This gives a set 𝒜4={b1,b2,,br}𝒱4, of size rn/6 for some fixed m1,v1,σ.

Compared to the full-version, we have two extra layers of functions: FB,3,FA,4. The fixing of m1,v1,σ does not affect FA,4, and the effect on FB,3 is described by H[FB,3R](14εB)nlogn from Claim 10.

Due to correlations induced by the message m1, the possibilities for function fB,3 between 𝒱5,𝒱6 are influenced by what V4 is. In particular, let FB,3m1,bjFB,3|R,V4=bj. We first show that many of the bj𝒜4 ensure a high entropy of FB,3m1,bj.

Claim 14.

There exists 𝒜4𝒜4 and constants c1>1,α1(0,1) satisfying:

  1. 1.

    bj𝒜4:H[FB,3|R,V4=bj](1c1εB)nlogn

  2. 2.

    |𝒜4|α1n

Proof.

Deferred to Appendix E.

We now invoke the following Composition Lemma, whose proof is given in the extended version of this paper.

Lemma 15 (Composition Lemma).

Let F:[n][n] be a (random) function satisfying H[F](1ε)nlogn. Let χ:[n]{0,1} be a random, balanced two-coloring of [n]. Then, J[n] with |J|(1ε)n such that for any distribution D on J , with H[D]k, with probability 11/n3 over choice of χ, the composed function XiχF(i) satisfies:

PrF(|iD(i)Xi12|10ε1/32+t)4ε1/32+1ε1/32exp(Ω(t22k))
Proof.

Deferred to the extended version of this paper.

Invoke the lemma for each bi𝒜4 in turn, setting FFB,3m1,bi (and ε=c1εB, where c1 is the constant from Claim 14), and random choice of χχ6:𝒱6{0,1}. Call a χ6 nice, if it ensures the result of Lemma 15 for every bi𝒜4. By a union bound, a random χ6 is nice with probability 11n.

Fix a nice χ6, this forms a subrectangle R′′ of R, being a restriction of FA,4. For each such nice χ6, we get sets J1,,Jr𝒱5, with |Ji|(1c1εB)n, for every i[r]. It now remains to show that a typical bi has a large shadow within Ji after m2. The following claim shows that this happens with good probability. The proof is deferred to Appendix E.

Claim 16.

Let γ:=c1εB. Let 𝒜4{b1,,br}𝒱4 with rα1n, as in ˜14. For each i[r], let Ji𝒱5 be sets with |Ji|(1γ)n. Define the events i:={FA,3(bi)Ji}, for each i[r].

Then, there exist a partitioning of R′′ into subrectangles {R(m2,z)′′}m2supp(M2),z{0,1}r (where m2 is Alice’s message and z is a function of FA,3 alone) and constant α2>0 such that with probability at least 1/8 over choice of M2=m2 and z, we have: at least α2n elements bi𝒜 satisfy:

H[FA,3(bi)|M2=m2,Z=z,i](14εA)logn,

Choose a (m2,z) to fix a sub-rectangle R~:=R(m2,z)′′R′′ (a horizontal slice in R′′) that satisfies the conclusion of ˜16. (χ6,m2,z) are fixed here. Let S{bi1,bi2,bi} be the indices in 𝒜4 arising out of Claim 16. We have Pr(S)α2n×1/20n=α2/20.

Let Dj be the distribution FA,3(bij)|R~; we have H[Dj](14εA)logn. Set δ to be εA1/32; by Lemma 17 (Appendix B), Dj is within δ in 1 distance of a distribution Dj such that H[Dj](14εA/δ)logn.
Fix a good bj𝒜4 (a vertical slice in R~).
Set t=10ε1/32, δ4εA/δ in Lemma 15. We conclude that χ6FB,3bj|R~ two-colors Dj with bias η20ε1/32 with probability (over FB,3bj|R~) of at least β(15ε1/32). (Recall that ε=c1εB.) This means it colors Dj with bias at most η2η+δ=η+εA1/32

Call an fB,3 good if it ensures a bias of at most η2 in R′′. On such fB,3, a deterministic 0 or 1 answer from Bob makes an error with probability at least 1/2η2.

The overall probability of error, is, therefore:

Pr[Error](12η2)×α220×18×(11n)×β=Ω(1)

This concludes the proof of Theorem 13.

References

  • [1] Simon Apers, Yuval Efron, Paweł Gawrychowski, Troy Lee, Sagnik Mukhopadhyay, and Danupon Nanongkai. Cut query algorithms with star contraction. In 2022 IEEE 63rd Annual Symposium on Foundations of Computer Science (FOCS), pages 507–518. IEEE, 2022.
  • [2] Sepehr Assadi. Recent advances in multi-pass graph streaming lower bounds. ACM SIGACT News, 54(3):48–75, 2023. doi:10.1145/3623800.3623808.
  • [3] Sepehr Assadi, Gillat Kol, Raghuvansh R Saxena, and Huacheng Yu. Multi-pass graph streaming lower bounds for cycle counting, max-cut, matching size, and other problems. In 2020 IEEE 61st Annual Symposium on Foundations of Computer Science (FOCS), pages 354–364. IEEE, 2020. doi:10.1109/FOCS46700.2020.00041.
  • [4] László Babai, Peter Frankl, and Janos Simon. Complexity classes in communication complexity theory. In 27th Annual Symposium on Foundations of Computer Science (sfcs 1986), pages 337–347. IEEE, 1986.
  • [5] Carsten Damm, Stasys Jukna, and Jiří Sgall. Some bounds on multiparty communication complexity of pointer jumping. Computational Complexity, 7(2):109–127, 1998. doi:10.1007/PL00001595.
  • [6] Devdatt P Dubhashi and Alessandro Panconesi. Concentration of measure for the analysis of randomized algorithms. Cambridge University Press, 2009.
  • [7] András Hajnal, Wolfgang Maass, and György Turán. On the communication complexity of graph properties. In Proceedings of the twentieth annual ACM symposium on Theory of computing, pages 186–191, 1988. doi:10.1145/62212.62228.
  • [8] Gábor Ivanyos, Hartmut Klauck, Troy Lee, Miklos Santha, and Ronald de Wolf. New bounds on the classical and quantum communication complexity of some graph properties. In 32nd International Conference on Foundations of Software Technology and Theoretical Computer Science, page 148, 2012.
  • [9] Bala Kalyanasundaram and Georg Schintger. The probabilistic communication complexity of set intersection. SIAM Journal on Discrete Mathematics, 5(4):545–557, 1992. doi:10.1137/0405044.
  • [10] Eyal Kushilevitz. Communication complexity. In Advances in Computers, volume 44, pages 331–360. Elsevier, 1997. doi:10.1016/S0065-2458(08)60342-3.
  • [11] Xinyu Mao, Guangxu Yang, and Jiapeng Zhang. Gadgetless lifting beats round elimination: Improved lower bounds for pointer chasing. In Raghu Meka, editor, 16th Innovations in Theoretical Computer Science Conference (ITCS 2025), volume 325 of Leibniz International Proceedings in Informatics (LIPIcs), pages 75:1–75:14, Dagstuhl, Germany, 2025. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.ITCS.2025.75.
  • [12] Colin McDiarmid. On the method of bounded differences. Surveys in combinatorics, 141(1):148–188, 1989.
  • [13] Noam Nisan and Avi Widgerson. Rounds in communication complexity revisited. In Proceedings of the twenty-third annual ACM symposium on Theory of computing, pages 419–429, 1991. doi:10.1145/103418.103463.
  • [14] Christos H Papadimitriou and Michael Sipser. Communication complexity. In Proceedings of the fourteenth annual ACM symposium on Theory of computing, pages 196–200, 1982. doi:10.1145/800070.802192.
  • [15] Stephen J Ponzio, Jaikumar Radhakrishnan, and Srinivasan Venkatesh. The communication complexity of pointer chasing. Journal of Computer and System Sciences, 62(2):323–355, 2001. doi:10.1006/JCSS.2000.1731.
  • [16] Anup Rao and Amir Yehudayoff. Communication complexity: and applications. Cambridge University Press, 2020.
  • [17] Ran Raz and Boris Spieker. On the “log rank”-conjecture in communication complexity. Combinatorica, 15(4):567–588, 1995.
  • [18] Alexander A Razborov. On the distributional complexity of disjointness. In International Colloquium on Automata, Languages, and Programming, pages 249–253. Springer, 1990. doi:10.1007/BFB0032036.
  • [19] Xiaoming Sun and David P Woodruff. Tight bounds for graph problems in insertion streams. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2015), pages 435–448. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2015. doi:10.4230/LIPIcs.APPROX-RANDOM.2015.435.
  • [20] Amir Yehudayoff. Pointer chasing via triangular discrimination. Combinatorics, Probability and Computing, 29(4):485–494, 2020. doi:10.1017/S0963548320000085.

Appendix A Information Theory Preliminaries

Let X be a discrete random variable over a domain 𝒳. The entropy of X is defined by H[X]xsupp(X)Pr(X=x)logPr(X=x) It is always true that H[X]log|supp(X)|. If σ is a random permutation, then H[σ]=logn!=nlognO(n).

The conditional entropy of a random variable Y given X is defined by

H[Y|X]xsupp(X)Pr[X=x]H[Y|X=x]=𝔼x[H[Yx]]

Here, Yx denotes the random variable with distribution Pr[Yx=y]=Pr[Y=y|X=x]

The min-entropy of a random variable X is defined as: H[X]=maxxsupp(X)log1Pr[X=x].

Properties of Entropy.
  1. 1.

    Non-negativity: H(X)0,  H(X)0.

  2. 2.

    Bounds: H(X)H(X)log|𝒳|.

  3. 3.

    Chain rule: H(X,Y)=H(X)+H(YX).

  4. 4.

    Conditioning: H(YX)H(Y).

  5. 5.

    Data processing: If Y=f(X) then H(Y)H(X) and H(Y)H(X).

  6. 6.

    Subadditivity: H(X,Y)H(X)+H(Y).

Fact 16.0.1 (Fano’s Inequality).

Let X be a random variable taking values in a finite set 𝒳, and let X^ be an estimate of X based on some observation Y. Define the probability of error as

Pe=Pr[X^X].

Then,

H(X|Y)h(Pe)+Pelog(|𝒳|1),

where H(X|Y) is the conditional entropy of X given Y, and h(p)=plogp(1p)log(1p) is the binary entropy function.

Appendix B Statements and proofs of lemmas used

Lemma 17.

Let μ be a distribution over [n], and Xμ. If H[X]lognA for some A0, Then for any δ(0,1), there exist distributions μ1,μ2 over [n] such that

μ=δμ1+(1δ)μ2,

where

H(μ2)lognA+1δ
Proof.

Set

θ:=2(A+1)/δn,B:={i[n]:μ(i)>θ}.

We claim μ(B)δ. Suppose, for the sake of contradiction, that μ(B)>δ. Let 𝟙[B] denote the indicator random variable of the event that XB. We have:

H[X] H[X,𝟙[B]]=H[𝟙[B]]+H[X|𝟙[B]]
1+μ(B)(i[B]μ(i)μ(B)logμ(B)μ(i))+(1μ(B))logn
1+μ(B)(lognA+1δ)+(1μ(B))logn+μ(B)logμ(B)
=logn+1μ(B)(A+1δ)+μ(B)logμ(B)
<lognA,if μ(B)>δ

contradicting the assumption H[X]lognA.

Let μ(B)δ<δ. Let μ1μ|B, and μ2μ|Bc. Then, clearly, μ=δμ1+(1δ)μ2.
Since δ>δ, we can write:

μ =(1δ)μ2+(δδ)μ2+δμ1
=(1δ)μ2+δμ1

Where μ1 is the normalized version of (δδ)μ2+(δ)μ1.

Lemma 18 (Entropy vs. Min-Entropy [15]).

Let D be a distribution on [n] with entropy at least lognA. Then, for all ε>0, D is within ε of a distribution with min-entropy at least

lognA+1ε.
Theorem 19 ((The method of averaged bounded differences),[12], [6, Corollary 5.1]).

Let X1,,Xn be random variables and f a function such that for each i[n] there exists ci0 with

|𝔼[fX1,,Xi1,Xi=ai]𝔼[fX1,,Xi1,Xi=ai]|ci

for all choices of ai,ai. Then, for any t0,

Pr(|f𝔼[f]|t)2exp(2t2i=1nci2).

Appendix C Missing proofs from Section 3

Proof of Claim 8.

We begin with the following observation:

Observation 20.

For a vertex vVA, and j, let wj(v) denote the unique vertex at distance j from v. Then, for all i[n],

w2k+1(A1,iα) ={Bk+1,iβ,if wk(A1,iα)mod2=1,Bk+1,iα,if wk(A1,iα)mod2=0,
w2k+1(A1,iβ) ={Bk+1,iα,if wk(A1,iβ)mod2=1,Bk+1,iβ,if wk(A1,iβ)mod2=0

Given Obs 20, we analyze each case:

  • YES Case: (When vk𝐦𝐨𝐝𝟐=𝟏)

    Chasing the pointer starting at the α-copy vertex A1,v0α, if vkmod2=1, then after the k/2-th layer, it switches to a path on the β block vertices, ending up at Bk+1,v0β. This switch triggers a path back such that A1,v0α and A1,v0β end up on the same cycle, call it C1.

    Now consider following the outgoing edges sequentially (chasing the pointer) starting any of the remaining vertices A1,α. Since n/2 is even and all functions are permutations, an odd number of them will switch from α to a copy vertex β in the layer k/2 when following outgoing edges. The edges are set up for the vertex A1, to lead back to A1,+1, so we eventually visit all the vertices.

    Due to this, the remaining edges will form a single second cycle C2. In this cycle, the vertices of A1, appear in the order A1,1A1,3A1,4.

  • NO Case: (When vk𝐦𝐨𝐝𝟐=𝟎)
    In this case, A1,v0α and A1,v0β end up in 2 separate cycles since they do not switch from α to β or vice versa.

    By a similar argument as above, chasing pointers from the remaining vertices A1,α, an even number of them switch from α to a β copy at layer k/2 when following pointers. This implies that A1,α and A1,β cannot fall in the same cycle.

Appendix D Missing proofs from Section 4

Proof of Claim 10

Proof of Claim 10.

Since |M1|εBnlogn, H[FBM1](2εB)nlogn. 555We take the lower order O(n) terms due to the functions being permutations as subsumed in the larger εBnlogn term due to M1 for clarity.

By Markov’s inequality, we have

Prm1(H[FBM1=m1](24εB)nlogn)34,

so for at least 34 of all fB, the first message M1=m1 satisfies

H[FBm1]H[FBM1=m1](24εB)nlogn.

For such any m1, we therefore have: H[FB,jM1=m1](14εB)nlogn, for j=1,2.

Since H[FB,1m1](14εB)nlogn, subadditivity of entropy implies that 𝔼i[n][H[FB,1|M1=m1]](14εB)logn. Applying Markov’s inequality yields that at least 2n/3 of i[n] satisfy H[FB,1m1(i)](112εB)logn.

Call a v1𝒱1 good for m1, if H[FB,1m1(i)](112εB)logn. Since the distribution of V1 is uniform, rectangles corresponding to pairs (m1,v1) with M1=m1 and good v1 cover at least a fraction 34×23=12 of the total inputs.

Proof of Claim 11

Proof of Claim 11.

Define 𝒜2supp(V2)={a1,,a}. Let FB,2m1,aj denote a random variable following the distribution of FB,2 conditioned on M1=m1,V2=aj.

Let μB be the distribution induced over 𝒜2 (equivalently, V2) in R. We know from Claim 10 that H[μB]=H[V2M1=m1,V1=v1](112εB)logn.

Since H[FB,2|M1=m1](14εB)nlogn , we have that H[FB,2|M1=m1,V2](14εB)nlognlogn(14.5εB)nlogn.

By Markov’s inequality, with probability 1/2 over choices for V2, it holds that H[FB,2m1,aj](19εB)nlogn.

Call 𝒜2:={aj𝒜2:H[FB,2m1,aj](19εB)nlogn}; the above implies μB(𝒜2)1/2.

Figure 3: Most σ’s confer good probability mass on i, if FB,21(i) between 𝒱4, 𝒱3 has high entropy.
Definition 21.
  • Call a vertex iV4 is locally good with respect to aj𝒜2 if

    H[FB,21(i)M1=m1,V2=aj](136εB)logn
  • iV4 is globally good if it is locally good for at least 14 of 𝒜2 under μB.

Claim 22.

Each aj𝒜2 has at least 3n/4 locally good vertices i𝒱4 with respect to it.

Proof.

Since H[FB,21m1,aj](19εB)logn, by sub-additivity we have:

𝔼i[n][H[FB,21(i)m1,aj]](19εB)logn

The statement follows by Markov’s inequality on lognH[FB,21(i)m1,aj].

Claim 23.

The number of globally good i𝒱4 is at least n6.

Proof.

Let s be the number of globally good i𝒱4. Since every aj𝒜2 has at least 3n/4 locally good vertices, and 𝒜2 has measure at least 1/2 under μB, we have:

s1+(ns)143n412

Rearranging gives the stated bound on s.

We now make a probabilistic argument to continue the proof of Claim 11. Let σ:𝒜2𝒱3 be a random one-to-one mapping induced by FA,2. Define:

νB(rij)PrFB,2[FB,21(i)=rm1,v1,V2=aj]

For each globally good i𝒱4, define the random variable:

Z(i):=j=1μB(aj)r=1n𝟏{σ(aj)=r}νB(rij),

Z(i) captures the probability mass (under FB) that fixing a σ leads to V4=i. We note that Z(i) is random purely due to the random assignment σ, which is completely determined by FA,2. Furthermore, FA,2 is a uniformly random permutation in the subrectangle R under consideration, so σ is a uniformly random one-to-one mapping.

Claim 24.

For any constant γ(0,1), and any globally good i𝒱4 as in Definition 21,

Prσ[Z(i)1γ16n]nexp(2γ2n1O(εB)),
Proof.

Partition the indices j[] into

Jgood:={j[]:i is locally good for aj},Jbad:=[]Jgood,

with μB(aJgood)1/4, since i is globally good.

Let S={j[]:μB(aj)n(1100εB)}. From a calculation as in Lemma 17 (Appendix B) with δ=1/8, A=12εBlogn applied on the distribution μB (recall that H[μB](112εB)logn) , we infer that μB(aj:jSc)1/8. This implies that:

jJgoodSμB(aj)1/41/8=1/8 (2)

We can thus write:
Z(i)=jJgoodSμB(aj)r=1n𝟏{σ(aj)=r}νB(rij)+jJgoodSμB(aj)r=1n𝟏{σ(aj)=r}νB(rij)

We now concentrate on the contribution from the good indices JgoodS and apply the decomposition lemma (Lemma 17) to the νB distributions therein, since they have high entropy.

For νB, from the entropy bound H[FB1(i)|m1,aj](136εB)logn, we apply Lemma 17 with A36εBlogn and δ12, giving:

νB=δνB(1)+(1δ)νB(2),H[νB(2)]logn72εBlognO(1).

Using this decomposition, we can write Z(i) as follows:

Z(i)=Z1(i)+Z2(i),

where

Z2(i):=12jJgoodSμB(aj)r=1n𝟏{σ(aj)=r}νB(2)(rij)

and Z1(i) has the remaining terms of Z(i), and is a non-negative random variable. Intuitively, Z2(i) captures components of the distribution νB having high min-entropy.

We now bound the probability of deviation of Z2(i) from its mean. Clearly, for any β, Pr(Z(i)β)Pr(Z2(i)β). We compute the expectation:

𝔼[Z2(i)] =12jJgoodSμB(aj)r=1n𝔼[𝟏{σ(aj)=r}]νB(2)(rij)
=12jJgoodSμB(aj)r=1n1nνB(2)(rij)116n (3)

The last step follows from Eq. 2. We now apply a deviation bound due to McDiarmid Theorem 19:

Theorem 25 ([12], [6, The method of averaged bounded differences, Corollary 5.1]).

Let X1,,Xn be random variables and f a function such that for each i[n] there exists ci0 with

|𝔼[fX1,,Xi1,Xi=ai]𝔼[fX1,,Xi1,Xi=ai]|ci

for all choices of ai,ai. Then, for any t0,

Pr(|f𝔼[f]|t)2exp(2t2i=1nci2).

We apply the above, for f being Z2(i) and the random variable Xjσ(aj). We bound c:=j=1ncj2 for Z2(i): each term in Z2(i) is of the form μB(aj)νB(2)(rij), and is non-zero only for one r=σ(aj). Thus, each contributes at most:

maxμB(aj)maxνB(2)(rij)n1+100εBn1+72εB+o(1)=n2+172εB+o(1).

Each cj2 is at most n4+O(εB), and we have at most n terms. This gives us:

cnn4+O(εB)=n3+O(εB).

Applying the concentration bound with t=γ/16n, we get:
Prσ[Z(i)1γ16n]Prσ[Z2(i)1γ16n]exp(2γ2256n2c)=exp(1128γ2n 1O(εB)) Setting b1,br, to be the set of globally good indices i, γ=1/5, and a union bound, we conclude the proof of Claim 11.

Proof of Claim 12

Proof of Claim 12.

Let Alice’s message M2 have |M2|εAnlogn. We know that before M2 is passed, for every σ, since FA,3 is independent of σ (and v1)   H[FA,3σ]=nlognO(n).

In particular, for σ=σ, with probability 3/4 over the choice of M2, H[FA,3σ,M2=m2](14εA)nlogn. Thus, within R on fixing such an m2, we get a sub-rectangle R′′ where, by subadditivity of entropy:

𝔼i[n][H[FA,3(i)|R′′]] (14εA)nlogn
S𝒱4:|S|(12εA)n such that
iS:H[FA,3(i)|R′′]] (12εA)logn

Call S the uncertain vertices, since Bob cannot predict the next pointer (V5) starting at these with reasonable certainty after receiving M2=m2. Intuitively, it is sufficient to show that in R′′, V4 occurs as an uncertain vertex with constant probability. We formalize this next.

Within R′′, the value of V4 is known exactly to Bob: it is the same across FA,3, for any fixed fB. This is because V1=v1,σ=σ are fixed. Thus, for a fixed fB, if v4S, H[V5](12εA)logn. Applying Fano’s inequality (Fact 16.0.1), the probability of Bob making an error in finding the value of V5 is at least Pe(12εA1logn)0.9, for εA being small enough.

Since σ satisfies Eq. (1), we get a set ={b1,,br} of globally good vertices satisfying Claim 11. Then |S|(1/62εA)nn/10, and each has probability at least 120n. Thus, the probability of Bob seeing an uncertain vertex as v4 in R′′ is at least 1/200.

Overall, the probability of error within R is:

3412000.91400

This concludes the proof of Claim 12.

Appendix E Missing proofs from Section 5

Proof of Claim 14

Proof of Claim 14.

Let F:=FB,3. Since |𝒜4|n/6, H[FR](14εB)nlogn, and H(V4)logn, we obtain

H[FR,V4](14εB)nlognlogn(15εB)nlogn.

There are at least n/12 elements in 𝒜4 satisfying Pr[V4=i](120n,110n); call this set 𝒟. Let I[F]nlognH[F]. From the above, 𝔼vV4[I[F|R,V4=v]]5εBnlogn. Let c1 be a large constant to be set later, and 𝒟good={i𝒟:I[F|R,V4=i]c1εBnlogn}.

Let μ be the induced distribution on V4 in R. We have μ(𝒟)1/240. By Markov’s inequality, μ(𝒟good)μ(𝒟)5c112405c11480, for c15×480.

But this means that we should have at least 1480110n=n48=:α1n elements in 𝒟good, since each element has probability at most 1/10n.

Proof of Claim 16

Proof of Claim 16.

Define {Zi𝟙[i]}i[r] to be indicator variables for the events i under a random choice of FA,3. For any i, we have 𝔼[Zi](1γ). Thus, 𝔼[i=1rZi](1γ)r. Let iZi(110γ)r.

Claim 26.

Pr(i=1rZi(110γ)r)1exp(Ω(γ2n))

Proof.

We use McDiarmid’s inequality (see Theorem 19) on the function fi[r]Zi. The average bounded difference at i satisfy cir/(nr)2α1.

Pr(Z(110γ)r)exp(Ω(81γ2(nr)2))=exp(α2γ2n) for some constant α2

Let FFA,3 for convenience. Let Z=(Z1,,Zr). Note that |Z|(110γ)r. Since H[F]nlognO(n), we have:

H[FM,Z] (1εA)nlognO(n)
H[FM,Z,] (12εA)nlognPr(c)nlognPr()
H[FM,Z,] (14εA)nlogn

Hence, conditioned on , with probability at least 3/4 over choices of (M,Z)=(m,z), we have that H[FM=m,Z=z,](116εA)nlogn. Fix such a (m,z) pair, and let z(i1)=1,,z(i)=1 for some (110γ)r. By subadditivity of entropy:

𝔼i[n]H[F(i)|M=m,Z=z,] (116εA)logn

By Markov’s inequality, there are at least (14εA)n indices i where H[F(i)|M=m,Z=z,](14εA)logn. From these, at least (α14εA)n coordinates have to be from 𝒜4. For each of these, we have that H[F(bi)M=m,i](14εA)logn. Let α2α14εA.

Thus, the probability (over M,Z, under the distribution of FA) that we get at least α2n indices from 𝒜, such that H[FA,3(bi)|M=m,Z=z,i](14εA)logn is at least:

Pr[]341/8

 

Appendix F Upper bounds

We adapt the protocols from [5, 15] to infer what depth of pointer can be chased with O(nloglogn) bits. For simplicity of notation, while describing protocols, we let fA,ifA,j and fB,ifB,j all i,j. We will incur a multiplicative factor of k in the communication cost if the functions are different.

Theorem 27.

Let kk+53. Then CB,k(PC(n,k))=O(nkloglogn).

Proof.

Let nnlogn.

  • Round 1 (Bob Alice): Bob sends the last loglogn bits of fB(vb) for every vbVB.

  • Round 2 (Alice Bob): Alice sends the last loglogn bits of fA(va) for every vaVA, and also sends v1=fA(v0) along with fA(v) for every possible vertex in Bob’s list that v1 could point to.

  • Round 3 (Bob Alice): Bob computes v2=fB(v1). From Alice’s message, he knows v3=fA(v2). He then computes v4=fB(v3) and sends v4 along with fB(v) for every possible vertex in Bob’s list that v4 could point to.

  • Round 4 (Alice Bob): Alice computes v5=fA(v4). From Bob’s message, she knows v6=fB(v5). She then computes v7=fA(v6) and sends v7 along with fA(v) for every possible vertex in Bob’s list that v7 could point to.

  • Round i: If i is even, Alice computes v3i7=fA(v3i8), knows the possibilities for v3i6=fB(v3i7) from Bob’s previous message, computes v3i5=fA(v3i6), and sends v3i5 along with fA(v) for all n possible vertices;

    if i is odd, Bob performs the analogous computation and sends v3i5 along with fB(v) for all n possible vertices.

  • Round k: If k is even, Alice knows v3k5 and sends it to Bob; if k is odd, Bob knows v3k5 and sends it to Alice, where kk+53.

Intuitively, Alice knowing the pointer values for one layer ahead from Bob allows her to “jump” forward 3 pointers per round: two from her layers and one from Bob.

Rounds 1 and 2 together take O(nloglogn) bits. All other rounds (except the last round) take O(logn+nlognlogn)=O(n) bits per round. Since the number of rounds k is constant, the total communication is O(nloglogn) bits.

Theorem 28.

Let kk+64. Then CB,k(bPC(n,k))=O(nkloglogn).

Proof.

The protocol follows the same communication structure as in the PC(n,k) protocol, with each party in every round (starting from round 2), each party additionally sending the following n bits every round:

  • Round 1 (Bob Alice): Bob sends the last loglogn bits of fB(v) for every vertex v, which also implicitly communicates the bit vectors χ1(v)=parity(fB(v)) across all vertices.

  • Round 2 (Alice Bob): The bit vector χ2(v):=χ1(fA(v)) for all vVA.

  • Round 3 (Bob Alice): The bit vector χ3(v):=χ2(fB(v)) for all vVB.

  • Round 4 (Alice Bob): The bit vector χ4(v):=χ3(fA(v)) for all vVA.

  • Round 5 (Bob Alice): The bit vector χ5(v):=χ4(fB(v)) for all vVB.

  • Round i: If i is even, Alice sends χi(v) for all vVA, else Bob sends χi(v) for all vVB.

  • Round k: If k is even, Alice knows v3k5 from the base PC(n,k) protocol, and she learns χk from the extra parity information exchanged from Round 2. Thus, she sends χkδ(v3k5)=parity(v4k6δ) to Bob, where δ{0,1,2,3} is chosen so that the resulting vertex is exactly vk. If k is odd, Bob analogously computes and sends parity(vk) to Alice.

Rounds 1 and 2 together take O(nloglogn) bits. All other rounds (except the last round) have n additional bits compared to the PC(n,k) protocol, but since the number of rounds k is constant, we still have the same total communication of O(nloglogn) bits.