Optimal Two-Round Communication Lower Bound for Graph Connectivity via Pointer Chasing
Abstract
We consider the communication complexity of the graph connectivity problem, where the edges of an -vertex undirected graph are distributed between two parties Alice and Bob, who are then required to communicate to determine if is connected. We show that in any randomized protocol with two-rounds of communication, Alice and Bob must exchange bits; such a lower bound for one-round protocols was shown by Sun and Woodruff (APPROX/RANDOM 2015). A one-round deterministic protocol, where Alice sends bits and Bob determines the answer, was observed by Hajnal, Maass and Turan (STOC 1988); they also showed a matching lower bound of bits for deterministic protocols with unbounded rounds of communication. For randomized protocols, a reduction from the set disjointness problem due to Babai, Frankl and Simon (FOCS 1986) implies a randomized lower bound of even with unbounded rounds of communication. Whether this lower bound can be improved to has been an outstanding open question, whose algorithmic implications were recently emphasized by Apers, Efron, Gawrychowski, Lee, Mukopadhyay and Nanongkai (FOCS 2022). Our lower bound for randomized two-round protocols is based on a reduction from a restricted version of the two-player pointer chasing problem originally studied by Papadimitriou and Sipser (JCSS 1984). Using this reduction, we show an lower bounds on graph connectivity for any constant number of rounds by extending deterministic lower bounds shown by Ponzio, Radhakrishnan and Venkatesh (JCSS 2001) to the randomized setting.
Keywords and phrases:
Communication complexityFunding:
Jaikumar Radhakrishnan: Department of Atomic Energy, Government of India, under project number RTI4001.Copyright and License:
2012 ACM Subject Classification:
Theory of computation Communication complexityEditor:
Shubhangi SarafSeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
Graph problems have been extensively studied in various computational settings – algorithms, decision trees, streaming algorithms, distributed algorithms and communication protocols. In this paper, we focus on the two-party communication setting: the edges of the -vertex undirected graph are distributed between two parties Alice and Bob (some of the edges may be given to both), who then need to exchange messages in order to compute some function of ; the goal is to minimize the number of bits exchanged. Early works [4, 7] investigated properties such as connectivity, st-connectivity, bipartiteness, proving both lower and upper bounds on the amount of communication needed in this model. Other properties such as triangle-freeness, existence of Eulerian tours, -edge connectivity, and cycle freeness among others have been studied and new lower bounds have been obtained in subsequent works [8, 19]. More recently, graph problems have been extensively studied for trade-offs between space requirements and the number of passes for streaming algorithms for estimating quantities like matchings, cuts, etc., in various settings (see [2] and the references therein). Lower bounds in these settings are also typically shown by considering the communication complexity of some associated graph problem.
A fundamental problem in this domain is the connectivity problem : Alice and Bob need to determine if is connected. The randomized complexity of this problem was studied by Babai, Frankl and Simon [4], who showed a reduction from the set disjointness problem; when combined with the lower bound for the set disjointness problem [18, 9], this implies an lower bound for the randomized communication complexity of . There exists an -bit -round deterministic communication protocol: Alice sends a spanning forest of her edges to Bob, who then determines the answer. Hajnal, Maas and Turan [7] showed a deterministic lower bound of for connectivity111In fact, they show that the bound holds in a stronger arbitrary partitioning model of the edges.. For randomized protocols with an unbounded number of rounds of communication, closing this gap between the upper and lower bounds has remained a major open question since the work of [4], and has been mentioned [1, 8] as one of the outstanding open questions in communication complexity. An optimal lower bound on also implies the optimality of some query algorithms [1]. A randomized lower bound of was shown for one-round protocols by [19]; however, no lower bound better than was known for randomized protocols with more than one round of communication222Following convention, we only require one of the players to know the answer for at the end of the protocol..
We show an optimal randomized lower bound of on against protocols with two rounds of communication, by relating it to the pointer chasing problem introduced by Papdimitrou and Sipser [14] to study tradeoffs between the number of rounds of communication, and number of bits communicated. In the two-party pointer chasing problem333Our definition differs from the standard definition, for we insist that the pointers given to Alice and Bob correspond to bijections – our reduction will make crucial use of this property. Note, however, that our lower bounds with this restriction on the inputs imply similar lower bounds for the problem without any restriction., the edges of a directed bipartite graph (where ) are distributed between Alice and Bob in the following way: Alice receives a one-to-one function and Bob receives a one-to-one function , which we view as (directed) matchings and respectively. A fixed starting vertex and a parameter are known to both players. We consider two versions of the problem: the full-version and the bit-version. In the full-version of the problem, , the goal is to find the identity of the unique vertex found by following the -length path , where , if is odd, and , if is even. In the bit-version of the problem, , Alice and Bob need to determine just the last bit or the parity of . At the end, both players must know the answer.444The extra round that is needed for both players to know the answer in pointer chasing is often crucial.
There is a simple deterministic -round protocol that exchanges bits to find , where Alice starts by sending , and in the -th round the appropriate player sends since they already know . A major focus of previous work has been to show a separation between the amount of communication needed in the above case versus if only rounds of communication are allowed, or rounds are allowed but Bob (the wrong player, since he does not know ) starts the communication. With this restriction, for a general , a sequence of works [13, 20, 11] have resulted in a lower bound against randomized protocols. For constant , Ponzio, Radhakrishnan and Venkatesh [15] show a lower bound for the full version. As for upper bounds, Nisan and Wigderson [13] gave a randomized protocol with communication , and Damm, Jukna and Sgall [5] design a deterministic protocol with communication , for constant .
Our reduction from to in fact passes through another related problem. In the Hamiltonian cycle problem, , the inputs to Alice and Bob are the same as in , that is, matchings represented as one-to-one functions and . Alice and Bob need to determine if the union of and is a Hamiltonian Cycle, or it consists of more than one cycle. This particular problem was studied earlier by Raz and Spieker [17], who showed a lower bound of for non-deterministic protocols even with unbounded rounds of communication. It is easy to see that a protocol for immediately yields a protocol with the same complexity for .
1.1 Our Results
To state our results, we will need the following notation; detailed problem definitions and notation are presented in Section 2. Let be the -error, -round randomized communication complexity of the function when Bob sends the first message; let be the corresponding complexity when Alice sends the first message. We write , omitting the superscript, to denote the complexity when either party is allowed to start. denotes the -th iterated logarithm of .
Our lower bound for is derived as follows. We show that the pointer chasing problem reduces to the Hamiltonian cycle problem; more precisely, we show (see Theorem 7 in Section 3) that for all , we have
We then show that
Theorem 13 is the main technical contribution of this work. Combining these with our earlier observation that reduces to , we obtain the following.
Theorem 1 (Main Result).
For all , we have
Typically, pointer chasing has been considered in the setting the number of rounds of communication is ; we know of only a few prior works [15, 3] that have considered . However, the techniques used in these do not seem to extend to yield an optimal lower bound.
Furthermore, our reduction can be used to obtain superlinear lower bounds for for randomized protocols with more than two rounds of communication. By extending the result [15, Theorem 6] to the randomized setting, we show that for all , constant and
our reduction then implies that for all constant ,
The proof details of the above statement are deferred to the extended version of the paper.
For our main result we used the lower bound for the bit version of the pointer chasing problem, where Alice and Bob need to determine the last bit for with just three rounds of communication. We show that for the full version determining is already equally hard.
Theorem 2.
For all constants , .
Complementing the above, we show that there are deterministic protocols that can chase a larger pointer depth with few rounds, by extending the protocols in [5, 15].
Theorem 3.
Let . Then . For the bit-version: . Then .
1.2 Organization of the paper
In Section 3, we describe the reduction from to (and in particular, the problem).
Section 4 proves Theorem 2, a strong lower bound for the full-version of the pointer chasing problem. Section 5 builds upon the result in Section 4 to prove the main technical result: a strong lower bound for the bit-version Theorem 13. Proofs are detailed in Appendices D and E. Appendix F proves the upper bounds in Theorem 3.
Each technical result is preceded by a proof outline that emphasizes key steps and conceptual insights.
2 Preliminaries
2.1 Problem Definitions
Definition 4 (Pointer Chasing, full-version).
is the 2-player pointer chasing problem defined as follows: The input instance is a -layered, directed graph . The vertex set is , with for all . The edges of are bijections from to for each . The edges are distributed between Alice and Bob as follows:
Alice gets bijections Bob gets bijections
There is a fixed vertex known to both, and they need to find the unique vertex at distance from .
Definition 5 (Pointer Chasing, bit-version).
is the problem of finding the least significant bit of (as opposed to in ). We call the least significant bit of the parity of (having identified the vertex set with ). This is also called the bit-version of the pointer-chasing problem.
For pointer chasing problems, we require that any valid protocol has both players decide on the same answer and output it after the protocol concludes.
We note that Definition 4 differs slightly from the description given in the introduction: in our formal definition above, we allow a different function (bijection) at every step. It is not difficult to see that the two formulations are equivalent up to a factor- blowup in the vertex-set size. In particular, any instance satisfying Definition 4 is a special case of the introduction’s definition with , obtained by taking and .
Definition 6.
The problem is defined as follows. Let and be two disjoint sets of vertices each. Alice receives a bijection map , and Bob receives a bijection . The goal is to decide whether the composition (equivalently, union of the two perfect matchings) forms a Hamiltonian cycle covering all vertices.
For the problems , under consideration, following convention, we require only that one of the players knows the answer after rounds.
2.2 Notation
We use to denote the set . Random variables will typically be denoted by capital letters: e.g. denotes a random variable, and will be its instantiation to the value . Most random variables we deal with will be over a finite, discrete domain. The support of a random variable is denoted by .
All logarithms are to the base 2, and denotes the -th iterated logarithm of .
Our proofs make significant use of information theoretic concepts. We describe some preliminaries in Section A.
We assume throughout the proofs that when used, are sufficiently small constants.
2.3 Communication Complexity preliminaries
We assume some familiarity with basic notions of two-party communication complexity, and refer the reader to standard books [10, 16] for the basics. Let be a function.
Rectangles, Subrectangles.
A combinatorial rectangle is a subset of inputs of the form , where and . A deterministic protocol splits the input space into a disjoint partition of rectangles, each rectangle corresponding to a certain transcript of the protocol.
We call a rectangle a subrectangle of a rectangle if . is a subrectangle of the protocol , if it is a subrectangle of some rectangle in the partition induced by the protocol . If is a rectangle, then a subrectangle of the form for is called a horizontal slice of . Similarly, , for is called a vertical slice.
Let be a rectangle. Let be a random variable that depends on the inputs. We denote to be the random variable having distribution of conditioned on inputs restricted to the rectangle .
3 Lower bounds on
Theorem 7.
Let be a multiple of . The problem reduces to the problem with in the two-party communication setting.
Proof.
For notational simplicity, we describe the reduction with only two functions , and in the pointer chasing instance. It will be clear that the reduction can be modified to accommodate the case of different bijections at every level. We specify the reduction for even , a similar one works for odd with minor changes.
Let be the graph for the pointer chasing instance, with permutations and as edges, and a fixed starting vertex . We identify and with . For notational reasons, we assume that is the vertex in . Let be the pointer at depth . We show that Alice and Bob can produce a graph without communication, such that is a single cycle if and only if .
Though the graph described will be a directed graph, the underlying undirected graph of (undirected version of ) will be a Hamiltonian cycle iff
We first describe a reduction that produces a graph satisfying the following: if , then is the union of two cycles. If , then is the union of four cycles. A slight modification to the edges in this instance will produce an appropriate YES/NO instance of , concluding the reduction.
Vertex set for .
We define symmetric vertex sets and forming the bipartite graph :
We have blocks of vertices. Each vertex in the original has two copies in each block: an -copy and a -copy (denoted by superscripts). There are additional special vertices labeled with block number for technical reasons. denotes the -copy of the vertex in the -th block.
We note again that was the vertex labeled in in .
Hence, and the total number of vertices in (and ) is
Alice’s Edges (based on ).
-
Fixed edges:
-
Permutation ():
-
Critical ():
Bob’s Edges (based on ).
-
Fixed:
-
Permutation ():
-
Block to Alice’s first block:
Claim 8.
If , is the union of cycles.
If , is the union of cycles.
Proof of Claim.
We defer the analysis of the cases to Appendix C.
Modification.
The following slight modification that replaces two constant edges in gives us the desired :
- (Old edges):
-
we now map
- (New edges):
-
.
These edges are present in different cycles in the Yes instances of . Splicing merges the two to a single Hamiltonian cycle (and keeps the NO instances disconnected, since at most two edges are modified).
4 Randomized lower bound for
Theorem 9 (Theorem 2 restated).
For any constant :
Proof outline.
For context, we first describe how pointer chasing lower bounds are typically shown via round elimination [13, 20, 15]. For odd , consider a -round protocol with low distributional error, where Bob sends . Fixing typical values of and , one shows that remains highly variable if is small. Iterating this -times, alternating roles between Alice and Bob each time, we can fix and while preserving variability in . Yet Bob must now infer with high confidence, contradicting the protocol’s correctness.
The strategy of fixing (i.e., looking at a subrectangle with fixed ) after round that helps isolate entropy in fails when bits of communication are allowed. For our lower bound for , we avoid fixing , instead allowing it to vary and analyzing a good assignment , where . Under high-entropy , we show that remains variable over candidates, each with probability . Since also retains entropy (since is small), Bob cannot reliably infer the next pointer across these ’s. Using an unfixed is what allows us to infer that is spread out. However, this requires a careful probabilistic analysis in choosing a good while accounting for correlations across layers. The proof relies on the permutation structure of the functions. We now present the proof.
Proof.
We show that no 3-round deterministic protocol , with Bob starting the communication and using at most bits exchanged per message for a small constant , can correctly solve the problem on more than a (small) constant fraction of inputs drawn from the uniform distribution. The theorem then follows by an application of Yao’s lemma [10, Sec 3.4, Theorem 3.20], followed by standard error reduction techniques via repetition.
Recall notation from Section 2: Alice has functions on , on , and on . Bob has and for and respectively. We use and to denote the set of all possible inputs to Alice and Bob respectively.
Let Bob send , with , and Alice send , with . After , Bob should know the answer, which he conveys to Alice, using . We therefore consider just the first two messages, and show that Bob cannot know the correct answer for a constant fraction of the inputs.
Initially, with no messages passed, we have:
-
-
, for all
We prove the theorem in a sequence of three claims, whose proofs we defer to Appendix D.
Claim 10 (Round 1).
With probability over the choice of inputs, satisfy the following:
-
1.
-
2.
-
3.
Proof.
Deferred to Appendix D.
We note that fixing such a pair of gives us a subrectangle of the protocol after . For any such subrectangle, we show the following:
Claim 11 (Main Claim).
For a subrectangle formed by fixing satisfying the conditions in Claim 10, the following is true: There exists a set , for , and a constant such that with probability over choice of :
| (1) |
Proof.
Deferred to Appendix D.
Let restricted to the domain . As mentioned in the outline, the proof of Claim 11 goes via analyzing a random assignment under . We proceed to analyze what happens in the subrectangle of a fixed that satisfies (1) from Claim 11, after Round 2 (i.e. after Alice sends ). We note that once we fix a , it corresponds to a class of ’s, and hence a subrectangle within the rectangle in Claim 11.
Proof.
Deferred to Appendix D.
5 Randomized lower bound for
5.1 Proof of main result
Theorem 13.
For any constant ,
Proof Outline.
Before giving the proof, we give a rough outline of the steps involved.
We start with a sub-rectangle of the protocol as described in Claim 12, where are fixed. We have possibilities for at this point, called . From Bob’s point of view, within the rectangle , he exactly knows ; suppose that .
On receiving , Bob will know the shadow of : the set , which typically has size . For the full-version, this was sufficient to prevent Bob from knowing , but it is not clear that the parity of reachable from potential ’s will vary within the shadow. Indeed, if the goal was to find parity of , Alice could explicitly send this information to Bob in bits with (by sending parity for every ). The extra layers in the instance come into play here: when looking at the parity of , even if Alice sends the last bit (or more) of to Bob, due to the high entropy of intermediate layers, we show that the composed function is still unpredictable to Bob within the shadow .
Following [15], we view the parity of a vertex in the set as a two-coloring . For a given set of inputs, , this induces two-colorings on higher levels: , . If the intermediate functions are viewed as random, the resulting coloring at each layer is also a random coloring.
In terms of two-colorings, it is sufficient to show that within , is two-colored by in a roughly balanced way while varying over the inputs consistent with the current subrectangle. We formally show this in the proof via a composition lemma. This is a major technical contribution of our work, and we elaborate on it in Appendix E.
Proof.
We assume, as before, that all input functions come from the uniform distribution over respective domains and show that any protocol communicating bits will err on a (small) constant fraction of the inputs (for small enough ).
We follow the proof of Theorem 9 and arrive at a rectangle as in Claim 11. This gives a set , of size for some fixed .
Compared to the full-version, we have two extra layers of functions: . The fixing of does not affect , and the effect on is described by from Claim 10.
Due to correlations induced by the message , the possibilities for function between are influenced by what is. In particular, let . We first show that many of the ensure a high entropy of .
Claim 14.
There exists and constants satisfying:
-
1.
-
2.
Proof.
Deferred to Appendix E.
We now invoke the following Composition Lemma, whose proof is given in the extended version of this paper.
Lemma 15 (Composition Lemma).
Let be a (random) function satisfying . Let be a random, balanced two-coloring of . Then, with such that for any distribution on , with , with probability over choice of , the composed function satisfies:
Proof.
Deferred to the extended version of this paper.
Invoke the lemma for each in turn, setting (and , where is the constant from Claim 14), and random choice of . Call a nice, if it ensures the result of Lemma 15 for every . By a union bound, a random is nice with probability .
Fix a nice , this forms a subrectangle of , being a restriction of . For each such nice , we get sets , with , for every . It now remains to show that a typical has a large shadow within after . The following claim shows that this happens with good probability. The proof is deferred to Appendix E.
Claim 16.
Let . Let with , as in ˜14. For each , let be sets with . Define the events , for each .
Then, there exist a partitioning of into subrectangles (where is Alice’s message and is a function of alone) and constant such that with probability at least over choice of and , we have: at least elements satisfy:
Choose a to fix a sub-rectangle (a horizontal slice in ) that satisfies the conclusion of ˜16. are fixed here. Let be the indices in arising out of Claim 16. We have .
Let be the distribution ; we have . Set to be ; by Lemma 17 (Appendix B), is within in distance of a distribution such that .
Fix a good (a vertical slice in ).
Set , in Lemma 15. We conclude that two-colors with bias with probability (over of at least . (Recall that .) This means it colors with bias at most
Call an good if it ensures a bias of at most in . On such , a deterministic or answer from Bob makes an error with probability at least .
References
- [1] Simon Apers, Yuval Efron, Paweł Gawrychowski, Troy Lee, Sagnik Mukhopadhyay, and Danupon Nanongkai. Cut query algorithms with star contraction. In 2022 IEEE 63rd Annual Symposium on Foundations of Computer Science (FOCS), pages 507–518. IEEE, 2022.
- [2] Sepehr Assadi. Recent advances in multi-pass graph streaming lower bounds. ACM SIGACT News, 54(3):48–75, 2023. doi:10.1145/3623800.3623808.
- [3] Sepehr Assadi, Gillat Kol, Raghuvansh R Saxena, and Huacheng Yu. Multi-pass graph streaming lower bounds for cycle counting, max-cut, matching size, and other problems. In 2020 IEEE 61st Annual Symposium on Foundations of Computer Science (FOCS), pages 354–364. IEEE, 2020. doi:10.1109/FOCS46700.2020.00041.
- [4] László Babai, Peter Frankl, and Janos Simon. Complexity classes in communication complexity theory. In 27th Annual Symposium on Foundations of Computer Science (sfcs 1986), pages 337–347. IEEE, 1986.
- [5] Carsten Damm, Stasys Jukna, and Jiří Sgall. Some bounds on multiparty communication complexity of pointer jumping. Computational Complexity, 7(2):109–127, 1998. doi:10.1007/PL00001595.
- [6] Devdatt P Dubhashi and Alessandro Panconesi. Concentration of measure for the analysis of randomized algorithms. Cambridge University Press, 2009.
- [7] András Hajnal, Wolfgang Maass, and György Turán. On the communication complexity of graph properties. In Proceedings of the twentieth annual ACM symposium on Theory of computing, pages 186–191, 1988. doi:10.1145/62212.62228.
- [8] Gábor Ivanyos, Hartmut Klauck, Troy Lee, Miklos Santha, and Ronald de Wolf. New bounds on the classical and quantum communication complexity of some graph properties. In 32nd International Conference on Foundations of Software Technology and Theoretical Computer Science, page 148, 2012.
- [9] Bala Kalyanasundaram and Georg Schintger. The probabilistic communication complexity of set intersection. SIAM Journal on Discrete Mathematics, 5(4):545–557, 1992. doi:10.1137/0405044.
- [10] Eyal Kushilevitz. Communication complexity. In Advances in Computers, volume 44, pages 331–360. Elsevier, 1997. doi:10.1016/S0065-2458(08)60342-3.
- [11] Xinyu Mao, Guangxu Yang, and Jiapeng Zhang. Gadgetless lifting beats round elimination: Improved lower bounds for pointer chasing. In Raghu Meka, editor, 16th Innovations in Theoretical Computer Science Conference (ITCS 2025), volume 325 of Leibniz International Proceedings in Informatics (LIPIcs), pages 75:1–75:14, Dagstuhl, Germany, 2025. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.ITCS.2025.75.
- [12] Colin McDiarmid. On the method of bounded differences. Surveys in combinatorics, 141(1):148–188, 1989.
- [13] Noam Nisan and Avi Widgerson. Rounds in communication complexity revisited. In Proceedings of the twenty-third annual ACM symposium on Theory of computing, pages 419–429, 1991. doi:10.1145/103418.103463.
- [14] Christos H Papadimitriou and Michael Sipser. Communication complexity. In Proceedings of the fourteenth annual ACM symposium on Theory of computing, pages 196–200, 1982. doi:10.1145/800070.802192.
- [15] Stephen J Ponzio, Jaikumar Radhakrishnan, and Srinivasan Venkatesh. The communication complexity of pointer chasing. Journal of Computer and System Sciences, 62(2):323–355, 2001. doi:10.1006/JCSS.2000.1731.
- [16] Anup Rao and Amir Yehudayoff. Communication complexity: and applications. Cambridge University Press, 2020.
- [17] Ran Raz and Boris Spieker. On the “log rank”-conjecture in communication complexity. Combinatorica, 15(4):567–588, 1995.
- [18] Alexander A Razborov. On the distributional complexity of disjointness. In International Colloquium on Automata, Languages, and Programming, pages 249–253. Springer, 1990. doi:10.1007/BFB0032036.
- [19] Xiaoming Sun and David P Woodruff. Tight bounds for graph problems in insertion streams. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2015), pages 435–448. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2015. doi:10.4230/LIPIcs.APPROX-RANDOM.2015.435.
- [20] Amir Yehudayoff. Pointer chasing via triangular discrimination. Combinatorics, Probability and Computing, 29(4):485–494, 2020. doi:10.1017/S0963548320000085.
Appendix A Information Theory Preliminaries
Let be a discrete random variable over a domain . The entropy of is defined by It is always true that . If is a random permutation, then .
The conditional entropy of a random variable given is defined by
Here, denotes the random variable with distribution
The min-entropy of a random variable is defined as: .
Properties of Entropy.
-
1.
Non-negativity: , .
-
2.
Bounds: .
-
3.
Chain rule: .
-
4.
Conditioning: .
-
5.
Data processing: If then and .
-
6.
Subadditivity: .
Fact 16.0.1 (Fano’s Inequality).
Let be a random variable taking values in a finite set , and let be an estimate of based on some observation . Define the probability of error as
Then,
where is the conditional entropy of given , and is the binary entropy function.
Appendix B Statements and proofs of lemmas used
Lemma 17.
Let be a distribution over , and . If for some , Then for any , there exist distributions over such that
where
Proof.
Set
We claim . Suppose, for the sake of contradiction, that . Let denote the indicator random variable of the event that . We have:
contradicting the assumption .
Let . Let , and . Then, clearly, .
Since , we can write:
Where is the normalized version of .
Lemma 18 (Entropy vs. Min-Entropy [15]).
Let be a distribution on with entropy at least . Then, for all , is within of a distribution with min-entropy at least
Appendix C Missing proofs from Section 3
Proof of Claim 8.
We begin with the following observation:
Observation 20.
For a vertex , and , let denote the unique vertex at distance from . Then, for all ,
Given Obs 20, we analyze each case:
-
YES Case: (When )
Chasing the pointer starting at the -copy vertex , if , then after the -th layer, it switches to a path on the block vertices, ending up at . This switch triggers a path back such that and end up on the same cycle, call it .
Now consider following the outgoing edges sequentially (chasing the pointer) starting any of the remaining vertices . Since is even and all functions are permutations, an odd number of them will switch from to a copy vertex in the layer when following outgoing edges. The edges are set up for the vertex to lead back to , so we eventually visit all the vertices.
Due to this, the remaining edges will form a single second cycle . In this cycle, the vertices of appear in the order .
-
NO Case: (When )
In this case, and end up in 2 separate cycles since they do not switch from to or vice versa.By a similar argument as above, chasing pointers from the remaining vertices , an even number of them switch from to a copy at layer when following pointers. This implies that and cannot fall in the same cycle.
Appendix D Missing proofs from Section 4
Proof of Claim 10
Proof of Claim 10.
Since , . 555We take the lower order terms due to the functions being permutations as subsumed in the larger term due to for clarity.
By Markov’s inequality, we have
so for at least of all , the first message satisfies
For such any , we therefore have: , for .
Since , subadditivity of entropy implies that . Applying Markov’s inequality yields that at least of satisfy .
Call a good for , if . Since the distribution of is uniform, rectangles corresponding to pairs with and good cover at least a fraction of the total inputs.
Proof of Claim 11
Proof of Claim 11.
Define . Let denote a random variable following the distribution of conditioned on .
Let be the distribution induced over (equivalently, ) in . We know from Claim 10 that .
Since , we have that .
By Markov’s inequality, with probability over choices for , it holds that .
Call ; the above implies .
Definition 21.
-
Call a vertex is locally good with respect to if
-
is globally good if it is locally good for at least of under .
Claim 22.
Each has at least locally good vertices with respect to it.
Proof.
Since , by sub-additivity we have:
The statement follows by Markov’s inequality on .
Claim 23.
The number of globally good is at least .
Proof.
Let be the number of globally good . Since every has at least locally good vertices, and has measure at least under , we have:
Rearranging gives the stated bound on .
We now make a probabilistic argument to continue the proof of Claim 11. Let be a random one-to-one mapping induced by . Define:
For each globally good , define the random variable:
captures the probability mass (under ) that fixing a leads to . We note that is random purely due to the random assignment , which is completely determined by . Furthermore, is a uniformly random permutation in the subrectangle under consideration, so is a uniformly random one-to-one mapping.
Claim 24.
For any constant , and any globally good as in Definition 21,
Proof.
Partition the indices into
with , since is globally good.
Let . From a calculation as in Lemma 17 (Appendix B) with , applied on the distribution (recall that ) , we infer that . This implies that:
| (2) |
We can thus write:
We now concentrate on the contribution from the good indices and apply the decomposition lemma (Lemma 17) to the distributions therein, since they have high entropy.
For , from the entropy bound , we apply Lemma 17 with and , giving:
Using this decomposition, we can write as follows:
where
and has the remaining terms of , and is a non-negative random variable. Intuitively, captures components of the distribution having high min-entropy.
We now bound the probability of deviation of from its mean. Clearly, for any , . We compute the expectation:
| (3) |
Theorem 25 ([12], [6, The method of averaged bounded differences, Corollary 5.1]).
Let be random variables and a function such that for each there exists with
for all choices of . Then, for any ,
We apply the above, for being and the random variable . We bound for : each term in is of the form , and is non-zero only for one . Thus, each contributes at most:
Each is at most , and we have at most terms. This gives us:
Applying the concentration bound with , we get:
Setting , to be the set of globally good indices , , and a union bound, we conclude the proof of Claim 11.
Proof of Claim 12
Proof of Claim 12.
Let Alice’s message have . We know that before is passed, for every , since is independent of (and ) .
In particular, for , with probability over the choice of , . Thus, within on fixing such an , we get a sub-rectangle where, by subadditivity of entropy:
Call the uncertain vertices, since Bob cannot predict the next pointer () starting at these with reasonable certainty after receiving . Intuitively, it is sufficient to show that in , occurs as an uncertain vertex with constant probability. We formalize this next.
Within , the value of is known exactly to Bob: it is the same across , for any fixed . This is because are fixed. Thus, for a fixed , if , . Applying Fano’s inequality (Fact 16.0.1), the probability of Bob making an error in finding the value of is at least , for being small enough.
Since satisfies Eq. (1), we get a set of globally good vertices satisfying Claim 11. Then , and each has probability at least . Thus, the probability of Bob seeing an uncertain vertex as in is at least .
Overall, the probability of error within is:
This concludes the proof of Claim 12.
Appendix E Missing proofs from Section 5
Proof of Claim 14
Proof of Claim 14.
Let . Since , , and , we obtain
There are at least elements in satisfying ; call this set . Let . From the above, . Let be a large constant to be set later, and .
Let be the induced distribution on in . We have . By Markov’s inequality, , for .
But this means that we should have at least elements in , since each element has probability at most .
Proof of Claim 16
Proof of Claim 16.
Define to be indicator variables for the events under a random choice of . For any , we have . Thus, . Let .
Claim 26.
Proof.
We use McDiarmid’s inequality (see Theorem 19) on the function . The average bounded difference at satisfy .
Let for convenience. Let . Note that . Since , we have:
Hence, conditioned on , with probability at least over choices of , we have that . Fix such a pair, and let for some . By subadditivity of entropy:
By Markov’s inequality, there are at least indices where . From these, at least coordinates have to be from . For each of these, we have that . Let .
Thus, the probability (over , under the distribution of ) that we get at least indices from , such that is at least:
Appendix F Upper bounds
We adapt the protocols from [5, 15] to infer what depth of pointer can be chased with bits. For simplicity of notation, while describing protocols, we let and all . We will incur a multiplicative factor of in the communication cost if the functions are different.
Theorem 27.
Let . Then .
Proof.
Let .
-
Round 1 (Bob Alice): Bob sends the last bits of for every .
-
Round 2 (Alice Bob): Alice sends the last bits of for every , and also sends along with for every possible vertex in Bob’s list that could point to.
-
Round 3 (Bob Alice): Bob computes . From Alice’s message, he knows . He then computes and sends along with for every possible vertex in Bob’s list that could point to.
-
Round 4 (Alice Bob): Alice computes . From Bob’s message, she knows . She then computes and sends along with for every possible vertex in Bob’s list that could point to.
-
…
-
Round : If is even, Alice computes , knows the possibilities for from Bob’s previous message, computes , and sends along with for all possible vertices;
if is odd, Bob performs the analogous computation and sends along with for all possible vertices.
-
…
-
Round : If is even, Alice knows and sends it to Bob; if is odd, Bob knows and sends it to Alice, where .
Intuitively, Alice knowing the pointer values for one layer ahead from Bob allows her to “jump” forward pointers per round: two from her layers and one from Bob.
Rounds 1 and 2 together take bits. All other rounds (except the last round) take bits per round. Since the number of rounds is constant, the total communication is bits.
Theorem 28.
Let . Then .
Proof.
The protocol follows the same communication structure as in the protocol, with each party in every round (starting from round 2), each party additionally sending the following bits every round:
-
Round 1 (Bob Alice): Bob sends the last bits of for every vertex , which also implicitly communicates the bit vectors across all vertices.
-
Round 2 (Alice Bob): The bit vector for all .
-
Round 3 (Bob Alice): The bit vector for all .
-
Round 4 (Alice Bob): The bit vector for all .
-
Round 5 (Bob Alice): The bit vector for all .
-
…
-
Round i: If is even, Alice sends for all , else Bob sends for all .
-
…
-
Round : If is even, Alice knows from the base protocol, and she learns from the extra parity information exchanged from Round 2. Thus, she sends to Bob, where is chosen so that the resulting vertex is exactly . If is odd, Bob analogously computes and sends to Alice.
Rounds 1 and 2 together take bits. All other rounds (except the last round) have additional bits compared to the protocol, but since the number of rounds is constant, we still have the same total communication of bits.
