Lower Bounds on FSS from Dynamic Data Structures
Abstract
In Function Secret Sharing (FSS), a dealer with a given function from bits to a commutative group such that is in a function class shares succinct keys with two properties. Evaluating each key separately on a common input results in additive shares of and any subset of the keys does not provide information on . Two-party FSS schemes which are reducible to One-way Functions (OWF) have applications in cryptography, complexity, and in practical data security systems.
We establish a two-way transformation between a two-party FSS scheme for a function class , which is black-box reducible to an OWF, or even black-box reducible to a family of Pseudo-Random Functions (PRF) and a dynamic data structure that supports range queries on . A data structure of this type enables dynamically adding functions to a multiset of functions , and answering range queries on the output of , i.e., returning for a query . The data structures are defined in one of several models which abstract RAM.
The correspondence together with known lower bounds on the update time and the query time in data structures leads to the first non-trivial lower bounds on FSS schemes which are black-box reducible to PRF. These lower bounds apply to FSS schemes with polynomial key size and include:
-
For , the class of all functions which assign a constant group element to any input in a specified -dimensional box and to all other inputs: if the key sharing function, , runs in time polynomial in and the evaluation function is then:
-
–
If and then ’s running time is .
-
–
If and is cyclic such that then ’s running time is .
-
–
If is a constant and further, and correspond to operations on data structures in the Oblivious Group Model (this includes all known FSS from OWF techniques), then the product of ’s time and the key size is .
-
–
-
For , the class of all monomials such that , assuming : if runs in polynomial time, then ’s running time is .
Keywords and phrases:
FSS, Data Structures, Lower Bounds, Black-Box ReductionsCopyright and License:
2012 ACM Subject Classification:
Theory of computation Cell probe models and lower bounds ; Theory of computation Cryptographic protocolsEditor:
Shubhangi SarafSeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
In this work, we show a surprising connection between two seemingly very different objects: Function Secret Sharing (FSS) and dynamic data structures which support range queries.
FSS [11, 13] is a generalization of secret sharing [32, 3], which enables a dealer to distribute succinct and secret shares of a function in a given function class . In more detail, FSS is made up of a pair of probabilistic, polynomial-time algorithms such that on input and a number of parties , produces keys with three properties. First, the sum of executing on each key with a common input results in for all in the domain of . In addition, an adversary that has access to at most keys does not obtain information on , beyond its inclusion in . Finally, the keys are succinct, i.e. much smaller than the size of the truth table of , preventing the trivial FSS scheme in which is secret-shared independently for every possible input .
In this work, as in most works on FSS, we are interested in FSS for two parties, i.e. . In this setting, the class of functions that admits FSS schemes depends on the computational power of the adversary. At one extreme, if the adversary is computationally unbounded, then even very simple function classes such as the class of all point functions, which assign to a single point in the domain and to all the others, require exponential key size [18, 11].
At the other extreme, assuming the hardness of some problems that imply additively homomorphic public-key encryption such as Decision Diffie-Hellman [12], variants of Learning With Errors (LWE) [15] or Decisional Composite Residuosity (DCR) [28] leads to FSS for the class of all logarithmic depth circuits with constant fan-in, while stronger lattice assumptions imply FSS for all circuits of polynomial size [21]. While these FSS schemes that require public-key encryption have small keys and asymptotically efficient evaluation time, the concrete evaluation time is typically too high for use in practical applications of FSS.
Arguably, the most useful type of FSS schemes for practical applications are those whose security relies on the existence of One-way Functions (OWF). The classes of functions that have FSS from OWF are, as far as we currently know, far more limited than classes of functions that have FSS from LWE or DCR. Prominent examples of such classes include the class of point functions, the class of intervals, i.e. all functions that are on a contiguous interval of and on the rest, and more generally, the class of functions that are computed by a polynomial size decision tree, while leaking the topology of that tree [13].
FSS schemes that are reducible to OWF are typically defined based on cryptographic primitives that are polynomially equivalent to OWF, but are more convenient to use and in a sense more powerful such as length-doubling Pseudo-Random Generators (PRG) or Pseudo-Random Functions (PRF). Thus, the key size of the FSS scheme is determined by the number of PRF keys (or PRG seeds) it uses and the computational complexity of its generation and evaluation algorithms are determined by the number of calls to a PRF (or a PRG).
FSS schemes from OWF have found a range of applications due to their simplicity, their small key size and their concretely efficient generation and evaluation times. These applications include privately reading from and writing to databases [11], anonymous messaging [19], Multi-Party Computation (MPC) with non-arithmetic gates [14, 7], Pseudo-Correlation Generators (PCG), which are used to locally generate long, correlated pseudorandom strings for use in MPC [8, 9, 27], and others.
All of these applications of FSS from OWF depend on a number of useful schemes, especially in the two-party setting. However, a rapid string of successes in construction of schemes after FSS was defined in [11], has recently been followed by much slower progress. At this point it is unclear whether we have reached the limit on the class of functions that admit FSS from OWF and on the efficiency of such FSS schemes or whether significant progress is still possible.
A large part of the problem is our lack of lower bounds on FSS schemes. The only known lower bounds on key size and computational complexity of any two-party FSS are trivial bounds that are implied by the communication complexity and circuit complexity of functions in the class without the security requirement of FSS.
The main goal of this paper is to improve our understanding of the FSS landscape by proving non-trivial lower bounds on the efficiency measures of FSS. We achieve this by connecting FSS with the second topic of this work which is dynamic data structures for range queries.
In its simplest form, the problem of range queries considers points in some domain, e.g. , which are inserted into the data structure by operations. The data structure supports operations on a range, which return data on all the points in the data structure that are included in the range, e.g., the number of points in the range.
A more general form of the problem, which is required in this work, associates a weight with each input point. The weight is an element of a commutative group . An operation updates the weight of a point, while a operation returns the sum of all the weights of points that fall in the queried range. For example, to support counting the number of points in a given range the weight of every point is .
Data structures for range searching have been an active research area for quite some time [4, 1]. In particular, work on lower bounds for range searching improved our understanding of what is feasible in this domain [22, 16, 17, 30, 29, 20, 24, 25, 26]. Demonstrating lower bounds on data structures requires a formal model in which such lower bounds can be proved.
An important model for lower bounds is Yao’s cell probe model [33]. This model, which is an abstraction of Random Access Memory, views the memory as a set of cells that each store bits. An update probes a number of cells and updates them, while a query again probes a number of cells and returns a response. In both the update and the query operations the location of the cells read or written to can be an arbitrary function of the operation’s input and the content of previously probed cells. The time of the update and query operations is measured by the number of probed cells, disregarding any other computation. The cell probe model is not restrictive and can therefore model almost any conceivable data structure. On the other hand this permissiveness results in relatively weak lower bounds.
Some of the lower bounds proved in the cell probe model are directly useful for our work. These bounds include a bound111Following FSS literature, the input domain in this work is , while in most data structure works the input domain is , leading to an exponential difference between the bound we report and the bound in the original works, which in this case is e.g. . from [24] on the query time of any data structure in the cell probe model for cells and weights with description size that supports weighted orthogonal range counting in two-dimensional space for any polynomial update time. In the version of the same problem in which the goal is to return the parity of the number of points in a range (i.e. a rectangle), the lower bound on the query time is only [26]. The same work [26] discusses a data structure for a different problem in which the data structure represents a polynomial over a binary field with degree at most for some . An update operation changes a coefficient in the polynomial and a query on point returns the last bit of . Then, the query time has a lower bound , where is the update time.
A more restrictive model in which lower bounds have been proved is the Oblivious Group Model [23]. In this model, the weights are again elements in a commutative group, but the data structure is not aware of the particular group. The memory cells store linear combinations of the weights of each item in the data structure. The data structure responds to range queries by computing linear combinations of the weights of the cells it probes until it achieves the sum of the weights of all the stored items that intersect the desired range. An update operation consists of reevaluating every cell with a linear combination that contains the weight of the updated point with non-zero coefficient. In this model, orthogonal range counting in -dimensional space for a constant has a lower bound on the update time and the query time which is (see the full version of [25]).
1.1 Our Contribution
The first contribution of this work is proof of a correspondence between two-party FSS that is black-box reducible to OWF, or even FSS that has black-box access to a family of PRF, and data structures for range queries.
One side of this correspondence is a transformation from an FSS scheme for a function class to a data structure that answers range queries related to . Initially, the data structure is populated by one of the keys that creates for the constant function (which can be assumed to be part of ) and the truth table of a random oracle, which plays the part of a PRF. Subsequently, each update to the data structure corresponds to an invocation of for a function that uses a modified version of the oracle, replacing the key in the data structure, and a subset of the possible values in the truth-table of the new oracle. Each query with input requires two calls to to return where are all the functions that have previously been inserted into the data structure.
A lower bound on the update time or query time in the data structure translates to a lower bound on time or time, respectively, in the FSS scheme.
However, the transformation outlined above does not create a data structure in the cell probe model, but in a generalization of it which we refer to as the write probe model. In this model, the memory cells can be initialized to arbitrary values prior to the first update and query operations, and further, the cost of updates is taken to be the number of cells to which the operation writes, essentially allowing an update operation to read all the cells for free. We show that a number of lower bounds in the cell probe model carry over with almost no change to the write probe model, enabling the translation of lower bounds on range queries to lower bounds on FSS.
An additional issue is that the natural form of the data structures stores points in a domain and provides a range in a query asking for the number of stored points in the range (or some other statistic on all the points in the range). However, the data structure that the transformation creates exchanges points and ranges, storing ranges (i.e. functions) in the data structure and providing a point as a query asking for the number of ranges in which this point appears. This inversion between domain points and ranges in the two-dimensional orthogonal range query case is called rectangle stabbing. A folklore result shows that in general for orthogonal range searching the inversion is possible with small cost in query and update time.
The second contribution of this work is the first non-trivial lower bounds for efficiency measures of two-party FSS from OWF on any function class. We use the correspondence between FSS and data structures together with minor updates to lower bounds from the literature on data structures in the cell probe model to prove the following:
-
For , the class of all functions which assign a constant group element to any input in a specified -dimensional box and to all other inputs: if the security of the FSS scheme is black-box reducible to PRG, the key sharing function, , runs in time polynomial in and the evaluation function is then:
-
–
If and then ’s running time is .
-
–
If and is cyclic such that then ’s running time is .
-
–
-
For , the class of all monomials such that , assuming : if again the security of the FSS scheme is black-box reducible to PRG and runs in polynomial time, then ’s running time is .
The second side of the above correspondence is a transformation from a data structure for range queries in the Oblivious Group Model to two-party FSS. The transformation requires developing an incremental variant of known schemes for Distributed Multi-Point Functions (DMPF) [10], i.e., FSS for the class of multi-point functions. DMPF is the class of functions that are across the whole domain except for points such that for all . The incremental variant of DMPF is a generalization of Incremental Distributed Point Functions [6] which enables the generation and evaluation of secret shares of values for all prefixes of the points with the same cost as DMPF.
Despite the restriction of the transformation to data structures in the oblivious group model, this transformation covers all the FSS schemes we currently know. Therefore, any lower bound for data structures in the oblivious group model applies to all FSS we know, and breaking such lower bounds will require substantially new FSS techniques
The final contribution of this paper is a lower bound on two-party FSS from OWF for the class of for . The lower bound applies only to FSS that can be viewed as the result of our transformation from a data structure for range searching boxes in -dimensional space in the oblivious group model and states that the product of the key size and time is .
1.2 Our Techniques
1.2.1 FSS to Data Structure
When constructing the data structure from an FSS scheme, our goal is to maintain a multiset of functions from a function class , allow updates that add functions to , and efficiently answer queries on input to which the response is .
Let be a two-party FSS scheme for the class , where and denote the FSS generation and evaluation algorithms with access to an oracle . takes as input a security parameter and a function and returns two keys . takes as input , a party index , the associated key , and a point and returns the evaluation result at . The correctness of FSS ensures that . In many cases, it is convenient to use the vectors .
In the FSS literature, a length-doubling Pseudo-Random Generator (PRG) is most commonly used as the oracle . In this work, we assume that the FSS has access to a more powerful primitive: Pseudo-Random Functions (PRF), since it strengthens our results. We use for the number of oracle queries makes and for the number of oracle queries makes.
We work under the assumption of fully black-box security [2, 13], where if there is an algorithm that breaks the FSS with probability , then there is an algorithm that uses as an oracle and breaks the PRF in polynomial time with success probability polynomial in .
The transformation we show requires a single invocation of for each update operation and two invocations of to respond to each query. Storing a single function could easily be done using the FSS by randomly sampling an oracle and generating . When answering queries, we compute .
To add more functions, we will use a procedure we call side-switching. In this procedure, we take a function , a key and have an oracle , we return a new key and a new oracle which is only different from in very few positions such that .
From [11] we can assume WLOG that the all-zero function is in the function class . Using two side-switching operations with and then we can therefore obtain a key and oracle with .
This gives us a way to build the data structure. We maintain two keys and two oracles. At the start they are initialized such that , and when doing updates we use the side-switching operations to update and to such that .
The only thing left to explain is how to achieve the side-switching operation. Note that because we are working in the write probe model the computation required does not matter.
Given a key and the function , we want to efficiently find an oracle and an execution trace (e.g. the random choices taken) of which returns , and from it we can obtain . We shall then use the FSS’s security to argue that the success of this procedure cannot significantly depend on being originally generated from , and this procedure must work with high probability for all keys.
We cannot just use , as that requires breaking the security of the FSS. To bypass this obstacle, we utilize the fact that the required efficiency is in the context of a reduction that is fully black-box. This means that the algorithm is allowed to do exponential work, as long as it invokes the oracle a polynomial number of times. It can therefore search for a second oracle and execution trace of which produces . The execution of only queries oracle locations, so it suffices to change at these locations.
This gives a similar oracle and a new key such that . This is close to the desired outcome, but what we actually want is to have with the original oracle in place of with the second oracle . To resolve this, we search for that does not “contradict” on the values that the FSS requires. That is, we start by computing and keep track of the locations it queries. When searching for , we ensure that it returns the same values as when querying these locations. Now, the side-switching procedure makes oracle queries – up to queries. By using instead of the original in the FSS, we get from the reduction that if this new side-switching method does not work with high probability, we can break the oracle PRF with good probability using just queries, which is impossible for a random oracle.
We can thus conclude that side-switching works correctly with high probability, and we have our data structure. There are a few details involved in proving it works correctly when it is used many times, as the oracle and key distribution may shift from the original distribution, which are worked out precisely in Lemma 17.
1.2.2 Data Structure to FSS
Recall that a data structure in the oblivious group model consists of adding constant multiples of the weight to some cells in and returning a linear combination of cells in .
We can consider the updates as a matrix , whose rows are indexed by the possible input points and columns are indexed by the memory cells. The content of the matrix at is the coefficient of input point in the -th memory cell.
The complexity of an update is the number of non-zero elements in the -th row of , and is the maximum number of non-zero elements in a row.
We can also regard the queries as a matrix , whose rows are indexed by memory cells and whose columns are indexed by the possible queries. The content of the matrix at is the coefficient of the -th memory cell in the linear combination for query .
The complexity of a query is the number of non-zero elements in the -th column of .
The weight of input point in query can be found by looking at the content of the product at .
We can now construct an FSS for the function family whose functions are the input points of the range queries problem, and the functions’ domain is the queries of the range queries problem solved by the data structure. We do this by having share a DMPF of the appropriate row of , . In we will compute a linear combination of the values of shared in the DMPF, . When summing both sides we have , so we get which is exactly the desired result.
While for some problems like DPF or DMPF this approach gives the best known result (up to constant factors), there are also many problems (DCF, prefix DPF) where this solution is worse by a factor of . Essentially, this is due to costs incurred by the DMPF that the specific schemes can avoid by utilizing the specific patterns in the matrices.
We construct a combination of DMPF and incremental DPF [6], called incremental DMPF. This variant enables sharing a binary tree with up to nodes and values at its nodes, instead of having any points, but its cost is proportional to instead of . The evaluation is also incremental, which means we can traverse the tree and evaluate as we walk, instead of having to traverse a path from the root to a leaf for each evaluation.
FSS schemes based on Incremental DMPF obtain for all function classes key size and evaluation time that are equivalent to the best known direct FSS constructions up to constant factors. The correspondence of such FSS schemes with data structures in the oblivious group model means that the lower bounds on the data structures hold for FSS based on Incremental DMPF.
2 Preliminaries
Notation 1.
is the uniform distribution on functions . is the uniform distribution on functions .
2.1 FSS
We adapt the definition of FSS to add an oracle and specialize it to two parties. As in [13], we assume that a function description contains the input size .
Definition 2 (FSS, adapted from [13]).
A (1-secure 2-party) FSS scheme for a function family with an oracle is a pair of
-
1.
a PPT algorithm : takes a description of a function , and returns two keys, .
-
2.
a deterministic polynomial algorithm : takes a key and a point to do the evaluation and returns a group element.
We define the key size as the worst-case number of -bit cells required to represent a key returned by . We define the evaluation efficiency of an FSS as the maximum number of oracle queries and key accesses (to -bit cells) done by . Finally, we define the generation efficiency as the maximum number of oracle queries done in .
Definition 3 (FSS correctness, from [13]).
A (1-secure 2-party) FSS scheme for a function family with an oracle is correct if for all oracles , with input size and input , if then
Another useful function is , which executes in all locations at the same time. For our purposes we can simply regard it as being implemented by calling for each value of , but in many schemes it can be implemented more efficiently.
We also define
Definition 4 (FSS observation).
Given an FSS scheme , a party index and a key , we define as a partial function containing all values of that are queried while running .
Note that if then .
We further define which is a modified version of . In addition to the two keys it also outputs an execution trace , which is a partial function containing input and output pairs for all oracle invocations that performed.
As for secrecy we will specialize the definition of fully black-box reductions (or BBB-reductions) [2, 31] to the case of FSS being secure from the oracle being a PRF.
The usual definition of security (without an oracle) states that there is a simulator such that is indistinguishable from for an acceptable leakage function . When adding an oracle to the definition, we can consider two variants: a weaker one in which the simulator has access to the same oracle that and call, and a stronger one in which the simulates the correct distribution without access to the oracle. Our proof requires the stronger variant.
In particular, cases where keys are pseudorandom fit under this more restrictive definition.
Definition 5 (FSS fully black-box secrecy, adapted from [2, 13]).
We say that an FSS scheme for a function family with an oracle is fully black-box reducible to PRF with reduction constant if for any party there exists a PPT simulator and a PPT PRF-distinguisher such that for every security parameter , function with description size , adversary (not necessarily polynomial-time) and oracle , if
then
Example 6.
The DPF from [13] is fully black-box secure (when simulating length-doubling PRG via two calls to the PRF)
Proof.
In the security proof of [13] there is a sequence of hybrid distributions such that is the same distribution as , is the same distribution as . It also gives algorithms such that the distribution of is and of is . The distinguisher will randomly choose and call on the result of . In the case we will invoke on one of and in the case we will invoke it on one of . The difference is exactly versus which is versus , so this distinguisher has advantage .
A particular type of FSS which we require in this paper is DMPF [10], which is an FSS which can represent all functions , but leaks an upper bound on the number of non-zero points in the function, and whose runtime depends on .
Using [10]’s DMPF construction based on Oblivious Key-Value Store (OKVS), instantiated with the OKVS from [5], gives a DMPF secure from One-way Functions with key size , generation efficiency and evaluation efficiency , as decoding the OKVS of [5] requires reading continuous bits, so with the key being stored in -bit cells we only need to read cells per decoding.
2.2 Data Structures
In dynamic data structures, we consider the problem of maintaining a data structure which is able to perform updates and answer queries.
We are interested in particular in the problem of (weighted) range counting [25, 24]. This problem has two similar variants:
-
1.
In the first, we have a function family with an abelian group as a range and integer weights , and we need to support changing the weight of a particular function for updates and computing for a given for queries.
-
2.
In the second presentation, the range of the functions is , and the weights come from an abelian group. Here too we need to support changing weights and computing .
There are multiple models in which these data structures can be considered. We are generally interested in two efficiency parameters, , the update cost, and , the query cost. One of the most restricted is the oblivious group model [25] where we assume the weights come from a black-box abelian group , and the data structure is a long array of group elements. This model fits the second presentation of weighted range counting. Queries consist of summing at most array elements with integer weights, and an update of adding to the weight of consists of adding integer multiples of to at most array elements. In the oblivious model the integer multipliers are independent of the preexisting data structure. Formally, we can define it as:
Definition 7 (Oblivious group model, adapted from [25]).
A data structure in the oblivious group model for with domain and range is a factorization of the matrix representing the function family to an update matrix and a query matrix , such that each row of has at most non-zero values, and each column of has at most non-zero values. We call the internal size of the data structure.
Another commonly considered model is the cell probe model. In this computational model the data structure is a long array of -bit cells. Updates can read and modify at most cells, and queries can read cells.
One data structure problem we consider multiple times is the 2D range counting problem, which we define as storing points on an grid for , potentially with weights, and answering queries of the number/sum of weights of points stored in the data structure which are dominated by a given query point – that is, points that are smaller than in both coordinates.
In the rectangle stabbing problem we maintain rectangles contained in a grid, potentially with weights, and want to find the number/sum of weights of rectangles containing a given point.
These two problems are equivalent by the folklore reduction outlined in the full version of [26].
These problems can be defined similarly in an arbitrary dimension , and the same reduction applies (when we treat as a constant).
3 Write Probe Model
The target of our reduction will be a new model, even more general than the cell probe model, which we call the write probe model.
In the write probe model we have an unrestricted initialization step which occurs before all updates and queries. Updates can modify at most cells given by an arbitrary function of the update and the current content of the data structure. Queries can adaptively read up to cells – the location of the next cell to read can be an arbitrary function of the contents of previous read cells and the query.
We first observe that as long as errors are detectable, we can allow for a small error probability:
Proposition 8.
Let there be a data structure in the write probe model for a problem with worst-case update time and worst-case query time . Suppose each update requires cells to describe, and fails with total failure probability for updates. Suppose further that if an update operation fails then it returns an error value, notifying the caller of this failure. Then, we have a data structure in the write probe model for the same problem with worst-case update time and expected query time for updates, and no failure probability.
Proof.
The modified data structure will additionally keep a list of all updates which were performed, and whether any update operation failed. In query, if the data structure has detected any failure, it will instead directly compute the answer from the list of updates. This requires reading cells of updates, but only with probability , so in expectation it only changes the query complexity by .
We can show that while the write probe model is theoretically stronger than the cell probe model, in practice they are very similar, and in particular many cell probe lower bounds transfer without any modifications to the proof. The underlying reason for this is the use of the chronogram method:
Our description is based on the overview of [24, 26]. In the chronogram method of Fredman and Saks [22], we have updates which are batched to batches (for some parameter , slightly larger than ), with batch consisting of performing updates to the data structure. The batches are performed from big to small. Very broadly, the argument then follows that for any the batches can contain no information about update , as they occur before it. Additionally, batches have total size , so they cannot contain much information about batch .
The argument then roughly follows information theoretically by showing that, together with some partial information about the data saved in the data structure while performing , one can recover more information about than is possible.
Remarkably, this methodology doesn’t care at all about which cells the data structure reads during an update, only about the modifications it makes to it. In essence, this is because each batch is only treated as statically, where we look at the overall modifications it made to the data structure. Additionally, this methodology is not affected if the data structure is initialized from whatever distribution we wish. This means that results following it apply as-is to the write probe model, and in particular:
Theorem 9 (Adapted from [24] Theorem 1).
Any data structure for dynamic weighted orthogonal range counting in the write probe model for updates must satisfy . Here is the cell size, is the expected average query time and is the worst case update time. This lower bound holds when the weights of the inserted points are -bit integers.
Proof Sketch.
We argue that Larsen’s proof works as-is for the write probe model case.
As described previously, [24] uses the chronogram methodology. To fill in more details, the information given about is a set of cells which is a subset of the cells modified by , such that many queries are resolved by – that is, all of the cells with last update time that are read when performing those queries are in . When encoding we describe and a set of resolved queries (as well as the batches ), and we condition on the batches . In the decoding we can then evaluate all queries in : when reading a location, if it was modified in batches we have the correct value, else if it is in we also know the stored value, and otherwise, since resolves , this must be from a batch which we know.
The main challenge here is showing that if a query does probes to the cells modified by for then there exist sets , , such that is resolved by , and the queries in it are sufficiently well-spread that their answers provide more information about than it takes to specify and , yielding the desired contradiction.
This existence is shown in cell probe mode by Lemma 3 in [24], and as the proof of the lemma doesn’t depend on the initialization nor does it utilize a bound on the number of cells an update reads, only the amount of cells it modifies, it also applies to the write probe model. Using the equivalence to rectangle stabbing mentioned in Section 2.2 we also get a lower bound for this problem.
Corollary 10.
Any data structure for dynamic weighted rectangle stabbing in the write probe model for updates must satisfy . Here is the cell size, is the expected average query time and is the worst case update time. This lower bound holds when the weights of the inserted rectangles are -bit integers.
Theorem 11 (One-way weak simulation theorem for the write probe model, adapted from [26] Theorem 1).
Let be a dynamic boolean data structure problem, with random updates grouped into epochs , such that , followed by a single query . If admits a dynamic data structure in the write probe model with cell size , worst-case update time and average (over ) expected query time , satisfying , then there exists some epoch for which there is a one-way randomized communication protocol in which Alice sends Bob a message of only bits, and after which Bob successfully computed with probability at least .
Proof Sketch.
The proof of Larsen, Weinstein and Yu also works without modification for the write probe model. One should note, however, that the epoch associated with a cell changes its meaning from the last epoch in which it was written or read to the last epoch in which it was written. Looking at how their proof uses associated cells, however, this is completely fine.
As an overview, the proof works by having Alice send Bob a random sample of the cells written during epoch . The probability of this sample sufficing to resolve a query is sufficiently large that if Bob could detect this random event we would finish, by outputting a random bit if it doesn’t happen and simulating the data structure if it does.
However, Bob cannot detect that event. Instead, Bob will proceed as if this event occurred, and consider the set of locations he probes when answering the query. By the Peak-to-Average Lemma from [26], there exists a small subset , such that conditioned on the values of the data structure in , the maximum-likelihood answer to query, conditioned on Bob’s information, has sufficient advantage over random guessing, and Bob can compute this privately. However, Alice does not know . To resolve this, she will use public randomness to randomly sample some cells, and out of those send the cells associated with epoch to Bob. Using the public randomness Bob can determine whether is contained in the sampled cells. If it is not he will abort and return a random bit, otherwise he knows the values of and can return the more likely query answer conditioned on his information.
The proof in [26] shows that the described protocol succeeds with sufficient probability, and this proof works identically in the write probe model.
Note that no step in the argument depends on the cells read during updates, or on the initialization, so it works as-is for the write probe model.
Since the weak simulation theorem also applies for the write probe model, so do the results from [26] which depend on it, and in particular we have:
Theorem 12 (Adapted from [26] Theorem 2).
Any write probe data structure for least-bit polynomial evaluation over supporting degree polynomials and updates,222Compared to [26] we have different parameters for the degree and number of updates. We reduce the larger one to the smaller to obtain the result. having cell size , worst case update time and expected query time must satisfy .
Theorem 13 (Adapted from [26] Corollary 1).
Any write probe data structure for 2D range parity, having cell size , worst case update time and expected query time must satisfy .
And using the reduction to 2D rectangle parity we also get a lower bound for it:
Corollary 14 (Adapted from [26]).
Any write probe data structure for 2D rectangle parity, having cell size , worst case update time and expected query time must satisfy .
4 FSS to Data Structure
For the following section, let be a function family that admits an with key size , generation time and evaluation time which is fully black-box reducible to PRF with reduction constant . We also assume that the leakage consists only of and , and will ignore it in this section. We use for the functions in with input size and assume that the function description size is bounded by a polynomial in , .
We start by proving a mostly technical result that uses the FSS scheme’s black-box security to show that an adversary which makes at most queries cannot distinguish between a random oracle and a key generated from it and a random oracle and a key sampled from with significantly higher advantage than .
Lemma 15.
Let . There is a polynomial such that for any , any algorithm with oracle that makes oracle queries and any ,
Proof.
Let be a constant such that makes at most oracle queries. Let be an increasing polynomial bounding the runtime of from Definition 5. We will set .
Set . By the triangle inequality, it suffices to prove .
Suppose by way of contradiction that .
By Definition 5,
Applying Jensen’s inequality, we get
This is impossible, however – makes at most oracle queries (counting indirect queries using the oracle). This means that it can query with as the first coordinate only with probability , and otherwise there is no way for it to distinguish between and a random oracle, so its advantage is bounded by .
By the triangle inequality we also get immediately that one cannot distinguish between keys of different functions:
Corollary 16.
Let . There is a polynomial such that for any , any algorithm with oracle which makes oracle queries and any ,
We now prove the main technical result behind the construction of the data structure: the FSS side-switching lemma. Informally, this lemma states that for any function , given a random oracle and a key for it, after observing only oracle locations, we can produce a modification of locations in the random oracle and a second key , such that:
-
1.
The modification does not change the evaluation of at all.
-
2.
After applying the modification, are a valid pair of FSS keys for with respect to the modified oracle.
-
3.
The distribution of the oracle after applying the modification together with is hard to distinguish from that of a random oracle together with a key for it.
The first two properties are baked into the algorithm, and the third is the content of Lemma 17.
FSS side-switching
(, , , ):
Lemma 17 (FSS side-switching correctness).
Let . Then there is a polynomial such that for any , any algorithm with oracle which makes oracle queries and any ,
For from Algorithm 1.
Proof.
We shall argue this result by showing a sequence of hybrid distributions of with the first one being , and the last one being . We will show that cannot distinguish between any pair of adjacent distributions better than , and the result will follow by the triangle inequality.
Let us start by writing the first distribution in more detail:
-
1.
Sample
-
2.
Sample
-
3.
Sample
Because only queries locations of to calculate , and then any query to requires at most one query to , we have by Lemma 15 that cannot distinguish with probability greater than between this distribution and
-
1.
Sample
-
2.
Sample
-
3.
Sample
Because steps 1,2 are independent, we can swap them. We also add a step sampling which we don’t use yet:
-
1.
Sample
-
2.
Sample
-
3.
Sample conditioned on
-
4.
Sample
Because 2 and 3 sample a pair of values from the same distribution conditioned on some function of them being equal, we obtain the same joint distribution by first sampling uniformly and then sampling conditioned . We then swap the sampling of and , as they are independent:
-
1.
Sample
-
2.
Sample
-
3.
Sample conditioned on
-
4.
Sample
Note that step 3 queries values of , and step 4 doesn’t use , we can apply Lemma 15 again to see that cannot distinguish better than between the previous distribution and:
-
1.
Sample
-
2.
Sample
-
3.
Sample conditioned on
-
4.
Sample
For here we will apply a long sequence of information-theoretical transformation, which don’t change the distribution at all. To begin with, we only care about so we can fold it into the sampling in step 2:
-
1.
Sample for a random oracle .
-
2.
Sample conditioned on
-
3.
Sample
Now let us unfold the definition of :
-
1.
Sample for a random oracle .
-
2.
Sample conditioned on
-
3.
Sample and conditioned on and
-
4.
Define
We can swap 2 and 3 as they are independent:
-
1.
Sample for a random oracle .
-
2.
Sample and conditioned on and
-
3.
Sample conditioned on
-
4.
Define
And then combine 3 and 4 together:
-
1.
Sample for a random oracle .
-
2.
Sample and conditioned on and
-
3.
Sample
We will now merge steps 1 and 2 together. We can think of them together as sampling from quintuples . The first step samples from the correct marginal distribution, and then the second step samples the rest of the value conditioned on the values sampled in the previous step, so we can merge them to sample a quintuple directly. Because is not used in step 3 we will remove it and only sample :
-
1.
Sample , for a random oracle .
-
2.
Sample
Finally, we can regard these two steps again as sampling from quintuples . The first step samples from their marginal distribution, and we see that is sampled from the correct distribution, that is all oracles which agree with are equally likely, because no relevant execution of can query any locations outside of (precisely by the definition of and ). This means that we simply sample from the quintuples, which we can also do by first sampling the oracle. This gives the desired distribution:
-
1.
Sample
-
2.
Sample
For the data structure itself we won’t use the full , but instead a reduced version . Instead of returning the full oracle, the reduced version will return the changed locations .
The data structure mostly functions by using the side-switching algorithm to do a small change to the oracle and key and change the result of the evaluation by the function .
FSS data structure
: oracle
: key, represented using -bit cells
: initial oracle
: initial key, represented using -bit cells
():
():
():
Theorem 18.
Let be a function family with an with key size , generation time and evaluation time for input size and security parameter which is fully black-box reducible to PRF. Let be the number of desired queries, and suppose
Then Algorithm 1 is a data structure for ’s range counting problem in the write probe model with worst case writes per update and probes per query with -bit cells which is correct for updates with probability .
Proof.
We assume WLOG that is one of the functions in using Section 3.2 of [11].
If the data structure doesn’t abort during an update, we show that the queries will be answered correctly: Call the initial oracle , the oracle after step 4 , and the oracle after step 8 . Since by the definition of , we have , and because come from with , and so are a possible output of for , we have . Combining the last two equations we have
Similarly for we have , and , so
This shows that increases by . In we output the -th element of , which starts at and is increased by with .
Left to show is an upper bound on the probability of returning during an update sequence. We will ignore and here, as these are only used for answering queries.
We will use a sequence of hybrids to bound the success probability for an update sequence , defined in Figure 2:
Hybrid number for :
Hybrid number for :
Hybrid number for :
Consider the difference in probability between hybrid and . We unfold in hybrid and use Lemma 17. The number of oracle queries made by the rest of the hybrid is , so by the lemma the distinguishing probability is bounded by .
Now consider the difference between and . Again we can apply Lemma 17, and get a bound on the difference of .
Finally, for the difference between hybrid number and hybrid we can apply Corollary 16 to get a bound of .
Now from the triangle inequality
And since we are finished.
Setting and we get error probability , so we apply Proposition 8 to get
Corollary 19.
Let be a function family with an with key size , generation time and evaluation time for input size and security parameter which is fully black-box reducible to PRF. Suppose , for some constant depending on the FSS and all functions in can be described with bits.
Then there exists a data structure for ’s range counting problem in the write probe model with worst case writes per update and expected query time which is correct for updates.
Specializing to use Corollary 10, we get
Corollary 20.
Let be the function family of functions which assign a constant group element for points in a given 2D rectangle and 0 to other points, with a cyclic group , and suppose we have an FSS scheme fully black-box reducible to PRF where the key size and time are polynomial for , then the efficiency of must be .
Proof.
By Corollary 19 we have a data structure in the write probe model for the rectangle stabbing problem with weights in . Assuming , we can solve also the rectangle stabbing problem with -bit integer weights, because the result fits in . The bound now follows from Corollary 10.
Remark 21.
The above bound also holds in any constant dimension larger than , as we can simply solve the 2D problem in the first two dimensions and ignore the rest.
Alternatively, specializing to apply Corollary 14:
Corollary 22.
Let be the function family of functions which assigns a for points in a given 2D rectangle and 0 to other points, with output in , and suppose we have an FSS scheme fully black-box reducible to PRF where the key size and time are polynomial for , then the efficiency of must be .
And lastly, applying Theorem 12 for , ,
Corollary 23.
Let be the function family of all monomials such that , and suppose we have an FSS scheme fully black-box reducible to PRF where the key size and time are polynomial for , then the efficiency of must be .
5 Data Structure to FSS
We also have a reduction from a data structure to an FSS secure from OWF, assuming the data structure is in the oblivious group model. Recall that a data structure in the oblivious group model can be represented as a factorization of the matrix representing the function family to an update matrix and a query matrix , such that each row of has at most non-zero values, and each column of has at most non-zero values.
The reduction will use the DMPF to share the appropriate row of , and during will run on the DMPF based on the column of the corresponding to the query, computing . The sum of the two shares will give .
This reduction using plain DMPF incurs, for many problems, an extra factor of compared to building the FSS directly. The source of this factor is that DMPF costs for storing points with bits. We can use a tree memory model to alleviate this problem, where we have an infinite binary tree with values in the nodes, and we can access these values by walking in the tree. This memory model can be implemented more efficiently by FSS constructions based on the GGM tree [13]. Simulating random access with cells in this memory structure (like what’s done by DMPF) inherently costs per operation, but for some problems we can use the tree memory better directly.
For example, consider the problem of prefix DPF, where the function we want to share is for all prefixes of some binary string and otherwise. Using the tree memory model, we can just write ones on the path from the root to , and in evaluation access the node corresponding to the query point. This costs for query and evaluation. If we use a flat structure instead, however, we still need to store points, which will cost a key size of using DMPF.
To abstract this, we define the tree cost for updates and queries in the oblivious group model.
For a given vector , the tree size of it is . If we label the root , its children , etc, is the number of nodes we need to access to access all non-zero values of .
We can now define the tree update cost and tree query cost of a data structure in the oblivious group model as the maximum tree size of a row in and column in respectively.
We also define incremental DMPF, which is a variant of DMPF which can take advantage of this structure. This is a merge of DMPF and incremental DPF [6]. In incremental DPF shares a single path in the infinite binary tree and values along it (this can also be thought of as sharing a binary string and a value for each prefix). There is an function which takes the key and the current state as well as a single bit, and moves in that direction, returning the new state as well as the value stored in the current node.
For incremental DMPF shares values on a binary tree, filling the rest of the infinite binary tree with zeros, instead of sharing only a single path in the infinite binary tree. functions in the same way as in incremental DPF.
The formal construction of the incremental DMPF will be given in the full version of this paper and gives us the theorem:
Theorem 24.
Let () be a data structure for ’s range searching problem in the oblivious group model. Then there exists an FSS for with a length-doubling PRG oracle , key size or bits, executions of in and executions of in .
From the trivial bounds and the lower bound for [25], we get the corollary:
Corollary 25.
If is an FSS for constructed using Theorem 24 with evaluation time and key size then .
References
- [1] Pankaj K Agarwal. Range searching. In Handbook of discrete and computational geometry, pages 1057–1092. Chapman and Hall/CRC, 2017. doi:10.1201/9781315119601.
- [2] Paul Baecher, Christina Brzuska, and Marc Fischlin. Notions of black-box reductions, revisited. In Kazue Sako and Palash Sarkar, editors, Advances in Cryptology - ASIACRYPT 2013, pages 296–315, Berlin, Heidelberg, 2013. Springer Berlin Heidelberg. doi:10.1007/978-3-642-42033-7_16.
- [3] Amos Beimel. Secret-sharing schemes: A survey. In International conference on coding and cryptology, pages 11–46. Springer, 2011. doi:10.1007/978-3-642-20901-7_2.
- [4] Jon Louis Bentley and Jerome H Friedman. Data structures for range searching. ACM Computing Surveys (CSUR), 11(4):397–409, 1979. doi:10.1145/356789.356797.
- [5] Alexander Bienstock, Sarvar Patel, Joon Young Seo, and Kevin Yeo. Near-Optimal oblivious Key-Value stores for efficient PSI, PSU and Volume-Hiding Multi-Maps. In 32nd USENIX Security Symposium (USENIX Security 23), pages 301–318, Anaheim, CA, August 2023. USENIX Association. URL: https://www.usenix.org/conference/usenixsecurity23/presentation/bienstock.
- [6] Dan Boneh, Elette Boyle, Henry Corrigan-Gibbs, Niv Gilboa, and Yuval Ishai. Lightweight techniques for private heavy hitters. In 2021 IEEE Symposium on Security and Privacy (SP), pages 762–776, 2021. doi:10.1109/SP40001.2021.00048.
- [7] Elette Boyle, Nishanth Chandran, Niv Gilboa, Divya Gupta, Yuval Ishai, Nishant Kumar, and Mayank Rathee. Function secret sharing for mixed-mode and fixed-point secure computation. In Annual International Conference on the Theory and Applications of Cryptographic Techniques, pages 871–900, 2021. doi:10.1007/978-3-030-77886-6_30.
- [8] Elette Boyle, Geoffroy Couteau, Niv Gilboa, Yuval Ishai, Lisa Kohl, and Peter Scholl. Efficient pseudorandom correlation generators: Silent ot extension and more. In Advances in Cryptology–CRYPTO 2019: 39th Annual International Cryptology Conference, Santa Barbara, CA, USA, August 18–22, 2019, Proceedings, Part III 39, pages 489–518. Springer, 2019. doi:10.1007/978-3-030-26954-8_16.
- [9] Elette Boyle, Geoffroy Couteau, Niv Gilboa, Yuval Ishai, Lisa Kohl, and Peter Scholl. Efficient pseudorandom correlation generators from ring-lpn. In Annual International Cryptology Conference, pages 387–416. Springer, 2020. doi:10.1007/978-3-030-56880-1_14.
- [10] Elette Boyle, Niv Gilboa, Matan Hamilis, Yuval Ishai, and Yaxin Tu. Improved Constructions for Distributed Multi-Point Functions . In 2025 IEEE Symposium on Security and Privacy (SP), pages 2414–2432, Los Alamitos, CA, USA, May 2025. IEEE Computer Society. doi:10.1109/SP61157.2025.00044.
- [11] Elette Boyle, Niv Gilboa, and Yuval Ishai. Function secret sharing. In Annual international conference on the theory and applications of cryptographic techniques, pages 337–367. Springer, 2015. doi:10.1007/978-3-662-46803-6_12.
- [12] Elette Boyle, Niv Gilboa, and Yuval Ishai. Breaking the circuit size barrier for secure computation under ddh. In Annual International Cryptology Conference, pages 509–539. Springer, 2016. doi:10.1007/978-3-662-53018-4_19.
- [13] Elette Boyle, Niv Gilboa, and Yuval Ishai. Function secret sharing: Improvements and extensions. In Edgar R. Weippl, Stefan Katzenbeisser, Christopher Kruegel, Andrew C. Myers, and Shai Halevi, editors, Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, Vienna, Austria, October 24-28, 2016, pages 1292–1303. ACM, 2016. doi:10.1145/2976749.2978429.
- [14] Elette Boyle, Niv Gilboa, and Yuval Ishai. Secure computation with preprocessing via function secret sharing. In Theory of Cryptography: 17th International Conference, TCC 2019, Nuremberg, Germany, December 1–5, 2019, Proceedings, Part I 17, pages 341–371. Springer, 2019. doi:10.1007/978-3-030-36030-6_14.
- [15] Elette Boyle, Lisa Kohl, and Peter Scholl. Homomorphic secret sharing from lattices without fhe. In Annual International Conference on the Theory and Applications of Cryptographic Techniques, pages 3–33. Springer, 2019. doi:10.1007/978-3-030-17656-3_1.
- [16] Bernard Chazelle. Lower bounds for orthogonal range searching: I. the reporting case. Journal of the ACM (JACM), 37(2):200–212, 1990. doi:10.1145/77600.77614.
- [17] Bernard Chazelle. Lower bounds for orthogonal range searching II. the arithmetic model. J. ACM, 37(3):439–463, July 1990. doi:10.1145/79147.79149.
- [18] Benny Chor, Eyal Kushilevitz, Oded Goldreich, and Madhu Sudan. Private information retrieval. Journal of the ACM (JACM), 45(6):965–981, 1998. doi:10.1145/293347.293350.
- [19] Henry Corrigan-Gibbs, Dan Boneh, and David Mazières. Riposte: An anonymous messaging system handling millions of users. In 2015 IEEE Symposium on Security and Privacy, pages 321–338. IEEE, 2015. doi:10.1109/SP.2015.27.
- [20] Mark de Berg, Marc van Kreveld, Mark Overmars, and Otfried Schwarzkopf. Orthogonal Range Searching, pages 95–120. Springer Berlin Heidelberg, Berlin, Heidelberg, 2008. doi:10.1007/978-3-540-77974-2_5.
- [21] Yevgeniy Dodis, Shai Halevi, Ron D Rothblum, and Daniel Wichs. Spooky encryption and its applications. In Annual International Cryptology Conference, pages 93–122. Springer, 2016. doi:10.1007/978-3-662-53015-3_4.
- [22] Michael Fredman and Michael Saks. The cell probe complexity of dynamic data structures. In Proceedings of the twenty-first annual ACM symposium on Theory of computing, pages 345–354, 1989. doi:10.1145/73007.73040.
- [23] Michael L Fredman. The complexity of maintaining an array and computing its partial sums. Journal of the ACM (JACM), 29(1):250–260, 1982. doi:10.1145/322290.322305.
- [24] Kasper Green Larsen. The cell probe complexity of dynamic range counting. In Howard J. Karloff and Toniann Pitassi, editors, Proceedings of the 44th Symposium on Theory of Computing Conference, STOC 2012, New York, NY, USA, May 19 - 22, 2012, STOC ’12, pages 85–94, New York, NY, USA, 2012. ACM. doi:10.1145/2213977.2213987.
- [25] Kasper Green Larsen. On range searching in the group model and combinatorial discrepancy. SIAM Journal on Computing, 43(2):673–686, 2014. doi:10.1137/120865240.
- [26] Kasper Green Larsen, Omri Weinstein, and Huacheng Yu. Crossing the logarithmic barrier for dynamic boolean data structure lower bounds. SIAM Journal on Computing, 49(5):STOC18–323–STOC18–367, 2020. doi:10.1137/18M1198429.
- [27] Zhe Li, Chaoping Xing, Yizhou Yao, and Chen Yuan. Efficient pseudorandom correlation generators for any finite field. In Annual International Conference on the Theory and Applications of Cryptographic Techniques, pages 145–175. Springer, 2025. doi:10.1007/978-3-031-91092-0_6.
- [28] Claudio Orlandi, Peter Scholl, and Sophia Yakoubov. The rise of Paillier: Homomorphic secret sharing and public-key silent ot. In Advances in Cryptology–EUROCRYPT 2021: 40th Annual International Conference on the Theory and Applications of Cryptographic Techniques, Zagreb, Croatia, October 17–21, 2021, Proceedings, Part I 40, pages 678–708. Springer, 2021. doi:10.1007/978-3-030-77870-5_24.
- [29] Mihai Patrascu. Lower bounds for 2-dimensional range counting. In Proceedings of the thirty-ninth annual ACM symposium on Theory of computing, pages 40–46, 2007. doi:10.1145/1250790.1250797.
- [30] Mihai Patrascu and Erik D Demaine. Logarithmic lower bounds in the cell-probe model. SIAM Journal on Computing, 35(4):932–963, 2006. doi:10.1137/S0097539705447256.
- [31] Omer Reingold, Luca Trevisan, and Salil P. Vadhan. Notions of reducibility between cryptographic primitives. In Theory of Cryptography, First Theory of Cryptography Conference, TCC 2004, Cambridge, MA, USA, February 19-21, 2004, Proceedings, volume 2951 of Lecture Notes in Computer Science, pages 1–20. Springer, 2004. doi:10.1007/978-3-540-24638-1_1.
- [32] Adi Shamir. How to share a secret. Communications of the ACM, 22(11):612–613, 1979. doi:10.1145/359168.359176.
- [33] Andrew Chi-Chih Yao. Should tables be sorted? Journal of the ACM (JACM), 28(3):615–628, 1981. doi:10.1145/322261.322274.
