Abstract 1 Introduction 2 Main Result 3 Bounds for Delta Functions 4 From Fourier Sparsity to Matching Vector PIRs References Appendix A A more sensitive lower bound Appendix B Covering the cube by affine hyperplanes over 𝔽𝒑 Appendix C Sparsity of delta functions on 𝑮=𝒎𝟏×𝒎𝟐 over

Fourier Sparsity of Delta Functions
and Matching Vector PIRs

Fatemeh Ghasemi ORCID Department of Mathematics, University of Toronto, Canada Swastik Kopparty ORCID Department of Mathematics and Department of Computer Science, University of Toronto, Canada
Abstract

In this paper we study a basic and natural question about Fourier analysis of Boolean functions, which has applications to the study of Matching Vector based Private Information Retrieval (PIR) schemes.

For integers m,r, define a delta function on {0,1}rmr to be a function f:mr if f(0)=1 and f(x)=0 for all nonzero Boolean x. The basic question that we study is how small can the Fourier sparsity of a delta function be; namely, how sparse can such an f be in the Fourier basis?

In addition to being intrinsically interesting and natural, such questions arise naturally while studying “S-decoding polynomials” for the known matching vector families. Finding S-decoding polynomials of reduced sparsity – which corresponds to finding delta functions with low Fourier sparsity – would improve the current best PIR schemes.

We show nontrivial upper and lower bounds on the Fourier sparsity of delta functions. Our proofs are elementary and clean. These results imply limitations on improvements to the Matching Vector PIR schemes simply by finding better S-decoding polynomials. In particular, there are no S-decoding polynomials which can make Matching Vector PIRs based on the known matching vector families achieve polylogarithmic communication for constantly many servers.

Many interesting questions remain open.

Keywords and phrases:
Fourier Sparsity, Matching Vectors, Private Information Retrieval
Funding:
Swastik Kopparty: Research supported by an NSERC Discovery Grant.
Copyright and License:
[Uncaptioned image] © Fatemeh Ghasemi and Swastik Kopparty; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Communication complexity
; Mathematics of computing Coding theory ; Mathematics of computing Mathematical analysis
Editor:
Shubhangi Saraf

1 Introduction

In this paper we study some basic and natural questions about Fourier analysis of Boolean functions. Specifically, we study the Fourier transform of functions f defined on the whole group mr given constraints on the values that f takes on the Boolean hypercube {0,1}r. These questions were originally motivated by a connection to Matching Vector based Private Information Retrieval (PIR) schemes, but we feel that they are intrinsically interesting in their own right.

The relationship of the subset {0,1}r with the superset mr is of central importance in many aspects of complexity theory. Many fundamental results about {0,1}r begin by viewing it as a subset of mr (or 𝔽pr), and using the more powerful algebra of the larger domain. This includes the Razborov-Smolensky lower bounds for AC0, arithmetization as a tool for interactive and probabilistically checkable proofs, and many more.

Let m,r be integers. Our main concern is the Fourier behaviour of the Boolean hypercube {0,1}r contained within the abelian group G=mr. Specifically, we will be interested in estimating the minimum possible Fourier sparsity of a delta function f:G.

  • Fourier Sparsity: For i[mr], let ψi:G be the Fourier characters of G. Every function f from G to can be expressed uniquely as a linear combination of the ψi. The number of ψi that get a nonzero coefficient is called the Fourier sparsity of f.

  • Delta function: A function f:G is called a delta function if f(0)=1, and for all x{0,1}r{0}, we have f(x)=0 (informally, it behaves like “the delta function” on {0,1}r).

On first reading, it may be useful to think about the asymptotic where m is a fixed constant, even 3, and r.

The set of all delta functions is a (mr2r)-dimensional affine subspace of the mr-dimensional space of all functions, and thus by general linear-algebra principles, there is always a subset of 2r Fourier characters whose span contains some delta function. Can this 2r be reduced?

1.1 Overview of Results

When m=2, there is only one delta function; and its Fourier sparsity is 2r.

Already for m=3, the situation turns out to be quite interesting, with both nontrivial upper and lower bounds. We show that there are delta functions with Fourier sparsity O((3)r), while any delta function must have Fourier sparsity at least Ω((1.5)r).

The situation drastically changes with growing m. We show that once mr, there are delta functions with Fourier sparsity r+1, and this is tight!

For general m (with r), we show an upper bound of O(mr/(m1))=exp(logmm1r) and a lower bound of Ω((mm1)r)=exp(1m1r).

More general groups

For applications to PIRs (discussed below), we actually need to study a slightly generalized problem.

  • The abelian group G is now taken to be of the more general form i=1rmi, where m1,,mr are positive integers,

  • Instead of -valued Fourier characters, for a field 𝔽 containing mi distinct mi’th roots of unity, we work with 𝔽-valued Fourier characters.

Even in this generalized setting, we can talk about the Boolean hypercube {0,1}rG (which is simply the product of each {0,1}mi), delta functions, and the Fourier expansion of functions f:G𝔽.

Appropriate generalizations of our lower bounds from the case where all the mi are equal to m continue to hold in this generalized setting. However our upper bounds break down, and for the setting relevant to PIRs, this leaves open a much bigger gap, which would be very interesting to close.

Delta functions on {𝟏,𝟎,𝟏}𝒓

One could also talk about delta functions on sets more general that {0,1}r. For a subset BG with 0B, a B-delta function is a function f with f(0)=1 and f(b)=0 for all bB{0}.

A natural and well known example is the case B=G. The unique G-delta function is the average of all its Fourier characters: this means that its Fourier sparsity is |G|.

A another natural set to consider is B={1,0,1}r (when G=i=1rmi). One would expect that the Fourier sparsity of B-delta functions to behave very similar to the case of {0,1}r. Surprisingly, there is a big difference!

Again from general principles, since |B|=3r, there is always a B-delta function of sparsity at most 3r. Are there B-delta functions with smaller sparsity?

We show that for all G, the Fourier sparsity of a B-delta function is at least 2r. This is in sharp contrast to the {0,1}r case, where the sparsity decays sharply with m in the group G=mr, and there are are delta functions of sparsity as small as r+1 once m>r.

Techniques

Our proofs are all elementary and simple. The lower bounds for {0,1}r are based on studying the product of a purported Fourier sparse delta function with a suitably constructed auxiliary function with desirable Fourier and primal support. The upper bounds for {0,1}r are by explicitly giving the functions. The lower bounds for {1,0,1}r are proved using the linear algebra method of combinatorics.

We feel that closing the gaps between our upper and lower bounds will need some new insights about the Fourier analysis of Boolean functions, which may potentially be useful in the many other situations in theoretical computer science where {0,1}r is viewed as a subset of the larger domain 𝔽pr.

Another related theme we would like to mention is the uncertainty principle in finite abelian groups, which gives lower bounds on the Fourier sparsity of functions f:G with small support. See [6, 13, 16] for various statements of this principle.

1.2 Matching Vector PIRs

The original motivation for studying Fourier-sparse delta functions is from questions about Private Information Retrieval (PIR). A PIR scheme111PIR schemes are very closely related to Locally Decodable Codes (LDCs). Many breakthroughs happened for both these notions together, but we only talk about PIRs in this paper. In particular, there are implications of our results for the LDCs known as Matching Vector codes. is an interactive protocol between a user and t non-communicating servers, which allows the user to access any desired bit ai of a database (a1,,an){0,1}n without revealing any information about i. PIRs have been extensively studied ever since their introduction by Chor, Goldreich, Kushilevitz and Sudan [5] in 1995. The main question is to achieve this with as little communication as possible. Very little is known on the lower bounds front; it is consistent with our knowledge that PIR can be achieved with 2-servers and 𝗉𝗈𝗅𝗒log(n) communication complexity!

When the number of servers t is constant, it was originally believed that the amount of communication needed to be at least polynomial in n, of the form Ω(nεt). However, many years later, surprising results of Yekhanin [17] and Efremenko [9] strongly refuted this belief. These protocols are the Matching Vector PIRs, and achieve a communication complexity of O(exp((logn)εt)) – and these are the only known way to achieve communication complexity subpolynomial in n with a constant number of servers.

There are two key ingredients of Matching Vector PIRs:

  • An S-matching vector family: for a subset Sm with 0S, an S-matching vector family is a collection of 2n vectors ui,vimk, for i[n], such that:

    • ui,vj=0 if i=j,

    • ui,vjS{0} if ij.

    We would like n as large as possible.

  • An S-decoding polynomial: for a subset Sm and a field 𝔽 containing an m-th root of unity γm, an S-decoding polynomial is a polynomial P(Z)𝔽[Z] such that:

    • P(γ0)=1,

    • P(γs)=0 for each sS{0}.

    We would like P(Z) to be as sparse as possible.

The best known PIR protocols [9, 7, 8, 10] use the S-matching vector families coming from the work of Barrington-Biegel-Rudich [2] and Grolmusz [11]. Here m is taken to be the product of primes m1,,mr, and S is taken to be a product set i=1r{0,1}i=1rmim. Such S is called a canonical S for m.

Even with trivial S-decoding polynomials, the above S-matching vector family alone is enough to achieve the dramatic improvement of communication complexity from polynomial to subpolynomial. It achieves communication exp((logn)εt). A clever choice of S-decoding polynomial further improves the communication complexity. The theory of sparse S-decoding polynomials was developed and advanced by Yekhanin [17], Raghavendra [14], Efremenko [9], Itoh-Suzuki [12] and Chee-Feng-Ling-Wang-Zhang [4], to achieve significant improvements for small t. Under plausible number theoretic conjectures, [4] reduces εt by an absolute constant factor for all t.222The improvement is more stark for fixed t: the communication complexity using a clever S-decoding polynomial is a further subpolynomial in the communication complexity achieved using trivial S-decoding polynomials.

To improve Matching Vector PIR parameters further, there are two immediate avenues. The first is to get even larger S-matching vector families – this is an extremely interesting and basic question about linear algebra mod m, and has attracted quite a bit of interest [3, 1]. To summarize the current status: it is wide open whether these exist.

The other approach to improved PIRs, is to get sparser S-decoding polynomials for the known S-matching vector families. This approach, a priori, could also be capable of achieving polylogarithmic communication for a constant number of servers: this would be possible if there exist S-decoding polynomials of constant sparsity modulo any m. This is the approach that this paper addresses.

Our results imply that for the S relevant for the known big matching vector family over m, any S-decoding polynomial has sparsity at least r+1. This means that for a Matching Vector PIR using the known matching vector families and the best possible S-decoding polynomial, the communication complexity for t server PIR is at least exp((logn)1/t). In particular it cannot achieve polylogarithmic communication for a constant number of servers.

1.3 Questions

There are two very nice and basic open questions that we would like to highlight.

  1. 1.

    Let m1,,mr be distinct primes, and let 𝔽 be a field containing all the mi’th roots of 1. We work with the 𝔽-valued Fourier characters. How small, as a function of r, can the Fourier sparsity of delta functions on {0,1}ri=1tmi?

    We showed that it cannot be smaller than r+1, while the best upper bounds are exponential in r (these come from known sparse S-decoding polynomials [4]).

    Can it be r+1? This would improve PIR communication to exp((logn)1/t).

    Surprisingly, if we drop the assumption that the mi are distinct, then r+1 is achievable (see Lemma 16).

  2. 2.

    We have the same setup as above (with the mi being distinct primes), but now we fix 𝔽 to be the complex numbers . We conjecture that every delta function on {0,1}ri=1rmi has Fourier sparsity at least 2r: there is no improvement on the trivial bound!

    We prove the r=2 case in Appendix C. The proof uses ideas related to the classical Schwarz lemma from complex analysis, and boils down to studying certain Möbius transformations and their behavior on the unit disk. For larger r it seems to be a challenging yet basic question.

    Again, for identical mi, Fourier sparsity r+1 is achievable, so the distinctness of the primes mi has to be used.

Finally, the question of determining the Fourier sparsity of delta functions on mr for fixed m is a very cleanly stated question which we do not fully understand. We know that it is exponential is r, but the base of the exponent is somewhere between m1m1 and mm1.

2 Main Result

In order to state our main results, we need the following definitions.

Definition 1.

Let G be a group and 𝔽 be a field, 0B,BG. We say:

f:G𝔽

is a delta function on B (or f is delta on B, or f is a B-delta function) if:

  • f(0)=1

  • For all bB{0}, we have f(b)=0.

For 𝔽-valued functions f on the abelian group G, we will be working with the 𝔽-valued Fourier transform f^ (generalizing the usual -valued Fourier transform), discussed in Section 3.

In the special case where the domain is mr and the function is delta on {0,1}r, the following theorem provides explicit bounds on the Fourier sparsity. Parts (i) and (ii) are special cases of the general lower bounds proved later in Section 3.2, namely Theorems 11 and 14, while parts (iii) and (iv) correspond to the upper bounds established in Lemma 16 and Claim 18. The proofs of all four parts appear in Section 3.2.

Theorem 2.

Let f:mr𝔽 be delta on {0,1}rmr. We have:

  1. (i)

    |Supp(f^)|r+1

  2. (ii)

    |Supp(f^)|(mm1)r

In the other direction:

  1. (iii)

    For m>r, there exists f:mr𝔽 which is delta on {0,1}r with |Supp(f^)|=r+1.

  2. (iv)

    For m,r with (m1)r, there exists f:mr𝔽 which is delta on {0,1}r with |Supp(f^)|mrm1.

One application of our results is to S-decoding polynomials, a tool in the construction of Matching Vector PIRs. We describe this now.

Definition 3 (S-decoding polynomial for the canonical set S).

Let m=p1pr and define the canonical set

S ={xm:x2=x}
={xm(xmodpi){0,1} for each i}.

Let 𝔽 be a field containing a primitive m-th root of unity γm. A polynomial P(Z)𝔽[Z] is called a S-decoding polynomial if

  • P(γms)=0 for all sS{0},

  • P(γm0)=P(1)=1.

If P has exactly t monomials, we say it is t-sparse.

We restrict here to the canonical set S; a more general definition for arbitrary subsets Sm will be given in Section 4.1.

S-decoding polynomials play a central role in the construction of Matching Vector PIR schemes: sparser polynomials yield the same communication complexity with fewer servers. For the canonical set Sm, our results show that such polynomials cannot be very sparse.

The following theorem gives the precise lower bound; its proof appears in Section 4.1 as Corollary 24.

Theorem 4.

Let m=p1pr, and let S be the canonical set for m. If P(Z)𝔽[Z] is an S-decoding polynomial, then the number of its monomials is at least r+1.

The consequences for Matching Vector PIRs are discussed in Section 4.

Our final main result concerns delta functions on a set different from the Boolean hypercube, namely {1,0,1}r. For such functions we obtain a much stronger lower bound on Fourier sparsity, that does not decay with m. The proof will be given in Section 3.3 as a corollary of Claim 19.

Theorem 5.

If f:mr𝔽 is delta on {1,0,1}r, then we have:

|Supp(f^)|2r

3 Bounds for Delta Functions

In this section, we first derive bounds for delta functions on the Boolean hypercube {0,1}r, and then we establish a lower bound for delta functions over {1,0,1}r.

3.1 Basic Facts from Fourier Analysis

We collect the Fourier-analytic facts used in this paper. Throughout, G is a finite abelian group of size m, written additively.

For a function h on a finite set X, we define its support as

Supp(h):={xX:h(x)0}.

For subsets A,B of an additive group, we write

A+B:={a+b:aA,bB}

for their sumset.

We now define the Fourier transform and state the basic Fourier inversion formula.

Definition 6 (Fourier transform over a finite abelian group, 𝔽-valued).

Let G be a finite abelian group, and let e denote the exponent of G (i.e., the least common multiple of the orders of elements of G). Let 𝔽 be a field with char(𝔽)|G| such that 𝔽 contains all e-th roots of unity (equivalently, a primitive e-th root of unity, and hence all its powers).

A character of G is a group homomorphisms from G to 𝔽×, where 𝔽× is the multiplicative group of 𝔽. Let G^ be the group of all characters of G.

For a function f:G𝔽, its Fourier transform is the function f^:G^𝔽 defined by

f^(ψ)=1|G|xGf(x)ψ(x)1,ψG^.

Whenever we talk about the Fourier transform of 𝔽-valued functions on G, we assume that char(|𝔽|)∤|G|, and that 𝔽 has the required roots of unity in it. The first assumption is non-negotiable. The second assumption is without loss of generality, by replacing 𝔽 with a finite extension containing the required roots of unity (this is possible because of the first assumption).

Next we state the basic fact about the Fourier transform of the product of functions.

Lemma 7 (Character basis and explicit parametrization).

Let 𝔽 be a field and Gi=1rmi be a finite abelian group such that char(𝔽)|G| and 𝔽 contains primitive mi-th roots of unity ωmi for 1ir.

For any a=(a1,,ar)i=1rmi, we have a character ψa given by:

ψa(x1,,xr)=Πi=1rωmiaixi.

every character ψ:G𝔽 is of the form ψa for some a.

Thus the character group G^ is isomorphic to i=1rmi, and we identify them.

With this notation, every function f:G𝔽 can be written as

f=aG^f^(a).ψa
Lemma 8.

Let G be a finite abelian group and 𝔽 be a field. Suppose f,g:G𝔽 are functions and define h=fg as their pointwise product. Then the Fourier transform of h is given by the convolution of the Fourier transforms of f and g:

h^(a)=(f^g^)(a)=bG^f^(b)g^(ab)

This implies:

Supp(h^)Supp(f^)+Supp(g^)
Corollary 9.

Let f,g:G𝔽 be functions on a finite abelian group G, and set h=fg. Then

|Supp(h^)||Supp(f^)||Supp(g^)|.

Finally, we recall the Fourier expansion of the (unique) G-delta function, which we call the total delta function to avoid ambiguity.

Lemma 10 (Fourier expansion of the total delta function).

Let G be a finite abelian group and let 𝔽 be a field with char(𝔽)|G|. Let δ denote the total delta function on G, i.e.

δ(x)={1if x=0,0otherwise.

Then its Fourier transform is given by

δ^(a)=1|G|for all aG^.

In particular, δ^ is nonzero on every character, and so δ has Fourier sparsity exactly |G|.

3.2 Bounds for Delta Functions on the Boolean Hypercube

In what follows, we first prove two lower bounds for functions

f:m1××mr𝔽

that are delta on {0,1}r. We then turn to the more specific setting

f:mr𝔽,

and establish an upper bound. In particular, when m>r, this upper bound matches the lower bound r+1, showing that the bound is tight.

First lower bound

Let

f:m1××mr𝔽

be a delta function on {0,1}r. We present two proofs showing that its Fourier sparsity is at least r+1.

The following theorem formalizes this bound.

Theorem 11.

Let

f:m1××mr𝔽

be a delta function on {0,1}rm1××mr, where m1,,mr are (not necessarily distinct) positive integers. Assume further that 𝔽 contains mi-th roots of unity ωmi for 1ir. Then

|Supp(f^)|r+1.
First proof (of the first lower bound)

Our first proof proceeds by constructing an auxiliary function g supported on {0,1}r. Multiplying f by g will produce a global delta function, and from this we can deduce a structural relation between the Fourier supports of f and g. The following lemma makes this precise.

Lemma 12.

Let

f:m1××mr𝔽

be a delta function on {0,1}r, where m1,,mr are (not necessarily distinct) positive integers. Suppose g:m1××mr𝔽 is a function such that

  • g(x)=0 for all x{0,1}r,

  • g(0)0.

Then

Supp(f^)+Supp(g^)=m1××mr.
Proof.

Let h=fg. By Lemma 8,

Supp(h^)Supp(f^)+Supp(g^).

Since g is supported on {0,1}r and f vanishes on {0,1}r{0} with f(0)0, we see that h is supported only at 0. Thus h is a nonzero scalar multiple of the total delta function on the whole group, and so by Lemma 10,

Supp(h^)=m1××mr.

This gives

Supp(f^)+Supp(g^)=m1××mr.

To apply Lemma 12, we need an explicit function g satisfying the required conditions and whose Fourier support we can describe precisely. The following example provides such a construction.

Example 13.

Let 𝔽 be a field containing primitive mi-th roots of unity ωmi for 1ir. Define

g:m1××mr𝔽

by

g(x1,,xr)=i=1r(ωmixij=2mi1(ωmixiωmij)).

For each coordinate i, define

hi(xi)=ωmixij=2mi1(ωmixiωmij)

Hence hi lies in the span of the characters {ωmibixi:1bimi1}. Consequently, g lies in the span

gSpan{i=1rωmibixi:  1bimi1}.

It follows that the Fourier support of g is exactly

Supp(g^)=i=1r{1,,mi1},

and therefore

|Supp(g^)|=i=1r(mi1).

We are now ready to give the first proof of Theorem 11.

Proof of Theorem 11.

By Lemma 12 and Example 13, we know that

Supp(f^)+i=1r{1,,mi1}=i=1r{0,1,,mi1}.

This implies the following property: for every element

ui=1r{0,1,,mi1},

there must exist some aSupp(f^) such that uiai for all coordinates i.

Now assume, for the sake of contradiction, that

Supp(f^)={b1,,bt},tr.

That is, suppose the Fourier support has size at most r. Construct a vector ui=1r{0,1,,mi1} by setting

u=(b11,b22,,btt,0,,0),

where the first t coordinates are taken from the corresponding elements b1,,bt in the support, and the remaining coordinates are set to 0.

By the mentioned property, there must exist some bSupp(f^) such that bjuj for every coordinate j. But by construction, for each it we have ui=bii, which forces bbi for all i=1,,t. This contradicts the assumption that Supp(f^)={b1,,bt}.

Therefore, our assumption was false, and we conclude that

|Supp(f^)|r+1.

Second proof (of the first lower bound)

Unlike the first proof, which relied on support-sum arguments, our second proof of Theorem 11 uses a polynomial identity arising from the delta property of f on {0,1}r.

Proof.

By assumption, there exists ωmi𝔽 where each ωmi is an mi-th root of unity. By Lemma 7, we can expand f as

f(x1,,xr)=j=1tbji=1rωmiaj(i)xi.

For convenience, define the vectors αi𝔽t by

αi=(αi(1),,αi(t)),αi(j):=ωmiaj(i)for 1jt.

With this notation we can rewrite f as

f(x1,,xr)=j=1tbji=1rαi(j)xi.

Since f is a delta function on {0,1}r, we have

f(x)=0for all x{0,1}r{0},f(0,,0)0.

Equivalently,

j=1tbjiBαi(j)= 0for every nonempty B[r],j=1tbj 0.

Observe that, introducing formal variables T1,,Tr, expanding

i=1r(1+Tiαi(j))

produces terms of the form iBTiαi(j) for subsets B[r]. Therefore

j=1tbji=1r(1+Tiαi(j))=B[r](iBTi)(j=1tbjiBαi(j)).

By the vanishing property established above, every inner sum with B equals zero, while for B= we get j=1tbj0. This proves the polynomial identity

j=1tbji=1r(1+Tiαi(j))=j=1tbj.

If rt, we may choose Ti=1/αi(i) for 1it. With this choice, each term in the left-hand side vanishes, so the entire left-hand side equals 0, while the right-hand side is nonzero. This yields a contradiction.

Second lower bound

We again use Lemma 12 and Example 13, but this time in combination with a counting argument, to derive a multiplicative lower bound.

Theorem 14.

Let

f:m1××mr𝔽

be a delta function on {0,1}rm1××mr, where m1,,mr are (not necessarily distinct) positive integers. Then

|Supp(f^)|i=1rmimi1.
Proof.

By Lemma 12 and Example 13, we have

Supp(f^)+B=i=1r{0,1,,mi1},B:=i=1r{1,,mi1}.

Since A+B has size at most |A||B| for any sets A,B, we obtain

|Supp(f^)||Supp(f^)+B||B|.

Here |Supp(f^)+B|=i=1rmi, while |B|=i=1r(mi1). Therefore

|Supp(f^)|i=1rmii=1r(mi1)=i=1rmimi1.

Corollary 15.

If f:mr𝔽 is delta on {0,1}r, then:

  1. (i)

    |Supp(f^)|r+1

  2. (ii)

    |Supp(f^)|(mm1)r

Upper bound

We next study upper bounds for the Fourier sparsity of delta functions on {0,1}r. The following construction illustrates how to achieve small support when m>r, and will serve as a building block for further results in this section.

Lemma 16.

Let m>r and let ω𝔽 be an m-th root of unity. Then there exists a function

f:mr𝔽

which is a delta function on {0,1}rmr and has exactly r+1 nonzero Fourier coefficients.

Proof.

Define

f(x1,,xr)=i=1r(ωx1++xrωi).

We first check the delta property. For any x{0,1}r{0} we have 1x1++xrr, hence one of the factors vanishes, and thus f(x)=0. On the other hand, f(0,,0)=i=1r(1ωi)0. Therefore f is indeed a delta function on {0,1}r.

Finally, observe that f can be expressed as a linear combination of the r+1 characters

ψ(i,,i)(x1,,xr)=ωi(x1++xr),0ir.

Thus f has exactly r+1 nonzero Fourier coefficients, as claimed.

Corollary 17.

When m>r, the above claim gives a construction of a delta function with Fourier sparsity r+1. By Corollary 15, we also have the lower bound r+1. Thus, in this case, the upper and lower bounds coincide, yielding a tight result.

Theorem 18.

Suppose m1 divides r. Then, there exists a function

f:mr𝔽

which is a delta function on {0,1}r and satisfies

|Supp(f^)|mrm1.
Proof.

Partition the index set {1,,r} into

=rm1

disjoint subsets, denoted A1,,A, each of size m1. For each 1k, define a function fk depending only on the coordinates indexed by Ak, as in Lemma 16:

fk((xj)jAk)=i=1m1(ωjAkxjωi),

where ω is a fixed primitive m-th root of unity. By Lemma 16, each fk is delta on {0,1}Ak and satisfies

|Supp(fk^)|=m.

Now define the overall function

f(x1,,xr)=k=1fk((xj)jAk).

Clearly, f is delta on {0,1}r. Applying Corollary 9 , we obtain

|Supp(f^)|k=1|Supp(fk^)|=m=mrm1.

3.3 Bounds for Delta Functions on {𝟏,𝟎,𝟏}𝒓

We now turn to delta functions on {1,0,1}r. Using a general structural claim, we derive lower bounds on their Fourier sparsity. In particular, we show that such functions must have support size at least 2r.

Claim 19.

Let f:G𝔽 be a delta function on a set BG with 0B. Suppose DB is such

DDB,

where DD is the difference set {d1d2d1,d2D}. Then

|Supp(f^)||D|.
Proof.

Write f as a linear combination of characters supported on some set TG^:

f(x)=aTcaψa(x).

For dG, define the translate fd:G𝔽 by:

fd(x):=f(xd).

Then we compute the Fourier expansion of fd as follows:

fd(x)=aTcaψa(xd)=aT(caψa1(d))ψa(x),

and thus:

fdSpan{ψa:aT}.

We now show that the functions {fd:dD} are all linearly independent. Enumerate D={d1,,d|D|}. For each dD, consider the evaluation vector

vd:=(fd(d1),,fd(d|D|))𝔽|D|.

By construction,

fd(di)=f(ddi).

If ddi, then ddiB{0}, so f(ddi)=0. If d=di, then fd(di)=f(0)0. Thus vd is a nonzero scalar multiple of the i-th standard basis vector ei. Hence the vectors {vd:dD} are linearly independent.

Therefore, the functions {fd:dD} are linearly independent, all lying inside Span{ψa:aT}. It follows that

|T||D|.

Finally, since T=Supp(f^), we conclude that |Supp(f^)||D|, as claimed.

Corollary 20.

If f:m1××mr𝔽 is delta on {1,0,1}r, then we have:

|Supp(f^)|2r
Proof.

Apply the previous claim with B={1,0,1}r and D={0,1}r. For distinct d1,d2D, the difference d1d2 lies in B{0}, so the conditions of the claim are satisfied. Since |D|=2r, the result follows.

4 From Fourier Sparsity to Matching Vector PIRs

In this section, we first establish the connections between the Fourier sparsity of delta functions and S-decoding polynomials. We then recall how the sparsity of S-decoding polynomials is related to Matching vector PIRs. Combining this with our results from the previous section we derive new bounds on the number of servers required in such PIR schemes.

4.1 Fourier Sparsity and 𝑺-decoding Polynomials

For Matching Vector PIRs, there are two key ingredients: a large S-matching vector family in mk, and a sparse S-decoding polynomial, where Sm.

For all the superpolynomially large S-matching vector families that we know333They all come from the Barrington-Biegel-Rudich method involving the Chinese remainder theorem., S has to have a special form: it must essentially be a canonical set for m, defined below444More precisely, S must contain a product set of the form S1×S2×Sr, where §ipi has size at least 2, and m is viewed as ipi for distinct primes p1,,pr. S-decoding polynomials for such S easily translate into S-decoding polynomials for the canonical S..

Definition 21 (Canonical set).

Let m be a positive integer. We define the canonical set for m to be the following set:

S={xms.t.x2xmodm}

Note that when m=p1pr is a product of r distinct primes, the canonical set consists of the following 2r values:

S={sm,pismodpi{0,1}}

For example, for m=6 we get:

S={0,1,3,4}

which are precisely the four values being 0 or 1 mod 2,3.

Definition 22 (t-sparse S-decoding polynomial).

Let Sm be a set containing 0, 𝔽 be a field containing an element γm which is a primitive m-th root of 1: namely γmm=1 and γmi1 for i=1,2,,m1.

A polynomial P(Z)𝔽[Z] is called an t-sparse S-decoding polynomial if P has t monomials, and the following conditions hold:

  • sS{0}:P(γms)=0

  • P(γm0)=P(1)=1

For m=p1××pr, for distinct primes p1,,pr, we use ϕ to denote the Chinese remainder isomorphism:

ϕ:mp1××pr
ϕ(a)=(a(1),,a(r)),a(i)=amodpi

Also, we use ϕ(a)i to denote a(i)=amodpi.

The next claim shows the relation between Fourier sparsity and S-decoding polynomials for canonical sets S.

Claim 23.

Let m=p1pr be a product of r distinct primes and S be the canonical set for m, 𝔽 be a field containing an m-th root of unity. Then, having a t-sparse S-decoding polynomial in 𝔽[Z] is equivalent to having a function:

f:p1××pr𝔽

such that

f|{0,1}r

is a delta function, |Supp(f^)|=t.

Proof.

Let P(Z)𝔽[Z] be an S-decoding polynomial,

P(Z)=j=1tbjZaj

Let γm𝔽 be a primitive m-th root of unity. Then, since pi’s are pairwise coprime, we can write it as:

γm=i=1rωpi

for ωpi𝔽, 1ir, with each ωpi a primitive pi-th root of unity. Therefore,

γms=(i=1rωpi)s=i=1rωpiϕ(s)i

Observe that:

P(γms)=0j=1tbj(i=1rωpiϕ(s)i)aj=0j=1tbji=1rωpiϕ(aj)iϕ(s)i=0

Now note that

S={sm:smodpi{0,1}}={sm:ϕ(s){0,1}r}

Hence, having such an S-decoding polynomial is equivalent to constructing a function f such that:

f(x1,,xr)=j=1tbji=1rωpiaj(i)xi,

and:

f|(ϕ(S){0})=0,
f(0)0.

By Lemma 7, i=1rωpiaj(i)xi=ψaj(x), and so the above expression for f equals:

f(x)=j=1tbjψaj(x).

Thus having such an S-decoding polynomial is exactly equivalent to having a function f:mp1××pr𝔽 such that f|ϕ(S) is delta and |Supp(f^)|=|{bj:j[t]}|=t.

By applying Theorem 11 to the above claim, we obtain the following corollary:

Corollary 24.

Let m=p1××pr, S be the canonical set for m, P(Z)𝔽[Z] be an S-decoding polynomial. Then, the number of monomials of P is at least r+1.

4.2 Applications to Matching Vector PIRs

We now discuss the relationship between S-decoding polynomials and Matching Vector based PIRs.

Definition 25 (S-Matching Vector Family).

Let Sm,0S and =(𝒰,𝒱) where 𝒰=(u1,,un),𝒱=(u1,,un) with ui,vimk. Then is said to be an S-matching vector family of size n and dimension k if the following conditions hold:

  • ui,vi=0 for every i[n]

  • ui,vjS{0} for every ij

The connection between decoding polynomials and matching vector PIR schemes is made explicit by the following result:

Theorem 26 (Theorem 2.3 in [10]).

Let m be a positive integer with r distinct prime factors, and let p be a prime not dividing m. Set M=mp, and let

=(𝒰,𝒱)

be an SM-matching vector family over M of size n and dimension k, where SM denotes the canonical set for M.

Suppose 𝔽 is a field of characteristic p such that, for the canonical set Smm, there exists an Sm-decoding polynomial over 𝔽 with sparsity t.

Then there exists a t-server PIR scheme with communication complexity O(k).

The above theorem highlights the role of S-decoding polynomials in such PIR schemes. In particular, designing sparser S-decoding polynomials leads directly to improved PIR constructions: The same communication complexity can be achieved with fewer servers. In other words, the sparsity of decoding polynomials dictates the number of servers required.

Theorem 27 gives the best known construction of large matching vector families, where the set S is taken to be the canonical set.

Theorem 27 (Large Matching Vector Families, Theorem 1.4 in [11]).

Let m=p1p2pr where p1,p2,,pr are distinct primes and r2. Then, there exists an explicitly constructible S-matching vector family in mk of size nexp(Ω((logk)r(loglogk)r1)) where S is the canonical set for m.

Combining this with Theorem 26, we obtain that any S-decoding polynomial for the canonical set directly yields a PIR scheme with parameters determined by the sparsity of the polynomial. In particular, a t-sparse decoding polynomial for the canonical S gives rise to a t-server PIR scheme with communication complexity exp(O~((logn)1r+1)) and database size n.

For the canonical set Sm, Corollary 24 shows that every S-decoding polynomial has sparsity at least r+1. Consequently, the Matching Vector PIR scheme based on the above matching vector family requires at least r+1 servers.

Stating this another way, the communication complexity of a t server Matching Vector PIR scheme based on the this matching vector family and an arbitrary S-decoding polynomial is bounded below by

exp(((logn)1/t)).

Hence these PIR schemes cannot achieve polylogarithmic communication with a constant number of servers merely by improving the S-decoding polynomials.

It would be extremely interesting to find S-decoding polynomials matching this sparsity bound. Today, the best known sparsity comes from the results of [4], and is approximately is (3)r under a plausible number theoretic hypothesis.

References

  • [1] Divesh Aggarwal, Pranjal Dutta, Zeyong Li, Maciej Obremski, and Sidhant Saraogi. Improved lower bounds for 3-query matching vector codes. In Raghu Meka, editor, 16th Innovations in Theoretical Computer Science Conference, ITCS 2025, January 7-10, 2025, Columbia University, New York, NY, USA, volume 325 of LIPIcs, pages 2:1–2:19. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2025. doi:10.4230/LIPIcs.ITCS.2025.2.
  • [2] David A. Mix Barrington, Richard Beigel, and Steven Rudich. Representing Boolean functions as polynomials modulo composite numbers. Comput. Complex., 4:367–382, 1994. doi:10.1007/BF01263424.
  • [3] Abhishek Bhowmick, Zeev Dvir, and Shachar Lovett. New bounds for matching vector families. In Proceedings of the Forty-Fifth Annual ACM Symposium on Theory of Computing, STOC ’13, pages 823–832, New York, NY, USA, 2013. Association for Computing Machinery. doi:10.1145/2488608.2488713.
  • [4] Yeow Meng Chee, Tao Feng, San Ling, Huaxiong Wang, and Liang Feng Zhang. Query-Efficient Locally Decodable Codes of Subexponential Length. Computational Complexity, 22:159–189, 2013. doi:10.1007/S00037-011-0017-1.
  • [5] Benny Chor, Oded Goldreich, Eyal Kushilevitz, and Madhu Sudan. Private information retrieval. Journal of the ACM, 45:965–981, 1998. doi:10.1145/293347.293350.
  • [6] David L. Donoho and Philip B. Stark. Uncertainty principles and signal recovery. SIAM Journal on Applied Mathematics, 49(3):906–931, 1989. doi:10.1137/0149053.
  • [7] Zeev Dvir, Parikshit Gopalan, and Sergey Yekhanin. Matching Vector Codes. In 2010 IEEE 51st Annual Symposium on Foundations of Computer Science, pages 705–714. IEEE, 2010. doi:10.1109/FOCS.2010.73.
  • [8] Zeev Dvir and Sivakanth Gopi. 2-Server PIR with Sub-Polynomial Communication. In STOC ’15: Proceedings of the forty-seventh annual ACM symposium on Theory of Computing, pages 577–584. Association for Computing Machinery, 2015. doi:10.1145/2746539.2746546.
  • [9] Klim Efremenko. 3-query locally decodable codes of subexponential length. In STOC ’09: Proceedings of the forty-seventh annual ACM symposium on Theory of Computing, pages 39–44. Association for Computing Machinery, 2009. doi:10.1145/1536414.1536422.
  • [10] Fatemeh Ghasemi, Swastik Kopparty, and Madhu Sudan. Improved pir schemes using matching vectors and derivatives. In STOC ’25: Proceedings of the 57th Annual ACM Symposium on Theory of Computing, pages 1648–1656, 2025. doi:10.1145/3717823.3718313.
  • [11] Vince Grolmusz. Superpolynomial size set-systems with restricted intersections mod 6 and explicit Ramsey graphs. Combinatorica, 20:71–86, 2000. doi:10.1007/S004930070032.
  • [12] Toshiya Itoh and Yasuhiro Suzuki. Improved Constructions for Query-Efficient Locally Decodable Codes of Subexponential Length. IEICE Trans. Inf. Syst., 93-D:263–270, 2008.
  • [13] Roy Meshulam. An uncertainty inequality for finite abelian groups. European Journal of Combinatorics, 27(1):63–67, 2006. doi:10.1016/j.ejc.2005.01.004.
  • [14] Prasad Raghavendra. A Note on Yekhanin’s Locally Decodable Codes. Electron. Colloquium Comput. Complex., TR07, 2007. URL: https://eccc.weizmann.ac.il/eccc-reports/2007/TR07-016/index.html.
  • [15] Walter Rudin. Real and Complex Analysis. McGraw-Hill, New York, 3 edition, 1987.
  • [16] Terence Tao. An uncertainty principle for cyclic groups of prime order. Mathematical Research Letters, 12(1):121–127, 2005. doi:10.4310/MRL.2005.v12.n1.a10.
  • [17] Sergey Yekhanin. Towards 3-query locally decodable codes of subexponential length. J. ACM, 55(1), 2008. doi:10.1145/1326554.1326555.

Appendix A A more sensitive lower bound

Define F(m1,,mr) to be the smallest size of a subset Simi such that:

S+i(mi{0})=imi.

Our proof in Section 3 actually shows that this quantity is a lower bound on the Fourier sparsity of delta functions on {0,1}rimi. It is a common generalization of the bounds r+1 and imimi1. Here we note a simple recursive inequality for it that is stronger than both the above bounds. It implies, for example, that when r=2m, that the Fourier sparsity of delta functions on mr is at least em (while the best upper bound we know for this case is that there exist delta functions with Fourier sparsity m2).

Lemma 28.
F(m1,,mr)mrmr1F(m1,,mr1).

In particular, we have

F(m1,,mr)max(F(m1,,mr1)+1,mrmr1F(m1,,mr1)),

and

F(m,m,,m){r+1rm1m(mm1)rm+1rm
Proof.

Let G=imi. Let H=i(mi{0}). Suppose SG is such that S+H=G.

For xm, let Sx be the set of all vS with vr=x. Choose x with |Sx| as large as possible: then |Sx|1mr|S|.

Observe that for any vSx and any wv+H, we have wrx.

Thus:

((SSx)+H)(i<rmi)×{x}.

Projecting to the first r1 coordinates, we see that the projection S of SSx to the first r1 coordinates satisfies:

S+i<r(mi{0})=i<rmi.

Thus |S|F(m1,,mr1). Combined with the inequality:

|S||SSx|=|S||Sx|mr1mr|S|,

we get the result.

In fact, the proof even shows that

F(m1,,mr1)F(m1,,mr)1mrF(m1,,mr)

(since in the notation of the proof, we get |S||S|1mr|S|).

Appendix B Covering the cube by affine hyperplanes over 𝔽𝒑

A classical result of Alon and Furedi shows that any set H of affine hyperplanes in r that cover all but one of the points of the Boolean hypercube must have size at least r. Specifically, if the hyperplanes of H cover ({0,1}r{0})r using affine hyperplanes, none of which contain 0, then |H|r.

Our results imply something nontrivial about the analogous covering question for ({0,1}r{0})𝔽pr for a prime p. Specifically, we show that if H is a set of hyperplanes, and:

H=H1H2Ht

is the partition of H into subsets of parallel hyperplanes, then:

i(|Hi|+1)max(r+1,(pp1)r).

On the other hand, there exist such sets Hi with

i(|Hi|+1)(p1/(p1))r

We now prove the lower bound. If W is a set of w parallel hyperplanes, then there is a function fW:𝔽pr with Fourier sparsity w+1 and which vanishes exactly on the union of those hyperplanes. Taking the product of fHi over all the i and scaling by a constant, we get a delta function with Fourier sparsity i(|Hi|+1). Then our Theorem 2 implies the result.

The upper bound follows by observing that the Fourier sparse function constructed in Claim 18 is of the above form.

Appendix C Sparsity of delta functions on 𝑮=𝒎𝟏×𝒎𝟐 over

In this appendix we consider the special case of delta functions on G=m1×m2 over the complex numbers. We show that when m1 and m2 are coprime, there is no -valued delta function on {0,1}2G with Fourier sparsity 3, and thus the minimum Fourier sparsity of such a function is 4. The argument uses complex-analytic tools.

By [9, 12, 4], there are some G of the form m1×m2 (with m1,m2 coprime and odd) for which there exists a delta function f:G𝔽¯2 (the algebraic closure of 𝔽2) with Fourier sparsity 3. Thus our result needs to use some special property of complex numbers.

In brief, we express the property of being a delta function on G in terms of vanishing of multilinear polynomials P(X,Y) at roots of unity. Vanishing of the multilinear polynomial P(X,Y) can be expressed in terms of an associated Möbius transformation Y=AP(X). Finally, we use properties of Möbius transformations on the unit disk in to get the result.

Proposition 29.

Let m1,m2 be coprime positive integers and let G=m1×m2. Suppose f:G is a function that is delta on {0,1}2, i.e.

f(0,0)=1,f(1,0)=f(0,1)=f(1,1)=0.

Then the Fourier sparsity of f is at least 4.

Proof.

Write G=m1×m2 and fix primitive m1-th and m2-th roots of unity ωm1,ωm2. Characters of G have the form

ψ(a,b)(x,y)=ωm1axωm2by,(a,b)m1×m2.

Assume for contradiction that f is a delta function on {0,1}2 with Fourier sparsity at most 3. Assume that f has a Fourier expansion of the form

f(x,y)=c1ψ(a1,b1)(x,y)+c2ψ(a2,b2)(x,y)+c3ψ(a3,b3)(x,y),

with (ai,bi) pairwise distinct and c1,c2,c3. Since f(0,0)=1, not all the ci are zero.

Set

αi:=ωm1ai,βi:=ωm2bi(i=1,2,3),

so that each αi is an m1-th root of unity and each βi is an m2-th root of unity. The values of the three characters at the four points (0,0),(1,0),(0,1),(1,1) are:

ψ(ai,bi)(0,0)=1,ψ(ai,bi)(1,0)=αi,ψ(ai,bi)(0,1)=βi,ψ(ai,bi)(1,1)=αiβi.

Thus, if we form the 4×3 matrix

M=(111α1α2α3β1β2β3α1β1α2β2α3β3),

the i-th column of M is the column vector of values of ψ(ai,bi) on {0,1}2.

The vector of values of f on {0,1}2 is then

Mc=(1000), where c:=(c1c2c3)

In other words, the existence of a 3-sparse delta function with these three characters is equivalent to the statement that

e1:=(1000)

lies in the column space of M.

By elementary linear algebra, e1 lies in the column space of M if and only if every row vector v4 that is orthogonal to all the columns of M is also orthogonal to e1, i.e.

vM=0ve1=0.

We now interpret such row vectors v in terms of multilinear polynomials evaluated at the (αi,βi). Write

v=(v1,vX,vY,vXY)4

and associate to v the multilinear polynomial

Pv(X,Y)=v1+vXX+vYY+vXYXY.

Observe that for i=1,2,3 we have

v(column i of M)=Pv(αi,βi).

Thus vM=0 is equivalent to

Pv(α1,β1)=Pv(α2,β2)=Pv(α3,β3)=0.

On the other hand

ve1=v1=Pv(0,0).

We conclude that the existence of a (3)-sparse delta function with support {ψ(ai,bi)}i=13 is equivalent to the following statement:

Every multilinear polynomial P(X,Y) that vanishes at the three points (αi,βi), i=1,2,3, must also vanish at (0,0).

We now show that this statement is false.

Lemma 30.

Let m1,m2 be relatively prime integers. Let α1,α2,α3 be m1-th roots of 1, and β1,β2,β3 be m2-th roots of 1.

Then there exists a multilinear polynomial P(X,Y) such that: P(αi,βi)=0 for each i, and P(0,0)0.

Proof.

We take cases on whether the αi and βi are all distinct or not.

  • Case 1: Some two (or more) of the αi are equal. Without loss of generality, assume α1=α2. Then P(X,Y)=(Xα1)(Yβ3) is a multilinear polynomial that vanishes at all the (αi,βi) and is nonzero at (0,0).

  • Case 2: Some two (or more) of the βi are equal. Without loss of generality, assume β1=β2. Then P(X,Y)=(Xα3)(Yβ1) is a multilinear polynomial that vanishes at all the (αi,βi) and is nonzero at (0,0).

  • Case 3: All the αi are distinct and all the βi are distinct. In this case, we use the basic fact that Möbius transformations can take any 3 distinct points to any 3 distinct points. Thus, there is a Möbius transformation

    A(X)=v1+vXXvY+vXYX

    with A(αi)=βi for each i. Rearranging this, we get that for the multilinear polynomial P(X,Y) given by:

    P(X,Y)=v1+vXX+vYY+vXYXY (1)

    satisfies, for each i:

    P(αi,βi)=0.

    We now prove that P(0,0) is nonzero. This is equivalent to showing that A(0)0. Suppose not; namely suppose A(0)=0. Then the map A: satsifies:

    • A(αi)=βi for each i. In particular, since all αi are distinct, all βi are distinct, they lie on the unit circle |z|=1, and Möbius transformations take circles to circles, we get that A maps the unit circle to itself.

    • A(0)=0.

    • Thus A maps the open unit disc 𝐃={z|z|<1} to itself.

    It is well known [15, Chapter 12] that Möbius transformations mapping 𝐃 to 𝐃 are of the form:

    A(z)=azb¯bza¯.

    If in addition we have A(0)=0, then we get:

    0=A(0)=b¯a¯,

    which means that b=0. Thus A(z) is of the form

    A(z)=λz,

    for some λ=aa¯.

    Now A(αi)=βi for i=1,2,3 implies βi=λαi. In particular, for any ij,

    βiβj=λαiλαj=αiαj.

    The left-hand side is an m2-th root of unity, and the right-hand side is an m1-th root of unity. Since gcd(m1,m2)=1, the only complex number that is simultaneously an m1-th and an m2-th root of unity is 1. So we must have βi/βj=1 and hence βi=βj, contradicting the fact that we are in the case where the βi are all distinct.

    Thus our assumption that P(0,0)=0 is wrong, and we conclude that the multilinear polynomial P(X,Y) specified by Equation (1) is as desired.

This concludes the proof of Lemma 30.

By the discussion preceding Lemma 30, we conclude that there are no delta functions on G with Fourier sparsity 3.