Abstract 1 Introduction 2 Preliminaries 3 Polynomials in 𝑿~ 4 Relative Rank-Bias Property 5 Regularization Relative to 𝑿~ 6 Radius of Reed-Muller over 𝑿~ 7 List Decoding Reed Muller Over 𝑿~ References Appendix A Equidistribution of Functions Appendix B Comparing Ranks

List Decoding Quotient Reed-Muller Codes

Omri Gotlib ORCID Department of Computer Science, Bar-Ilan University, Ramat-Gan, Israel Tali Kaufman Department of Computer Science, Bar-Ilan University, Ramat-Gan, Israel Shachar Lovett ORCID Department of Computer Science and Engineering, UC San Diego, CA, USA
Abstract

Reed-Muller codes consist of evaluations of n-variate polynomials over a finite field 𝔽 with degree at most d. Much like every linear code, Reed-Muller codes can be characterized by constraints, where a codeword is valid if and only if it satisfies all degree-d constraints.

For a subset X~𝔽n, we introduce the notion of X~-quotient Reed-Muller code. A function F:X~𝔽 is a valid codeword in the quotient code if it satisfies all the constraints of degree-d polynomials lying in X~. This gives rise to a novel phenomenon: a quotient codeword may have many extensions to original codewords. This weakens the connection between original codewords and quotient codewords which introduces a richer range of behaviors along with substantial new challenges.

Our goal is to answer the following question: what properties of X~ will imply that the quotient code inherits its distance and list-decoding radius from the original code?
We address this question using techniques developed by Bhowmick and Lovett [8], identifying key properties of 𝔽n used in their proof and extending them to general subsets X~𝔽n. By introducing a new tool, we overcome the novel challenge in analyzing the quotient code that arises from the weak connection between original and quotient codewords. This enables us to apply known results from additive combinatorics and algebraic geometry [34, 35, 37] to show that when X~ is a high rank variety, X~-quotient Reed-Muller codes inherit the distance and list-decoding parameters from the original Reed-Muller codes.

Keywords and phrases:
Reed-Muller Codes, Quotient Code, Quotient Reed-Muller Code, List Decoding, High Rank Variety, High-Order Fourier Analysis, Error-Correcting Codes
Funding:
Tali Kaufman: Supported by ISF.
Shachar Lovett: supported by NSF award 2425349 and a Simons investigator award.
Copyright and License:
[Uncaptioned image] © Omri Gotlib, Tali Kaufman, and Shachar Lovett; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Error-correcting codes
; Mathematics of computing Coding theory ; Theory of computation Pseudorandomness and derandomization
Related Version:
Full Version: https://arxiv.org/abs/2502.15650 [25]
Editors:
Srikanth Srinivasan

1 Introduction

Let 𝔽 be a finite field, n, and let X~𝔽n be a subset111As a convention, we use ~ to denote properties of the subset, and thus also the subset itself. . We begin by introducing a new definition applicable to any linear code over 𝔽: the X~-quotient code. We then illustrate this novel definition using Reed-Muller codes, and present a property of X~ which we use to show that X~-quotient Reed-Muller code inherits its distance and list decoding radius from the original Reed-Muller code. Finally, leveraging known results from additive combinatorics and algebraic geometry, we establish as a corollary that this inheritance holds when X~ is a high-rank variety.

The Quotient Code

Let be a linear code over 𝔽. Each codeword of can be described as a function F:𝔽n𝔽 that is in the span of the columns of the code’s generator matrix. An equivalent way to describe is using a parity check matrix, where a function F is a codeword if and only if it satisfies the constraints represented by parity-check matrix. Each such constraint can be thought of as a requirement over a few inputs of F from 𝔽n: the requirement that their weighted sum will equal 0.

The first novel definition we introduce is the definition of the X~-induced code:

Definition 1 (The X~-Induced Code).

We define the X~-induced code X~ to be the set of all functions f:X~𝔽 222By convention, we use uppercase letters to denote functions with domain 𝔽n and lowercase letters to denote functions with domain X~. that satisfy all the constraints of that lie in X~, i.e. constraints that are supported only on points from X~ 333We note that this definition is in fact a property of the constraints of the code, such as other well-studied desired code properties (e.g. local testability). .

Let us briefly describe the connection between codewords in 𝔽n and X~-induced codewords. One can easily verify that each original codeword restricted to X~ is a valid codeword in the induced code. This is because each original codeword satisfies all the constraints in 𝔽n by definition, and the constraints that words need to satisfy to be considered induced-codewords are only a subset of those constraints.
We call an extension of an X~-induced codeword f:X~𝔽 to valid codeword in the original code (extending its domain to 𝔽n), a lift of f. When each induced codeword has a unique lift, there is a natural 1-to-1 correspondence between the original and induced codeword. This becomes substantially more interesting for subsets X~ in which induced codewords have multiple lifts. This non-uniqueness weakens the connection between the original codewords and induced codewords, and leads to a richer range of phenomena (and interesting new challenges).

We also note that the other direction is not always true: For a general subset X~, there might be an induced codeword (a valid codeword in the induced code) that cannot be lifted to a valid codeword in 𝔽n. We are interested to better understand X~ using and vice-versa, and therefore we introduce a new notion, which is the notion of the X~-quotient code:

Definition 2 (The X~-Quotient Code).

Let be a linear code, and let X~ be the X~-induced code of . We say X~ is a X~-quotient code if every quotient codeword fX~ has a lift to 𝔽n.

In the case described above, we also say that X~ is a lift-enabler for and that the code is a covering code for the code X~.
The novelty of this definition is that it captures subsets in which there is a correspondence between codewords in X~ and in 𝔽n, and the correspondence may be 1-to-many.

Importance of Definition

This timely definition extends a fundamental and useful concept previously introduced for graphs and complexes – namely, the notion of a covering graph or alternatively, the quotient graph. This concept gained an increasing prominence in theoretical computer science, where it was recently employed to construct high dimensional expanders [18, 6] and achieve improved local testing results [26, 19, 3], where the latter also played a crucial role in constructions of PCPs. Consequently, the study of covering spaces for graphs has found usages in theoretical computer science and specifically in development of PCPs with enhanced properties. We believe our question, which explores the analogous question for codes, will similarly lead to meaningful applications in theoretical computer science.

In addition to that, the question of puncturing of codes has caught much attention recently, in a line of work [14, 2, 12, 13], followed by the resolution of the GM-MDS conjecture [39, 45]. Where the question of puncturing is focused exclusively on the case where the lift is unique, the study of quotient codes also tackles subsets X~𝔽n where the lift is not unique. Notably, in the unique-lift case there are well-established lower-bounds for the size of X~ such as [20, Theorem 1.1]. In contrast, the size of X~ in quotient codes may be much smaller than its lower-bound in punctured code (for example in Reed-Muller codes), suggesting the potential for new insights and improved results.

Our Question

Our goal is to answer the following question: what properties of X~ will imply that the quotient code inherits its distance and list-decoding radius from the original code?

This question is analogous to the study of quotients of expander graphs – just as not all quotients of an expander necessarily preserve expansion, not all subsets X~ necessarily yield a well-behaved quotient code. Understanding the conditions under which expansion is preserved has been a fundamental problem in the study of expanders, and similarly, identifying the conditions under which a quotient code retains key properties of the original code is a central challenge in our work. Given this parallel, we believe our question may have broader implications for future research in both coding theory and theoretical computer science.

We answer this question in the context of Reed-Muller codes. Notably, our approach does not only address the case of where there are multiple lifts, but also introduces a novel framework for analyzing unique-lift (puncturing) setting when the field size is constant-a scenario that is typically considered more challenging.

Reed-Muller Codes

Let 𝔽 be a finite field, and let n,d be integers. We focus on prime fields (𝔽=𝔽p) and assume this setting unless explicitly stating otherwise. This assumption also applies to all fields considered in the works we reference 444We believe that our techniques may extend to non-prime fields as well, but we do not pursue this direction in the current work. .

Each codeword in Reed-Muller code RM𝔽,𝔽n(d), is defined by a polynomial over 𝔽 in n variables with total degree d 555We focus on the regime where d,|𝔽| are considered constants and n is considered very large. . The message that one wishes to encode is represented in the code as a polynomial P:𝔽n𝔽, whose coefficients are the different message characters. The encoding of the message is a vector of the different evaluation of P over all possible points in 𝔽n.

Alternatively, one can describe Reed-Muller codes using a set of local constraints. A function F:𝔽n𝔽 is a polynomial of degree d if and only if the (alternating) sum of each possible cube, which is a set of points of the form {x+iSyi}S[d+1] for x,y1,,yd+1𝔽n, equals 0. Thus each cube represents a constraint, and we refer to the set of all cubes the set of constraints of degree-d polynomials. See Section 2.2 for more information in this regard.

Next, we present our notations for the induced Reed-Muller code:

Notation 3 (The X~-Induced Reed-Muller Code).

We say a function F:X~𝔽 is a polynomial of degree d in X~ if it satisfies all the constraints of degree-d polynomials that lie in X~ 666That is, the set of all cubes that their points are in X~. .
We denote the X~-induced Reed-Muller code:

RM𝔽,X~(d)={p:X~𝔽|p is a polynomial of degree d in X~}
Properties of Induced Reed-Muller Codes

A study of Ziegler and Kazhdan [34, 35, 36] shows that if X~ is a high rank variety 777Under some conditions we describe later. , then X~ is a lift-enabler for RM𝔽,𝔽n(d). In other words, the authors showed that the X~-induced Reed-Muller code is in fact a X~-quotient Reed-Muller code. We rely on this property of X~ as a black-box. See Section 3 for more details in this regard.

An additional property of X~𝔽n we rely on is the connection between algebraic structure and random behavior (equidistribution) of polynomials in X~.
For 𝔽n, this connection is a well-studied result [28, 33, 9]. It lies in the heart of many results in higher-order Fourier analysis, and specifically was used in [8] to analyze the list decoding radius of Reed-Muller code in 𝔽n.
The equivalent of this relation for subsets X~𝔽n was studied in [37, 27]. These works captured the measure of algebraic-structure in X~ by a definition called relative rank, and captured the lack of random behavior in X~ by a definition called relative bias. We note that for subsets, the definition of algebraic structure of a polynomial in X~ considers the algebraic structure of all its possible lifts. It was shown in [37] that when X~ is a high-rank variety, high relative rank implies low relative bias 888 Note that even though Gowers and Karam [27] also acheived a similar relation for a type of subsets, the definition of rank they used is slightly different than the standard definition of rank. While this difference may seem unharmful at first, it is, to our knowledge, does not allow to do a regularization process (note that a generalization of this process is the heart of our proof). .
We use this property as a black box as well. When a subset X~𝔽n has such property for polynomials of degree d, we say that it has the d-relative rank-bias property. See Section 4 for more details.

Our Results

Next, let us present our main theorem more concretely. Our work focuses on the regime where d<|𝔽|. Denote the minimum normalized distance of RM𝔽,𝔽n(d) by δ𝔽,𝔽n(d), shorthand by δ𝔽(d). We have:

δ𝔽(d)=1d/|𝔽|

Moreover, we define the list decoding count of RM𝔽,𝔽n(d) by:

𝔽,𝔽n(d,τ)maxF:𝔽n𝔽|{PPolyd(𝔽n𝔽)|dist(P,F)τ}|

Let LDR𝔽,𝔽n(d) be the list decoding radius of RM𝔽,𝔽n(d), which is the maximum τ for which 𝔽,𝔽n(d,τϵ) is bounded by a constant depending only on ϵ,|𝔽|,d.
In the paper [8] it was shown that for constant field size and degree, the list decoding radius reaches the distance of the code, as conjectured earlier by [24] 999Note that it is known that LDR𝔽,𝔽n(d)δ𝔽(d), and therefore, in a sense, their result is optimal in 𝔽n assuming d,|𝔽| are considered as constants. .

For X~, we define the X~-list decoding count:

𝔽,X~(d,τ)maxF:X~𝔽|{PPolyd(X~𝔽)|dist(P,F)τ}|

We denote the distance parameters of X~𝔽n by δ𝔽,X~(d) and LDR𝔽,X~(d) respectively,

We next present our main theorem, which establishes that the list decoding radius of the quotient Reed-Muller code is at least as good as the that of the original code:

Theorem (List Decoding Quotient Reed-Muller Code).
101010Informal, for formal see Theorem 76.

Let 𝔽 be a finite field of constant size, let d be a constant such that d<|𝔽|, and let n be an integer.
Let X~𝔽n be a subset that is a lift-enabler for RM𝔽,𝔽n(d) and has the d-relative rank-bias property.
Then, RM𝔽,X~(d) inherits its list decoding radius from RM𝔽,𝔽n(d), i.e:

LDR𝔽,X~(d)LDR𝔽,𝔽n(d)

In addition, we also achieve a (simpler) result regarding the distance of the quotient Reed-Muller code (Theorem 68): Under the conditions described above, RM𝔽,X~(d) also inherits its distance from RM𝔽,𝔽n(d), i.e δ𝔽,X~(d)δ𝔽,𝔽n(d) 111111Our techniques also show that also the other direction is true, which yields an equality in the distance of the two codes. .

As a corollary, using results studied in [34, 35, 37] regarding high-rank varieties, we obtain the following:

Corollary (List Decoding Quotient Reed-Muller Code: High Rank Variety).

Let X~𝔽n be a high rank variety, that is, X~ is the set of common zeros of a collection of polynomials ~=(L1,,Lc~) that is of high rank 121212To be more precise, the greater or equals in the decoding parameters wirtten in the theorem are not exact, but they are true up to some ϵ that depend on the rank of the collection. Thus the higher the rank of the collection is, the more similar the quotient Reed-Muller code and the original Reed-Muller code in terms of distance and list-decoding radius. 131313We also note that for this result some assumptions are needed regarding the field size or the degree of the polynomials in the collection. , i.e. X~=Z(~)={x|i:Li(x)=0}.
Then, RM𝔽,X~(d) inherits its distance parameters from RM𝔽,𝔽n(d), i.e:

  1. 1.

    δ𝔽,X~(d)δ𝔽,𝔽n(d).

  2. 2.

    LDR𝔽,X~(d)LDR𝔽,𝔽n(d).

Example 4.

Let d,n, and denote n=dn. Define Ln:𝔽n𝔽 to be:

Ln(x1,,xn)i=1nj=1dx(i1)d+j

It was shown [35, Theorem 1.9] that rank(Ln) as n. Thus for sufficiently large n, the variety X~Z(Ln)={x|Ln(x)=0}𝔽n satisfies the necessary conditions so that X~-quotient Reed-Muller code inherits its distance parameters from the original Reed-Muller code 141414More accuratly, it was shown that its schmidt rank goes to infinity as n goes to infinity. This is a sufficient condition for applying Theorem 76. .

Main Technical Challenge

We achieve these results by combining the two black-box properties of subsets X~𝔽n we presented. Analysis of the polynomials in X~ raises a new challenge, as previous techniques that were used to analyze low-degree polynomials, both regarding 𝔽n [28] and regarding subsets X~ [37], were focused on maintaining the behavior of polynomials within the relevant domain, without maintaining a connection between polynomials over X~ and their extensions to 𝔽n.
The novelty of our new technique is that it uses a similar approach to analyze polynomials in X~ as commonly used in 𝔽n, while simultaneously maintaining a connection between polynomials in X~ to polynomials in 𝔽n. This connection allows us to deduce that polynomials in X~ behave similarly to polynomials in 𝔽n. Informally, given a question about a polynomial defined over X~, our technique enables us to identify a suitable lift of the polynomial to a polynomial over 𝔽n, and answer the question using properties of that lift. Crucially, the appropriate lift depends on the nature of the question, meaning that no single canonical lift suffices for all purposes.

Next we describe this challenge in more detail.
Analyses of polynomials in 𝔽n were commonly based on the structure-randomness connection of polynomials in 𝔽n. A central tool that leverages this property of 𝔽n is the regularization process [28], which transforms any collection of polynomials into a new one that:

  1. 1.

    Is equidistributed in 𝔽n.

  2. 2.

    Captures the same functions as the original collection.

This capturing is formalized via measurability: a function F:𝔽n𝔽 is said to be measurable with respect to a collection 𝒫=(P1,,Pc) if it can be expressed as a function of the polynomials in 𝒫. The regularization process guarantees that any function measurable by the original collection remains measurable by the new one.

Equidistribution is obtained by enforcing the collection to be of high rank, which informally means the polynomial has extremly low algebraic structure, that is, it cannot be approximated or predicted by a small number of lower-degree polynomials. In 𝔽n, this structural condition implies low bias in 𝔽n, where a low-biased polynomial is equidistributed.

To generalize this process to X~𝔽n, while being able to deduce properties of polynomials in X~ using polynomials in 𝔽n, we aim to construct a new collection of polynomials that has additional requirements:

  1. 3.

    Is equidistributed restricted to X~.

  2. 4.

    Captures the same functions as the original collection restricted to X~.

The requirement of the conditions together is unique to our setting and introduces a key difficulty. Equidistribution over X~ requires the collection to have high relative rank, which requires one to eliminate structure not just from a polynomial, but from all of its X~-equivalent polynomials – those over 𝔽n that agree with it on X~ and have the same degree bound 151515This is the same as considering all lifts of the polynomial P|X~, assuming such lift exist. . Ensuring this while maintaining measurability in 𝔽n is nontrivial.
To achieve equidistribution in 𝔽n, the regularization process replaces structured polynomials by a small collection of high-rank 161616More accuratly, the replacement is done recursively unitl the collection is of high rank. polynomials that capture them. Over X~, avoiding structure in all X~-equivalents may require replacing a polynomial with polynomials that capture a different – but X~-equivalent – function. This risks breaking measurability in 𝔽n and thus losing the connection to the polynomials in 𝔽n.
In summary, our main technical challenge is to achieve equidistribution over X~ while preserving measurability over both X~ and 𝔽n, despite the need to eliminate structure across all X~-equivalent polynomials.

Introducing New Tools

We overcome this challenge by presenting a new definition that relaxes the notion of measurable we required for functions in 𝔽n, which we call X~-measurable. This enables us to describe a relaxed version of the regularization process, in which we require that every function in 𝔽n that was X~-measurable by the old collection will still be X~-measurable by the new collection. In contrast to the original regularization process, which mandated that functions that were measurable by the old collection will be measurable by the collection, this relaxed definition only requires such functions to be X~-measurable by the new collection.

Even though we no longer need to capture all previously captured functions in 𝔽n, it is important that the new relaxed-definition is strict enough to keep the connection between polynomials in X~ and in 𝔽n. Therefore, maintaining the X~-measurable functions throughout the regularization process cannot be done trivially, and this is handled in a procedure we call the X~-relative regularization process which is a stronger-version of the regularization process that is used in 𝔽n. This new definition and procedure are thoroughly described in Section 5.

We note that these new definition and procedure are a novel contribution of this work, and we believe they can be useful in future research of the quotient Reed-Muller code.

1.1 Comparison to Related Work

In [8] the authors studied the list decoding radius of Reed Muller codes 𝔽n. They proved that, for prime fields, the list decoding radius reaches the distance of the code, as conjectured earlier by [24] 171717Note that it is known that LDR𝔽,𝔽n(d)δ𝔽(d), and therefore, in a sense, their result is optimal in 𝔽n assuming d,|𝔽| are considered as constants. 181818We also note that their work also apply to the regime d|𝔽|. . Formally, they showed the following theorem:

Theorem 5 ([8, Theorem 1]).

Let 𝔽 be a prime field. Let ϵ>0 and d,n. There exists a constant 191919It is important to note that c is independent of n. cc(|𝔽|,d,ϵ) such that:

𝔽,𝔽n(d,δ𝔽(d)ϵ)c

Our work gives new tools for analyzing polynomials in X~𝔽n, which we later use to follow their line of proof and show an equivalent result in X~.

We next present related work regarding the study of polynomial codes in subsets X~𝔽n. Before presenting them specifically, we note that our work has a fundamental difference than that of the previous study of polynomials in subsets. Most works which studied polynomials over subsets X~𝔽n were focused on subsets in which every polynomial has a unique lift. This ensures that there is a 1-to-1 correspondence between polynomials in X~ and in 𝔽n and therefore allows easier connection between polynomials in X~ and in 𝔽n.
We note that our work is non-trivial even in this case: it extracts the properties of 𝔽n that were used in [8], in a way they can be used to analyze quotient Reed-Muller codes. However, as described earlier, our work addresses an additional substantial challenge which arise when the lift is not unique. Thus our work is only comparable to other works in the unique-lift case, which is the less-challenging case we address.

The first line of work we mention is this regard is the study of hitting sets for low degree polynomials [40, 15, 31], and a stronger variant of it which is the study of pseudorandom-generators against low degree polynomials  [10, 11, 38, 44, 16, 17, 22] Both definitions capture subsets 202020Sometimes this subset is allowed to be a multiset. X~𝔽n such that every polynomial over 𝔽n has a non-negligible distance from 0 when restricted to X~. This requirement implicitly implies that every low degree polynomial over X~ has at most a single lift.

Another line of work worth mentioning in this regard is [21, 30], which studied puncturing of Reed-Muller codes. This line of work studied the construction of sets X~𝔽n, such that puncturing Reed-Muller codes over X~, that is, taking every original codeword and restricting it to X~, will yield a good error-correction code. To perform their analysis, it was important that every polynomial in X~ has at most a single lift, and therefore their work was focused on subsets where there is a unique lift.

The papers [14, 2, 12] also studied similar questions. This line of work is followed by the resolution of the GM-MDS conjecture, which was proved by [39, 45].
We note that these works were focused on the regime where the field is large. More specifically, they require that the field is large in respect of n, i.e. Ω(n). We emphasize that our work is focused on constant fields. Moreover, their results were regarding random puncturing, while our result makes an explicit puncturing.

We also note that most studies presented above also achieved results regarding the rate of the punctured code. This property of the code can be analyzed naturally when each polynomial over X~ has a unique lift, as such assumption implies that the number of polynomials remains the same in X~ as of in 𝔽n. In contrast, our work does not rely on such a uniqueness assumption, and therefore does not address the rate of the resulting code. As our work does not assume such uniqueness, the rate of the code we consider is not analyzed in our work. Nonetheless, we note that the Hilbert function of a subset X~𝔽n corresponds to the rate of the X~-quotient Reed-Muller code, and that some great progress has been made in analyzing this function [5, 4, 41, 23, 1].

1.2 Proof Overview

Our main technical contribution is a generalization of the regularization process to subsets X~𝔽n, which we call the relative regularization process. This tool addresses the core difficulty of non-unique lift, and relies on a new notion we introduce: X~-measurability.

Measurablity and Regularization in 𝔽𝒏

Measurablility is a mathematical-analysis notion, which was first used in a similar context in [29]. It is defined as follows:

Definition 6 (Measurable).

Let 212121In this context we think of c as a small (constant for example). 𝒫=(P1,,Pc) be a collection of polynomials of degree d. A function F:𝔽n𝔽 is measurable in respect of 𝒫 if it can be determined by the values of P1,,Pc:

F(x)=ΓF(P1(x),,Pc(x))

for some function ΓF:𝔽c𝔽 222222One can think of this definition as a generalization of linear span: the collection spans the function, where Γ is some notion of a span. .

Intuitively, 𝒫 captures the information required to compute F. If 𝒫 is a high-rank collection, the tuple (P1(x),,Pc(x)) is equidistributed over x𝔽n as 𝔽n has the property that high-rank implies equidistribution. This allows one to analyze F through the simpler function ΓF.

The regularization process is a fundamental tool presented in [28] that constructs a high-rank collection 𝒫 refining 𝒫, meaning that all 𝒫-measurable functions remain measurable with respect to 𝒫.

𝑿~-Relative Rank

We remind the reader that rank is a notion that measures the algebraic structure of polynomials, where high rank implies extremly low algebraic structure. In addition, X~-relative rank is a notion that measures the algebraic structure of a polynomial in a subset X~𝔽n, by considering the structure of all of its X~-equivalent polynomials. This notion was presented by [27, 37], and is used to achieve equidistribution in X~ assuming X~ has relative rank-bias property. It is defined as follows:

Definition 7 (Relative Rank, informal. See definition 48).

Let X~𝔽n be a subset, let d, and let P:𝔽n𝔽 be a polynomial of degree =d. The X~-relative rank of P is defined as follows:

rankX~(P)min{rank(PPmissing¯)|Pmissing¯Polyd(𝔽n𝔽),Pmissing¯|X~0}

1.2.1 𝑿~-Measurablity and The 𝑿~-Relative Regularization Process

In this subsection we discuss the generalization of the regularization process to subsets X~𝔽n using the equivalent of rank-bias relation in X~. We name this tool the relative regularization process.

Practically, we use this tool to show that given a specific question in mind, every p:X~𝔽 has some polynomial P:𝔽n𝔽 that behave “similarly” in respect to this question. This allows us to pull properties of P to better understand p. The perfect candidate for such P is a lift of p.
In order to use P to deduce properties of p, we use the well-studied properties of polynomials in 𝔽n to acheive properties of P, and relate these to properties of p. More specifically, assume that p and P are measurable in respect of a collection of polynomials 𝒫 (each in its domain). Our strategy is to use P to deduce properties of ΓP, and then use the properties of ΓP to deduce properties of p.

Now let us describe the extra challenge. We start by following the ideas of the regularization process we described for 𝔽n. Assuming the collection is not a collection of X~-relative high rank, then there must exist a polynomial in the collection that has low relative rank, which we denote by P 232323More precisely, some linear combination of polynomials has low relative rank. . Note that in relative rank, this does not necessarily mean that P is of low rank, but that there exists another X~-equivalent polynomial that has a low rank. Thus, even if we remove the low-rank X~-equivalent polynomial and add to the collection all the polynomials that decomposed it, we cannot require that every function that was measurable by the old collection will still be measurable by the new collection: even the polynomial we removed is not necessarily measurable by the new collection!
To allow such regularization process to still apply, we note that while P might not be measurable in respect of the new collection, a X~-equivalent polynomial of P is measurable with respect of it. Therefore, we relax the notion of being measurable to being X~-measurable.
We say a function F is X~-measurable in respect of 𝒫 if it can be determined by the polynomials of 𝒫 up to a valid X~-remainder. We first describe an incomplete definition, then present the challenge that rises with it, and finally present its resolution.

Definition 8 (X~-measurable, Incomplete Definition).
242424This incomplete definition lacks the requirement of the validity of the X~-remainder

We say a function F is X~-measurable in respect of 𝒫=(P1,,Pc) if there exists a function Γ:𝔽c𝔽 and a X~-remainder, i.e. a function Fmissing¯:𝔽n𝔽 with Fmissing¯|X~0 such that:

a𝔽n:F(a)=Γ(P1(a),,Pc(a))+Fmissing¯(a)

Previous works analyzing polynomials in 𝔽n were able to deduce two things from F being measurable by a high-rank collection 𝒫. The first, is that the structure of Γ is similar to the structure of F: for example, if one is a polynomial of bounded degree, so is the other. The second, is that a random input of Γ behave similarly to a random input of F: that is, the output distribution of Γ (over its inputs 𝔽c) is close to the output distribution of F (over its inputs 𝔽n).
To study polynomials in X~, we wish to connect p to P (which is a lift of p). Thus, we think of F=P, and require two similar things. Firstly, we want the structure of Γ to be similar to the structure of F (in this case, P), which we understand as F is a polynomial in 𝔽n. Secondly, we want a random input of Γ to behave similarly to a random input of p, as p is the polynomial we wish to understand. The latter is easily achieved using the fact high X~-relative rank implies equidistribution in X~. The former, however, might be damaged by the remainder as we defined it: we can only learn the structure of Γ using the structure of FFmissing¯ using the equality FFmissing¯=Γ(P1,,Pc). However, the structure of FFmissing¯ can be very different from the structure of F, as we did not require any structure of the X~-remainder Fmissing¯. Thus, we can not deduce the structure of Γ via the structure of F using the incomplete definition described above.

To handle this issue, we add one more requirement regarding the X~-remainder, which ensures that the structure of F can be understood via the structure of Γ:

deg(FFmissing¯)deg(F)

If the X~-remainder also has this property, we say it is a valid X~-remainder for F. This can be summarized by the following (complete) definition:

Definition 9 (X~-measurable).

We say a function F is X~-measurable in respect of 𝒫=(P1,,Pc) if there exists a function Γ:𝔽c𝔽 and a valid X~-remainder, i.e. a function Fmissing¯:𝔽n𝔽 with Fmissing¯|X~0 and deg(FFmissing¯)deg(F) such that:

a𝔽n:F(a)=Γ(P1(a),,Pc(a))+Fmissing¯(a)

We use this new definition the following way: Instead of using F to understand Γ, we use FFmissing¯ to do so. We choose FFmissing¯ as it has the same structure as F, but it is “closer” to the function Γ as FFmissing¯=Γ(P1,,Pc) 252525One can think of this step as “taking the right X~-equivalent” in respect of 𝒫. . Finally, as Γ behaves similarly to p for random inputs, we can use Γ to deduce properties regarding p.
With this in hand, let us finish describing the relative-regularization process. The requirement on the validity of the X~-remainder raises a new challenge in the X~-relative regularization process: we need to somehow control the structure of the X~-remainder, even though this “error” is substituted in Γ each time we wish to replace a polynomial in our collection. We address this challenge using a Lemma proved in [9] called the “faithful composition lemma”, which allows us to deduce strong properties regarding the structure of Γ given the collection was of a high (regular) rank in the first place. Therefore, we add to each step of the relative-regularization process a (regular) regularization, which ensures Γ is very structured. This strong structure of Γ is later used to control the error and deduce it is in the form of a valid X~-remainder. For the exact details, see Theorem 64. We conclude this by informally stating our main technical theorem, which is the relative regularization process we just described:

Theorem 10 (Relative Regularization Process, Informal, See Theorem 64).

Let r,d be integers that represents a requested rank and degree respectively, and let P1,,Pc be a collection of polynomials of degree d. Then, there is another collection P1,,Pc of polynomials of degree d, such that:

  1. 1.

    Every function that is X~-measurable in respect to the first collection is also X~-measurable in respect to the new collection.

  2. 2.

    The new collection is of X~-relative rank r.

  3. 3.

    The new collection is of bounded size, i.e. cCr,d,c.

1.2.2 List Decoding in 𝑿~ via 𝑿~-Relative Regularization

In this subsection, we demonstrate how to use the relative regularization process to achieve our main theorem: analysis of the list decoding radius of RM𝔽,X~(d).

We follow the line of proof of [8], but this time, we are interested in bounding the amount of polynomials in X~ around every function in X~. More specifically, we wish to show that there is a constant number of words that are (δ𝔽(d)ϵ)-close to any fixed function in X~.

Let f:X~𝔽 be a received word. First, we apply a lemma proved in [8, Corollary 3.3]. The lemma shows that there is a constant-sized (depending on ϵ) collection of polynomials in X~, denoted by 𝔥, such that the distance of f to any polynomial can be approximated by the distance of f to some function that is measurable by 𝔥 in X~. This means that instead of bounding the number of polynomials in the radius of f, one can bound the number of polynomials in the radius of some function measurable by 𝔥. Thereby, every polynomial-specific measurable function can be thought of as a low complexity proxy for f in respect to the polynomial.

Next, we lift each polynomial from 𝔥 and apply the relative regularization process. This yields a new collection of polynomials in 𝔽n that is constant sized and randomly-behaving (in 𝔽n). Denote this new collection by 262626We use the same notations as the original proof for clearannce. . Thereby, the question of list decoding is reduced to the following question: We have a specific constant-sized randomly-behaving collection of polynomials ={H1,,Hc} that was constructed using the function f. We need to bound the amount of polynomials in X~ that are (δd(𝔽)ϵ/2)-close to be measurable by this collection in X~. Note that the randomly-behaving property was achieved using the relative rank-bias property of X~. Additionally, we note the collection is a collection of polynomials in 𝔽n which we obtained by using the lift-enabler property of X~.

From there (and similarly to the analysis in 𝔽n), the strategy is to show that polynomials that are that close to being measurable by the randomly-behaving collection , are in fact measurable by it. This will bound the number of such polynomials by the amount of possible functions that are measurable by , which is constant as the collection is of constant size.

Let p:X~𝔽 be a polynomial of degree d, and consider a lift of it P:𝔽n𝔽. Consider the collection {P}. Surely, P is measurable by this collection in 𝔽n. Applying X~-relative-regularization to this collection yields a new collection ′′ that is equidistributed in X~, such that every X~-measurable function by the old collection is X~-measurable by the new collection. By a reason we have not explained in this brief explanation, we can ensure this collection is of the form ′′={H1′′,,Hc′′′′}.

As P was X~-measurable by {P} (it was even measurable), P is X~-measurable by the new collection ′′: That is, P is measurable by ′′ up to a valid remainder, denoted by Pmissing¯.
This means there exists Φ:𝔽c+c′′𝔽 such that:

a𝔽n:P(a)=Φ(H1(a),,Hc(a),H1′′(a),,Hc′′′′(a)))+Pmissing¯(a)

In 𝔽n, the proof would follow by studying the structure of the function Φ and use it to induce that Φ does not depend on its last c′′ variables. This implies that P is measurable by the original collection which concludes the proof 272727Note that in 𝔽n there is no remainder, so the equation above (with the last c′′ variables as constants) implies measurability by . .

More accurately, the analysis in 𝔽n used the fact that substituting randomly behaving polynomials in Φ yields a structured function 282828In our notations, this structured function is P, which is a polynomial of degree d and thus structured . This is used to show that Φ as a function by itself, with inputs from 𝔽c+c′′, is a very structured function. The strong structure of Φ, with the fact that Φ (with inputs substitued to be the functions of ′′) is close to the function f, are then combined to deduce that Φ does not depend on its last c′′ variables.

This paradigm can not be extended effortlessly to our case. In X~, deducing that Φ is very structured requires a one-more major step. This is because we do not know any correspondence in the behavior of Φ (which we want to understand) with the behavior of P (which we know is structured). We only know there is a correspondence between Φ to another function PPmissing¯, which apriori we do not know is structured!

Fortunately, the relative regularization process (Theorem 64) mandates that the remainder of the measurement is valid. That is, if P was structured (a polynomial of degree d), then so does PPmissing¯. This is crucial, as it allows us to use the relation between Φ and PPmissing¯ to deduce that Φ is structured, and continue the original outline of the proof of [8]. For more details in this regard, see Theorem 76.

1.3 Organization

In Section 2 we present some basic notations and conventions, and define the preliminaries we have regarding high-order Fourier analysis in 𝔽n: polynomials, rank and regularization. We later generalize each component we presented in Section 2 to study polynomials in 𝔽n to also study polynomials in X~: in Section 3 we present the set of polynomials in X~ and present the lift-enabler property; in Section 4 we present the X~-relative rank-bias property; and in Section 5 we present the X~-measurable notion, and our main tool, which is the X~-relative regularization process. Next, we present two applications regarding the distance parameters of Reed-Muller codes in X~: In Section 6 we prove the inheritance of the distance of the code; and in Section 7 we prove the inheritance of the list decoding distance of it (which is much more involved).

2 Preliminaries

2.1 Basic Definitions and Notations

We denote by the set set of integers, i.e. natural numbers (excluding 0). For an integer k we denote [k]{1,2,,k}. We use y=x±ϵ to denote y[xϵ,x+ϵ], and similarly y=xλ to denote y[x+λ,xλ] (usually when λ<0).
Fix a prime field 𝔽=𝔽p. Denote by || the natural map of 𝔽 to {1,,p1}. We denote the character from 𝔽 by e[x]e2πi|x|

Generally speaking and unless stated otherwise, we use the following conventions: We use n to denote the number of variables in Reed-Muller code. We use d to denote a degree (typically the degree of the polynomials in our code), and X~ to denote the subset of 𝔽n we work in i.e. X~𝔽n. Properties of the subset X~ will usually be denoted with ~. We use F,G,H to denote general functions with domain 𝔽n, and f,g,h to denote functions with domain X~. We use 𝔉,𝔊, and 𝔣,𝔤,𝔥 respectively to denote sets of such functions. Similarly, we use P,Q,H to denote polynomials with domain 𝔽n, and p,q,h polynomials with domain X~ (polynomials as defined in Section 3). We use 𝒫,𝒬, and 𝔭,𝔮,𝔥 respectively to denote sets of such polynomials.

2.2 Polynomials in 𝔽𝒏

We start by presenting a standard definition for a polynomial over a finite field.

Definition 11 (Polynomial: Global Definition).

Let d be a constant. A function P:𝔽n𝔽 is called a polynomial of degree d if it is of the following form:

P(x1,,xn)=0d1,,dn:i=1ndidcd1,,dni=1nxidi

We denote the set of all polynomials of degree d by Polyd(𝔽n𝔽). The value d in the definition above is called the global degree of the function P, shorthand by the degree of P, and it is denoted by deg(P)=d.
Additionally, the set of all polynomials from 𝔽n to 𝔽 of degree d is denoted by:

Polyd(𝔽n𝔽)
 Note 12.

Note that it is a folklore that every function F:𝔽n𝔽 is a polynomial function, that is, can be written in the representation stated above for some degree. This follows from the representation of Dirac delta function over 𝔽n as a polynomial:

𝟙0(x1,,xn){1x1==xn=00otherwise=i=1n(1(xi)|𝔽|1)

Therefore the definition of total degree is meaningful for all functions F:𝔽n𝔽.

Next, we present a known equivalent definition for a polynomial using derivatives. To do so, we first define a derivative in the case of finite fields.

Definition 13 (Derivative).

Given a function F:𝔽n𝔽 and a𝔽n, we define the derivative of F in direction a as a function DaF:𝔽n𝔽 defined as follows:

DaF(x)F(x+a)F(x)
Lemma 14.

Let d. A function F:𝔽n𝔽 is a polynomial of degree d if and only if DaF is a polynomial of degree d1 for all a𝔽n.

This leads us to a natural definition of a degree of a function using derivatives.

Definition 15 (Local Degree).

For a function F:𝔽n𝔽, we define its local degree, to be the least integer d such that for all a1,,ad+1,x𝔽n:

Dad+1Da1F(x)=0

In 𝔽n, the two definitions of degree coincide, and we get a single definition of a degree:

Lemma 16 (Equivalance of definitions of a degree).

Let F:𝔽n𝔽 be a function, and let d be an integer. Then, the global degree of a F equals its local degree.

 Remark 17.

We sometimes refer to the requirement that the local degree of a function is d, as the local criteria of degree d polynomials.

2.3 Rank-bias in 𝔽𝒏

We start by defining the notion of bias, which is a measure of how the function is far from being equidistributed (see Appendix A for the exact details).

Definition 18 (Bias).

Let F:𝔽n𝔽. The bias of the function F is defined in the following way:

bias(F)1/|𝔽n|x𝔽ne[F(x)]

Moreover, for a subset X~𝔽n, we define the bias of F in X~ to be:

biasX~(F)1/|X~|xX~e[F(x)]

Next, we present a standard definition of rank of a polynomial, which is a notion that measures how structured is the function. Note that low rank implies the polynomial is highly structured. Formally we have the following definition:

Definition 19 (Rank of a Polynomial).

Given a constant d and a polynomial P, the d-rank of P, denoted as rankd(P) is defined to be the smallest integer r such that P can be computed given r polynomials of degree <d. In other wards, we say rankd(P)=r if r is the smallest integer such that there exists r polynomials Q1,,QrPolyd1(𝔽n𝔽) and a function Γ:𝔽n𝔽 such that:

P(x)=Γ(Q1(x),,Qr(x))

If d=1, then 1-rank is defined to be if P is non constant, and 0 otherwise.
Moreover, for a polynomial P of degree deg(P)=d we denote rank(P)rankd(P).
We call such function Γ a decomposition or a computation of P using lower-degree polynomials.

Let us now define a factor. Note that we focus our discussion to factors in 𝔽n, but define the basic definitions over a general set U so they will apply for factors over a general sets. This is necessary as we will later use them also for other sets such as X~𝔽n.

Definition 20 (Factor).

Let U𝔽n be a set. Let F1,,Fc:U𝔽 be a collection of functions. A factor defined by F1,,Fc over U is the map:

F1,,Fc(u)(F1(u),,Fc(u))

By an abuse of notation, we also use to denote the partition of the set U defined by the map. We call each subset in the partition is an atom:

{uU|F1(u)=b1,,Fc(u)=bc}

for all b1,,bc𝔽. By an abuse of notation, sometimes refers to the set of all atoms (which is a partition of U).

Notation.

Let F1,,Fc:U𝔽 be a collection of functions. For a factor F1,,Fc, we denote by || the amount of functions that define it, i.e. ||=c. Moreover, we denote |𝔽|c, which is the maximal amount of (possibly empty) atoms.

Definition 21 (Polynomial Factor).

We say a factor over 𝔽n is a polynomial factor if it is defined by a collection of polynomials P1,,Pc:𝔽n𝔽, i.e. =P1,,Pc. The degree of the factor, denote as deg() is the maximal degree of the polynomials P1,,Pc.

 Note.

We emphasize that every function F:𝔽n𝔽 is a polynomial function for some degree, thus the phrase “polynomial factor“ is used to emphasize that there is a degree bound on the functions defining the factor. Also note that the notion of degree (and polynomial) are defined only for functions over 𝔽n, therefore this definition is well-defined only for U=𝔽n.

Definition 22 (Rank of a Factor).

Let 𝒫 be a collection of polynomials P1,,Pc:𝔽n𝔽. The rank of the polynomial collection is defined as:

rank(𝒫)min{rankd(i=1cλiPi)|0λ𝔽c,d=maxi[c]deg(λiPi)}

For a factor defined by a collection of polynomials 𝒫, we define its rank to be the rank of the collection of polynomials defining it. For a non-decreasing function r:, a factor is called r-regular if its rank is at least r(||).

 Note.

Note that in the definition above, the rank of each linear combination is calculated as the d-rank, where d is the maximal degree of a polynomial that participates in the linear combination non-trivially. This is crucial as it ensures that a high rank factor do not have linear dependence in the largest-degree homogenous component of any of its polynomials.

We now present a fundamental property of high rank polynomials, that was first proved by [28] when d<|𝔽|, later extended to general fields by [33], and further extended also to large fields by [9]. This property of high rank polynomials is that they have low bias:

Theorem 23 (Rank-bias in 𝔽n).

Let 𝔽 be a finite field. Let ϵ>0 and d. There exists r23r23(𝔽,d,ϵ), such that for every degree-d polynomial P:𝔽n𝔽: if rank(P)r23 then bias(P)<ϵ.

 Remark 24.

This property implies that a collection of polynomials that have high rank is equidistributed. See Appendix A for more details in this regard.

2.4 Regularization in 𝔽𝒏

In this subsection we define the regularization process in 𝔽n. Before doing so, let us present some definitions in this regard. Note that we define the basic definitions over a general set U so they will apply for factors over a general sets, as this is necessary as we will later use them also for other sets such as X~𝔽n.

Definition 25 (Measureable).

Let U be a set, and let AU. Let 𝔉={F1,,Fc} be a collection of functions Fi:U𝔽. We say a function G:U𝔽 is measurable in respect of 𝔉 in A, shorthand by 𝔉-measurable in A, if there exists a function Γ:𝔽c𝔽 such that:

aA:g(a)=Γ(F1(a),,Fc(a))

When discussing the factor over A defined by =F1,,Fc, we also say G is measurable in resepct of . The function Γ will be denoted as the measurement function of G in respect of 𝔉. Additionally, when A=U, we sometimes omit the specification of the domain, and say G is measurable in respect of 𝔉.
Note that in this paper, we usually think of U=𝔽n, and A is either 𝔽n or X~𝔽n.

 Remark 26.

If G is 𝔉-measurable in A, then every value of G in A can be determined by the values of F1,,Fc. In other words, the function G is constant inside every atom of the factor defined by 𝔉.

Definition 27 (Syntactic Refinement).

Let and be polynomial factors over U𝔽n. We say a factor is a syntactic refinement of the factor , if the collection of functions defining is a subset of the set of functions defining . We denote this property of by syn.

We now present a standard generalized definition of refinement, where we only require the atoms induced by the refined factors are sub-atoms of those that are induced by the original factor. Note that in this refinement, we allow the refined factor to include completely different polynomials than the original factor.

Definition 28 (Semantic Refinement).

Let and be polynomial factors on U defined by 𝒫 and 𝒫 respectively. We say the factor is a semantic refinement of the factor in AU, if x,yA with (x)=(y) implies that (x)=(y). We denote this property of by sem|A. When A=U, we sometimes omit A from the syntax and denote it with sem
Note that syn implies sem|A for every AU.

 Remark 29.

A handy property of semantic refinement is that if F:A𝔽 is 𝒫-measurable, then it is also 𝒫-measurable in A. Moreover, the other direction is also true: If every 𝒫-measurable function F:A𝔽 in A is also 𝒫-measurable in A, then sem|A.

Next, we recall a lemma that was presented in [7, Theorem 4.1], that allows us, given a polynomial that is measurable by a high rank factor in 𝔽n, to replace the polynomials in the measurement function to any collection of polynomials with smaller or equal degree, and preserve the degree of the original polynomial. Note that we state the lemma under the constraint that d<|𝔽|, but it is also valid for when d|𝔽| with proper generalization of definitions to this case (See [7, Theorem 4.1] for the exact statement).

Lemma 30 (Preserving Degree in 𝔽n).

Let d>0 an integer such that d<|𝔽|, and let P1,,Pc:𝔽n𝔽 be polynomials of degree at most d, that form a factor of rank r30(𝔽,d,c). Assume that for Γ:𝔽c𝔽, the function γ:𝔽n𝔽 defined as γ(a)Γ(P1(a),,Pc(a)) is of deg(γ)=d.
Then, for every collection of polynomials Q1,,Qc:𝔽n𝔽 that satisfy deg(Qi)deg(Pi), the function γ defined as γ(a)=Γ(Q1(a),,Qc(a)) is a polynomial of deg(γ)d.

Next, we restate a useful lemma from [9, Lemma 4.17] that shows that under the conditions above, Γ is as a low-degree polynomial (with even stronger conditions). Formally, they showed:

Lemma 31 (Faithful Composition).

In the case discussed above, the structure of Γ is as follows:

Γ(z1,,zc)=α[p1]cCαi=1cziαi

where Cα=0 whenever i=1c(αideg(Pi))>d.
In other words, this means that Γ as a function Γ:𝔽c𝔽, is a polynomial of degree d, even when substituting its i-th input by any polynomial of degree deg(Pi).

Next, we restate the regularization process, that was first presented by [28, Lemma 2.3], with the second part of the lemma presented in [28, Lemma 9.3] (a statement the combines the two can be found [32, Lemma 7.29]).
We begin with a definition:

Definition 32.

Let 𝒫=(P1,,Pc) be a collection of polynomials of degree d that defines a factor . Define M()(Md,,M1)d, where Mi denotes the number of polynomials in 𝒫 that have degree exactly i. Thus, i=1dMi=c.
We define the lexicographical order on d where M>M if and only if Mi>Mi for some 1id, and Mj=Mj for all j>i.

The regularization process shows that every factor have a high-rank factor that semantically refines it, without increasing the size of the factor too much (its new size is independent of n).

Lemma 33 (Regularization in 𝔽n).

Let r: be a non-decreasing function and let d. There exists Cr,d33: such that the following holds: Let be a factor on 𝔽n defined by polynomials 𝒫=(P1,,Pc) where for all i[c]: Pi:𝔽n𝔽 and deg(Pi)d Then, there is an r-regular factor defined by polynomials 𝒬=(Q1,,Qc) where for all i[c]: Qi:𝔽n𝔽 such that sem, M()M() and cCr,d33(c).
Moreover, if syn¯ for some polynomial factor ¯ with rank at least r(c)+c+1, then we can require that syn¯.

 Note.

Note that in the definitions above implicitly assume there are no constants in the collections of polynomials we discuss (no polynomials of degree 0). This is a valid assumption as we are interested in the set of functions that are measurable in respect to the collections, and this property is unaffected by constant polynomials in the collection. Therefore, we can always assume there are no such polynomials in any collection we consider in this context.

3 Polynomials in 𝑿~

In this section we wish to generalize the definition of degree-d polynomials for functions f:X~𝔽. Note that we wish to define it using a property of f that is intrinsic to X~: given a function f:X~𝔽, we wish be able to determine its degree only using values of X~, without considering any value outside of X~ (such as values of 𝔽nX~).
To define such property, we generalize the local definition of a degree that is defined for polynomials in 𝔽n. We remind the reader that in 𝔽n, we said a function over 𝔽n is a polynomial of degree d if and only if its (d+1)-derivative in every direction is 0. Thus, in order to determine the (d+1)-derivative of a function in directions y1,,yd+1, one needs to evaluate the function over all the points of the cube generated by x,y1,,yd+1, which is the set of points {x+iSyi}S[d+1]. This raises a challnge in extending this definition for functions defined over X~𝔽n: depending on X~, the function f:X~𝔽 is not be defined to all points in all the cubes of 𝔽n, because some of those points do not lie in X~.
Therefore, to generalize the definition of a polynomial to X~, we start by giving the formal definition and notation of the set of cubes in X~:

Definition 34 (Cubes).

Let k be an integer and let x,y1,,yk𝔽n. We define the cube (x|y1,yk) as follows:

(x|y1,,yk){x+iSyi}S[k]

We refer to x as the offset of the cube, and y1,,yk as the directions of the cube.
Moreover, let X~𝔽n be a subset. We define the set of cubes of X~ of size k as follows:

Ck(X~){(x|y1,,yk)|S[k]:(x+iSyi)X~}

Using this definition, we can define a polynomial of degree d for subsets of 𝔽n:

Definition 35 (Polynomials in X~).

Let d be an integer, and let X~𝔽n. We say the degree of a function f:X~𝔽 is d if d is the smallest integer such that f vanishes over all cubes of size (d+1), i.e:

(x|y1,,yd+1)Cd+1(X~):Dyd+1Dy1p(x)=0

A function over X~ of degree d is also called a polynomial of degree d over X~. We sometimes also refer to such functions as polynomials in X~, and use the two interchangeably. We denote the set of polynomials of degree d over X~ by Polyd(X~𝔽).

 Note.

For X~=𝔽n, the definition above coincides with the local definition of polynomials.

3.1 Lifting Polynomials

Our goal to achieve good properties for polynomials over X~. To do so, we wish to connect the desired properties of polynomials defined over X~, to properties known for polynomials over 𝔽n. Following such strategy raises a question: given a polynomial p:X~𝔽, which polynomial over 𝔽n should we consider to deduce properties of p? To find such a polynomial over 𝔽n, it would have been useful that all polynomials over X~ actually “came from“ polynomials over 𝔽n. More formally, it would have been useful that all polynomials p:X~𝔽 would be equal to a restriction of some polynomial P:𝔽n𝔽 of degree d, to the set X~. This would give us a “good candidate“ (or candidates) to polynomials over 𝔽n, that using their known properties, we could achieve the properties we desire for polynomials over X~.
Generally speaking, the existence of such polynomial P:𝔽n𝔽 is not trivial by itself, and it mapy depend on the polynomial p and the set X~. In this subsection, we discuss sets X~𝔽n that have this property for every polynomial p:X~𝔽. Before formulating the notion above, we start by a simple remark:

 Remark 36.

By the local criteria for 𝔽n, we have that a restriction of a polynomial of degree d over 𝔽n to X~ is a polynomial of degree d over X~. Therefore, the other direction is true: every restriction of a polynomial over 𝔽n to X~ is a polynomial over X~.

Next, let us define subsets X~𝔽n that have the desired property, which we call d-lift-enabler variety.

Definition 37 (d-lift-enabler Subset).

Let 𝔽 be a field, and n>0 be an integer. For an integer d>0, we say a subset X~𝔽n is d-lift-enabler if for every dd, for every polynomial pPolyd(X~𝔽) there exist a polynomial p^Polyd(𝔽n𝔽) such that p|X~=p^|X~.

 Remark 38.

Using the local criterion of polynomials and the fact that that Cd+1(X~)Cd+1(𝔽n), one can see that for a polynomial p:X~𝔽 with deg(p)=d, every extension P:𝔽n𝔽 with p=P|X~ holds the bound deg(p^)d. The other direction is not true in the general case, but it is specifically promised when the variety is d-lift-enabler.

This definition naturally raises the following definition:

Definition 39 (The Lift Operator).

Let d be an integer. Let X~𝔽n be a d-lift-enabler subset. We define the d-lift operator to be an operator ^:Polyd(X~𝔽)Polyd(𝔽n𝔽) the following way:
Let dd. Given a polynomial p:X~𝔽 of degree d, the operator ^ returns a polynomial p^:𝔽n𝔽 of degree d such that p=p^|X~. Note that we did not require the lift to be unique. Thus, in case there are multiple valid lifts for a polynomial pPolyd(X~𝔽), the lift operator picks a single (consistent) one of them. Moreover, the lift always exists because the subset X~ is d-lift-enabler.
In addition, for a collection 𝔭=(p1,,pc) of polynomials piPolyd(X~𝔽), we denote 𝔭^(p1^,,pc^)

In the following subsections, we give example to two concrete sets X~𝔽n that are d-lift-enablers. Before doing so, we define an algebraic variety:

Definition 40 (Algebraic Variety).

For a collection of functions 𝔉{F1,Fc} such that Fi:𝔽n𝔽, we denote Z(𝔉){x𝔽n|i:Fi(x)=0}.
If the collection is a collection of polynomials, we call Z(𝔉) an algebraic variety, shorthand by variety.
The degree of the variety which is a complete intersection is the product of the degrees of the polynomials in the collection that defines it.

3.2 High Rank Varieties of High Minimal Degree

We now present a theorem proved in [34, Corollary 1.10], that shows that high rank varieties are d-lift-enabler when the polynomials defining the variety are of degree >d:

Theorem 41.

Let 𝔽 be a finite field, and let d~, c~>0 representing parameters of a variety. Let d<d~ a positive integer representing a degree of a polynomial which we wish to lift. There exists r¯=r¯(𝔽,d~,c~)>0 such that for all n, any variety X~=Z(~)𝔽nfor ~=(L1,,Lc~) which is a complete intersection such that rank(~)>r¯ 292929The definition of rank used in thier proof is slightly different than our definition of rank. This is addressed in Appendix B. , degree deg(~)=d~, with all defining polynomials of degree deg(Li)>d, it holds that X~ is a d-lift-enabler subset.

 Remark 42.

Under the conditions stated above, it was proved in  [34] that the lift is in fact unique. Formally, if p:X~𝔽 is a polynomial of degree d, then there exists a unique polynomial P:𝔽n𝔽 such that P|X~p.

3.3 High Rank Varieties on a Large Field

In this subsection, we recall a theorem proved by [35, Theorem 1.7] regarding high rank varieties that are defined on “large” fields. We note that the fields are large in respect of the degree d one wish to lift, but still does not depend on n.

Next we define a weakly polynomial, which generalizes our definition of a polynomial in X~, that was used in [35, Definition 1.1]:

Definition 43.

Let X~𝔽n be a set. We say a function F:X~𝔽 is a weakly polynomial of degree d if for any affine subspace LX~, the restriction F|L is a polynomial of degree d.

 Remark 44.

By the local criteria of a polynomial, it is easy to see that every PPolyd(X~𝔽) is a weakly polynomial of degree d.

And now, we can present the lifting theorem for large fields, as proved in [36, Theorem 2.17].

Theorem 45 ([36, Theorem 2.17]).

Let d,d~, and let 𝔽 be a finite field such that |𝔽|>dd~. There exists r45=r45(d~,d) such that for any variety X~𝔽n of degree d~ which is a complete intersection, defined by a collection of polynomials with rank 303030The definition of rank used in thier proof is slightly different than our definition of rank. This is addressed in Appendix B. r45, have the following property: Every weakly polynomial function p:X~𝔽 of degree d can be lifted to a polynomial function P:𝔽n𝔽 of degree d.

 Note.

Note that we stated the theorem above to finite fields, but it is also valid for infinite algebraically closed fields.

The theorem above implies the following corollary:

Corollary 46.

Let d,d~, and let 𝔽 be a finite field such that |𝔽|>dd~. There exists r45=r45(d~,d) such that for any variety X~𝔽n of degree d~ and rank r45 is a d-lift-enabler.

4 Relative Rank-Bias Property

In this section, we generalize the relation between rank and bias that is known for 𝔽n also for X~𝔽n. Specifically, in Theorem 23, it was shown that high rank factors have low bias in 𝔽n. We wish to define an alternative definition of rank for X~𝔽n, called X~-relative rank, such that high X~-relative rank implies low bias in X~. This type of relation (and definition) was shown previously to a few sets; in [37, Theorem 1.8] for sets X~=Z(𝒬) where 𝒬 is a collection of polynomials of high rank; and in [27, Theorem 1.4] for sets X~=Sn for S𝔽.

To understand this notion, we first introduce a simple example that demonstrates the need for a different definition of rank to achieve equidistribution properties in subsets of 𝔽n.

Example 47.

Let X~={x𝔽n|x1=0}. Define P:𝔽n𝔽 by P(x)x1.
In 𝔽n, P has rank as it can not be decomposed polynomials of degree <1 (constants). Additionally, it is perfectly equidistributed. This is the simplest example of the rank-bias relation in 𝔽n.
However, when restricting P to X~, we get P|X~0. As 0 is a constant function, it is the least equidistributed possible in X~. Therefore, we see that the way we defined rank in 𝔽n does not imply the desired equidistribution in X~: we found a polynomial with high rank (infinity) that has a very high bias in X~ (the maximal).

The reason the definition of rank in 𝔽n fails to capture equidistribution even on subsets that are really similar to 𝔽n (isomorphic to 𝔽k), is because of the following reason: Even though our polynomial P does not have a decomposition to a few lower-degree polynomials by itself, there exists a X~-equivalent polynomial that has such structured decomposition. Here, by X~-equivalent we mean a polynomial in 𝔽n that is bounded by the same degree bound, and is equal to P in X~. In the example described above, this equivalent polynomial is the constant function 0, and its decomposition is the trivial one (any function decomposes a constant function). An alternative perspective which we use throughout this paper to X~-equivalence is that both polynomials are equal up to a valid X~-remainder: a bounded degree polynomial that is 0 in X~.

Generally speaking, high X~-relative rank may not imply low bias in X~. Therefore, this structure-randomness relation is not true for a general subset X~𝔽n, but is a property of the subset X~. Thus, we say that a subset has the relative rank-bias property if this relation holds, i.e. if high X~-relative rank implies equidistribution in X~.

Let us now formally define our definition for relative rank, inspired by the two different definitions of relative rank presented in [37, Definition 1.6] and in [27, Definition 1.3]:

Definition 48 (Relative Rank of a Polynomial).

Let X~𝔽n and let d. For an integer d and a polynomial P:𝔽n𝔽, we define its d-relative rank in respect of X~ as:

rankd,X~(P)min{rankd(PPmissing¯)|Pmissing¯Polydeg(P)(𝔽n𝔽),Pmissing¯|X~0}

For a polynomial P of degree deg(P)=d we denote rankX~(P)rankd,X~(P).

Definition 49 (X~-equivalent and X~-remainder).

Moreover, we say a polynomial is X~-equivalent to P if its restriction to X~ is P|X~. We say it is valid X~-equivalent to P if it is X~-equivalent to P and it is of the same degree of P.
Similarly, we say a polynomial is X~-remainder if its restriction to X~ is 0. We say it is valid X~-remainder P if it is X~-remainder and it is of degree smaller or equal of the degree of P.
We typically denote such polynomial as Pmissing¯.

In other words, the d-relative rank of a polynomial P is the smallest d-rank of all valid X~-equivalents of P.

 Note 50.

Note that [27, Definition 1.3] defines rank in a substantially different way than our definition, and consequentially our results will not apply to the sets they presented. One of the main differences in the definition of rank occurs for d=1. In the definition we use for rank, the rank of every (non-constant) degree-1 polynomial is , where in the definition used in [27] it is a finite number (which is possibly very small). This difference is crucial, as for example, it makes regularization according to their definition not-trivially possible, where it is known to be possible when rank is defined by the definition we use (Lemma 33).

Definition 51 (Relative Rank of a Factor).

Let X~𝔽n. Let 𝒫 be a set of polynomials 𝒫={P1,,Pc}. The rank of the polynomial set 𝒫 relative to the subset X~ is defined as:

rankX~(𝒫)min{rankd,X~(i=1cλiPi)|0λ𝔽c,d=maxi[c]deg(λiPi)}

For a factor defined by a collection of polynomials, we define its relative rank relative to X~ to be the relative rank of the collection of polynomials defining it, relative to the set X~.
For a non-decreasing function r:, a factor is called r-X~-regular if its relative rank in respect to X~ is at least r(||).

4.1 Relative Rank-Bias Property

Definition 52 (Relative Rank-Bias property).

Let 𝔽 be a finite field, and let d be an integer. Let r~:+ be a function that represents the rank-bias relation for a fixed d,𝔽.
We say a set X~𝔽n has the (r~,𝔽,d)-relative rank-bias property if for every ϵ>0, for every polynomial P of degree d with rankX~(P)r~(ϵ) we have:

biasX~(P)<ϵ

As an immediate corollary of Theorem 23 that shows that high rank implies low bias, we have that X~=𝔽n has the relative rank-bias property.

Corollary 53 (𝔽n has the relative rank-bias property).

For every finite field 𝔽 and d, let r~:+ defined as r~(ϵ)r23(𝔽,d,ϵ). Then, we have that the set X~=𝔽n has the (r~,𝔽,d)-relative rank-bias property.

Proof.

This is a simple usage of Theorem 23: Note that when X~=𝔽n, we have that rankX~(P)=rank(P). Now, if P is a polynomial of degree d and rankX~(P)=rank(P)r~(ϵ)=r~23(𝔽,d,ϵ), then:

biasx𝔽n(P(x))<ϵ

4.2 Limited-Relative Rank-Bias Property

Sometimes, however, we can not request X~ to be such that high relative rank implies low bias for every ϵ>0, but only for ϵϵ for some constant ϵ>0. This leads to defining the limited relative rank-bias property, which will be used to discuss such sets X~𝔽n.
As we will later see, this definition raises naturally where X~ is a high rank variety, in which for the relative rank-bias property to hold for some ϵ>0, the rank of the variety should be greater than a value that is dependent of ϵ. Thus, to have the relative rank-bias property for a high rank variety but without requiring an infinitely large rank, we must limit the relative rank-bias property for ϵϵ We formulate the definition of this property as follows:

Definition 54 (Limited Relative Rank-bias property).

Let 𝔽 be a finite field, let d be an integer, and let ϵ>0 be a constant. Let r~:[ϵ,] be a function that represents the limited-relative-rank-bias relation.
We say a set X~𝔽n has the (r~,𝔽,d,ϵ)-limited-relative-rank-bias property if for every ϵϵ, for every polynomial P of degree d with rankX~(P)r~(ϵ) we have:

biasX~(P)<ϵ

As a convention, we denote by ϵ~ the ϵ such that the limited-relative-rank-bias property holds for X~.

4.2.1 High Rank Varieties

In this subsection, we are discussing specifically X~𝔽n that are in the form X~=Z(𝒬) for a set of polynomials 𝒬 that form a high rank factor. Let us present some known results of the relative rank-bias relation for high rank varieites: In the scenario when we are working relative to X~, the equivalent for Theorem 23 is also known when we assume d<char(𝔽), as shown in [37, Theorem 1.8]:

Theorem 55 (High relative rank implies low bias in high rank varieties).

Let 𝔽 be a finite field and let 0d<char(𝔽). Let ϵ>0 be a constant, and let c~. There exist r¯55=r¯55(𝔽,d,c~,ϵ) and r55=r55(𝔽,d,ϵ) such that the following holds:
Let ~=(L1,,Lc~) be a collection of polynomials of degrees d with rank(~)r¯55 and let P be a polynomial of degree d.
Then, if rankX~(P)r55, we have:

biasX~(P)<ϵ
 Note.

Note that the original statements in [37] are stated for a different definition of rank, noted as schmidt rank. In the appendix B we compare the two different definitions, and show that our definition of rank is comprehensive enough in a sense that a polynomial with high rank also has high schmidt rank. Additionally, we show that for a given r, the lower bound of rank required for a polynomial to be of schmidt rank r, is only cr for some constant c.

 Remark 56.

Note that in the original statement of theorem 55 as stated in [37, Theorem 1.8], there are good bounds on the rank needed for ~ and P for the theorem to hold.
Specifically, there exist constants A(d),B(d) such that for an error ϵ=|𝔽s|, if r¯55=A(c~+s)B and r55=A(1+s)B, then we have:

biasZ(~)(P)<|𝔽|s

In our proof, it is enough that the bounds on r and r¯ are independent of n, thus we omit the exact bounds stated above and use the statement as stated in Theroem 55.

 Remark 57.

Note that both r55(𝔽,d,ϵ) and r¯55(𝔽,d,c~,ϵ) are decreasing when ϵ is increasing. This means for example, that for all ϵϵ, a variety that satisfies the theorem’s rank condition for ϵ also satisfies the theorem’s rank condition for ϵ. Therefore, a polynomial with rank r55(𝔽,d,ϵ) will have a bias <ϵ.

As a corollary of Theorem 55 and Remark 88, we have that high rank varieties has the limited-relative-rank-bias property. Formally, we have:

Corollary 58 (High Rank Varieties Have the Limited-Relative Rank-Bias Property).

Let 𝔽 be a finite field, and let d~ such that 0<d~<|𝔽|. Let ϵ~>0 be a constant which represents the desired relative rank-bias limit. There exists r~58:[ϵ~,] with r~58r~58(𝔽,d~) such that the following holds:
Let c~ be an integer. There exists r¯58r¯58(𝔽,d~,c~,ϵ~) such that for every ~=(L1,,Lc~) collection of polynomials with rank(~)r¯, defining a variety X~=Z(~) of degree d~ which is a complete intersection, we have:
The variety X~ has the (r~58,𝔽,d~,ϵ~)-limited-relative-rank-bias property.

Proof.

Let 𝔽 be a finite field, and let d~ such that 0<d~<|𝔽|. Let ϵ~>0. We choose:

r~58(ϵ)r55(𝔽,d~,ϵ)

Note that for every ϵ in its domain, r~58 does not depend on ϵ~. Let c~. Now, we choose:

r¯58(𝔽,d~,c~,ϵ~)r¯55(𝔽,d~,c~,ϵ~)

Using Theorem 55 that shows high rank implies low bias in X~, and the assumption that ϵϵ~ (specifically Remark 57) concludes the proof.

5 Regularization Relative to 𝑿~

In this section, we generalize the definitions and statements regarding factors and regularization in 𝔽n, to their corresponding definitions and statements to relative rank in respect of X~𝔽n.
Note that in oppose to the previous chapter that we discussed a general U and AU, in this chapter we discuss only U=A=𝔽n. This is done for clearance and to avoid defining definitions we will not use in our main proof.

Definition 59 (Measurable Relative to X~).

Let 𝔉={F1,,Fc} be a set of functions Fi:𝔽n𝔽. We say a function G:𝔽n𝔽 is measurable in respect of 𝔉 relative to X~, or X~-relative 𝔉-measurable, if there exists a function G¯:𝔽n𝔽 with G¯|X~0 and a function Γ:𝔽c𝔽 such that:

a𝔽n:G(a)=Γ(F1(a),,Fc(a))+G¯(a)

And:

deg(GG¯)deg(G)

We sometimes refer to Γ as the X~-relative measurement function.

 Note.

Note that if deg(GG¯)deg(G) as discussed above, then the same bound also bounds the degree of the remainder, i.e. deg(G¯)deg(G). Therefore G¯ is a valid X~-remainder of G. Moreover, this requirement is equivalent to the definition above, as if deg(G¯)deg(G), then we also have deg(GG¯)deg(G).

 Note.

Also note that without the bound on the degree of the remainder, being measurable relative to X~ is in fact equivalent for being a measurable in A=X~. This is true because under these conditions, the remainder G¯ has no constraints but G¯|X~0, thus the condition left on the measurement is just being a measurement to G in X~.

 Remark 60.

If G is a function that it is 𝔉-measurable relative to X~, then every value of G can be determined by the values of F1,,Fc up to a remainder G¯ of degree d. Thus, perhaps we do not know that the function G is constant inside every atom of 𝔉 as in a regular semantic refinement, but we do know that there exists a function (GG¯) that equals to G on X~, is constant on every atom of 𝔉 and it is a function with a bounded degree i.e. deg(GG¯)deg(G).

Next, we present a new type of refinement, which is a relaxation of semantic refinement. This relaxation will allow us to discuss the corresponding claim of the polynomial regularity lemma (Lemma 33) for relative rank (instead of rank).

Definition 61 (Semantic Refinement Relative to X~).

Let and be polynomial factors on 𝔽n, defined by sets of polynomials 𝒫,𝒫 respectively, and let d. We say a factor is a semantic refinement relative to X~ of the factor , or X~-relative semantic refinement, if the following holds: Every function F:𝔽n𝔽 that is 𝒫-measurable relative to X~, is also 𝒫-measurable relative to X~. If the definition above holds, we denote semX~.

 Note.

It is easy to see that this relation is transitive, i.e. if semX~ and ′′semX~, then ′′semX~.

 Remark 62.

In X~, semantic refinements relative to X~ behave the same as regular semantic refinements in the perspective of being measurable: every function that is 𝒫-measurable in X~ is also 𝒫-measurable in X~. However, the two definitions behave differently in the perspective of being measurable in 𝔽n. Specifically, in relative semantic refinements, if G is a 𝒫-measurable function it is not necessarily 𝒫-measurable. However, it is measurable up to a remainder G¯ of degree deg(G) such that G¯|X~0.

Corollary 63.

If semX~, then in X~ it is a regular semantic refinement, i.e. sem|X~.

Next, we present a new regularization process that allows us to increase the relative rank of a factor without increasing the size of the factor too much (independent of n). This regularization process generalizes the regularization process in 𝔽n, that we stated in Lemma 33. We call this type of regularization process a relative-regularization process relative to X~, shorthand by X~-regularization For a specific function r, we will sometimes call applying this lemma a r-X~-regularization. Note that to allow such a relative-regularization process to hold, we must use the relaxed definition of semantic refinement that is presented above.

Theorem 64.

Let r: be a non-decreasing function and let d. There exists Cr,d64: such that the following holds: Let be a factor defined by polynomials 𝒫=(P1,,Pc) where for all i[c]: Pi:𝔽n𝔽 and deg(Pi)d. Then, there is an r-X~-regular factor defined by polynomials 𝒫=(P1,,Pc) where for all i[c]: Pi:𝔽n𝔽 and deg(Pi)d such that semX~ and cCr,d64(c).
Moreover, if syn¯ for some polynomial factor ¯ with relative rank of at least r(c)+c+1 and rank of at least r30(𝔽,d,c)+c+1, then we can require that syn¯.

Proof.

We follow the lines of the proof given by [32][Lemma 7.29], but here, we wish to increase the relative rank of the factor instead of its rank. We present an iterative process, which will eventually lead us to a factor of size c with relative rank higher than r(c), that is a semantic refinement relative to X~. Let d, and let be a polynomial factor defined by 𝒫=(P1,,Pc) such that Pi:𝔽n𝔽 of degree d. We remind the reader definition 32, where we defined M()(Md,,M1)d, where Mi denotes the number of polynomials in 𝒫 that have degree exactly i, and the lexicographical order on d where M>M if and only if Mi>Mi for some 1id, and Mj=Mj for all j>i. This proof will be by transfinite induction on M under the lexicographical order. Next we describe a step of the regularization process.
Let be a polynomial factor defined by 𝒫=(P1,,Pc). Note that this is an abuse of notations: the factor and the set 𝒫 refer to the original factor in the first step, and also to the current factor in the middle of the relative-regularization process. If is r-X~-regular, then we are done. Otherwise, we change as follows: First, we denote r30𝔽,d(c)r30(𝔽,d,c), and we r30-regularize 𝒫 using lemma 33 to get a set of polynomials 𝒫1=(P11,,Pc11) of degree d, which defines a factor 1 and has a rank r30𝔽,d(c1). Note that M(1)M(). Then, again, if somehow 1 is now r-X~-regular, we are done.
Otherwise, by definition, there exists some linear combination of the polynomials in 𝒫1 that has d-relative rank less than r(c1), where d is the maximal degree that participates in the linear combination. Let P(x)=i=0c1λiPi1(x) where 0λ𝔽c1, be the linear combination with rankd,X~(P)r(c1) where dmaxi[c1]deg(λiPi1). By definition of relative rank, there exists Pmissing¯Polydeg(P)(𝔽n𝔽) with Pmissing¯|X~0 such that rankd(PPmissing¯)r(c1). Note that deg(Pmissing¯)d. By definition of d-rank, we have that we can decompose PPmissing¯ as a function of r(c1) polynomials of degree d1. In other words, there exist a measurement function Γ:𝔽r(c1)𝔽 and polynomials Q1,,Qr(c1) with deg(Qi)d1 such that:

a𝔽n:P(a)Pmissing¯(a)=Γ(Q1(a),,Qr(c1)(a))

Now, let 𝒫𝒫1 be the set of all such maximal-degree polynomials, and let i be chosen such that Pi1𝒫. Note that the set 𝒫 is non empty, as by definition, d is the maximal degree of polynomial in the expression i=1c1λiPi1 such that λi0.
For the next step, define the polynomial factor 2 be the polynomial factor defined by the set:

𝒫2𝒫1{Pi1}{Q1,,Qr(c1)}

Finally, the factor 2 will be the factor returned from the relative-regularization step.
It is easy to see that if the process above halts, we get a r-X~-regular factor. Now, we prove the first part of the lemma by showing the following claims:

Claim 65.

The factor generated from the regularization above is of bounded size: a bound that may depend on r,d,c, but does not depend on n. Formally, we claim that there exists Cr,d64: such that we have cCr,d64(c).

Proof.

It is enough to prove the following:

  1. 1.

    In each step, the amount of polynomials there are in 𝒫1,𝒫2 are bounded by a bound that depend only on r,d,c (independent of n).

  2. 2.

    The number of steps of the relative-regularization process is also bounded by a bound that depends only on r,d,c (independent of n).

The combination of these two will obtain the desired bound of the amount of polynomials in the last-step regularized factor, which is Cr,d64(c). Note that the bound on the last-step relative-regularized factor in not simply the multiplication of the two bounds, but a recursively-substitution of the bound in 1, a bounded amount of times (bounded by the bound in 2).
For 1, we first notice that the number of polynomials in the regular regularization process is bounded, specifically we have |𝒫1|=c1Cr30,d33(c). Moreover, the polynomial factor 2 is generated by adding at most r(c1) polynomials to the factor, and thus we have |𝒫2|c1+r(c1) which is also bounded by substituting the bound on c1.
For 2, we use the transfinite induction on M we mentioned earlier to show that the process must halt after a bounded number of steps. Formally, we show that there exist M which depends only on M() such that M(2)M<M(). This will bound the number of steps by a value that depend only on M(), which depends only on r,d,c. To do so, we first notice that the regular regularization does not increase the value of M, i.e. M(1)M(). Thus, we can focus on the second part of the relative-regularization. In this part, we replace a single degree d polynomial by at most r(c1) polynomials of degree d1. Therefore, by choosing M(Md,,Md+1,Md1,Md1+r(c1),,M1+r(c1)) we get that M(2)M<M(1)M(), which concludes 2.

Claim 66.

The factor generated from the regularization above is a X~-relative semantic refinement of the original factor, i.e. semX~.

Proof.

It is enough to show that in each step, the factors generated by the relative-regularization process are semantic refinements relative to X~ of the previous step’s factor. Specifically, we show 2semX~1semX~ and the claim will follow from transitivity of relative semantic refinements.
We start by proving 1semX~. Let F:𝔽n𝔽 be a function that is 𝒫-measurable relative to X~. We denote dFdeg(F). By definition, there exists Γ:𝔽c𝔽, Fmissing¯:𝔽n𝔽 where deg(Fmissing¯),deg(FFmissing¯)dF and Fmissing¯|X~0, such that:

a𝔽n:F(a)=Γ(P1(a),,Pc(a))+Fmissing¯(a)

Clearly, the function Γ(P1(a),,Pc(a)) is 𝒫-measurable in 𝔽n, and because we have sem1, it is also 𝒫1-measurable in 𝔽n. Thus there exists Γ1:𝔽c1𝔽 such that:

a𝔽n:F(a)=Γ1(P11(a),,Pc11(a))+Fmissing¯(a)

And therefore we have 1semX~.
Now, we prove 2semX~1. Let F:𝔽n𝔽 be a function that is 𝒫1-measurable relative to X~. Again, we denote dFdeg(F), and by definition there exists Γ1:𝔽c1𝔽, F1¯:𝔽n𝔽 where deg(FF1¯)dF and F1¯|X~0, such that:

a𝔽n:F(a)=Γ1(P11(a),,Pc11(a))+Fmissing¯1(a) (1)

Note that we also have deg(F1¯)dF. We will refer this equation, and its simplifications we do throughout the proof, as the 𝒫1-decomposition of F.
We wish to show that there exists Γ2:𝔽c2𝔽 and Fmissing¯2:𝔽n𝔽 where deg(FFmissing¯2)dF and Fmissing¯2|X~0, such that:

a𝔽n:F(a)=Γ2(P11(a),Pi11(a),Pi+11(a),,Pc1(a),Q1(a),,Qr(c1)(a))+Fmissing¯2(a)

We will do so using the 𝒫1-decomposition of F. Note that showing deg(FF2¯)dF is equivalent of showing deg(Fmissing¯2)dF.
First, by the way we built 𝒫2, using the same notations in the regularization step, we have:

a𝔽n:Pi1(a)=Γ(Q1(a),,Qr(c1)(a))+Pmissing¯(a)iiλiPi1(a)

Next, we substitute the value of Pi1 in the 𝒫1-decomposition of F (1), and get another decomposition of F that does not depend on Pi1. Specifically we have:

a𝔽n:F(a) =Γ1(P11(a),,(Γ(Q1(a),,Qr(c1)(a))+Pmissing¯(a)iiλiPi1(a)),,Pc11(a)) (2)
+Fmissing¯2(a) (3)

We wish to use the equation above to show that F is 𝒫2-measurable relative to X~. However, in order to show that the equation above is in the desired structure that proves that F is 𝒫2-measurable, the expression inside Γ1 must not depend on Pmissing¯ because Pmissing¯𝒫2. Note that this is enough as the rest of the polynomials in the expression above are in 𝒫2, and therefore without Pmissing¯ the expression is 𝒫2-measurable.
To do so, we start by simplifying some of the notations. We denote:

P2(a)Γ(Q1(a),,Qr(c1)(a))iiλiPi1(a)

This is the part of the sum that decomposes Pi1(a) that is 𝒫2-measurable, thus the following equality applies:

Pi1(a)=P2(a)+Pmissing¯(a)

where deg(P2),deg(Pmissing¯)d. Using this notation, we write the 𝒫1-decomposition of F (2), and get:

a𝔽n:F(a)=Γ1(P11(a),,(P2(a)Pmissing¯(a)),,Pc11(a)))+Fmissing¯1(a) (4)

Now, we use the following key observation: rank(𝒫1)r30(𝔽,d,c1), and as deg(Γ1(P11,,Pc11))dF we can use Lemma 30 to achieve that Γ1 is a polynomial of the form:

Γ1(z1,,zc1)=α[p1]c1Cαi=1c1ziαi ()

where Cα=0 whenever i=1c1(αideg(Pi1))>dF.
Next, we substitute the polynomial structure of Γ1 (5) in the 𝒫1-decomposition of F (4), and observe what happens to each summand monomial with non-zero coefficients of Γ1 in the expression after the substitution.
We will show that each such monomial is either 𝒫2-measurable, or a sum of a 𝒫2-measurable function with a valid X~-remainder, i.e. a polynomial of degree dF that is 0 in X~. Note that if this is true for each monomial, every linear combination of such monomials is also a sum of 𝒫2-measurable function with a valid X~-remainder. Thus, this will also be true for the entire decomposition of F, as it is a linear combination of such monomials summed with a valid remainder Fmissing¯1. This will conclude the proof.
Let α=(α1,,αc1) be a vector of degrees that represents such a monomial. If αi=0, then the monomial is in the form:

i[c1]Piαi=i[c1]{i}Piαi

and therefore it is clearly 𝒫2-measurable as all the polynomials in the expression above are in 𝒫2.
Next, if αi0, then the monomial is in the form:

i[c1]Piαi=(P2+Pmissing¯)αi(i[c1]{i}Piαi) (5)

where i[c1](αideg(Pi1))dF. As deg(P2+Pmissing¯)=deg(Pi)=d, we have:

deg(i[c1]{i}Piαi)=i[c1]i(αideg(Pi1))dFαid

Now, we open the left brackets in (5), i.e. (P2+Pmissing¯)αi. This enables us to separate the monomial to the part that only depend on P2 summed with a polynomial with bounded degree multiplied by Pmissing¯ (and therefore a valid remainder). To be more specific, the monomial is in the form:

(P2+Pmissing¯)αi=P2αi+Pα¯

for some polynomial Pα¯ such that:

  1. 1.

    Pα¯ is of degree deg(Pα¯)max{deg(P2),deg(P)¯}αiαid

  2. 2.

    Pα¯ is a multiple of Pmissing¯, and therefore Pα¯|X~0

Therefore, by substituting the left brackets back to the equation (5) and as P2 and Pi for ii are 𝒫2-measurable, one can see that the monomial is a sum of a 𝒫2-measurable polynomial with a valid remainder. Specifically, the remainder 0 in X~, and its degree is αid+dFαid=dF. This concludes the proof of the claim. Now, it remains to prove the second part of the Theorem 64.

Claim 67.

If syn¯ for some polynomial factor ¯ with relative rank of at least r(c)+c+1 and rank of at least r30(𝔽,d,c)+c+1, then we can require that syn¯.

Proof.

We will show claim step-by-step. We denote by 𝒫,𝒫¯,𝒫1,𝒫2 the polynomial sets that generate the factors ,¯,1,2. Note that 1,2 are the factors in the current step of the regularization process, and thus change in each step of the proof. We show that in each step, if syn¯ for some polynomial factor ¯ with relative rank of at least r(c)+c+1 and rank of at least r30(𝔽,d,c)+c+1, then we can require that 1syn¯, and also that 2syn¯.
For the first part, we have 1syn¯ by a simple usage of the second part of lemma 33, as:

rank(𝒫¯)>r30(𝔽,d,c)+c+1r30(𝔽,d,c1)+c1+1

Now we prove the second part. We show that in the current regularization step, we could replace Pi1𝒫1 such that Pi1𝒫¯. Note that this is possible whenever 𝒫𝒫¯ as the choice of i is arbitrary in polynomials which are in 𝒫.
Assume that is not possible and the factor 𝒫1 is still not r-X~-regular. Then, we have a linear combination P(x)i=0c1λiPi1(x) with rankd,X~(P)r(c1) where d=maxi[c1]deg(λiPi1). We denote by I[c1] the set of indexes of such maximal-degree polynomials. By this notation, our assumption states that for all iI we have Pi1𝒫¯. Additionally, note that for all iI we have deg(Pi1)<d. Therefore, as the linear combination is of d-relative rank r(c1), there exists a polynomial Pmissing¯ of degree deg(P)d with Pmissing¯|X~0 such that rankd(PPmissing¯)r(c1). In other words, there exist a measurement function Γ:𝔽r(c1)𝔽 and polynomials Q1,,Qr(c1) with deg(Qi)d such that:

a𝔽n:P(a)Pmissing¯(a)=Γ(Q1(a),,Qr(c1)(a))

By a simple calculation we have:

a𝔽n:iIPi1(a)Pmissing¯(a)=Γ(Q1(a),,Qr(c1)(a))+iIPi1(a)

and by this we found a linear combination of polynomials in 𝒫¯ with maximal degree d, that has d-relative-rank r(c1)+c1+1. This is a contradiction to our assumptions on ¯, which completes the proof of the claim. This completes the proof of the lemma.

6 Radius of Reed-Muller over 𝑿~

We recall that the normalized distances of Reed-Muller codes over 𝔽n and over X~ are denoted by δ𝔽(d) and δ𝔽,X~(d) respectively. We present a theorem that shows that Reed-Muller codes over a subset X~𝔽n that is d-lift-enabler and has the (limited) relative rank-bias property, has (approximately) an equal normalized distance as Reed-Muller codes over 𝔽n.

Theorem 68.

There exist a function ϵ1(𝔽,d,r~,ϵ~) such that the following holds: Let 𝔽 be a finite field, and let d be an integer that represents a degree. Let ϵ~>0, and let r~:[ϵ~,] be a limited-relative-rank-bias function.
Let X~𝔽n be a set with the following properties

  1. 1.

    X~ is d-lift-enabler with a lift operator ^.

  2. 2.

    X~ has the (r~,𝔽,d,ϵ~)-relative-rank-bias property.

Then, for ϵ1ϵ1(𝔽,d,r~,ϵ~) we have that for all n:

δ𝔽,X~(d)δ𝔽(d)ϵ1
Proof.

We wish to do a reduction of our question regarding the radius of Reed-Muller in X~ to the same question about Reed-Muller in 𝔽n. Let 𝔽 be a finite field, and let d be an integer that represents a degree. Let ϵ~>0, and let r~:[ϵ~,] be a limited-rank-relative-bias function. Let ϵ1ϵ1(𝔽,d,r~,ϵ~) be a function we will specify later. Let X~𝔽n be a set with the properties defined above.
Moreover, let ϵ>ϵ1 be some positive value. We will show that:

δ𝔽,X~(d)>δ𝔽(d)ϵ

This will be enough as if the above holds for every ϵ>ϵ1, we get that in fact δ𝔽,X~(d)δ𝔽(d)ϵ1.
For start, we note a simple observation: as Reed-Muller over X~ is a linear code, we have

δ𝔽,X~(d)=min{PrxX~[p(x)0]|pPolyd(X~𝔽)}

Now, let pPolyd(X~𝔽) be a polynomial over X~, and denote dpdeg(p). We wish to lower-bound the value of PrxX~[p(x)0]. To do so, we will equivalently upper-bound the value of PrxX~[p(x)=0]. Precisely, to complete the proof all we need to show is:

PrxX~[p(x)=0]1δ𝔽(d)+ϵ

Now we begin the proof itself. First, we lift the polynomial p and get a polynomial p^:𝔽n𝔽 such that p^|X~p and deg(p^)=dp. Next, denote by p^ the factor defined by the set of single polynomial 𝒫={p^}. Trivially, the polynomial p^ is measurable in respect of 𝒫.
We define the rank function:

r(m)max{r~(ϵ/2|𝔽|m),r23(𝔽,d,ϵ/2|𝔽|m)}

Then, we r-X~-regularize 𝒫 using Lemma 64. This gives us a r-X~-regular factor , which is defined by a set of polynomials 𝒫{P1,,Pc} of degree d such that semX~p^ with rankX~(𝒫)r and with bounded amount of polynomials defining it i.e,cCr,d64(1). Therefore, from definition we have that p^ is 𝒫-measurable relative to X~. Thus, there exists a measurement function Γ:𝔽c𝔽 and a remainder Γ¯:𝔽n𝔽 with Γ¯|X~0 and degree bounded by dp, such that:

a𝔽n:p^(a)=Γ(P1(a),,Pc(a))+Γ¯(a)

Next, we denote Pp^Γ¯. By definition of remainder function, we have that P|X~p. Additionally, note that P is a polynomial over 𝔽n of degree deg(P)=dpd, and hence by the definition of δ𝔽(d):

Pra𝔽n[P(a)=0]1δ𝔽(d) (6)

For the next step, we claim that P equals 0 in 𝔽n approximately with the same probability it equals 0 in X~. Note that this is the heart of the proof: it allows use properties known in 𝔽n to new properties in X~. This is formulated as follows:

Claim 69.

We have:

|Pra𝔽n[P(a)=0]PrxX~[P(x)=0]|ϵ
Proof.

Denote S𝔽c, and for all sS, denote:

p1(s)Pra𝔽n[(P1(a),,Pc(a))=s]

As of our choice of r, we have rank(𝒫)r23(𝔽,d,ϵ/2|𝔽|c). By combining Theorem 23 with Lemma 81, we have that p1 is (ϵ/2|S|)-equidistributed, i.e:

p1(s)=1±ϵ/2|S|

Similarly, denote:

p2(s)PrxX~[(P1(x),,Pc(x))=s]

As of our choice of r, we have rankX~(𝒫)r~(ϵ/2|S|). Now, we wish to use the relative rank-bias relation with Lemma 81 to conclude similarly that p2 is (ϵ/2|S|)-equidistributed, i.e:

p2(s)=1±ϵ/2|S|

However, in order to so, we must first ensure that (ϵ/2|S|)ϵ~. This is done by choosing a correct ϵ1, and formulated in the following claim:

Claim 70.

One can choose ϵ1ϵ1(𝔽,d,r~,ϵ~) such that if ϵϵ1 we have that ϵ/2|S|ϵ1.

Proof.

We need that:

ϵ2|𝔽|cϵ~

As cCr,d64(1), for the term above to hold it is enough that the following will be true:

ϵϵ~2|𝔽|Cr,d64(1)

and as r and thus also Cr,d64(1) are independent of n, we can pick ϵ1=ϵ1(𝔽,d,r~,ϵ~) and get what we aimed for. Now, under that assumption of ϵ1 written above, we have that p2 is (ϵ/2|S|)-equidistributed. This allows us to use the similar distributions of 𝒫 in 𝔽n and in X~ to conclude that P behaves similar in 𝔽n and in X~:

Pra𝔽n[P(a)=0] =sSp1(s)1Γ(s)=0
=sSp2(s)1Γ(s)=0±ϵ
=PrxX~[P(x)=0]±ϵ

which concludes the proof of the claim.

Finally, as P|X~p, we have that PrxX~[P(x)=0]=PrxX~[p(x)=0]. Thus, the claim above combining with (6) shows that the probability we wished to bound is bounded as we aimed for:

PrxX~[p(x)=0]1δ𝔽(d)+ϵ

This concludes the proof of the theorem.

 Remark 71.

Under the same conditions, the distance of Reed-Muller codes in X~ is also bounded from above by the distance of Reed-Muller codes in 𝔽n, and we have:

δ𝔽,X~(d)δ𝔽(d)+ϵ1
Proof.

Let P:𝔽n𝔽 be the polynomial in 𝔽n with the smallest distance from 0 as possible, that is δ𝔽(d). Denote pP|X~. Note that p is a polynomial in X~. Now repeat the proof using these two polynomials, and by Claim 69, we have that a random input of P yields 0 (approximately) the same as a random input of p yields 0. Thus as we have Prx𝔽n[P(x)=0]=1δ𝔽(d) we also get:

PrxX~[p(x)=0]1δ𝔽(d)ϵ1

This bounds from above the distance of Reed-Muller code in X~ and we have:

δ𝔽,X~(d)δ𝔽(d)+ϵ1

Corollary 72.

If we assume X~ has the limited-relative rank-bias property to any extent (or just the relative rank-bias property), then the theorem above proves an exact equality δ𝔽,X~(d)=δ𝔽(d).

7 List Decoding Reed Muller Over 𝑿~

In this section, we prove our main theorem: we prove the list decoding radius of Reed-Muller codes in X~ is at least the list decoding radius of Reed-Muller codes in 𝔽n, assuming X~ is lift-enabler and has the relative rank-bias property. We start by presenting formally the list decoding radius in X~.

Definition 73 (List Decoding in X~).

Let 𝔽 be a finite field. Let d,n, and let X~𝔽n.
We define the Reed-Muller list-decoding count in X~ at distance τ as follows:

𝔽,X~(d,τ)maxF:X~𝔽|{PPolyd(X~𝔽)|dist(P,F)τ}|

Additionally, we define LDR𝔽,X~(d) to be the list decoding radius, which is the maximum τ for which 𝔽,X~(d,τϵ) is bounded by a constant depending only on ϵ,|𝔽|,d.

We recall that it was shown in [8, Theorem 1] that the list decoding radius of Reed Muller is δ𝔽(d). To be more precise, it was shown that for every ϵ>0, the list-decoding count is constant (independent of n) in distance τ=δ𝔽(d)ϵ. Formally, they have shown the following theorem:

Theorem 74 (List Decoding RM in 𝔽n).

There exists a function c(𝔽,d,ϵ) such that the following holds: Let 𝔽 be a finite field, let ϵ>0, and let d,n. Then, we have:

𝔽,𝔽n(d,δ𝔽(d)ϵ)c(𝔽,d,ϵ)

Additionally, we recall a lemma that was presented in [8, Corollary 3.3], and was used in the analysis of the list decoding radius of Reed-Muller codes in 𝔽n:

Lemma 75 (Low Complexity Approximation).

[8, Corollary 3.3] Let G:AB, and let ϵ>0. Let 𝔉BA be a collection of functions from A to B. Then there exists c1/ϵ2 functions F1,,Fc𝔉 such that for every F𝔉, there is a function ΓF:BcB such that:

PrxA[ΓF(F1(x),,Fc(x))=F(x)]PrxA[G(x)=F(x)]ϵ

The lemma shows that G can “estimated” by a only a few functions from 𝔉. Note that the estimation is close to G compared to every F𝔉 and not necessarily close to G itself.

Finally, we present our main theorem, which shows that under assumptions on the subset X~𝔽n, the list decoding radius of polynomials in X~ will be similar to the list decoding radius in 𝔽n.
In more details (and informally), we show that if X~𝔽n is lift-enabler, has the limited-relative-rank-bias-property, the list-decoding count is constant (independent of n) for every valid ϵ in distance τ=δ𝔽(d)ϵ. Note that not every ϵ>0 will be valid: the valid values of ϵ will depend on the limitations of the rank-bias property. Formally, we show the following:

Theorem 76 (List Decoding RM in X~).

There exist functions c1(𝔽,d,r~,ϵ~) and c2(𝔽,d,r~,ϵ) such that the following holds: Let 𝔽 be a finite field, and let d be an integer that represents a degree. Let ϵ~>0, and let r~:[ϵ~,] be a limited-relative-rank-bias function.
Let X~𝔽n be a set with the following properties

  1. 1.

    X~ is d-lift-enabler with a lift operator ^.

  2. 2.

    X~ has the (r~,𝔽,d,ϵ~)-relative-rank-bias property.

Then, for every ϵc1(𝔽,d,r~,ϵ~) it holds:

𝔽,X~(d,δ𝔽,𝔽n(d)ϵ)c2(𝔽,d,r~,ϵ)
Proof.

We follow the lines of the proof of [8, Theorem 1]. Let 𝔽 be a finite field, and let d be an integer that represents a degree. Let ϵ~>0, and let r~:[ϵ~,] be a limited-rank-relative-bias function. Let c1(𝔽,d,r~,ϵ~) be a function we will specify later. Let X~𝔽n be a set with the properties defined above.
Finally, let ϵc1(𝔽,d,r~,ϵ~) for c1 that we will specify later, and let f:X~𝔽 be a received word. We wish to bound the amount of polynomials in Polyd(X~𝔽) that are (δ𝔽(d)ϵ)-close to f.
Apply Lemma 75 with A=X~, B=𝔽, G=f, 𝔉=Polyd(X~𝔽) and approximation parameter ϵ/2 to obtain 𝔥Polyd(X~𝔽), defined by 𝔥=(h1,,hc) where c4/ϵ2, such that for every pPolyd(X~𝔽) there is a function Γp:𝔽c𝔽 that approximates f in X~ relative to Polyd(X~𝔽) i.e.:

pPolyd(X~𝔽):PrxX~[Γp(h1(x),,hc(x))=p(x)]PrxX~[f(x)=p(x)]ϵ/2

Let r1,r2: be two non-decreasing functions that represents rank that we will specify later. For r1, we will require that for all m1:

r1(m)max{r2(Cr2,d64(m+1))+Cr2,d64(m+1)+1,r2(Cr30,d64(m+1))+Cr30,d64(m+1)+1}

Note that in the expression above, we denote r30:, as follows: r30(c)r30(𝔽,d,c).
The reason we chose this r1, is that by our choice of r1 we can use the second part of Lemma 64. Specifically, if we start with r1-X~-regular factor and we r2-X~-regularize it, we get that the r2-X~-regular factor that we received is a syntactic refinement of the r1-X~-regular factor we started with.
As a first step, we lift the polynomial factor to get 𝔥^. Note that because xX~:hi^(x)=hi(x), for all pF we have:

PrxX~[Γp(h1^(x),,hc^(x))=p(x)]PrxX~[f(x)=p(x)]ϵ/2

Next, we r1-X~-regularize the factor generated by the collection by Theorem 64. This gives us a r1-X~-regular factor , which is defined by a set of polynomials (H1,,Hc) of degree d such that semX~, with rankX~()r(c) and with bounded amount of polynomials defining it i.e. cCr1,d64(c). We apply Corollary 63 and get that sem|X~. We then use the fact that Γp(h1^(x),,hc^(x)) is measurable in respect of in X~, and deduce we have a similar approximation of p using as the approximation of p using . Formally, there exists a function Γp:𝔽c𝔽 such that:

PrxX~[Γp(H1(x)),,Hc(x))=p(x)]PrxX~[f(x)=p(x)]ϵ/2

Now we recall that we wished to bound the amount of polynomials pPolyd(X~𝔽) such that PrxX~[f(x)p(x)]δ𝔽(d)ϵ. Let pPolyd(X~𝔽) be a polynomial as we just described. We will show that such p is measurable with respect to in X~. This will upper bound the amount of possible polynomials p by the amount of possible different Γp:𝔽c𝔽, which is |𝔽|=p(pc), and thus c2(𝔽,d,r~,ϵ)p(pc).
By our choice of c we have that cCr1,d64(4/ϵ2), and thus c2 is bounded by a function of (𝔽,d,r1,ϵ). Note that we have not yet specified the value of r1, because it is determined by the choice of r2 that we will later define its exact values. The important thing about our future choice of r2 is that the value of r2 must be independent of n, but can depend on (𝔽,d,r~,ϵ). This will conclude the proof.
Now, consider a lift of p, i.e. Pp^. Note that by the definition of lift xX~:P(x)=p(x). We will show that P is measurable in respect of in X~.
We consider the factor P that is generated by P{P}. By using Theorem 64, we can r2-X~-regularize it and get the polynomial factor ′′ that relative-refines P. We denote the set of polynomials in the factor as ′′.
Next, notice that the factor ′′ is a r2-regular factor, therefore by our choice of r1 and the second part of Theorem 64, we in fact have ′′syn. This is true because by our choice of r1:

rankX~()r1(c)r2(Cr2,d64(c+1))+Cr2,d64(c+1)+1r2(|′′|)+|′′|+1

And as rank is always bigger than relative rank, we also have:

rank()r1(c)r2(Cr30,d64(c+1))+Cr30,d64(c+1)+1

Thus, the polynomials defining ′′ are in the form ′′{H1′′,,Hc′′′′}. Note that as promised in Theorem 64, we have |′′|=c+c′′C64r2,d(c).
Additionally, by the way we built P, the function P is measurable in respect of it. Therefore, as ′′semX~P, we have that P is ′′-measurable relative to X~. In other words, there exists Φ:𝔽c+c′′𝔽 and Pmissing¯:𝔽n𝔽 with deg(Pmissing¯),deg(PPmissing¯)deg(P)d and Pmissing¯|X~0 such that:

a𝔽n:P(a)=Φ(H1(a),,Hc(a),H1′′(a),,Hc′′′′(a)))+Pmissing¯(a)

And specifically in X~ we have:

xX~:P(x)=Φ(H1(x),,Hc(x),H1′′(x),,Hc′′′′(x)))

Denote PPPmissing¯. We will show the polynomial P does not depend on its last c′′ variables, and thus Φ does not depend on its last c′′ variables. This will imply that P is measurable in respect of in X~, which will conclude the proof.
Now, we choose r2 to be such that:

r2(m)max{r~(ϵ/4|𝔽|m),r23(ϵ/4|𝔽|m),r30(m)}

Note that in the expression above we are discussing fixed field and degree, i.e. 𝔽,d. Therefore we denote r30: as r30(c)r30(𝔽,d,c) and r23: as r23(ϵ)r23(𝔽,d,ϵ).
Next, we show that even if we change the polynomials in the factor to have a disjoint set of inputs in 𝔽n, we still obtain a polynomial in the same degree, which have an approximation close to the approximation we had in X~. Note that after this step, the proof becomes very similar to the proof of list decoding Reed Muller in 𝔽n [8, Theorem 1]: we omit the dependence of X~ and get the same approximation by functions of multiple variables, as we had in 𝔽n. This is done by the following lemma:

Lemma 77.

Let {ai,bj},i[c],j[c′′] be pairwise disjoint sets of n variables each. Let nn(c+c′′). Let P:𝔽n𝔽 and f:𝔽n𝔽 be functions of n variables defined as follows:

P(a)Φ(H1(a1),,Hc(ac),H1′′(b1),,Hc′′′′(bc′′))

and:

f(a)Γp(H1(a1)),,Hc(ac))

Note that f is a function that receives n variables, and ignores its last c′′ variables.
Then:

  1. 1.

    The degree of P remains bounded, i.e. deg(P)d.

  2. 2.

    The approximation of f to P in 𝔽n is close to the approximation of Γp to p in X~. Specifically, we show:

    |Pra𝔽n[f(a)=P(a)]PrxX~[Γp(H1(x)),,Hc(x))=p(x)]|ϵ/4
Proof.

We start by proving the first part of the lemma: bounding the degree of P by d. First, we recall that P=PPmissing¯ where Pmissing¯ is a valid remainder. Specifically, we have deg(P)=deg(PPmissing¯)deg(P)d. In addition, by the way we built Φ we have:

a𝔽n:P(a)=Φ(H1(a),,Hc(a),H1′′(a),,Hc′′′′(a)))

Thus the function above is of degree d. Moreover, we have:

rank(′′)rankX~(′′)r2(|′′|)r30(|′′|)

Therefore we can use Lemma 30 to get that deg(P)deg(P)d. Note that in order to use the lemma formally, we had to extend the input space of P to be of n variables (and make it depend only on the first n variables as it used to). Because lemma 30 require bounds independent of n, this is done smoothly.
Now we move to the second part of the lemma: bounding the approximation of f to P. Denote S𝔽c+c′′, and for each sS denote:

p1(s)PrxX~[(H1(x),,Hc(x),H1′′(x),,Hc′′′′(x))=s]

and as of our choice of r2, we have rank(′′)r~(ϵ/8|S|). Therefore, if we require that the relative rank-bias relation holds for ϵ/8|S|, we can use Lemma 81 with A=X~ to get that p1 is (ϵ/8|S|)-almost uniform, i.e:

p1(s)=1±ϵ/8|S|

We show that this can be done in the following claim by choosing a proper c1:

Claim 78.

One can choose c1c1(𝔽,d,r~,ϵ~) such that if ϵc1 we have that ϵ/8|S|c1.

Proof.

This is done by using the bound we already know. We need that:

ϵ~ϵ8|𝔽|c+c′′

As c+c′′Cr2,d64(c), for the term above to hold it is enough that the following will be true:

ϵϵ~8|𝔽|Cr2,d64(c)

and as r2,c and thus also Cr2,d64(c) are independent of n, we can pick c1=c1(𝔽,d,r~,ϵ~) and get what we aimed for. Thus, we can assume that p1 is (ϵ/8|S|)-almost uniform. Now, let:

p2(s)Pra𝔽n[(H1(a1),,Hc(ac),H1′′(b1),,Hc′′′′(bc′′))=s]

Note that the rank of ′′={H1(a1),,Hc(ac),H1′′(b1),,Hc′′′′(bc′′)}, as a factor defined over 𝔽n, can not be lower than the rank of ′′ and thus we have rank(′′)r23(ϵ/8|𝔽|m). By using Theorem 23, which shows the rank-bias relation for 𝔽n, we can similarly use Lemma 81 with A=𝔽n to get that p2 is also (ϵ/8|S|)-almost-uniform, i.e:

p2(s)=1±ϵ/8|S|

Now, we show the approximations are the same. Denote by s the restriction of s to its first c coordinates, and consider the approximation:

Pra𝔽n[f(a)=P(a)]=
=sSp2(s)1Φ(s)=ΓP(s)
=sSp1(s)1Φ(s)=ΓP(s)±ϵ/4
=PrxX~[Γp(H1(x)),,Hc(x))=p(x)]±ϵ/4

This completes the proof the lemma.

The proof is followed by the same methods used in [8]. We repeat if for completeness. We next restate a lemma proved in [8, Claim 4.2], which is a varaiant of the Schwartz-Zippel lemma [43, 46]:

Lemma 79.

Let d, n1, n2 be integers. Let P1Polyd(𝔽n1+n2𝔽), and let F1:𝔽n1𝔽 be a function. Assume the polynomial is δ𝔽(d)-close to the function, i.e:

Prx1,,xn1+n2𝔽[P1(x1,,xn1+n2)=F1(x1,,xn)]>1δ𝔽(d)

Then, P1 does not depend on xn1+1,,xn1+n2.

Now, apply Lemma 79 to P1=P, F1=f, n1=nc, n2=nc′′. We obtain that P does not depend on its last c′′ variables, and thus by denoting CiHi′′(0) for i[c′′] we have:

P(a)=Φ(H1(a1),,Hc(ac),C1,,Cc′′)

Now, for every a𝔽n, if we substitute a in the i-th component of a for every i[c] in the equation above, we get the following is true:

P(a)=Φ(H1(a),,Hc(a),C1,,Cc′′)

Hence P does not depend on its last c′′ variables. As explained earlier, this implies that P is measurable in respect of in X~. This completes the proof of the theorem. 

References

  • [1] Omar Alrabiah, Jesse Goodman, Jonathan Mosheiff, and João Ribeiro. Low-degree polynomials are good extractors, 2025. doi:10.48550/arXiv.2405.10297.
  • [2] Omar Alrabiah, Venkatesan Guruswami, and Ray Li. Randomly punctured reed–solomon codes achieve list-decoding capacity over linear-sized fields, 2024. doi:10.48550/arXiv.2304.09445.
  • [3] Mitali Bafna and Dor Minzer. Characterizing direct product testing via coboundary expansion, 2024. doi:10.48550/arXiv.2308.09668.
  • [4] Paul Beame, Shayan Oveis Gharan, and Xin Yang. On the bias of reed-muller codes over odd prime fields, 2018. arXiv:1806.06973.
  • [5] Ido Ben-Eliezer, Rani Hod, and Shachar Lovett. Random low degree polynomials are hard to approximate. In Proceedings of the 12th International Workshop and 13th International Workshop on Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, APPROX ’09 / RANDOM ’09, pages 366–377, Berlin, Heidelberg, 2009. Springer-Verlag. doi:10.1007/978-3-642-03685-9_28.
  • [6] Inbar Ben Yaacov, Yotam Dikstein, and Gal Maor. Sparse high dimensional expanders via local lifts, 2024. doi:10.48550/arXiv.2405.19191.
  • [7] Arnab Bhattacharyya, Eldar Fischer, Hamed Hatami, Pooya Hatami, and Shachar Lovett. Every locally characterized affine-invariant property is testable, 2013. arXiv:1212.3849.
  • [8] Abhishek Bhowmick and Shachar Lovett. List decoding reed-muller codes over small fields, 2014. arXiv:1407.3433.
  • [9] Abhishek Bhowmick and Shachar Lovett. Bias vs structure of polynomials in large fields, and applications in effective algebraic geometry and coding theory. CoRR, abs/1506.02047, 2015. arXiv:1506.02047.
  • [10] Andrej Bogdanov. Pseudorandom generators for low degree polynomials. In Proceedings of the Thirty-Seventh Annual ACM Symposium on Theory of Computing, STOC ’05, pages 21–30, New York, NY, USA, 2005. Association for Computing Machinery. doi:10.1145/1060590.1060594.
  • [11] Andrej Bogdanov and Emanuele Viola. Pseudorandom bits for polynomials. In 48th Annual IEEE Symposium on Foundations of Computer Science (FOCS’07), pages 41–51, 2007. doi:10.1109/FOCS.2007.42.
  • [12] Joshua Brakensiek, Manik Dhar, and Sivakanth Gopi. Generalized gm-mds: Polynomial codes are higher order mds, 2024. doi:10.48550/arXiv.2310.12888.
  • [13] Joshua Brakensiek, Manik Dhar, Sivakanth Gopi, and Zihan Zhang. Ag codes achieve list-decoding capacity over constant-sized fields, 2024. doi:10.48550/arXiv.2310.12898.
  • [14] Joshua Brakensiek, Sivakanth Gopi, and Visu Makam. Generic reed-solomon codes achieve list-decoding capacity, 2024. arXiv:2206.05256.
  • [15] Nader Bshouty. Testers and their applications. In Proceedings of the 5th Conference on Innovations in Theoretical Computer Science, ITCS ’14, pages 327–352, New York, NY, USA, 2014. Association for Computing Machinery. doi:10.1145/2554797.2554828.
  • [16] Gil Cohen and Amnon Ta-Shma. Pseudorandom generators for low degree polynomials from algebraic geometry codes. Electron. Colloquium Comput. Complex., TR13, 2013. URL: https://api.semanticscholar.org/CorpusID:13155686.
  • [17] Harm Derksen and Emanuele Viola. Fooling polynomials using invariant theory. In 2022 IEEE 63rd Annual Symposium on Foundations of Computer Science (FOCS), pages 399–406. IEEE, 2022. doi:10.1109/FOCS54457.2022.00045.
  • [18] Yotam Dikstein. New high dimensional expanders from covers, 2022. doi:10.48550/arXiv.2211.13568.
  • [19] Yotam Dikstein and Irit Dinur. Agreement theorems for high dimensional expanders in the small soundness regime: the role of covers, 2024. doi:10.48550/arXiv.2308.09582.
  • [20] Dean Doron, Amnon Ta-Shma, and Roei Tell. On hitting-set generators for polynomials that vanish rarely. Comput. Complex., 31(2):16, 2022. doi:10.1007/S00037-022-00229-2.
  • [21] Zeev Dvir and Amir Shpilka. Noisy interpolating sets for low degree polynomials. In 2008 23rd Annual IEEE Conference on Computational Complexity, pages 140–148, 2008. doi:10.1109/CCC.2008.14.
  • [22] Ashish Dwivedi, Zeyu Guo, and Ben Lee Volk. Optimal pseudorandom generators for low-degree polynomials over moderately large fields, 2024. doi:10.48550/arXiv.2402.11915.
  • [23] Alexander Golovnev, Zeyu Guo, Pooya Hatami, Satyajeet Nagargoje, and Chao Yan. Hilbert functions and low-degree randomness extractors, 2024. doi:10.48550/arXiv.2405.10277.
  • [24] Parikshit Gopalan, Adam R. Klivans, and David Zuckerman. List-decoding reed-muller codes over small fields. In Proceedings of the Fortieth Annual ACM Symposium on Theory of Computing, STOC ’08, pages 265–274, New York, NY, USA, 2008. Association for Computing Machinery. doi:10.1145/1374376.1374417.
  • [25] Omri Gotlib, Tali Kaufman, and Shachar Lovett. List decoding quotient reed-muller codes, 2025. doi:10.48550/arXiv.2502.15650.
  • [26] Roy Gotlib and Tali Kaufman. List agreement expansion from coboundary expansion, 2022. doi:10.48550/arXiv.2210.15714.
  • [27] W. T. Gowers and Thomas Karam. Equidistribution of high-rank polynomials with variables restricted to subsets of 𝔽p, 2022. arXiv:2209.04932.
  • [28] Ben Green and Terence Tao. The distribution of polynomials over finite fields, with applications to the gowers norms, 2007. arXiv:0711.3191.
  • [29] Ben Green and Terence Tao. The primes contain arbitrarily long arithmetic progressions, 2007. arXiv:math/0404188.
  • [30] Venkatesan Guruswami, Lingfei Jin, and Chaoping Xing. Efficiently list-decodable punctured reed-muller codes, 2017. arXiv:1508.00603.
  • [31] Venkatesan Guruswami and Chaoping Xing. Hitting sets for low-degree polynomials with optimal density. In 2014 IEEE 29th Conference on Computational Complexity (CCC), pages 161–168, June 2014. doi:10.1109/CCC.2014.24.
  • [32] Hamed Hatami, Pooya Hatami, and Shachar Lovett. Higher-order Fourier Analysis and Applications. Now Foundation and Trends, January 2019. doi:10.1561/9781680835939.
  • [33] Tali Kaufman and Shachar Lovett. Worst case to average case reductions for polynomials, 2008. arXiv:0806.4535.
  • [34] David Kazhdan and Tamar Ziegler. Polynomial functions as splines, 2018. arXiv:1712.09047.
  • [35] David Kazhdan and Tamar Ziegler. Extending weakly polynomial functions from high rank varieties, 2019. arXiv:1808.09439.
  • [36] David Kazhdan and Tamar Ziegler. Properties of high rank subvarieties of affine spaces, 2020. arXiv:1902.00767.
  • [37] Amichai Lampert and Tamar Ziegler. Relative rank and regularization, 2021. arXiv:2106.03933.
  • [38] Shachar Lovett. Unconditional pseudorandom generators for low degree polynomials. In Proceedings of the Fortieth Annual ACM Symposium on Theory of Computing, STOC ’08, pages 557–562, New York, NY, USA, 2008. Association for Computing Machinery. doi:10.1145/1374376.1374455.
  • [39] Shachar Lovett. MDS matrices over small fields: A proof of the GM-MDS conjecture. CoRR, abs/1803.02523, 2018. arXiv:1803.02523.
  • [40] Chi-Jen Lu. Hitting set generators for sparse polynomials over any finite fields. In 2012 IEEE 27th Conference on Computational Complexity, pages 280–286, 2012. doi:10.1109/CCC.2012.20.
  • [41] Shay Moran and Cyrus Rashtchian. Shattered sets and the hilbert function, 2020. arXiv:1511.08245.
  • [42] Wolfgang M. Schmidt. The density of integer points on homogeneous varieties. Acta Mathematica, 154(3-4):243–296, 1985. doi:10.1007/BF02392473.
  • [43] J. T. Schwartz. Fast probabilistic algorithms for verification of polynomial identities. J. ACM, 27(4):701–717, October 1980. doi:10.1145/322217.322225.
  • [44] Emanuele Viola. The sum of d small-bias generators fools polynomials of degree d. In 2008 23rd Annual IEEE Conference on Computational Complexity, pages 124–127, 2008. doi:10.1109/CCC.2008.16.
  • [45] Hikmet Yildiz and Babak Hassibi. Optimum linear codes with support constraints over small fields. CoRR, abs/1803.03752, 2018. arXiv:1803.03752.
  • [46] Richard Zippel. Probabilistic algorithms for sparse polynomials. In Symbolic and Algebraic Computation, 1979. URL: https://api.semanticscholar.org/CorpusID:15629042.

Appendix A Equidistribution of Functions

Assume we have a collection of functions (F1,,Fc), where Fi:A𝔽 for some finite set A. We are interested in showing that the functions are equidistributed, which means that their values behave close to independent random variables. We begin by formulating this definition:

Definition 80 (Equidistribution of Functions).

Given ϵ>0 and A𝔽n, we say a collection of functions 𝔉=(F1,,Fc) where Fi:A𝔽 is ϵ-equidistributed in A if for all α=(α1,,αc)𝔽c we have:

PrxA[(F1(x),,Fc(x))=α]=1|𝔽|c±ϵ

The following is a standard lemma that shows that if every linear combination of a collection of functions has low bias, the collection is equidistributed. We repeat the steps of the proof of [32, Lemma 7.24], but here, we think of A as any finite set (and not particularly 𝔽n):

Lemma 81.

Let ϵ>0, and let A be a finite set. Let 𝔉=(F1,,Fc) be a collection of functions defined over A, i.e. Fi:A𝔽. Assume each linear combination of the collection has low bias, i.e. for each λ=(λ1,,λc)𝔽c such that λ0 we have:

biasxA(i=1cλiFi)<ϵ

Then, the collection 𝔉 is ϵ-equidistributed over A.
In particular, for ϵ<1|𝔽|c, the lemma shows that each atom of 𝔉 is not empty i.e. for all α there is some xA such that (F1(x),,Fc(x))=α.

Proof.

We wish to show that for each α𝔽c we have:

PrxA[(F1(x),,Fc(x))=α]=1|𝔽|±ϵ

We express the fraction of inputs that are in the atom α the following way:

PrxA[(F1(x),,Fc(x))]=𝔼xA[i=1c1[Fi(x)=αi]]

We use the fact that for every 0x𝔽, we have λ=0p1e[λx]=0, and if x=0 we have λ=0p1e[λx]=p. Therefore, the expression above equals:

=𝔼xA[i=1c(1pλi=0p1e[λi(Fi(x)αi)])]=1pc𝔼xA[i=1cλi=0p1e[λi(Fi(x)αi)]]

By the definition of character functions, we have that e[a+b]=e[a]e[b], and therefore the expression above equals:

1pc(λ1,,λc)i=1c[0,p1](𝔼xA[e[i=0cλi(Fi(x)αi)]])

Now, we use the fact that:

biasxA(i=1c(λi(Fi(x)αi))=biasxA(i=1c(λiFi(x)))<ϵ

and get that:

PrxA[(F1(x),,Fc(x))=α]=1pc(1±ϵi=1cp)=1|𝔽|c±ϵ

Appendix B Comparing Ranks

In this section, we compare the definition of rank we used in this paper to another definition of rank used implicitly throughout this paper. This comparison is crucial, as there is no universally accepted definition of rank; different theorems presented throughout this paper employ distinct definitions. We demonstrate that our definition is sufficiently comprehensive, in that a polynomial (or a factor) classified as having high rank according to our criteria also exhibits high rank according to the second implicitly-used definition. While in many cases the comparison may appear straightforward, we include it for the sake of completeness.
Specifically, we compare our definition of rank with the definition established in  [37]. The paper [37] extended the original definition of rank that was presented in [42], to include also the concept of relative rank. It is important to note that this definition is specifically defined to subsets X~𝔽n that can be expressed as sets in the form X~=Z(~) for some set of polynomials ~, and not to a general set X~𝔽n.
First, we present a useful notation that is used in the definition presented in  [37]:

Notation (Largest Degree Homogenous Part).

For a polynomial P of degree d, we denote by h(P) its degree-d homogenous component. In other words, h(P) is the sum of all the monomials of P of degree exactly d. For a set of polynomials 𝒫={P1,,Pc}, we define h(𝒫){h(Pi)|i=1,,c}.

Next, we present the exact definition of rank for a polynomial:

Definition 82 (Schmidt Rank of a Polynomial).

The schmidt rank of a homogenous polynomial P:𝔽n𝔽, noted as schmrank(P), is the minimal r such that there exist (Qi,Hi)i[r] with degQi,degHi<degP such that:

P(x)=i=1r(Q(x)H(x))

For a general polynomial P of degree d, we set its rank to be the rank of its degree-d homogenous component, i.e. schmrank(P)schmrank(h(P)).

 Remark 83 (High rank implies high schmidt rank).

If rank(P)2r+1 for some constant r, then schmrank(P)r.

Proof.

For homogenous polynomial P, assume schmrank(P)<r. Then, there exist r<r such that there exist (Qi,Hi)i=1r with degQi,degHi<degP such that:

P(x)=i=1r(Q(x)H(x))

Then we can choose Γ:𝔽2r𝔽 to be a sum of multiples of each two consecutive variables to get that P(x)=Γ(Q1(x),H1(x),,Qr(x),Hr(x)), where the polynomials are from a degree <deg(P). This means that rank(P)2r<2r as we requested.
If we do not assume P is homogenous, by adding Ph(P) as an input to Γ, one can create a Γ:𝔽2r+1𝔽 which equals to P when substituting the inputs with some polynomials with degree <degP, which concludes the proof in a similar way.

Next, we present the definition of Schmidt rank of a factor as defined in  [37].

Definition 84 (Schmidt Rank of a Factor).

For a factor of homogenous polynomials 𝒫=(P1,,Pc), the schmidt rank of the factor is defined as:

schmrank(𝒫)min(schmrank(i=1cλiPi)|0(λ1,,λc)𝔽c)

Similarly, for a factor of general polynomials 𝒫, we set its rank to be the rank of its matching homogenous-factor, i.e. schmrank(𝒫)schmrank(h(𝒫)) For a factor generated by 𝒫, we define schmrank()schmrank(𝒫).

To establish the equivalence of this definition with the one employed throughout the paper, we must first acknowledge two key distinctions between the definitions. The first distinction is that this definition focuses on the largest-degree homogeneous components of the polynomials involved in the factor, rather than considering linear combinations of polynomials from the factor. The second distinction pertains to the treatment of d in the computation of d-rank of each linear combination. This definition uses the degree of the linear combination directly to calculate the rank that participates in the minimum, in contrast to our definition which uses maxideg(λiPi). Despite these differences, we will demonstrate that both definitions ultimately yield a similar rank assessment, thereby affirming their equivalence.

 Remark 85 (High Rank Implies High Schmidt Rank for Factors).

Let 𝒫=(P1,,Pc) be a set of polynomials and let r be a positive integer, i.e. r>0. If rank(𝒫)2r+1, then schmrank(𝒫)r.

Proof.

Assume that schmrank(𝒫)r for r>0. We will show that rank(𝒫)2r+1. By definition, there exists a linear combination of polynomials in h(𝒫) with rank r. In other words, there exists 0λ𝔽c such that schmrank(i=1cλih(Pi))r. Denote Phi=1cλih(Pi). As was shown in a previous remark, a rank of a polynomial is smaller than its schmidt rank up to a constant factor, thus rank(Ph)2r+1 (see Remark 83).
Next, we denote Pi=1cλiPi, and dMmaxi[c]λiPi. Note that deg(P)dM. We wish to show that rankdM(P)2r+1. First, we observe that the dM-degree homogenous component of Ph equals the dM-degree homogenous component of P. This is true because every highest-degree component of polynomials in the linear combination that generated P, also exists in the linear combination that generates Ph. In particular, all homogenous components of degree dM exists in both linear combinations Ph and P. Therefore, if the degree of P equals dM, we have that rankdM(P)=rank(P)2r+1. Otherwise, if deg(P)<dM, then rankdM(P)=12r+1. This completes the proof.

 Note.

In the case discussed above, if deg(P)<dM, then schmrank(𝒫)=0.

Proof.

Assume that deg(P)<dM. Therefore, the degree of the linear combination P=i=1cλiPi is strictly smaller than the degree of at least one of the polynomials participating in it. Denote by λ the sub-combination of λ that consists only the polynomials that participated in P that are of degree =dM. Trivially, λ0. Additionally, we have deg(i=1cλiPi)<dM. Now, we use the following observation: the linear combination above, when summing only the homogenous components of each polynomial, equals 0, i.e. i=1cλih(Pi)0. By this, we found a linear combination of h(𝒫) that is 0. Thus by definition, we have schmrank(𝒫)=0.

 Note.

This shows that if we compare only the differences in the definition of rank of a factor, i.e. the focus on linear combinations of the largest-degree homogenous components in contrast to the use of the maximal degree d-rank, the two definitions for a rank of a factor are equal up to ±1 (in case we use the same definition of rank for a single polynomial). To avoid confusion, we omit the exact definitions and respective proof.

We now present the definition of relative rank as stated in  [37, Definition 1.6]: We remind the reader that this definition is specifically defined to subsets X~𝔽n that can be expressed by X~=Z(~) for some set of polynomials ~, and not to a general set X~𝔽n.

Definition 86 (Relative Schmidt Rank of a Polynomial).

The relative schmidt rank of a homogeneous polynomial P relative to a collection of homogeneous polynomials ~=(L1,,Lc~) is

shcmrank~(P)min{schmrank(P+i=1c~RiLi)|deg(Li)+deg(Ri)deg(P),i[c~]}

Note that whenever degLi>degP, this implies Ri=0.
For general polynomial P and general collection of polynomials ~, we define the schmidt rank of the former in respect to the latter by the relative rank of their largest-degree homogenous component, i.e. shcmrank~(P)shcmrankh(~)(h(P)).

 Remark 87 (High Relative Rank High Relative Schmidt Rank).

Let P and ~={L1,,Lc~} be polynomials, and let X~𝔽n be defined as X~=Z(~).

If rankX~(P)2r+2 for some constant r, then shcmrank~(P)r.

Proof.

Let P and L1,,Lc~ be polynomials. Assume that shcmrank~(P)r. Then, there exists R1,,Rc~ with deg(Li)+deg(Ri)deg(P) for all i[c~] such that:

schmrank(h(P)+i=1c~Rih(Li))r

Denote Ph¯i=1c~Rih(Li). As we have shown earlier, a rank of a polynomial is smaller than its schmidt rank up to a constant factor (See Remark 83). Thus:

rank(h(P)+Ph¯)2schmrank(h(P)+Ph¯)+12shcmrankX~(P)+1=2r+1

Next, we denote the respective remainder polynomial for the non-homogenous analogue, i.e. Pmissing¯i=1c~RiLi. By observing the highest degree homogenous component of each summand, one can see that h(P+Pmissing¯)=h(h(P)+Ph¯). Therefore, by adding to the decomposition the non higest-degree-homogenous-component, one can see that:

rank(P+Pmissing¯)rank(h(P)+Ph¯)+12r+2

This completes the proof as rankX~(P)rank(P+Pmissing¯)2r+2.

 Remark 88 (Relative Schmidt Rank over Varieties of High Degree).

If the polynomials defining the variety ~=(L1,,Lc~) are of degree >deg(P), then, shcmrank~(P)=schmrank(P). This is true because in this case, in the calculation of the minimum in the definition of relative schmidt rank, we must have Ri=1 for all i[c~] and therefore the minimum above is simply rank(P).
Note that a similar statement holds for factors aswell. If 𝒫=(P1,,Pc) is a factor of degree d, then if all the polynomials in ~ have degree >d, then the statement above is also true i.e. shcmrank~(𝒫)=schmrank(𝒫). This is true because for every linear combination of 𝒫 has degree d and therefore its relative schimdt rank equals its rank.

Finally, we present the extension of the definition of relative rank for polynomials factors:

Definition 89 (Relative Schmidt Rank of a Factor).

The relative rank of a set of homogenous polynomials 𝒫={P1,,Pc} relative to another collection of polynomials ~={L1,,Lc~} is defined as:

shcmrank~(𝒫)min{shcmrank~(i=1cλiPi)|0(λ1,,λc)𝔽c}

If 𝒫 is a general collection of polynomials, then shcmrank~(𝒫)shcmrank~(h(𝒫)).
For a factor generated by a set of polynomials 𝒫, we define its schmidt rank relative to X~=Z(~) to be shcmrankX~()shcmrank~(𝒫).

 Remark 90.

Let 𝒫={P1,,Pc} and ~={L1,,Lc~} be sets of polynomials, and let X~𝔽n be defined as X~=Z(~). Additionally, let r such that r>0. If rankX~(𝒫)2r+2 for some constant r, then shcmrank~(P)r.

Proof.

Assume that shcmrank~(𝒫)r. We will show that rankX~(𝒫)2r+2. Let 0λ𝔽c be some vector of coefficients. Let Pi=1cλiPi and Phi=1cλih(Pi) be the linear combinations of polynomials in 𝒫 and h(𝒫) with coefficients λ respectively, and let dMmaxi[c]deg(λiPi). Additionally, denote r^shcmrank~(Ph)r. It is enough to show that rankdM,X~(P)2r^+2, If deg(P)<dM, then rankdM,X~(P)=12r+2. Otherwise, if deg(P)=dM, then the remark follows from Remark 87 as:

rankdM,X~(P)=rankX~(P)2shcmrank~(P)+2

Where:

shcmrank~(P)shcmrank~(h(P))=shcmrank~(h(Ph))=shcmrank~(Ph)=r^