List Decoding Reed-Solomon Codes in the Lee, Euclidean, and Other Metrics

Peikert, Chris; Hostetler, Alexandra Veliche

doi:10.4230/LIPIcs.ITCS.2026.106

List Decoding Reed–Solomon Codes in the
Lee, Euclidean, and Other Metrics

Chris Peikert

University of Michigan, Ann Arbor, MI, USA Alexandra Veliche Hostetler

University of Michigan, Ann Arbor, MI, USA

Abstract

Reed–Solomon error-correcting codes are ubiquitous across computer science and information theory, with applications in cryptography, computational complexity, communication and storage systems, and more. Most works on efficient error correction for these codes, like the celebrated Berlekamp–Welch unique decoder and the (Guruswami–)Sudan list decoders, are focused on measuring error in the Hamming metric, which simply counts the number of corrupted codeword symbols. However, for some applications, other metrics that depend on the specific values of the errors may be more appropriate. This work gives a polynomial-time algorithm that list decodes (generalized) Reed–Solomon codes over prime fields in $\ell_{p}$ (semi)metrics, for any $0<p\leq 2$ . Compared to prior algorithms for the Lee ( $\ell_{1}$ ) and Euclidean ( $\ell_{2}$ ) metrics, ours decodes to arbitrarily large distances (for correspondingly small rates), and has better distance-rate tradeoffs for all decoding distances above some moderate thresholds. We also prove lower bounds on the $\ell_{1}$ and $\ell_{2}$ minimum distances of a certain natural subclass of GRS codes, which establishes that our list decoder is actually a unique decoder for many parameters of interest. Finally, we analyze our algorithm’s performance under random Laplacian and Gaussian errors, and show that it supports even larger rates than for corresponding amounts of worst-case error in $\ell_{1}$ and $\ell_{2}$ (respectively).

Keywords and phrases:

Reed–Solomon codes, list decoding, unique decoding, Lee metric, Euclidean metric, Guruswami–Sudan algorithm

Copyright and License:

2012 ACM Subject Classification:

Theory of computation

\rightarrow

Error-correcting codes

Editor:

Shubhangi Saraf

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

Reed–Solomon codes [11] are among the most widely used families of error-correcting codes, with applications across computer and communication sciences. Their many virtues include: a very simple definition; the largest possible minimum distance as a function of rate; and efficient decodability from errors, via either unique decoding up to half the minimum distance (see, e.g., [6, Section 12.1]), or list decoding up to the larger Johnson bound, via the celebrated works of Sudan [14] and Gururswami–Sudan [7] (see also [6, Section 12.2]).

List decoding [3, 15] is the task of finding all codewords that are within some desired distance of a (potentially corrupted) received word. When this radius is more than half the code’s minimum distance, there can potentially be more than one codeword within range (hence the name “list decoding”). Despite this non-uniqueness, list decoding can suffice for many purposes (e.g., finding a nearest codeword within range), and indeed, it has found numerous applications.

Most work on decoding Reed–Solomon codes has measured errors in the Hamming metric, which simply counts the number of corrupted codeword symbols (regardless of how they are corrupted). However, there are many other natural metrics that depend on the specific values of the errors. Such metrics can be more appropriate for settings where introducing a “large” error at a coordinate is more costly than a “small” error, or where the communication channel might add some nonzero error to every coordinate. An example is a channel that adds error according to a Gaussian or other fairly concentrated distribution. When the code alphabet is $\mathbb{Z}_{q}$ (the integers modulo $q$ ) – in particular, a prime field $\mathbb{F}_{q}$ – one metric of frequent study is the Lee metric, which is merely the $\ell_{1}$ norm $\lVert\mathbf{x}\rVert_{1}=\sum_{i}\lvert x_{i}\rvert$ after lifting $\mathbb{Z}_{q}$ to its distinguished representatives in $[-q/2,q/2)$ . Other natural, analogously defined choices include the Euclidean ( $\ell_{2}$ ) or other $\ell_{p}$ metrics.

We know of only a few prior works on efficiently decoding Reed–Solomon codes in metrics others than Hamming. For the Lee ( $\ell_{1}$ ) metric, Roth and Siegel [12] gave an algorithm that uniquely decodes up to half of (a lower bound on) the minimum distance; their algorithm works for certain subclasses of (generalized) Reed-Solomon and BCH codes. In addition, Wu, Kuijper, and Udaya [16] gave a list-decoding algorithm for $\ell_{1}$ , built around Guruswami–Sudan [7], that decodes to larger distances than in [12] for all small enough rates. Finally, for the Euclidean ( $\ell_{2}$ ) metric, Mook and Peikert [10] recently gave a list-decoding algorithm that also uses [7] as a black box.

1.1 Contributions

This work gives a polynomial-time algorithm that list decodes any generalized Reed–Solomon (GRS) code over a prime field in the $\ell_{p}$ (semi)metric for any $0<p\leq 2$ ; in particular, this includes the Lee ( $\ell_{1}$ ) and Euclidean ( $\ell_{2}$ ) metrics.¹¹1A semimetric is just a metric that does not necessarily satisfy the triangle inequality (which we will not need). Our algorithm works for a broader range of parameters, and has a better distance-rate tradeoff for all decoding distances above some moderate thresholds, than the prior algorithms for $\ell_{1}$ and $\ell_{2}$ [12, 16, 10]; see below for elaboration and Figure 1 for a visual depiction. For ease of comparison across the various works and $\ell_{p}$ (semi)metrics, we use a suitably normalized version of distance: for code length $n$ , distance $d$ corresponds to relative distance $\delta\vcentcolon=d/n^{1/p}$ .

For $p=2$ , our algorithm can handle an arbitrarily large decoding distance, for a correspondingly small enough rate: specifically, as $\delta$ and the alphabet size grow, we can decode for rates rapidly approaching $1/(\delta\sqrt{2\pi e})$ . By contrast, the prior work [10] applies only for relative distance $\delta<1/\sqrt{2}\approx 0.7071$ (i.e., $\ell_{2}$ distance less than $\sqrt{n/2}$ ). In addition, our algorithm works for larger rates than the one in [10] whenever $\delta$ exceeds about $0.51797$ . (See Section 5.2 for a detailed comparison.) This is particularly interesting since the rates obtained in [10] were shown to be optimal (in a certain sense) for $\delta<1/2$ , but not for larger values.

For $p=1$ , again our algorithm (like the one from [16]) can handle an arbitrarily large decoding distance, whereas [12] is limited to relative distance $\delta<1$ (i.e., $\ell_{1}$ distance less than $n$ ). In addition, our algorithm works for larger rates than those of [12, 16] whenever the relative decoding distance exceeds about $0.78988$ , and in general, as $\delta$ and the alphabet size grow, we can decode for rates rapidly approaching $1/(2e\delta)$ . (See Section 6.2 for details.) Our algorithm is also qualitatively broader: it decodes from continuous (real-valued) error, whereas the ones from [12, 16] require discrete (integer) error. While continuous error can be discretized by rounding, this can increase the relative distance from the codeword by up to $1/2$ in $\ell_{1}$ , which significantly degrades the distance-rate tradeoffs of the prior works, making them worse than ours for all distances.

We also give several useful supplementary results. By adapting an argument of [12], we prove lower bounds on the $\ell_{1}$ and $\ell_{2}$ minimum distances for a certain natural subclass of GRS codes. These imply that for many parameters of interest, our list-decoding algorithm outputs at most one codeword, i.e., it is actually a unique decoder. (See Lemmas 5.7 and 6.4 and the discussions thereafter.) And in addition to worst-case errors added by an adversarial channel, we also consider our algorithm’s performance under average-case errors produced by “memoryless additive” channels. Such channels add independent identically distributed error, drawn from some specified distribution, to each coordinate of the transmitted codeword. For Laplacian and Gaussian errors (which roughly correspond to $\ell_{1}$ and $\ell_{2}$ , respectively), we show that our algorithm supports even larger rates than what we would get by merely applying concentration bounds on the error vector and invoking our worst-case results.

Figure 1: Plots of the adjusted rate

R^{*,(p)}

, as a function of the

\ell_{p}

relative decoding distance

\delta=d/n^{1/p}

or corresponding channel error width

r=p^{1/p}\cdot c_{p}\cdot\delta

, for which our algorithm can list decode prime-field GRS codes in the worst case (wc) or average case (ac), respectively, for

p=2

(left) and

p=1

(right). (For simplicity, these plots assume a field size

q\gg\delta,r

.) For comparison, also shown are the corresponding functions from the prior work on decoding GRS codes in these metrics: [10] is for list decoding in

\ell_{2}

, and [12, 16] are respectively for unique and list decoding in the

\ell_{1}

(Lee) metric, but only for discrete (integer) error. Also shown are rate bounds

R^{(p)}_{\text{uniq}}

for which decoding to

\ell_{p}

relative distance

\delta

is guaranteed to yield a unique codeword, for a certain natural subclass of GRS codes. (See Lemmas 5.7 and 6.4 and the discussions thereafter.)

1.2 Technical Overview

At the highest level, our algorithm for list decoding prime-field GRS codes in $\ell_{p}$ follows the basic approach of [10] for list decoding (G)RS codes in $\ell_{2}$ : we first translate the received word into a suitable weight (or reliability) vector, then invoke a soft-decision list-decoding algorithm [7, 5, 8] for GRS codes. Informally, a weight vector specifies, for each coordinate of the received word and each symbol in the code alphabet, a “confidence level” that the transmitted codeword had that symbol at that coordinate. Given such a weight vector, a soft-decision decoding algorithm then finds all codewords that are sufficiently correlated with it, as determined by the code rate. (For the formal definitions and theorem statement, see Definitions 3.1, 3.2, and 3.3.)

For our purposes, the principal challenge is in mapping a received word to an appropriate weight vector so that any codeword that is close enough to the received word (in the $\ell_{p}$ metric) has sufficient correlation with the weight vector. The prior work [10] uses very simple weights: given a real received value $r$ , only its floor $\lfloor r\rfloor$ and ceiling $\lceil r\rceil$ receive positive weights, of $1-(r-\lfloor r\rfloor)$ and $1-(\lceil r\rceil-r)$ , respectively. It was shown that with these weights, the cited soft-decision algorithms decode up to any $\ell_{2}$ relative distance $\delta<1/\sqrt{2}$ for any code rate up to $1-2\delta^{2}$ . Moreover, for $\delta\leq 1/2$ it was shown that this rate is optimal for those algorithms, i.e., no other weight assignment can work for a larger rate. However, [10] did not consider larger decoding distances than these, nor other metrics.

In this work, to handle large decoding distances and other metrics, we use “smoother” weights, which typically assign a positive weight to every alphabet symbol. Our overall approach (see Section 3) is quite general, and is parameterized by a function $f\colon\mathbb{R}\to[0,1]$ satisfying mild hypotheses, primarily that its Fourier transform $\widehat{f}$ is non-negative (see ˜2.12). This function can be seen as defining a weight – or a relative “likelihood,” in the case of a random channel – for every potential real-valued error.²²2For example, we take $f$ to be a Gaussian function for decoding in the $\ell_{2}$ metric, or under a Gaussian channel. For a prime-order code alphabet $\mathbb{F}_{q}\cong\mathbb{Z}_{q}\vcentcolon=\mathbb{Z}/q\mathbb{Z}$ , and a received real value $r\in\mathbb{R}/q\mathbb{Z}$ , we assign weight $f(r-w+q\mathbb{Z})=\sum_{e=r-w\mkern 4.0mu({\operator@font mod}\mkern 6.0muq)}% f(e)$ to each alphabet symbol $w\in\mathbb{Z}_{q}$ . Since the elements of the coset $r-w+q\mathbb{Z}$ are exactly those errors that would convert a transmitted symbol $w$ to the received value $r$ , this assignment captures the total weight of all such errors.

Our first main result, given in Theorem 3.5, lower bounds the correlation between the weight vector for a received word $\mathbf{r}$ over $\mathbb{R}/q\mathbb{Z}$ and any (code)word $\mathbf{c}$ over $\mathbb{Z}_{q}$ , by the ratio of two quantities determined by $f$ : its (arithmetic or geometric) mean over the coordinates of the error vector $\mathbf{r}-\mathbf{c}$ , and (the square root of) its sum over a certain two-dimensional integer lattice $\mathcal{L}_{q}$ . So, for a particular decoding distance in a metric of interest (or channel distribution, in the average case), the goal becomes to choose a suitable function $f$ that nearly maximizes this ratio. The proof of the theorem uses a mild generalization of Fourier-analytic results on lattices from [1, 9], and is the source of our requirement that $\widehat{f}$ is non-negative, and ultimately the restriction that $0<p\leq 2$ for $\ell_{p}$ (semi)metrics.

The bulk of the remaining work is then devoted to making a suitable choice of function $f$ for the $\ell_{p}$ (semi)metric (and corresponding channel distributions), and analyzing its summation over $\mathcal{L}_{q}$ . In Section 4 we consider scalings of the function $f^{(p)}(x)\vcentcolon=\exp(-\lvert x\rvert^{p})$ , which is known to have non-negative Fourier coefficients for $0<p\leq 2$ (but not for any other $p$ ). Then in Sections 5 and 6 we specialize to $p=2$ and $p=1$ , respectively, and give fairly tight upper bounds on $f(\mathcal{L}_{q})$ using Fourier-analytic techniques or direct analysis. Finally, we use these bounds to optimize the distance-rate tradeoffs for which we can list decode GRS codes in these $\ell_{p}$ metrics, and for Gaussian and Laplacian random channels as well.

2 Preliminaries

For a positive integer $n$ , let $[n]\vcentcolon=\{1,\ldots,n\}$ . For a positive integer $q$ , define the quotient ring $\mathbb{Z}_{q}\vcentcolon=\mathbb{Z}/q\mathbb{Z}$ and the additive quotient group $\mathbb{R}_{q}\vcentcolon=\mathbb{R}/q\mathbb{Z}$ . For a prime power $q$ , let $\mathbb{F}_{q}$ denote the finite field of size $q$ . When $q$ is prime, we identify $\mathbb{Z}_{q}$ with the finite field $\mathbb{F}_{q}$ in the natural way.

For any $x\in\mathbb{R}_{q}$ (which is a coset of $q\mathbb{Z}$ ), define its “lift” $\overline{x}\in[-q/2,q/2)$ to be the unique real number such that $\overline{x}=x\mkern 4.0mu({\operator@font mod}\mkern 6.0muq)$ , i.e., the “zero-centered” distinguished representative of $x$ . We also apply this notation entry-wise to vectors over $\mathbb{R}_{q}$ .

For any $p>0$ , define the $\ell_{p}$ (quasi)norm on $\mathbb{R}^{n}$ as $\lVert\mathbf{x}\rVert_{p}\vcentcolon=\big(\sum_{i=1}^{n}\lvert x_{i}\rvert^{p% }\big)^{1/p}$ . It is well known that this is a norm if and only if $p\geq 1$ , and is a quasinorm for any $p>0$ .³³3A quasinorm relaxes the triangle inequality axiom to require only that $\lVert\mathbf{x}+\mathbf{y}\rVert\leq K(\lVert\mathbf{x}\rVert+\lVert\mathbf{y% }\rVert)$ for some fixed $K$ . We do not use the triangle inequality, or even this relaxation, so we can consider $p<1$ . Similarly, we define the $\ell_{p}$ (semi)metric on $\mathbb{R}_{q}^{n}$ by lifting, i.e., via $\lVert\mathbf{x}\rVert_{p}\vcentcolon=\lVert\overline{\mathbf{x}}\rVert_{p}$ .⁴⁴4Formally, this is not a norm because it is not defined on a vector space (since $\mathbb{R}_{q}$ is not a field), and it does not satisfy homogeneity due to the mod- $q$ reduction. However, it does define a (semi)metric (where “semi” does not require the triangle inequality), with distance function $d(\mathbf{x},\mathbf{y})=\lVert\mathbf{x}-\mathbf{y}\rVert_{p}$ . For $p=1$ , this generalizes the Lee metric over $\mathbb{Z}_{q}$ to $\mathbb{R}_{q}$ .

For two groups $X, Y$ , their direct sum group $X\oplus Y$ is their Cartesian product with the group operation defined component-wise. This notation extends to the direct sum of group cosets, which is a coset of the direct sum of the groups.

For any two vectors $\mathbf{x}=(x_{1},\ldots,x_{n})$ and $\mathbf{y}=(y_{1},\ldots,y_{n})$ of the same dimension, their coordinate-wise (or Hadamard) product is denoted by $\mathbf{x}\odot\mathbf{y}\vcentcolon=(x_{1}\cdot y_{1},\ldots,x_{n}\cdot y_{n})$ .

For a finite sequence $X_{1},\ldots,X_{n}$ of real values, we denote their average by $\operatorname*{Avg}_{i}[X_{i}]\vcentcolon=\tfrac{1}{n}\sum_{i=1}^{n}X_{i}$ . We use the following special case of the well known Hoeffding (lower-)tail bound.

Lemma 2.1 (Hoeffding’s Inequality).

Let $X_{1},\ldots,X_{n}$ be independent identically distributed random variables in $[0,1]$ with common expectation $\mu=\operatorname*{\mathbb{E}}[X_{i}]$ . Then for any $\gamma\geq 0$ ,

\Pr\Big[\operatorname*{Avg}_{i}[X_{i}]\leq\mu-\gamma\Big]<\exp(-2\gamma^{2}n)% \;\text{.}

Operations on functions

For any function $f\colon D\to\mathbb{C}$ and countable subset $X\subseteq D$ , we define $f(X)\vcentcolon=\sum_{\mathbf{x}\in X}f(\mathbf{x})$ . We extend the domain to $D^{k}$ multiplicatively, as

f^{k}(\mathbf{x})\vcentcolon=\prod_{i=1}^{k}f(x_{i})\;\text{,}

(2.1)

often omitting the superscript $k$ when it is clear from context. When $D=\mathbb{R}^{n}$ , for any real $s\neq 0$ we define $f_{s}(\mathbf{x})\vcentcolon=f(\mathbf{x}/s)$ .

2.1 Linear Codes

A linear (error-correcting) code of (block) length $n$ over the alphabet $\mathbb{F}_{q}$ is a linear subspace of $\mathbb{F}_{q}^{n}$ . As a subspace, it has a dimension. In this paper, we consider the following family of codes.

Definition 2.2 ((Generalized) Reed–Solomon code).

Let $n\leq q$ be positive integers, with $q$ a prime power. For a non-negative integer $k$ , a vector $\boldsymbol{\alpha}\in\mathbb{F}_{q}^{n}$ with distinct entries, and a vector $\mathbf{t}\in(\mathbb{F}_{q}\setminus\{0\})^{n}$ with (not necessarily distinct) non-zero entries, the Generalized Reed–Solomon (GRS) code of dimension $k$ with evaluation points $\boldsymbol{\alpha}$ and twist factors $\mathbf{t}$ is defined as

\text{GRS}_{q,k}(\boldsymbol{\alpha},\mathbf{t})\vcentcolon=\{\mathbf{t}\odot f% (\boldsymbol{\alpha})=(t_{1}\cdot f(\alpha_{1}),\ldots,t_{n}\cdot f(\alpha_{n}% ))\ \colon f\in\mathbb{F}_{q}[x],\ \deg(f)<k\}\;\text{.}

A special case is a Reed–Solomon (RS) code, which is obtained by using trivial twist factors $\mathbf{t}=(1,\ldots,1)$ .

2.2 Lattices

Definition 2.3 (Lattice, Basis).

An ( $n$ -dimensional, full-rank) lattice $\mathcal{L}\subset\mathbb{R}^{n}$ is the set of all integer linear combinations of some $n$ linearly independent basis vectors $\mathbf{B}=\{\mathbf{b}_{1},\ldots,\mathbf{b}_{n}\}\subset$ $\mathbb{R}^{n}$ :

\mathcal{L}=\mathcal{L}(\mathbf{B})\vcentcolon=\Big\{\sum_{i=1}^{n}z_{i}% \mathbf{b}_{i}:z_{i}\in\mathbb{Z}\Big\}\;\text{.}

Equivalently, it is a discrete additive subgroup of $\mathbb{R}^{n}$ whose $\mathbb{R}$ -span is $\mathbb{R}^{n}$ ; as such, it defines the quotient group $\mathbb{R}^{n}/\mathcal{L}$ of lattice cosets $\mathbf{x}+\mathcal{L}$ for $\mathbf{x}\in\mathbb{R}^{n}$ . A sublattice of $\mathbb{Z}^{n}$ is called an integer lattice.

In this work, all lattices are implicitly full rank. A lattice basis can equivalently be seen as an invertible matrix $\mathbf{B}\in\mathbb{R}^{n\times n}$ whose columns are the vectors $\mathbf{b}_{1},\ldots,\mathbf{b}_{n}$ . Note that a given lattice has multiple different bases, which are all related by right-multiplication by unimodular matrices in $\mathbb{Z}^{n\times n}$ .

Definition 2.4 (Determinant).

The determinant of a lattice $\mathcal{L}$ generated by basis $\mathbf{B}$ is $\det(\mathcal{L})\vcentcolon=\lvert\det(\mathbf{B})\rvert$ .

Note that the determinant of a lattice is invariant under the choice of basis, by the above-mentioned relationship between the bases of a lattice.

Definition 2.5 (Dual lattice).

The dual lattice of a lattice $\mathcal{L}\subset\mathbb{R}^{n}$ is

\mathcal{L}^{*}\vcentcolon=\{\mathbf{x}\in\mathbb{R}^{n}:\forall\ \mathbf{v}% \in\mathcal{L},\,\langle\mathbf{v},\mathbf{x}\rangle\in\mathbb{Z}\}\;\text{.}

If $\mathbf{B}$ is a basis of $\mathcal{L}$ , then its dual basis $\mathbf{B}^{*}\vcentcolon=\mathbf{B}^{-t}$ is a basis of $\mathcal{L}^{*}$ , and hence $\det(\mathcal{L}^{*})=\det(\mathcal{L})^{-1}$ .

Lemma 2.6.

Let $f\colon D\to\mathbb{R}$ and $X,Y\subseteq D$ be countable subsets of its domain (e.g., lattice cosets). Then $f(X\oplus Y)=f(X)\cdot f(Y)$ .

Proof.

This follows directly from the definition of direct sum and Equation 2.1. $\hfill\blacktriangleleft$

2.3 Fourier Analysis

Let $f\colon\mathbb{R}^{n}\to\mathbb{C}$ be a (Borel) measurable function that satisfies $\int_{\mathbb{R}^{n}}\lvert f(\mathbf{x})\rvert\mathrm{d}\mathbf{x}<\infty$ . Its Fourier transform $\widehat{f}\colon\mathbb{R}^{n}\to\mathbb{C}$ is defined as

\widehat{f}(\mathbf{w})\vcentcolon=\int_{\mathbb{R}^{n}}f(\mathbf{x})\cdot\exp% (-2\pi i\langle\mathbf{x},\mathbf{w}\rangle)\mathrm{d}\mathbf{x}\;\text{.}

It satisfies the following standard properties, which follow by routine calculations.

Lemma 2.7 (Multiplicativity).

For any function $f$ as above, $\widehat{f^{k}}=\widehat{f}\,^{k}$ (where the exponent notation is as defined in Equation 2.1).

Lemma 2.8 (Time-scaling property).

For any function $f$ as above and real $s\neq 0$ , $\widehat{f_{s}}(\mathbf{w})=s^{n}\cdot\widehat{f}_{1/s}(\mathbf{w})$ .

Lemma 2.9 (Time-shift property).

For any function $f$ as above and $\mathbf{c}\in\mathbb{R}^{n}$ , let $g(\mathbf{x})=f(\mathbf{x}-\mathbf{c})$ . Then $\widehat{g}(\mathbf{w})=\widehat{f}(\mathbf{w})\cdot\exp(-2\pi i\langle\mathbf% {w},\mathbf{c}\rangle)$ .

We say that $f$ is nice if it satisfies conditions that are sufficient for the following formula to hold, e.g., those given in [13, pages 106–107]. All of the specific functions $f$ we use in this work are easily seen to be nice.

Lemma 2.10 (Poisson Summation Formula (PSF)).

For any lattice $\mathcal{L}$ and nice function $f$ ,

f(\mathcal{L})=\det(\mathcal{L}^{*})\cdot\widehat{f}(\mathcal{L}^{*})\;\text{.}

We will use a more general version of the PSF for lattice cosets.

Lemma 2.11 (Generalized PSF).

For any lattice $\mathcal{L}\subset\mathbb{R}^{n}$ , nice function $f$ , and $\mathbf{y}\in\mathbb{R}^{n}$ ,

f(\mathbf{y}+\mathcal{L})=\det(\mathcal{L}^{*})\cdot\sum_{\mathbf{w}\in% \mathcal{L}^{*}}\widehat{f}(\mathbf{w})\cdot\exp(2\pi i\langle\mathbf{w},% \mathbf{y}\rangle)\;\text{.}

2.4 Lattice Roughness

Continuing from Section 2.3, for the rest of this work we require the following properties of $f$ .

Assumption 2.12.

The function $f$ has range $[0,1]$ and is nice, and $\widehat{f}$ is non-negative real with $\widehat{f}(0)>0$ .

Because $f$ is real, its Fourier transform is conjugate symmetric, i.e., $\widehat{f}(-w)=\widehat{f}(w)^{*}$ for all $w$ , where the star denotes complex conjugation. Since $\widehat{f}$ is also real, this implies that it is symmetric, i.e., $\widehat{f}(-w)=\widehat{f}(w)$ . Finally, note that if $f$ satisfies this assumption, then so does its multiplicative extension $f^{k}$ .

We now define an important Fourier-analytic quantity that plays an important role in our analysis. We adopt the name “roughness” because it is the functional inverse of the “smoothing parameter” from [9], which is the smallest $s$ that makes the function $f_{s}(\mathbf{y}+\mathcal{L})$ sufficiently “smooth” as a function of $\mathbf{y}$ .

Definition 2.13.

For a function $f$ , lattice $\mathcal{L}\subset\mathbb{R}^{n}$ , and real $s>0$ , the roughness is defined as

\varepsilon_{\mathcal{L},s}\vcentcolon=\frac{\widehat{f_{s}}(\mathcal{L}^{*}% \setminus\{\mathbf{0}\})}{\widehat{f_{s}}(\mathbf{0})}=\frac{\widehat{f_{s}}(% \mathcal{L}^{*})}{\widehat{f_{s}}(\mathbf{0})}-1\geq 0.

More generally, for a (linear) subspace $H$ of $\mathbb{R}^{n}$ , the $H$ -roughness is defined as

\varepsilon_{\mathcal{L},s}(H)\vcentcolon=\frac{\widehat{f_{s}}(\mathcal{L}^{*% }\setminus H^{\perp})}{\widehat{f_{s}}(\mathcal{L}^{*}\cap H^{\perp})}=\frac{% \widehat{f_{s}}(\mathcal{L}^{*})}{\widehat{f_{s}}(\mathcal{L}^{*}\cap H^{\perp% })}-1\leq\varepsilon_{\mathcal{L},s}(\mathbb{R}^{n})=\varepsilon_{\mathcal{L},% s}.

(Both inequalities follow from the non-negativity of $\widehat{f_{s}}$ .)

Lemma 2.14 (adapted from [9, Lemmas 2.9 and 4.1]).

For any $f$ satisfying ˜2.12, lattice $\mathcal{L}\subset\mathbb{R}^{n}$ , real $s>0$ , subspace $H$ of $\mathbb{R}^{n}$ defining roughness $\varepsilon\vcentcolon=\varepsilon_{\mathcal{L},s}(H)$ , and $\mathbf{y}\in H$ ,

f_{s}(\mathbf{y}+\mathcal{L})\in\det(\mathcal{L}^{*})\cdot\widehat{f_{s}}(% \mathcal{L}^{*}\cap H^{\perp})\cdot[1-\varepsilon,1+\varepsilon]\;\text{,}

with equality against the upper bound when $\mathbf{y}=\mathbf{0}$ . In particular, $f_{s}(\mathbf{y}+\mathcal{L})\in f_{s}(\mathcal{L})\cdot[\tfrac{1-\varepsilon}% {1+\varepsilon},1]$ .

Due to space constraints, the proof is left to the full version.

3 List-Decoding Reed–Solomon Codes

3.1 Soft-Decision Decoding

To list-decode Reed–Solomon codes under various norms and probabilistic channel models, we use the “weighted,” or soft-decision, list decoder of Guruswami and Sudan (hereafter GS) [7], as elaborated upon in Guruswami’s thesis [5, Section 6.2.10] and the work of Koetter and Vardy [8]. A soft-decision decoder takes a “weight vector” as input, and outputs a set of codewords.

Definition 3.1.

A weight vector for a length- $n$ code over $\mathbb{F}_{q}$ is some $W\vcentcolon=(W_{1},\ldots,W_{n})\in[0,1]^{qn}$ where each block $W_{i}\in[0,1]^{q}$ is indexed by $\mathbb{F}_{q}$ ; equivalently, each block is a function $W_{i}\colon\mathbb{F}_{q}\to[0,1]$ .

Conceptually, each block $W_{i}$ of a weight vector may be thought of as specifying a (posterior) probability distribution $\Pi_{i}$ over $\mathbb{F}_{q}$ , where $\Pi_{i}(x)$ is proportional to the probability that the $i$ th transmitted symbol was $x\in\mathbb{F}_{q}$ , given what was received from the channel (which need not be an element of $\mathbb{F}_{q}$ ). At a formal level, this interpretation makes sense only when the channel is probabilistic (for average-case decoding), but it still serves as useful intuition when the channel is adversarial (for worst-case decoding). We consider both types of channels in our results below.

For $c\in\mathbb{F}_{q}$ , define $[c]\in[0,1]^{q}$ to be the binary indicator vector indexed by $\mathbb{F}_{q}$ that has a $1$ in coordinate $c$ and $0$ s elsewhere. Similarly, for any vector $\mathbf{c}=(c_{1},\ldots,c_{n})\in\mathbb{F}_{q}^{n}$ , define the weight vector $[\mathbf{c}]\vcentcolon=([c_{1}],\ldots,[c_{n}])\in[0,1]^{qn}$ . Observe that its Euclidean norm is $\lVert[\mathbf{c}]\rVert=\sqrt{n}$ .

Definition 3.2.

The correlation between a weight vector $W\in[0,1]^{qn}$ and a word $\mathbf{c}\in\mathbb{F}_{q}^{n}$ is defined as their length-normalized inner product (or the cosine of the angle between them):

\operatorname{corr}(W,\mathbf{c})\vcentcolon=\frac{\langle W,[\mathbf{c}]% \rangle}{\lVert W\rVert\cdot\sqrt{n}}\;\text{.}

Theorem 3.3 (adapted from [7, Theorem 18] and [5, Theorem 6.21]).

For a prime power $q$ , let $\mathcal{C}\subseteq\mathbb{F}_{q}^{n}$ be a Generalized Reed–Solomon code of dimension $k$ and adjusted rate $R^{*}\vcentcolon=(k-1)/n$ . There is a deterministic algorithm that, given a weight vector $W$ and a “tolerance” $\tau>0$ , outputs in time $\operatorname{poly}(n,q,1/(\tau\lVert W\rVert))$ the set of all codewords $\mathbf{c}\in\mathcal{C}$ that satisfy

\operatorname{corr}(W,\mathbf{c})\geq\sqrt{R^{*}}+\tau\;\text{.}

(3.1)

We remark that the above theorem is originally stated for rational weights, but the supporting argument (from [5, Lemma 6.20]) easily adapts to handle real-valued weights that can be lower bounded to any needed precision in polynomial time, as all of ours can be.

3.2 From Received Words to Weight Vectors

Here we describe a general approach for translating a received word to a weight vector. This translation is parameterized by a function that, conceptually, can be viewed as (proportional to) the channel’s probability density function, even if the channel is not actually probabilistic.

Let $f\colon\mathbb{R}\to[0,1]$ be a function that satisfies ˜2.12, extended multiplicatively to $\mathbb{R}^{n}$ as in Equation 2.1, and recall that $f_{s}(x)\vcentcolon=f(x/s)$ for any constant $s>0$ . Next let $q$ be a positive integer, and recall that we identify $\mathbb{Z}_{q}\vcentcolon=\mathbb{Z}/q\mathbb{Z}$ with $\mathbb{F}_{q}$ in the natural way when $q$ is prime. Let the set of possible received values be $\mathbb{R}_{q}=\mathbb{R}/q\mathbb{Z}$ , and for any such value $y\in\mathbb{R}_{q}$ , define the weight function $W_{s,y}\colon\mathbb{Z}_{q}\to[0,1]$ by

W_{s,y}(x)\vcentcolon=f_{s}(y-x+q\mathbb{Z})\;\text{.}

Notice that here $f_{s}$ is applied to a coset of $q\mathbb{Z}$ , which represents an infinite series; for all our concrete choices, these series converge and so the function $W_{s,y}$ is well defined. This function can also be seen as the vector $W_{s,y}\vcentcolon=(W_{s,y}(x))_{x\in\mathbb{Z}_{q}}\in[0,1]^{q}$ , indexed by $\mathbb{Z}_{q}$ .

In line with the probabilistic conception of weight vectors from Section 3.1 above, the function $W_{s,y}$ can be seen as follows. Suppose that a uniformly random symbol in $\mathbb{Z}_{q}$ is sent over a channel, which adds (modulo $q$ ) noise drawn from a distribution over $\mathbb{R}$ whose probability density function is proportional to $f_{s}$ . Then the probability that the sent symbol was $x\in\mathbb{Z}_{q}$ , conditioned on receiving $y$ , is proportional to $W_{s,y}(x)$ . This is because the coset $y-x\in\mathbb{R}_{q}$ is the set of all noise values that yield $y$ if $x$ is sent. Note that in the definition of $W_{s,y}$ we do not normalize by the total weight $W_{s,y}(\mathbb{Z}_{q})=f_{s}(y+\mathbb{Z})$ (which may vary based on the received value $y$ ); this turns out to yield simpler analyses and tighter results.

Definition 3.4.

For a function $f_{s}$ as above and any received vector $\mathbf{y}=(y_{1},\ldots,y_{n})\in\mathbb{R}_{q}^{n}$ , define the corresponding weight vector as

W_{s,\mathbf{y}}\vcentcolon=(W_{s,y_{1}},\ldots,W_{s,y_{n}})\in[0,1]^{nq}\;% \text{.}

In order to use the soft-decision algorithm (Theorem 3.3) for decoding under an adversarial channel, it suffices to show that we can choose a suitable $s$ so that for any received word $\mathbf{y}$ and any sufficiently close codeword $\mathbf{c}$ (in the norm of interest), the correlation $\operatorname{corr}(W_{s,\mathbf{y}},\mathbf{c})$ satisfies (3.1). Similarly, for decoding under a probabilistic channel, it suffices to show that with high probability over the channel noise $\mathbf{e}$ , the transmitted codeword $\mathbf{c}$ has large enough correlation with the weight vector $W_{s,\mathbf{y}}$ of the received word $\mathbf{y}=\mathbf{c}+\mathbf{e}$ (again, for some suitably chosen $s$ ). To this end, in what follows we give a lower bound on $\langle W_{s,\mathbf{y}},[\mathbf{c}]\rangle$ and an upper bound on $\lVert W_{s,\mathbf{y}}\rVert$ , in terms of $f_{s}$ and the difference $\mathbf{y}-\mathbf{c}$ between the received word and the codeword of interest.

3.3 Main Theorem

Here we state and prove the main result of this section. For this we define the two-dimensional integer lattice $\mathcal{L}_{q}$ that consists of all shifts of the lattice $q\mathbb{Z}^{2}$ by $(z,z)$ for an integer $z$ , i.e.,

\mathcal{L}_{q}\vcentcolon=\bigcup_{x\in\mathbb{Z}_{q}}(x\oplus x)=\bigcup_{z% \in\mathbb{Z}}((z,z)+q\mathbb{Z}^{2})\supset q\mathbb{Z}^{2}.

We have that $\det(\mathcal{L}_{q})=q$ , and so $\det(\mathcal{L}_{q}^{*})=1/q$ . We sometimes omit the $q$ subscript when it is clear from context or its value is unimportant.

Theorem 3.5.

Suppose that $f$ satisfies ˜2.12. For any $s>0$ and $\mathbf{y}\in\mathbb{R}_{q}^{n}$ defining $W=W_{s,\mathbf{y}}$ , and any $\mathbf{c}\in\mathbb{Z}_{q}^{n}$ ,

\operatorname{corr}(W,\mathbf{c})\geq\frac{\operatorname*{Avg}_{i\in[n]}[f_{s}% (y_{i}-c_{i})]}{\sqrt{f_{s}(\mathcal{L}_{q})}}\geq\frac{f_{s}(\mathbf{y}-% \mathbf{c})^{1/n}}{\sqrt{f_{s}(\mathcal{L}_{q})}}\;\text{.}

Proof.

This follows immediately from the following lower and upper bounds on the numerator and denominator of $\operatorname{corr}(W,\mathbf{c})=\frac{\langle W,[\mathbf{c}]\rangle/n}{% \lVert W\rVert/\sqrt{n}}$ . For the numerator, by the definitions of $W$ and $[\mathbf{c}]$ ,

\langle W,[\mathbf{c}]\rangle/n=\operatorname*{Avg}_{i\in[n]}\left[W_{s,y_{i}}% (c_{i})\right]=\operatorname*{Avg}_{i\in[n]}\left[f_{s}(y_{i}-c_{i})\right]% \geq f_{s}(\mathbf{y}-\mathbf{c})^{1/n}\;\text{,}

where the last step follows by the inequality of arithmetic and geometric means, and the non-negativity and multiplicativity of $f_{s}$ over direct sums of cosets (Lemma 2.6). For the denominator, the upper bound $\lVert W\rVert/\sqrt{n}\leq\sqrt{f_{s}(\mathcal{L}_{q})}$ is proved in Lemma 3.6 below. $\hfill\blacktriangleleft$

Lemma 3.6.

Adopting the setup from Theorem 3.5, and letting $\tilde{\varepsilon}=\varepsilon_{\mathcal{L}_{q},s}(H)$ where $H=\operatorname{span}(1,1)$ ,

\lVert W\rVert^{2}/n\in f_{s}(\mathcal{L}_{q})\cdot\Big[\frac{1-\tilde{% \varepsilon}}{1+\tilde{\varepsilon}},1\Big]\;\text{.}

Proof.

By definition of $W$ ,

\lVert W\rVert^{2}/n=\operatorname*{Avg}_{i\in[n]}\Big[\sum_{x\in\mathbb{Z}_{q% }}f_{s}(y_{i}-x)^{2}\Big]\;\text{.}

To bound this, let $y\in\mathbb{R}_{q}$ be arbitrary. By Lemma 2.6,

	$\displaystyle\sum_{x\in\mathbb{Z}_{q}}f_{s}(y-x)^{2}$	$\displaystyle=\sum_{x\in\mathbb{Z}_{q}}f_{s}((y-x)\oplus(y-x))$
		$\displaystyle=\sum_{x\in\mathbb{Z}_{q}}f_{s}((y\oplus y)-(x\oplus x))$
		$\displaystyle=f_{s}((\overline{y},\overline{y})+\mathcal{L}_{q})$
		$\displaystyle\in f_{s}(\mathcal{L}_{q})\cdot\Big[\frac{1-\tilde{\varepsilon}}{% 1+\tilde{\varepsilon}},1\Big]\;\text{,}$

where the last step follows by the latter part of Lemma 2.14 on the lattice $\mathcal{L}_{q}$ with subspace $H$ , and noting that $(\overline{y},\overline{y})\in H$ . The claim follows by averaging over $i\in[n]$ . $\hfill\blacktriangleleft$

3.4 Average-Case Decoding

Here we consider list-decoding in the average case, where the channel is probabilistic (not worst case) and the goal is to output a list of codewords that includes the transmitted one. We consider channels that add independent, identically distributed random error (drawn from some specified distribution) to each coordinate of the transmitted codeword; this is often known as a memoryless additive channel. Specifically, we assume that the channel’s error distribution (for each coordinate) is proportional to $f_{r}$ for some $r>0$ , i.e., it has probability density function

D_{r}(x)\vcentcolon=\frac{f_{r}(x)}{\widehat{f_{r}}(0)}\;\text{.}

For example, if $f_{r}$ is a Gaussian function, this is known as the additive white Gaussian noise (AWGN) channel model. In some settings one may also consider a discrete channel distribution, e.g., over $\mathbb{Z}$ , in which case its probability mass function is $D_{r}(x)\vcentcolon=f_{r}(x)/f_{r}(\mathbb{Z})$ . For any $s>0$ (which may differ from $r$ ), define

\mu_{r,s}\vcentcolon=\operatorname*{\mathbb{E}}_{e\leftarrow D_{r}}[f_{s}(e)]% \;\text{.}

In Section 4 we will use the following bound for a specific family of functions $f$ to show that the transmitted codeword is recovered with high probability over the channel error.

Lemma 3.7.

For any $r,s>0$ and $T$ defining $\gamma\vcentcolon=\mu_{r,s}-T\cdot\sqrt{f_{s}(\mathcal{L}_{q})}\geq 0$ , and any $\mathbf{c}\in\mathbb{Z}_{q}^{n}$ ,

\Pr_{\mathbf{e}\leftarrow D_{r}^{n}}\big[\operatorname{corr}(W_{s,\mathbf{c}+% \mathbf{e}},\mathbf{c})\leq T\big]<\exp(-2\gamma^{2}n)\;\text{.}

This follows from ˜2.12, 3.5, and 2.1. Due to space constraints, the details of the proof are left to the full version.

4 General $\ell_{p}$ (Semi)Metrics

In this section we define weight vectors via Definition 3.4 using the function $f\colon\mathbb{R}\to[0,1]$ defined as

	$\displaystyle f(x)=f^{(p)}(x)$	$\displaystyle\vcentcolon=\exp\big(-(c_{p}\lvert x\rvert)^{p}\big)$		(4.1)
	$\displaystyle\text{where }c_{p}$	$\displaystyle\vcentcolon=2\cdot\Gamma(1+1/p)\;\text{,}$

where the gamma function $\Gamma(z)=\int_{0}^{\infty}u^{z-1}\exp(-u)\mathrm{d}u$ for $z>0$ , and satisfies $\Gamma(1)=1$ and $\Gamma(1+z)=z\cdot\Gamma(z)$ . As two important examples, $c_{1}=2$ and $c_{2}=\sqrt{\pi}$ .

Note that by multiplicativity (Equation 2.1),

f(\mathbf{x})=\prod_{i=1}^{n}f(x_{i})=\exp\Big(-\sum_{i=1}^{n}(c_{p}\lvert x_{% i}\rvert)^{p}\Big)=\exp\big(-(c_{p}\lVert\mathbf{x}\rVert_{p})^{p}\big)=f(% \lVert\mathbf{x}\rVert_{p})\;\text{.}

Regarding the Fourier transform of $f$ , the “normalizing constant” $c_{p}$ has been defined to make $\widehat{f}(0)=1$ .

It is also known that $\widehat{f}$ is non-negative for $0<p\leq 2$ ; this follows immediately from an elegant lemma and proof due to Logan, given in [4, Lemma 5].⁶⁶6For $p>2$ , by contrast, $\widehat{f}$ can have negative values, which prevents our framework from supporting $\ell_{p}$ metrics for such $p$ . So, $f$ satisfies ˜2.12 for such $p$ . Another immediate consequence of Logan’s lemma is that as $s$ grows, $\widehat{f_{s}}(w)/s$ strictly decreases and approaches zero for every $w\neq 0$ .

We will need the following simple lemma.

Lemma 4.1.

For any $s>0$ and $\mathbf{y}\in\mathbb{R}_{q}^{n}$ defining $W=W_{s,\mathbf{y}}$ , we have that $\lVert W\rVert_{2}\geq\sqrt{n}/\exp(c_{p}^{p}/(2s)^{p})$ .

Due to space constraints, the proof is left to the full version.

4.1 Worst-Case Decoding

We now address list-decoding in the $\ell_{p}$ (semi)metric for $0<p\leq 2$ , under worst-case error. Consider decoding distance $d=\delta\cdot n^{1/p}$ , where $n$ is the code length, and $\delta$ can be seen as the relative decoding distance (relative to $n^{1/p}$ , which is the most natural normalization factor for $\ell_{p}$ ). For $s>0$ , relative distance $\delta\geq 0$ , and positive integer modulus $q$ , define

W^{(p)}_{q,\delta}(s)\vcentcolon=\frac{f_{s}(\delta)}{\sqrt{f_{s}(\mathcal{L}_% {q})}}=\frac{\exp\big(-(c_{p}\cdot\delta/s)^{p}\big)}{\sqrt{f_{s}(\mathcal{L}_% {q})}}\geq 0\;\text{.}

(4.2)

By Theorems 3.5 and 3.3, to decode a GRS code of adjusted rate $R^{*}$ over a prime field $\mathbb{F}_{q}$ to within $\ell_{p}$ distance $\delta\cdot n^{1/p}$ using the GS algorithm, it suffices to set $s>0$ so that $W^{(p)}_{q,\delta}(s)>\sqrt{R^{*}}$ . In other words, we can decode under relative distance $\delta$ for any $R^{*}$ less than

R^{*,(p)}_{\text{wc},q}(\delta)\vcentcolon=\sup_{s>0}W^{(p)}_{q,\delta}(s)^{2}% \;\text{.}

(4.3)

The following makes this formal.

Theorem 4.2.

For any $0<p\leq 2$ , $\delta\geq 0$ , and prime $q$ , the GS soft-decision algorithm using weight vector given by $f^{(p)}_{s}$ for any $s>0$ list-decodes, up to $\ell_{p}$ distance $d=\delta\cdot n^{1/p}$ , any GRS code $\mathcal{C}\subseteq\mathbb{F}_{q}^{n}$ with adjusted rate $R^{*}<W^{(p)}_{q,\delta}(s)^{2}$ , in time polynomial in $n$ , $q$ , and $\exp(1/s^{p})/(W^{(p)}_{q,\delta}(s)-\sqrt{R^{*}})$ .⁷⁷7We remark that in many cases, the bound on the polynomial running time can be improved using a better lower bound for $\lVert W\rVert_{2}$ , such as the one given by Lemma 3.6.

Proof.

We invoke the GS algorithm on the weight vector $W=W_{s,\mathbf{y}}$ given by the choice of $s$ and the received word $\mathbf{y}$ , and tolerance $\tau=W^{(p)}_{q,\delta}(s)-\sqrt{R^{*}}>0$ .⁸⁸8To be more precise, we can invoke GS on any approximation of $\tau$ in $[\tau/2,\tau]$ , say. This can be computed by approximating $f_{s}(\mathcal{L}_{q})$ from above to the needed precision, by enumerating sufficiently many points of $\mathcal{L}_{q}$ near the origin, and upper-bounding the contribution of the remaining points in the “tails” using, e.g., Lemma 5.3. The running time is polynomial in $n$ , $q$ , and $1/(\tau\lVert W\rVert_{2})\leq\exp(c_{p}^{p}/(2s)^{p})/(\tau\sqrt{n})$ , by Lemma 4.1.

Now let $\mathbf{c}\in\mathcal{C}$ be a codeword within distance $d$ of $\mathbf{y}$ , i.e., $\lVert\overline{\mathbf{y}-\mathbf{c}}\rVert_{p}\leq d$ . By Theorems 3.5, 2.12, and 4.2,

\operatorname{corr}(W,\mathbf{c})\geq\frac{f_{s}(\mathbf{y}-\mathbf{c})^{1/n}}% {\sqrt{f_{s}(\mathcal{L}_{q})}}\geq\frac{f_{s}(d)^{1/n}}{\sqrt{f_{s}(\mathcal{% L}_{q})}}=W^{(p)}_{q,\delta}(s)=\sqrt{R^{*}}+\tau\;\text{.}

So, by Theorem 3.3, the output of the GS algorithm includes $\mathbf{c}$ , as needed. $\hfill\blacktriangleleft$

$\blacktriangleright$ Remark 4.3.

Interestingly, as $\delta$ , $q/\delta$ , and $n$ grow (and the other parameters remain fixed), the product of the relative distance $\delta$ and the adjusted rate $R^{*}$ for which we can decode approaches the relative radius of a unit-volume $\ell_{p}$ ball. Due to space constraints, we defer this derivation to the full version.

4.2 Average-Case Decoding

We now consider average-case decoding under a memoryless additive (continuous or discrete) channel whose density function is proportional to a scaling of $f=f^{(p)}$ . Specifically, we consider the continuous distribution with probability density function $D_{r}(x)\vcentcolon=f_{r}(x)/r$ , and the discrete distribution over $\mathbb{Z}$ with probability mass function $D_{r}(x)\vcentcolon=f_{r}(x)/f_{r}(\mathbb{Z})$ . Following Section 3.4, for any $r,s>0$ define

\mu^{(p)}_{r,s}\vcentcolon=\mu_{r,s}=\operatorname*{\mathbb{E}}_{e\leftarrow D% _{r}}[f_{s}(e)]\;\text{.}

For these channel distributions we derive suitable bounds on $\mu^{(p)}_{r,s}$ , then reach the conclusion via Lemmas 3.7 and 3.3.

Lemma 4.4.

For any $0<p\leq 2$ , any $r>0$ defining a continuous or discrete distribution $D_{r}$ , and $s>0$ ,

\mu^{(p)}_{r,s}\geq\frac{s}{\lVert(r,s)\rVert_{p}}\;\text{,}

with equality in the continuous case and strict inequality in the discrete case.

Due to space constraints, the proof is left to the full version.

Now, for any channel parameter $r>0$ and for $s>0$ , define

A^{(p)}_{q,r}(s)\vcentcolon=\frac{\mu^{(p)}_{r,s}}{\sqrt{f_{s}(\mathcal{L}_{q}% )}}\geq\frac{s}{\lVert(r,s)\rVert_{p}\cdot\sqrt{f_{s}(\mathcal{L}_{q})}}\;% \text{,}

(4.4)

where the inequality is by Lemma 4.4. By Theorems 3.5 and 3.3, to decode (with high probability) a GRS code of adjusted rate $R^{*}$ over a prime field $\mathbb{F}_{q}$ under a channel with parameter $r$ , it suffices to set $s>0$ so that $A^{(p)}_{q,r}(s)>\sqrt{R^{*}}$ . In other words, we can decode under channel parameter $r$ for any $R^{*}$ less than

R^{*,(p)}_{\text{ac},q}(r)\vcentcolon=\sup_{s>0}A^{(p)}_{q,r}(s)^{2}\;\text{.}

(4.5)

The following makes this formal.

Theorem 4.5.

Let $0<p\leq 2$ , $r>0$ , $\alpha\in(0,1)$ , and $q$ be prime. Under a memoryless additive (continuous or discrete) channel with distribution $D_{r}$ , the GS soft-decision algorithm, using weight vector given by $f^{(p)}_{s}$ for any $s>0$ , list-decodes any GRS code $\mathcal{C}\subseteq\mathbb{F}_{q}^{n}$ with adjusted rate $R^{*}<A^{(p)}_{q,r}(s)^{2}$ , in time polynomial in $n$ , $q$ , and $\exp(1/s^{p})/(A^{(p)}_{q,r}(s)-\sqrt{R^{*}})$ , except with probability less than

\exp\big(-2n\cdot f_{s}(\mathcal{L}_{q})\cdot\alpha^{2}\cdot\big(A^{(p)}_{q,r}% (s)-\sqrt{R^{*}}\big)^{2}\big)\;\text{.}

The proof is similar to that of Theorem 4.2, but using Lemma 3.7 to bound the probability that $\operatorname{corr}(W,\mathbf{c})<\sqrt{R^{*}}+\tau$ . Due to space constraints, the details are left to the full version.

$\blacktriangleright$ Remark 4.6.

Theorem 4.5 outperforms Theorem 4.2 (for worst-case decoding) by a factor that approaches $(e/2)^{1/p}$ in the adjusted rate $R^{*}$ it can handle, as $r$ and $q/r$ grow. Specifically, consider a channel with parameter $r$ . A calculation reveals that its relative error (in $\ell_{p}$ , relative to $n^{1/p}$ ) is tightly concentrated around $\delta=r/(p^{1/p}\cdot c_{p})$ , so following the analysis in Remark 4.3, Theorem 4.2 applies for $R^{*}$ that approaches $1/(r\cdot e^{1/p})$ . By comparison, Theorem 4.5 applies for $R^{*}$ that approaches $1/(r\cdot 2^{1/p})$ .

5 The $\ell_{2}$ Metric and Gaussian Error

In the remainder of the paper we instantiate our general list-decoding results for $\ell_{p}$ (semi)metrics (Theorems 4.2 and 4.5) to specific metrics of interest and memoryless additive channels. In this section, we consider the $\ell_{2}$ metric and Gaussian channels.

We specialize Equation 4.1 to $p=2$ , i.e., the Gaussian function

f(x)\vcentcolon=f^{(2)}(x)=\exp(-\pi x^{2})\;\text{.}

By a straightforward calculation it can be seen that this function is its own Fourier transform: $\widehat{f}=f$ . Note that $\widehat{f_{s}}=s\cdot f_{1/s}$ by the time-scaling property of the Fourier transform (Lemma 2.8). Finally, recalling that $f(\mathbf{x})=f(\lVert\mathbf{x}\rVert_{2})$ , we get that $f$ is invariant under rotations.

5.1 Bounds

In this subsection we derive fairly tight bounds on the factor $f_{s}(\mathcal{L}_{q})$ that appears in the quantities that govern the adjusted rates under which we can decode in the worst and average cases (Equations 4.2 and 4.4, respectively). For this purpose we need to define a suitable “fudge factor.” For $r\geq r_{0}\vcentcolon=\sqrt{\ln(4)/\pi}\approx 0.66428$ , define

E(r)\vcentcolon=1-2\exp(-\pi r^{2}/2)\in[0,1)\;\text{.}

Notice that $E(r)$ is positive for $r>r_{0}$ , is strictly increasing, and rapidly approaches $1$ as $r$ increases. Next, for real $s, q$ such that $s\in[r_{0},q/r_{0}]$ , define

E_{q}(s)\vcentcolon=\sqrt{E(q/s)\cdot E(s)}\in[0,1)\;\text{.}

Similarly, $E_{q}(s)$ is positive for $s\in(r_{0},q/r_{0})$ , and rapidly approaches $1$ as both $s,q/s$ increase.

We next state some bounds on $f_{s}(\mathcal{L}_{q})$ and $E_{q}(s)$ ; Lemma 5.1 is the main one we use. Due to space constraints, the proofs are given in the full version.

Lemma 5.1.

For any real $s$ and positive integer $q$ such that $s\in(r_{0},q/r_{0})$ ,

\frac{1}{f_{s}(\mathcal{L}_{q})}>\frac{\sqrt{2}}{s}\cdot E_{q}(s)^{2}\;\text{.}

This follows directly from Lemmas 5.2 and 5.4 below.

Lemma 5.2.

For any real $s>0$ and positive integer $q$ , let $\varepsilon^{\prime}=\varepsilon_{\mathbb{Z},q/(s\sqrt{2})}$ and $\tilde{\varepsilon}=\varepsilon_{\mathcal{L}_{q},s}(H)$ where $H=\operatorname{span}(1,1)$ . Then

f_{s}(\mathcal{L}_{q})=\frac{s}{\sqrt{2}}\cdot(1+\varepsilon^{\prime})\cdot(1+% \tilde{\varepsilon})\;\text{.}

Next we bound the roughness quantities $\varepsilon^{\prime},\tilde{\varepsilon}$ from Lemmas 5.1 and 5.2, using the following classic tail inequality.

Lemma 5.3 (adapted from [2, Lemma 2.4]).

For any lattice $\mathcal{L}$ , unit vector $\mathbf{u}$ , and $s,t>0$ , let $T_{\mathbf{u},t}=\{\mathbf{x}:\lvert\langle\mathbf{x},\mathbf{u}\rangle\rvert% \geq t\}$ . Then

f_{s}(\mathcal{L}\cap T_{\mathbf{u},t})<2\exp(-\pi t^{2}/s^{2})\cdot f_{s}(% \mathcal{L})\;\text{.}

Lemma 5.4.

Let $r>r_{0}$ and $H=\operatorname{span}(1,1)$ . Then

\frac{1}{1+\varepsilon_{\mathbb{Z},r/\sqrt{2}}}\;,\;\frac{1}{1+\varepsilon_{% \mathcal{L}_{q},r}(H)}>E(r)=1-2\exp(-\pi r^{2}/2)\;\text{.}

5.2 Worst-Case Decoding

We now address list-decoding in the $\ell_{2}$ metric, under worst-case error of bounded distance, by specializing the material of Section 4.1 to $p=2$ and using our bounds on $f_{s}(\mathcal{L}_{q})$ from Section 5.1. So, we consider decoding distance $d=\delta\sqrt{n}$ , where $n$ is the code length and $\delta$ is the relative decoding distance. Then by Equations 4.2 and 4.3, we can list-decode for any $R^{*}$ less than

R^{*,(2)}_{\text{wc},q}(\delta)=\sup_{s>0}W^{(2)}_{q,\delta}(s)^{2}>\sup_{s\in% (r_{0},q/r_{0})}\frac{\sqrt{2}\cdot\exp(-2\pi\delta^{2}/s^{2})}{s}\cdot E_{q}(% s)^{2}\;\text{,}

(5.1)

where the inequality follows by Lemma 5.1.

Corollary 5.5 below is obtained by nearly maximizing the right-hand side of (5.1). More specifically, a standard calculation shows that taking $s=\delta\sqrt{4\pi}$ maximizes the “main term” $\sqrt{2}\cdot\exp(-2\pi\delta^{2}/s^{2})/s$ , to have value $1/(\delta\sqrt{2\pi e})$ . For moderate or larger values of $\delta$ (and hence $s$ ), this very nearly maximizes the entire expression, because $E_{q}(s)\geq E(s)$ since $q/s\geq s$ , and $E(s)$ rapidly approaches $1$ as $s$ grows. For example, $E(s)^{2}\geq 1-10^{-8}$ for $\delta\geq 1$ . So, as $\delta$ grows, the $R^{*}$ for which we can list-decode rapidly approaches $1/(\delta\sqrt{2\pi e})$ .

Corollary 5.5.

For any $\delta>\sqrt{\ln(4)}/(2\pi)\approx 0.1874$ and prime $q\geq 4\pi\delta^{2}$ , the GS algorithm using weight vector given by $f_{s}$ for $s=\delta\sqrt{4\pi}$ list-decodes, up to $\ell_{2}$ distance $\delta\sqrt{n}$ in time $\operatorname{poly}(n,q,1/(\sqrt{\widetilde{R}^{*,(2)}_{\text{wc},q}(\delta)}-% \sqrt{R^{*}}))$ , any GRS code with adjusted rate

R^{*}<\widetilde{R}^{*,(2)}_{\text{wc},q}(\delta)\vcentcolon=\frac{1}{\delta% \sqrt{2\pi e}}\cdot E_{q}(\delta\sqrt{4\pi})^{2}\;\text{.}

Proof.

For $s=\delta\sqrt{4\pi}$ , the lower bounds on $\delta$ and $q$ imply that $s=\delta\sqrt{4\pi}\in(r_{0},q/r_{0})$ . Then by hypothesis and Lemmas 5.1 and 4.2,

R^{*}<\widetilde{R}^{*,(2)}_{\text{wc},q}(\delta)=\frac{1}{\delta\sqrt{2\pi e}% }\cdot E_{q}(\delta\sqrt{4\pi})^{2}<\frac{\exp(-2\pi\delta^{2}/s^{2})}{f_{s}(% \mathcal{L}_{q})}=W^{(2)}_{q,\delta}(s)^{2}\;\text{.}

The claim then follows directly by Theorem 4.2. $\hfill\blacktriangleleft$

Comparison to [10]

The previous best result for list-decoding (Generalized) Reed–Solomon codes in the $\ell_{2}$ metric was given by Mook and Peikert [10].⁹⁹9By a standard reduction, the result from [10] also applies to GRS codes, not just RS codes as was originally stated.

Proposition 5.6 ([10, Theorem 3.4]).

For any GRS code $\mathcal{C}\subseteq\mathbb{F}_{q}^{n}$ with any adjusted rate $R^{*}<1$ and any $\varepsilon>0$ , there is a $\operatorname{poly}(n,q,1/\varepsilon)$ -time algorithm that list-decodes $\mathcal{C}$ up to $\ell_{2}$ distance $d=\sqrt{n(1-R^{*})(1-\varepsilon)/2}$ .

Equivalently, for a relative decoding distance $\delta=d/\sqrt{n}>0$ , the result from [10] works for adjusted rates $R^{*}$ approaching $1-2\delta^{2}$ , so it applies only for

\delta\leq\sqrt{(1-R^{*})/2}\leq 1/\sqrt{2}\;\text{.}

By contrast, our Theorem 4.2 works for any (arbitrarily large) $\delta>0$ (and Corollary 5.5 gives a simpler and more explicit rate bound for any $\delta>0.1875$ ). Moreover, for those $\delta$ for which both Theorems 4.2 and 5.6 apply, our result works for a larger $R^{*}$ as long as $R^{*,(2)}_{\text{wc},q}(\delta)>1-2\delta^{2}$ (see (4.3)). For typical (moderate or larger) $q$ , this holds for all $\delta\gtrapprox 0.51797$ , which corresponds to $R^{*}\lessapprox 0.46342$ . (For tiny $\delta\approx 0$ , Theorem 4.2 works for $R^{*}\approx 0.93700$ , whereas [10] works for $R^{*}\approx 1$ , so the latter is better for very small distances.)

We also point out that [10] proves that for any $\delta\leq 1/2$ , which corresponds to $R^{*}\geq 1/2$ , its (very simple) choice of weight vector gives an optimal tradeoff between $\delta$ and $R^{*}$ for the GS/KV soft-decision algorithm and analysis. However, the optimality argument breaks down for $\delta>1/2$ (equivalently, for $R^{*}<1/2$ ). And indeed, as we have just seen, we obtain a better distance-rate tradeoff than [10] for almost all such $\delta$ . This highlights the interesting question of determining an optimal choice of weights for the GS soft-decision algorithm for $\delta>1/2$ (especially at the low end of this range).

5.3 Unique Decoding for a Subclass of GRS Codes

For a certain natural subclass of GRS codes, and certain rates and decoding distances covered by our list-decoding algorithm, decoding is in fact unique (i.e., the list size is at most one). We show this by giving a lower bound on the $\ell_{2}$ minimum distance of such codes, and then observing that our list-decoding algorithm can decode to beyond half this distance for all small enough rates.

Lemma 5.7 (adapted from [12, Theorem 4]).

Any prime-field GRS code $\text{GRS}_{q,k}(\boldsymbol{\alpha},\boldsymbol{\alpha})\subseteq\mathbb{F}_{% q}^{n}$ (whose twist factors $\mathbf{t}$ equal the nonzero evaluation points $\boldsymbol{\alpha}$ ) of rate $R=k/n$ has squared $\ell_{2}$ minimum distance at least

\frac{(n+1)^{2}-k^{2}}{12k^{2}}\cdot(n+1)>\frac{1-R^{2}}{12R^{2}}\cdot n\;% \text{.}

Due to space constraints, the proof is left to the full version.

Lemma 5.7 gives a relationship between the code rate $R$ and (a lower bound on) half the $\ell_{2}$ minimum distance, for which decoding to that distance yields a unique solution. By taking the functional inverse of half this minimum-distance bound, we see that decoding to relative distance $\delta$ yields a unique solution as long as

R<R^{(2)}_{\text{uniq}}(\delta)\vcentcolon=\frac{1}{\sqrt{48\delta^{2}+1}}\;% \text{,}

which approaches $1/(4\sqrt{3}\delta)$ as $\delta$ grows. This curve is shown in Figure 1. Observe that for any $\delta$ for which our list-decoding algorithm outperforms the one of [10], we have that $R^{*,(2)}_{\text{wc}}(\delta)>R^{(2)}_{\text{uniq}}(\delta)$ . In other words, we can efficiently list decode to relative distance $\delta$ for all rates up to $R^{(2)}_{\text{uniq}}(\delta)$ (and beyond), thus yielding a unique decoder for these parameters. Alternatively, as the rate $R$ approaches zero, we can efficiently list decode to a multiple of the unique-decoding distance bound that approaches $4\sqrt{3}/\sqrt{2\pi e}\approx 1.6764$ .

5.4 Average-Case Decoding

We now consider average-case decoding under a memoryless additive (continuous or discrete) Gaussian channel, by specializing the material of Section 4.2 to $p=2$ and using our bounds on $f_{s}(\mathcal{L}_{q})$ from Section 5.1. Consider a Gaussian channel of parameter $r>0$ . Then by Equations 4.4 and 4.5, we can list-decode for any $R^{*}$ less than

R^{*,(2)}_{\text{ac},q}(r)=\sup_{s>0}A^{(2)}_{q,r}(s)^{2}>\sup_{s\in(r_{0},q/r% _{0})}\frac{s\sqrt{2}}{r^{2}+s^{2}}\cdot E_{q}(s)^{2}\;\text{,}

(5.2)

where the inequality is by Lemma 5.1.

Corollary 5.8 below is obtained by nearly maximizing the right-hand side of (5.2). More specifically, setting $s=r$ maximizes the “main term” $s\sqrt{2}/(r^{2}+s^{2})$ , to have value $1/(r\sqrt{2})$ . As above, for moderate or larger values of $r$ (and hence $s$ ), this very nearly maximizes the entire expression, because $E_{q}(s)$ rapidly approaches $1$ as $s$ grows.¹⁰¹⁰10By contrast, $E_{q}(s)\ll 1$ for values of $s$ very close to $r_{0}$ , in which case the bound is maximized by taking $s$ somewhat larger than $r$ . So, as $r$ grows, the rate $R^{*}$ for which we can list-decode rapidly approaches $1/(r\sqrt{2})$ .

Corollary 5.8.

For any $r\in(r_{0},q/r_{0})$ , $\alpha\in(0,1)$ , and prime $q$ , the GS algorithm using weight vector given by $f_{r}$ list-decodes, in time $\operatorname{poly}(n,q,1/(\sqrt{\widetilde{R}^{*,(2)}_{\text{ac},q}(r)}-\sqrt% {R^{*}}))$ , any GRS code with adjusted rate

R^{*}<\widetilde{R}^{*,(2)}_{\text{ac},q}(r)\vcentcolon=\frac{1}{r\sqrt{2}}% \cdot E_{q}(r)^{2}\;\text{,}

except with probability less than $\exp\big(-\sqrt{2}n\cdot\alpha^{2}\cdot r\cdot\big(\sqrt{\widetilde{R}^{*,(2)}% _{\text{ac},q}(r)}-\sqrt{R^{*}}\big)^{2}\big)$ .

Proof.

By hypothesis, Lemmas 4.4, 5.1, and 4.4,

R^{*}<\frac{1}{r\sqrt{2}}\cdot E_{q}(r)^{2}<\frac{\mu_{r,r}^{2}}{f_{r}(% \mathcal{L}_{q})}=A^{(2)}_{q,r}(r)^{2}\;\text{.}

The claim then follows directly by Theorem 4.5, and the fact that $f_{r}(\mathcal{L}_{q})>r/\sqrt{2}$ by Lemma 5.2. $\hfill\blacktriangleleft$

6 The $\ell_{1}$ Metric and Laplacian Error

In this section, we consider the $\ell_{1}$ metric and Laplacian channels. We specialize Equation 4.1 to $p=1$ , i.e., the Laplacian function

f(x)\vcentcolon=f^{(1)}(x)=\exp(-2\lvert x\rvert)\;\text{.}

(The Fourier transform of this function is given by $\widehat{f}(w)=1/(1+(\pi w)^{2})$ , but we will not use this; as already noted earlier, $f^{(1)}$ satisfies ˜2.12.)

Throughout this section we use the hyperbolic tangent function

\tanh(x)\vcentcolon=\frac{e^{x}-e^{-x}}{e^{x}+e^{-x}}=\frac{1-e^{-2x}}{1+e^{-2% x}}=\frac{e^{2x}-1}{e^{2x}+1}<1

and its reciprocal $\coth(x)=1/\tanh(x)>1$ . Observe that $\tanh(x)$ approaches $1$ as $x$ grows; it also satisfies $\tanh(x)<x$ for all $x>0$ , and approaches $x$ as $x$ approaches zero.¹¹¹¹11Both facts can be seen from the Taylor series $\tanh(x)=x-x^{3}/3+\cdots$ , valid for $\lvert x\rvert<\pi/2$ .

6.1 Bounds

In this subsection, we analyze the exact value of $f_{s}(\mathcal{L}_{q})$ and derive an asymptotic bound. This appears in the quantities that govern the adjusted rates under which we can decode in the worst and average cases (Equations 4.2 and 4.4, respectively). For this purpose, we define a suitable “fudge factor”. For any real $x>0$ , define

E(x)\vcentcolon=\Big(\coth(x)+\frac{4x\cdot e^{2x}}{(e^{2x}-1)^{2}}\Big)^{-1}% \in(0,1)\;\text{,}

(6.1)

where the upper bound comes from the fact that $\coth(x)>1$ . Note that, as $x$ grows, the first term in the sum rapidly approaches one, and the second term rapidly approaches zero. More precisely, a brief calculation reveals that

E(x)=1-O(x\cdot e^{-2x})\;\text{.}

(6.2)

Lemma 6.1.

For any $s>0$ and positive integer $q$ ,

\frac{1}{f_{s}(\mathcal{L}_{q})}>\tanh(2/s)\cdot E(q/s)\;\text{.}

Note that by Equation 6.2, for any fixed $s>0$ , as $q$ (or equivalently, $q/s$ ) grows, $1/f_{s}(\mathcal{L}_{q})$ rapidly approaches $\tanh(2/s)$ . In turn, this approaches $2/s$ as $s$ grows.

The proof of Lemma 6.1 follows directly from Lemma 6.2 below and Equation 6.1. Due to space constraints, the details are left to the full version.

Lemma 6.2.

For any $s>0$ and positive integer $q$ ,

f_{s}(\mathcal{L}_{q})=\coth(2/s)\cdot\coth(q/s)+\frac{2q\cdot e^{2q/s}}{(e^{2% q/s}-1)^{2}}\;\text{.}

Due to space constraints, the proof is left to the full version.

6.2 Worst-Case Decoding

Now we address list-decoding in the $\ell_{1}$ metric, under worst-case error of bounded distance, by specializing the material of Section 4.1 to $p=1$ and using our bound on $f_{s}(\mathcal{L}_{q})$ from Lemma 6.1. We consider decoding distance $d=\delta n$ , where $n$ is the code length and $\delta$ is the relative decoding distance. Then by Equations 4.2, 4.3, and 6.1, we can list-decode for any $R^{*}$ less than

R^{*,(1)}_{\text{wc},q}(\delta)=\sup_{s>0}W^{(1)}_{q,\delta}(s)^{2}>\sup_{s>0}% \exp(-4\delta/s)\cdot\tanh(2/s)\cdot E(q/s)\;\text{.}

(6.3)

Corollary 6.3 below is obtained by maximizing the “main term” $\exp(-4\delta/s)\cdot\tanh(2/s)$ of the right-hand side of (6.3). By calculus, this is done by taking $s=4/\ln(D(\delta))>0$ , where

D(\delta)\vcentcolon=\sqrt{1+\frac{1}{\delta^{2}}}+\frac{1}{\delta}>1\;\text{.}

Substituting, this means we can list-decode for any $R^{*}$ less than

\widetilde{R}^{*,(1)}_{\text{wc},q}(\delta)\vcentcolon=\frac{\tanh(\ln\sqrt{D(% \delta)})}{D(\delta)^{\delta}}\cdot E(q\ln(D(\delta))/4)=\frac{D(\delta)-1}{D(% \delta)+1}\cdot\frac{E(q\ln(D(\delta))/4)}{D(\delta)^{\delta}}\;\text{.}

(6.4)

We consider this quantity’s asymptotic behavior for large and small $\delta$ :

$\blacksquare$

As $\delta$ grows, $D(\delta)=1+1/\delta+O(1/\delta^{2})$ and $D(\delta)^{\delta}$ approaches $e$ , hence $\widetilde{R}^{*,(1)}_{\text{wc},q}(\delta)$ approaches $1/(2e\delta)$ as $q/\delta$ also grows. This is consistent with Remark 4.3.
$\blacksquare$

As $\delta$ approaches zero, $D(\delta)$ approaches $2/\delta$ and $D(\delta)^{\delta}$ approaches $1$ , hence $\widetilde{R}^{*,(1)}_{\text{wc},q}(\delta)$ approaches $1$ as $q/\delta$ also grows.

Corollary 6.3.

For any $\delta>0$ and prime $q$ , the GS algorithm using weight vector $f_{s}$ for $s=4/\ln(D(\delta))$ list-decodes, up to $\ell_{1}$ distance $\delta n$ in time $\operatorname{poly}(n,q,1/(\sqrt{\widetilde{R}^{*,(1)}_{\text{wc},q}(\delta)}-% \sqrt{R^{*}}))$ , any GRS code with adjusted rate $R^{*}<\widetilde{R}^{*,(1)}_{\text{wc},q}(\delta)$ (see Equation 6.4).

Proof.

By hypothesis and Lemmas 6.1 and 4.2,

R^{*}<\widetilde{R}^{*,(1)}_{\text{wc},q}(\delta)=\frac{\tanh(\ln\sqrt{D(% \delta)})}{D(\delta)^{\delta}}\cdot E(q/s)<\frac{\exp(-4\delta/s)}{f_{s}(% \mathcal{L}_{q})}=W^{(1)}_{q,\delta}(s)^{2}\;\text{.}

The claim then follows directly by Theorem 4.2. $\hfill\blacktriangleleft$

Comparison to [12, 16]

To our knowledge, the only prior algorithms for (unique or list) decoding Reed–Solomon codes in the $\ell_{1}$ (Lee) metric are [12, Section 5] and [16]. We note that both of these require discrete (integer) error, whereas our algorithm works for continuous error.

For a certain subclass of GRS codes (and BCH codes more generally), [12] gives a unique decoding algorithm for up to half (a lower bound on) the $\ell_{1}$ minimum distance, using Euclid’s algorithm for polynomials. This algorithm decodes up to any relative distance $\delta<1-R<1-R^{*}$ . For any prime-field GRS code, [16] gives a list-decoding algorithm that uses GS as a subroutine, and has a piecewise distance-rate tradeoff due to its optimization over an integer parameter. (The algorithm works by putting equal weight on a range of alphabet symbols centered at the received symbol, optimizing over the range size for a given rate.)

By contrast with [12], and like [16], our Corollary 6.3 works for any GRS code, and for any (arbitrarily large) relative decoding distance $\delta>0$ , for sufficiently small $R^{*}>0$ . Our rate-distance trade-off surpasses that of both [12, 16] for all $\delta\gtrapprox 0.78988$ , which corresponds to rates $R^{*}\lessapprox 0.21012$ ; see Figure 1.

6.3 Unique Decoding for a Subclass of GRS Codes

As in Section 5.3, for the same subclass of GRS codes and certain parameters covered by our list-decoding algorithm, the decoding output is in fact unique. To show this, we give a lower bound on the $\ell_{1}$ minimum distance of such codes, and then observe that our list-decoding algorithm can decode to beyond half this distance for all small enough rates.

Lemma 6.4 (adapted from [12, Theorem 4]).

Any prime-field GRS code $\text{GRS}_{q,k}(\boldsymbol{\alpha},\boldsymbol{\alpha})\subseteq\mathbb{F}_{% q}^{n}$ (whose twist factors $\mathbf{t}$ equal the nonzero evaluation points $\boldsymbol{\alpha}$ ) of rate $R=k/n$ has $\ell_{1}$ minimum distance at least

\frac{(n+1)^{2}-k^{2}}{4k}>\frac{1-R^{2}}{4R}\cdot n\;\text{.}

Due to space constraints, the proof is left to the full version.

Lemma 6.4 gives a relationship between the code rate $R$ and (a lower bound on) half the $\ell_{1}$ minimum distance, for which decoding to that distance yields a unique solution. By taking the functional inverse of half this minimum-distance bound, we see that decoding to relative distance $\delta$ yields a unique solution as long as

R<R^{(1)}_{\text{uniq}}(\delta)\vcentcolon=-4\delta+\sqrt{(4\delta)^{2}+1}\;% \text{,}

which approaches $1/(8\delta)$ as $\delta$ grows. This curve is shown in Figure 1. Observe that for any $\delta$ for which our list-decoding algorithm outperforms the unique decoder of [12] (or for which [12] does not apply), we have that $R^{*,(1)}_{\text{wc}}(\delta)>R^{(1)}_{\text{uniq}}(\delta)$ . In other words, we can efficiently list decode to relative distance $\delta$ for all rates up to $R^{(1)}_{\text{uniq}}(\delta)$ (and beyond), thus yielding a unique decoder for these parameters. Alternatively, as the rate $R$ approaches zero, we can efficiently list decode to a multiple of the unique-decoding distance bound that approaches $8/(2e)\approx 1.4715$ .

6.4 Average-Case Decoding

We now consider average-case decoding under a memoryless additive (continuous or discrete) Laplacian channel, by specializing the material of Section 4.2 to $p=1$ and using our bound on $f_{s}(\mathcal{L}_{q})$ from Lemma 6.1. Consider a Laplacian channel of parameter $r>0$ . Then by Equations 4.4 and 4.5, we can list-decode for any $R^{*}$ less than

R^{*,(1)}_{\text{ac},q}(r)=\sup_{s>0}A^{(1)}_{q,r}(s)^{2}>\sup_{s>0}\frac{s^{2% }\cdot\tanh(2/s)}{(r+s)^{2}}\cdot E(q/s)\;\text{,}

(6.5)

where the inequality is by Lemma 6.1.

Corollary 6.5 below is obtained by nearly maximizing the right-hand side of (6.5), at least for moderate or large values of $r$ . Specifically, we use the bound $\tanh(2/s)<2/s$ to approximate the “main term” of (6.5) by $2s/(r+s)^{2}$ . This is maximized at $s=r$ , which makes the original main term equal to $\tanh(2/r)/4$ . Note that $R^{*,(1)}_{\text{ac},q}(r)$ does indeed approach this value as $r$ and $q/r$ grow, because $\tanh(2/r)$ approaches $2/r$ , and $E(q/r)$ rapidly approaches $1$ (see Equation 6.2).

However, for small values of $r$ , the expression in (6.5) is maximized for $s$ significantly larger than $r$ , to have value much larger than $\tanh(2/r)/4<1/4$ . This maximization can be computed numerically, and indeed, $R^{*,(1)}_{\text{ac},q}(r)$ approaches $1$ as $r$ approaches $0$ ; see Figure 1.

Corollary 6.5.

For any $r>0$ , $\alpha\in(0,1)$ , and prime $q$ , the GS algorithm using weight vector given by $f_{r}$ list-decodes, in time $\operatorname{poly}(n,q,1/(\sqrt{R^{*,(1)}_{\text{ac},q}}-\sqrt{R^{*}}))$ , any GRS code with adjusted rate

R^{*}<\widetilde{R}^{*,(1)}_{\text{ac},q}(r)\vcentcolon=\frac{\tanh(2/r)}{4}% \cdot E(q/r)\;\text{,}

except with probability less than $\exp\big(-n\cdot\alpha^{2}\cdot r\cdot\big(\sqrt{\widetilde{R}^{*,(1)}_{\text{% ac},q}(r)}-\sqrt{R^{*}}\big)^{2}\big)$ .

Proof.

By hypothesis, Lemmas 4.4, 6.1, and 4.4,

R^{*}<\widetilde{R}^{*,(1)}_{\text{ac},q}(r)=\frac{\tanh(2/r)}{4}\cdot E(q/r)<% \frac{\mu_{r,r}^{2}}{f_{r}(\mathcal{L}_{q})}=A^{(1)}_{q,r}(r)^{2}\;\text{.}

The claim then follows directly by Theorem 4.5, and (for the probability bound) the fact that $f_{r}(\mathcal{L}_{q})>\coth(2/r)>r/2$ by Lemma 6.2. $\hfill\blacktriangleleft$

References

[1] Wojciech Banaszczyk. New bounds in some transference theorems in the geometry of numbers. Mathematische Annalen, 296(4):625–635, 1993.
[2] Wojciech Banaszczyk. Inequalites for convex bodies and polar reciprocal lattices in $\mathbb{R}^{n}$ . Discrete & Computational Geometry, 13:217–231, 1995.
[3] Peter Elias. Zero error capacity under list decoding. IEEE Transactions on Information Theory, 34(5):1070–1074, September 1988. Originally appeared as Quarterly Progress Report, vol. 48, pp. 88-90, Research Laboratory of Electronics, MIT, January 1958. doi:10.1109/18.21233.
[4] N. D. Elkies, A. M. Odlyzko, and J. A. Rush. On the packing densities of superballs and other bodies. Inventiones mathematicae, 105:613–639, December 1991.
[5] Venkatesan Guruswami. List decoding of error correcting codes. PhD thesis, Massachusetts Institute of Technology, 2001. URL: http://dspace.mit.edu/handle/1721.1/8700.
[6] Venkatesan Guruswami, Atri Rudra, and Madhu Sudan. Essential coding theory, March 2019. URL: https://cse.buffalo.edu/faculty/atri/courses/coding-theory/book/web-coding-book.pdf.
[7] Venkatesan Guruswami and Madhu Sudan. Improved decoding of Reed-Solomon and algebraic-geometry codes. IEEE Trans. Inf. Theory, 45(6):1757–1767, 1999. Preliminary version in FOCS 1998. doi:10.1109/18.782097.
[8] Ralf Koetter and Alexander Vardy. Algebraic soft-decision decoding of Reed-Solomon codes. IEEE Trans. Inf. Theory, 49(11):2809–2825, 2003. doi:10.1109/TIT.2003.819332.
[9] Daniele Micciancio and Oded Regev. Worst-case to average-case reductions based on Gaussian measures. SIAM J. Comput., 37(1):267–302, 2007. Preliminary version in FOCS 2004. doi:10.1137/S0097539705447360.
[10] Ethan Mook and Chris Peikert. Lattice (list) decoding near Minkowski’s inequality. IEEE Trans. Inf. Theory, 68(2):863–870, 2022. doi:10.1109/TIT.2021.3126540.
[11] Irving S. Reed and Gustave Solomon. Polynomial codes over certain finite fields. Journal of the Society for Industrial and Applied Mathematics, 8(2):300–304, 1960.
[12] Ron M. Roth and Paul H. Siegel. Lee-metric BCH codes and their application to constrained and partial-response channels. IEEE Trans. Inf. Theory, 40(4):1083–1096, 1994. doi:10.1109/18.335966.
[13] Jean-Pierre Serre. A Course in Arithmetic. Springer New York, NY, 1973. doi:10.1007/978-1-4684-9884-4.
[14] Madhu Sudan. Decoding of Reed Solomon codes beyond the error-correction bound. J. Complex., 13(1):180–193, 1997. doi:10.1006/JCOM.1997.0439.
[15] John M. Wozencraft. List decoding. Quarterly Progress Report, Research Laboratory of Electronics, MIT, 48:90–95, 1958.
[16] Xin-Wen Wu, Margreta Kuijper, and Parampalli Udaya. Lee-metric decoding of BCH and Reed–Solomon codes. Electronics Letters, 39(21):1522–1524, October 2003.

[bib.bib1] [1] Wojciech Banaszczyk. New bounds in some transference theorems in the geometry of numbers. Mathematische Annalen, 296(4):625–635, 1993.

[bib.bib2] [2] Wojciech Banaszczyk. Inequalites for convex bodies and polar reciprocal lattices in $\mathbb{R}^{n}$ . Discrete & Computational Geometry, 13:217–231, 1995.

[bib.bib3] [3] Peter Elias. Zero error capacity under list decoding. IEEE Transactions on Information Theory, 34(5):1070–1074, September 1988. Originally appeared as Quarterly Progress Report, vol. 48, pp. 88-90, Research Laboratory of Electronics, MIT, January 1958. doi:10.1109/18.21233.

[bib.bib4] [4] N. D. Elkies, A. M. Odlyzko, and J. A. Rush. On the packing densities of superballs and other bodies. Inventiones mathematicae, 105:613–639, December 1991.

[bib.bib5] [5] Venkatesan Guruswami. List decoding of error correcting codes. PhD thesis, Massachusetts Institute of Technology, 2001. URL: http://dspace.mit.edu/handle/1721.1/8700.

[bib.bib6] [6] Venkatesan Guruswami, Atri Rudra, and Madhu Sudan. Essential coding theory, March 2019. URL: https://cse.buffalo.edu/faculty/atri/courses/coding-theory/book/web-coding-book.pdf.

[bib.bib7] [7] Venkatesan Guruswami and Madhu Sudan. Improved decoding of Reed-Solomon and algebraic-geometry codes. IEEE Trans. Inf. Theory, 45(6):1757–1767, 1999. Preliminary version in FOCS 1998. doi:10.1109/18.782097.

[bib.bib8] [8] Ralf Koetter and Alexander Vardy. Algebraic soft-decision decoding of Reed-Solomon codes. IEEE Trans. Inf. Theory, 49(11):2809–2825, 2003. doi:10.1109/TIT.2003.819332.

[bib.bib9] [9] Daniele Micciancio and Oded Regev. Worst-case to average-case reductions based on Gaussian measures. SIAM J. Comput., 37(1):267–302, 2007. Preliminary version in FOCS 2004. doi:10.1137/S0097539705447360.

[bib.bib10] [10] Ethan Mook and Chris Peikert. Lattice (list) decoding near Minkowski’s inequality. IEEE Trans. Inf. Theory, 68(2):863–870, 2022. doi:10.1109/TIT.2021.3126540.

[bib.bib11] [11] Irving S. Reed and Gustave Solomon. Polynomial codes over certain finite fields. Journal of the Society for Industrial and Applied Mathematics, 8(2):300–304, 1960.

[bib.bib12] [12] Ron M. Roth and Paul H. Siegel. Lee-metric BCH codes and their application to constrained and partial-response channels. IEEE Trans. Inf. Theory, 40(4):1083–1096, 1994. doi:10.1109/18.335966.

[bib.bib13] [13] Jean-Pierre Serre. A Course in Arithmetic. Springer New York, NY, 1973. doi:10.1007/978-1-4684-9884-4.

[bib.bib14] [14] Madhu Sudan. Decoding of Reed Solomon codes beyond the error-correction bound. J. Complex., 13(1):180–193, 1997. doi:10.1006/JCOM.1997.0439.

[bib.bib15] [15] John M. Wozencraft. List decoding. Quarterly Progress Report, Research Laboratory of Electronics, MIT, 48:90–95, 1958.

[bib.bib16] [16] Xin-Wen Wu, Margreta Kuijper, and Parampalli Udaya. Lee-metric decoding of BCH and Reed–Solomon codes. Electronics Letters, 39(21):1522–1524, October 2003.

List Decoding Reed–Solomon Codes in the Lee, Euclidean, and Other Metrics

Abstract

Keywords and phrases:

Copyright and License:

2012 ACM Subject Classification:

Related Version:

DOI:

Event:

Editor:

Series and Publisher:

1 Introduction

1.1 Contributions

1.2 Technical Overview

2 Preliminaries

Lemma 2.1 (Hoeffding’s Inequality).

Operations on functions

2.1 Linear Codes

Definition 2.2 ((Generalized) Reed–Solomon code).

2.2 Lattices

Definition 2.3 (Lattice, Basis).

Definition 2.4 (Determinant).

Definition 2.5 (Dual lattice).

Lemma 2.6.

Proof.

2.3 Fourier Analysis

Lemma 2.7 (Multiplicativity).

Lemma 2.8 (Time-scaling property).

Lemma 2.9 (Time-shift property).

Lemma 2.10 (Poisson Summation Formula (PSF)).

Lemma 2.11 (Generalized PSF).

2.4 Lattice Roughness

Assumption 2.12.

Definition 2.13.

Lemma 2.14 (adapted from [9, Lemmas 2.9 and 4.1]).

3 List-Decoding Reed–Solomon Codes

3.1 Soft-Decision Decoding

Definition 3.1.

Definition 3.2.

Theorem 3.3 (adapted from [7, Theorem 18] and [5, Theorem 6.21]).

3.2 From Received Words to Weight Vectors

Definition 3.4.

3.3 Main Theorem

Theorem 3.5.

Proof.

Lemma 3.6.

Proof.

3.4 Average-Case Decoding

Lemma 3.7.

4 General ℓ𝒑 (Semi)Metrics

Lemma 4.1.

4.1 Worst-Case Decoding

Theorem 4.2.

Proof.

▶ Remark 4.3.

4.2 Average-Case Decoding

Lemma 4.4.

Theorem 4.5.

▶ Remark 4.6.

5 The ℓ𝟐 Metric and Gaussian Error

5.1 Bounds

Lemma 5.1.

Lemma 5.2.

Lemma 5.3 (adapted from [2, Lemma 2.4]).

Lemma 5.4.

5.2 Worst-Case Decoding

Corollary 5.5.

Proof.

Comparison to [10]

Proposition 5.6 ([10, Theorem 3.4]).

5.3 Unique Decoding for a Subclass of GRS Codes

Lemma 5.7 (adapted from [12, Theorem 4]).

5.4 Average-Case Decoding

Corollary 5.8.

Proof.

6 The ℓ𝟏 Metric and Laplacian Error

6.1 Bounds

Lemma 6.1.

Lemma 6.2.

6.2 Worst-Case Decoding

Corollary 6.3.

List Decoding Reed–Solomon Codes in the
Lee, Euclidean, and Other Metrics

4 General $\ell_{p}$ (Semi)Metrics

$\blacktriangleright$ Remark 4.3.

$\blacktriangleright$ Remark 4.6.

5 The $\ell_{2}$ Metric and Gaussian Error

6 The $\ell_{1}$ Metric and Laplacian Error