List Decoding Reed–Solomon Codes in the
Lee, Euclidean, and Other Metrics
Abstract
Reed–Solomon error-correcting codes are ubiquitous across computer science and information theory, with applications in cryptography, computational complexity, communication and storage systems, and more. Most works on efficient error correction for these codes, like the celebrated Berlekamp–Welch unique decoder and the (Guruswami–)Sudan list decoders, are focused on measuring error in the Hamming metric, which simply counts the number of corrupted codeword symbols. However, for some applications, other metrics that depend on the specific values of the errors may be more appropriate. This work gives a polynomial-time algorithm that list decodes (generalized) Reed–Solomon codes over prime fields in (semi)metrics, for any . Compared to prior algorithms for the Lee () and Euclidean () metrics, ours decodes to arbitrarily large distances (for correspondingly small rates), and has better distance-rate tradeoffs for all decoding distances above some moderate thresholds. We also prove lower bounds on the and minimum distances of a certain natural subclass of GRS codes, which establishes that our list decoder is actually a unique decoder for many parameters of interest. Finally, we analyze our algorithm’s performance under random Laplacian and Gaussian errors, and show that it supports even larger rates than for corresponding amounts of worst-case error in and (respectively).
Keywords and phrases:
Reed–Solomon codes, list decoding, unique decoding, Lee metric, Euclidean metric, Guruswami–Sudan algorithmCopyright and License:
2012 ACM Subject Classification:
Theory of computation Error-correcting codesEditor:
Shubhangi SarafSeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
Reed–Solomon codes [11] are among the most widely used families of error-correcting codes, with applications across computer and communication sciences. Their many virtues include: a very simple definition; the largest possible minimum distance as a function of rate; and efficient decodability from errors, via either unique decoding up to half the minimum distance (see, e.g., [6, Section 12.1]), or list decoding up to the larger Johnson bound, via the celebrated works of Sudan [14] and Gururswami–Sudan [7] (see also [6, Section 12.2]).
List decoding [3, 15] is the task of finding all codewords that are within some desired distance of a (potentially corrupted) received word. When this radius is more than half the code’s minimum distance, there can potentially be more than one codeword within range (hence the name “list decoding”). Despite this non-uniqueness, list decoding can suffice for many purposes (e.g., finding a nearest codeword within range), and indeed, it has found numerous applications.
Most work on decoding Reed–Solomon codes has measured errors in the Hamming metric, which simply counts the number of corrupted codeword symbols (regardless of how they are corrupted). However, there are many other natural metrics that depend on the specific values of the errors. Such metrics can be more appropriate for settings where introducing a “large” error at a coordinate is more costly than a “small” error, or where the communication channel might add some nonzero error to every coordinate. An example is a channel that adds error according to a Gaussian or other fairly concentrated distribution. When the code alphabet is (the integers modulo ) – in particular, a prime field – one metric of frequent study is the Lee metric, which is merely the norm after lifting to its distinguished representatives in . Other natural, analogously defined choices include the Euclidean () or other metrics.
We know of only a few prior works on efficiently decoding Reed–Solomon codes in metrics others than Hamming. For the Lee () metric, Roth and Siegel [12] gave an algorithm that uniquely decodes up to half of (a lower bound on) the minimum distance; their algorithm works for certain subclasses of (generalized) Reed-Solomon and BCH codes. In addition, Wu, Kuijper, and Udaya [16] gave a list-decoding algorithm for , built around Guruswami–Sudan [7], that decodes to larger distances than in [12] for all small enough rates. Finally, for the Euclidean () metric, Mook and Peikert [10] recently gave a list-decoding algorithm that also uses [7] as a black box.
1.1 Contributions
This work gives a polynomial-time algorithm that list decodes any generalized Reed–Solomon (GRS) code over a prime field in the (semi)metric for any ; in particular, this includes the Lee () and Euclidean () metrics.111A semimetric is just a metric that does not necessarily satisfy the triangle inequality (which we will not need). Our algorithm works for a broader range of parameters, and has a better distance-rate tradeoff for all decoding distances above some moderate thresholds, than the prior algorithms for and [12, 16, 10]; see below for elaboration and Figure 1 for a visual depiction. For ease of comparison across the various works and (semi)metrics, we use a suitably normalized version of distance: for code length , distance corresponds to relative distance .
For , our algorithm can handle an arbitrarily large decoding distance, for a correspondingly small enough rate: specifically, as and the alphabet size grow, we can decode for rates rapidly approaching . By contrast, the prior work [10] applies only for relative distance (i.e., distance less than ). In addition, our algorithm works for larger rates than the one in [10] whenever exceeds about . (See Section 5.2 for a detailed comparison.) This is particularly interesting since the rates obtained in [10] were shown to be optimal (in a certain sense) for , but not for larger values.
For , again our algorithm (like the one from [16]) can handle an arbitrarily large decoding distance, whereas [12] is limited to relative distance (i.e., distance less than ). In addition, our algorithm works for larger rates than those of [12, 16] whenever the relative decoding distance exceeds about , and in general, as and the alphabet size grow, we can decode for rates rapidly approaching . (See Section 6.2 for details.) Our algorithm is also qualitatively broader: it decodes from continuous (real-valued) error, whereas the ones from [12, 16] require discrete (integer) error. While continuous error can be discretized by rounding, this can increase the relative distance from the codeword by up to in , which significantly degrades the distance-rate tradeoffs of the prior works, making them worse than ours for all distances.
We also give several useful supplementary results. By adapting an argument of [12], we prove lower bounds on the and minimum distances for a certain natural subclass of GRS codes. These imply that for many parameters of interest, our list-decoding algorithm outputs at most one codeword, i.e., it is actually a unique decoder. (See Lemmas 5.7 and 6.4 and the discussions thereafter.) And in addition to worst-case errors added by an adversarial channel, we also consider our algorithm’s performance under average-case errors produced by “memoryless additive” channels. Such channels add independent identically distributed error, drawn from some specified distribution, to each coordinate of the transmitted codeword. For Laplacian and Gaussian errors (which roughly correspond to and , respectively), we show that our algorithm supports even larger rates than what we would get by merely applying concentration bounds on the error vector and invoking our worst-case results.
1.2 Technical Overview
At the highest level, our algorithm for list decoding prime-field GRS codes in follows the basic approach of [10] for list decoding (G)RS codes in : we first translate the received word into a suitable weight (or reliability) vector, then invoke a soft-decision list-decoding algorithm [7, 5, 8] for GRS codes. Informally, a weight vector specifies, for each coordinate of the received word and each symbol in the code alphabet, a “confidence level” that the transmitted codeword had that symbol at that coordinate. Given such a weight vector, a soft-decision decoding algorithm then finds all codewords that are sufficiently correlated with it, as determined by the code rate. (For the formal definitions and theorem statement, see Definitions 3.1, 3.2, and 3.3.)
For our purposes, the principal challenge is in mapping a received word to an appropriate weight vector so that any codeword that is close enough to the received word (in the metric) has sufficient correlation with the weight vector. The prior work [10] uses very simple weights: given a real received value , only its floor and ceiling receive positive weights, of and , respectively. It was shown that with these weights, the cited soft-decision algorithms decode up to any relative distance for any code rate up to . Moreover, for it was shown that this rate is optimal for those algorithms, i.e., no other weight assignment can work for a larger rate. However, [10] did not consider larger decoding distances than these, nor other metrics.
In this work, to handle large decoding distances and other metrics, we use “smoother” weights, which typically assign a positive weight to every alphabet symbol. Our overall approach (see Section 3) is quite general, and is parameterized by a function satisfying mild hypotheses, primarily that its Fourier transform is non-negative (see ˜2.12). This function can be seen as defining a weight – or a relative “likelihood,” in the case of a random channel – for every potential real-valued error.222For example, we take to be a Gaussian function for decoding in the metric, or under a Gaussian channel. For a prime-order code alphabet , and a received real value , we assign weight to each alphabet symbol . Since the elements of the coset are exactly those errors that would convert a transmitted symbol to the received value , this assignment captures the total weight of all such errors.
Our first main result, given in Theorem 3.5, lower bounds the correlation between the weight vector for a received word over and any (code)word over , by the ratio of two quantities determined by : its (arithmetic or geometric) mean over the coordinates of the error vector , and (the square root of) its sum over a certain two-dimensional integer lattice . So, for a particular decoding distance in a metric of interest (or channel distribution, in the average case), the goal becomes to choose a suitable function that nearly maximizes this ratio. The proof of the theorem uses a mild generalization of Fourier-analytic results on lattices from [1, 9], and is the source of our requirement that is non-negative, and ultimately the restriction that for (semi)metrics.
The bulk of the remaining work is then devoted to making a suitable choice of function for the (semi)metric (and corresponding channel distributions), and analyzing its summation over . In Section 4 we consider scalings of the function , which is known to have non-negative Fourier coefficients for (but not for any other ). Then in Sections 5 and 6 we specialize to and , respectively, and give fairly tight upper bounds on using Fourier-analytic techniques or direct analysis. Finally, we use these bounds to optimize the distance-rate tradeoffs for which we can list decode GRS codes in these metrics, and for Gaussian and Laplacian random channels as well.
2 Preliminaries
For a positive integer , let . For a positive integer , define the quotient ring and the additive quotient group . For a prime power , let denote the finite field of size . When is prime, we identify with the finite field in the natural way.
For any (which is a coset of ), define its “lift” to be the unique real number such that , i.e., the “zero-centered” distinguished representative of . We also apply this notation entry-wise to vectors over .
For any , define the (quasi)norm on as . It is well known that this is a norm if and only if , and is a quasinorm for any .333A quasinorm relaxes the triangle inequality axiom to require only that for some fixed . We do not use the triangle inequality, or even this relaxation, so we can consider . Similarly, we define the (semi)metric on by lifting, i.e., via .444Formally, this is not a norm because it is not defined on a vector space (since is not a field), and it does not satisfy homogeneity due to the mod- reduction. However, it does define a (semi)metric (where “semi” does not require the triangle inequality), with distance function . For , this generalizes the Lee metric over to .
For two groups , their direct sum group is their Cartesian product with the group operation defined component-wise. This notation extends to the direct sum of group cosets, which is a coset of the direct sum of the groups.
For any two vectors and of the same dimension, their coordinate-wise (or Hadamard) product is denoted by .
For a finite sequence of real values, we denote their average by . We use the following special case of the well known Hoeffding (lower-)tail bound.
Lemma 2.1 (Hoeffding’s Inequality).
Let be independent identically distributed random variables in with common expectation . Then for any ,
Operations on functions
For any function and countable subset , we define . We extend the domain to multiplicatively, as
| (2.1) |
often omitting the superscript when it is clear from context. When , for any real we define .
2.1 Linear Codes
A linear (error-correcting) code of (block) length over the alphabet is a linear subspace of . As a subspace, it has a dimension. In this paper, we consider the following family of codes.
Definition 2.2 ((Generalized) Reed–Solomon code).
Let be positive integers, with a prime power. For a non-negative integer , a vector with distinct entries, and a vector with (not necessarily distinct) non-zero entries, the Generalized Reed–Solomon (GRS) code of dimension with evaluation points and twist factors is defined as
A special case is a Reed–Solomon (RS) code, which is obtained by using trivial twist factors .
2.2 Lattices
Definition 2.3 (Lattice, Basis).
An (-dimensional, full-rank) lattice is the set of all integer linear combinations of some linearly independent basis vectors :
Equivalently, it is a discrete additive subgroup of whose -span is ; as such, it defines the quotient group of lattice cosets for . A sublattice of is called an integer lattice.
In this work, all lattices are implicitly full rank. A lattice basis can equivalently be seen as an invertible matrix whose columns are the vectors . Note that a given lattice has multiple different bases, which are all related by right-multiplication by unimodular matrices in .
Definition 2.4 (Determinant).
The determinant of a lattice generated by basis is .
Note that the determinant of a lattice is invariant under the choice of basis, by the above-mentioned relationship between the bases of a lattice.
Definition 2.5 (Dual lattice).
The dual lattice of a lattice is
If is a basis of , then its dual basis is a basis of , and hence .
Lemma 2.6.
Let and be countable subsets of its domain (e.g., lattice cosets). Then .
Proof.
This follows directly from the definition of direct sum and Equation 2.1.
2.3 Fourier Analysis
Let be a (Borel) measurable function that satisfies . Its Fourier transform is defined as
It satisfies the following standard properties, which follow by routine calculations.
Lemma 2.7 (Multiplicativity).
For any function as above, (where the exponent notation is as defined in Equation 2.1).
Lemma 2.8 (Time-scaling property).
For any function as above and real , .
Lemma 2.9 (Time-shift property).
For any function as above and , let . Then .
We say that is nice if it satisfies conditions that are sufficient for the following formula to hold, e.g., those given in [13, pages 106–107]. All of the specific functions we use in this work are easily seen to be nice.
Lemma 2.10 (Poisson Summation Formula (PSF)).
For any lattice and nice function ,
We will use a more general version of the PSF for lattice cosets.
Lemma 2.11 (Generalized PSF).
For any lattice , nice function , and ,
2.4 Lattice Roughness
Continuing from Section 2.3, for the rest of this work we require the following properties of .
Assumption 2.12.
The function has range and is nice, and is non-negative real with .
Because is real, its Fourier transform is conjugate symmetric, i.e., for all , where the star denotes complex conjugation. Since is also real, this implies that it is symmetric, i.e., . Finally, note that if satisfies this assumption, then so does its multiplicative extension .
We now define an important Fourier-analytic quantity that plays an important role in our analysis. We adopt the name “roughness” because it is the functional inverse of the “smoothing parameter” from [9], which is the smallest that makes the function sufficiently “smooth” as a function of .
Definition 2.13.
For a function , lattice , and real , the roughness is defined as
More generally, for a (linear) subspace of , the -roughness is defined as
(Both inequalities follow from the non-negativity of .)
Lemma 2.14 (adapted from [9, Lemmas 2.9 and 4.1]).
For any satisfying ˜2.12, lattice , real , subspace of defining roughness , and ,
with equality against the upper bound when . In particular, .
Due to space constraints, the proof is left to the full version.
3 List-Decoding Reed–Solomon Codes
3.1 Soft-Decision Decoding
To list-decode Reed–Solomon codes under various norms and probabilistic channel models, we use the “weighted,” or soft-decision, list decoder of Guruswami and Sudan (hereafter GS) [7], as elaborated upon in Guruswami’s thesis [5, Section 6.2.10] and the work of Koetter and Vardy [8]. A soft-decision decoder takes a “weight vector” as input, and outputs a set of codewords.
Definition 3.1.
A weight vector for a length- code over is some where each block is indexed by ; equivalently, each block is a function .
Conceptually, each block of a weight vector may be thought of as specifying a (posterior) probability distribution over , where is proportional to the probability that the th transmitted symbol was , given what was received from the channel (which need not be an element of ). At a formal level, this interpretation makes sense only when the channel is probabilistic (for average-case decoding), but it still serves as useful intuition when the channel is adversarial (for worst-case decoding). We consider both types of channels in our results below.
For , define to be the binary indicator vector indexed by that has a in coordinate and s elsewhere. Similarly, for any vector , define the weight vector . Observe that its Euclidean norm is .
Definition 3.2.
The correlation between a weight vector and a word is defined as their length-normalized inner product (or the cosine of the angle between them):
Theorem 3.3 (adapted from [7, Theorem 18] and [5, Theorem 6.21]).
For a prime power , let be a Generalized Reed–Solomon code of dimension and adjusted rate . There is a deterministic algorithm that, given a weight vector and a “tolerance” , outputs in time the set of all codewords that satisfy
| (3.1) |
We remark that the above theorem is originally stated for rational weights, but the supporting argument (from [5, Lemma 6.20]) easily adapts to handle real-valued weights that can be lower bounded to any needed precision in polynomial time, as all of ours can be.
3.2 From Received Words to Weight Vectors
Here we describe a general approach for translating a received word to a weight vector. This translation is parameterized by a function that, conceptually, can be viewed as (proportional to) the channel’s probability density function, even if the channel is not actually probabilistic.
Let be a function that satisfies ˜2.12, extended multiplicatively to as in Equation 2.1, and recall that for any constant . Next let be a positive integer, and recall that we identify with in the natural way when is prime. Let the set of possible received values be , and for any such value , define the weight function by
Notice that here is applied to a coset of , which represents an infinite series; for all our concrete choices, these series converge and so the function is well defined. This function can also be seen as the vector , indexed by .
In line with the probabilistic conception of weight vectors from Section 3.1 above, the function can be seen as follows. Suppose that a uniformly random symbol in is sent over a channel, which adds (modulo ) noise drawn from a distribution over whose probability density function is proportional to . Then the probability that the sent symbol was , conditioned on receiving , is proportional to . This is because the coset is the set of all noise values that yield if is sent. Note that in the definition of we do not normalize by the total weight (which may vary based on the received value ); this turns out to yield simpler analyses and tighter results.
Definition 3.4.
For a function as above and any received vector , define the corresponding weight vector as
In order to use the soft-decision algorithm (Theorem 3.3) for decoding under an adversarial channel, it suffices to show that we can choose a suitable so that for any received word and any sufficiently close codeword (in the norm of interest), the correlation satisfies (3.1). Similarly, for decoding under a probabilistic channel, it suffices to show that with high probability over the channel noise , the transmitted codeword has large enough correlation with the weight vector of the received word (again, for some suitably chosen ). To this end, in what follows we give a lower bound on and an upper bound on , in terms of and the difference between the received word and the codeword of interest.
3.3 Main Theorem
Here we state and prove the main result of this section. For this we define the two-dimensional integer lattice that consists of all shifts of the lattice by for an integer , i.e.,
We have that , and so . We sometimes omit the subscript when it is clear from context or its value is unimportant.
Theorem 3.5.
Suppose that satisfies ˜2.12. For any and defining , and any ,
Proof.
This follows immediately from the following lower and upper bounds on the numerator and denominator of . For the numerator, by the definitions of and ,
where the last step follows by the inequality of arithmetic and geometric means, and the non-negativity and multiplicativity of over direct sums of cosets (Lemma 2.6). For the denominator, the upper bound is proved in Lemma 3.6 below.
Lemma 3.6.
Adopting the setup from Theorem 3.5, and letting where ,
Proof.
By definition of ,
To bound this, let be arbitrary. By Lemma 2.6,
where the last step follows by the latter part of Lemma 2.14 on the lattice with subspace , and noting that . The claim follows by averaging over .
3.4 Average-Case Decoding
Here we consider list-decoding in the average case, where the channel is probabilistic (not worst case) and the goal is to output a list of codewords that includes the transmitted one. We consider channels that add independent, identically distributed random error (drawn from some specified distribution) to each coordinate of the transmitted codeword; this is often known as a memoryless additive channel. Specifically, we assume that the channel’s error distribution (for each coordinate) is proportional to for some , i.e., it has probability density function
For example, if is a Gaussian function, this is known as the additive white Gaussian noise (AWGN) channel model. In some settings one may also consider a discrete channel distribution, e.g., over , in which case its probability mass function is . For any (which may differ from ), define
In Section 4 we will use the following bound for a specific family of functions to show that the transmitted codeword is recovered with high probability over the channel error.
Lemma 3.7.
For any and defining , and any ,
4 General (Semi)Metrics
In this section we define weight vectors via Definition 3.4 using the function defined as
| (4.1) | ||||
where the gamma function for , and satisfies and . As two important examples, and .
Note that by multiplicativity (Equation 2.1),
Regarding the Fourier transform of , the “normalizing constant” has been defined to make .
It is also known that is non-negative for ; this follows immediately from an elegant lemma and proof due to Logan, given in [4, Lemma 5].666For , by contrast, can have negative values, which prevents our framework from supporting metrics for such . So, satisfies ˜2.12 for such . Another immediate consequence of Logan’s lemma is that as grows, strictly decreases and approaches zero for every .
We will need the following simple lemma.
Lemma 4.1.
For any and defining , we have that .
Due to space constraints, the proof is left to the full version.
4.1 Worst-Case Decoding
We now address list-decoding in the (semi)metric for , under worst-case error. Consider decoding distance , where is the code length, and can be seen as the relative decoding distance (relative to , which is the most natural normalization factor for ). For , relative distance , and positive integer modulus , define
| (4.2) |
By Theorems 3.5 and 3.3, to decode a GRS code of adjusted rate over a prime field to within distance using the GS algorithm, it suffices to set so that . In other words, we can decode under relative distance for any less than
| (4.3) |
The following makes this formal.
Theorem 4.2.
For any , , and prime , the GS soft-decision algorithm using weight vector given by for any list-decodes, up to distance , any GRS code with adjusted rate , in time polynomial in , , and .777We remark that in many cases, the bound on the polynomial running time can be improved using a better lower bound for , such as the one given by Lemma 3.6.
Proof.
We invoke the GS algorithm on the weight vector given by the choice of and the received word , and tolerance .888To be more precise, we can invoke GS on any approximation of in , say. This can be computed by approximating from above to the needed precision, by enumerating sufficiently many points of near the origin, and upper-bounding the contribution of the remaining points in the “tails” using, e.g., Lemma 5.3. The running time is polynomial in , , and , by Lemma 4.1.
Now let be a codeword within distance of , i.e., . By Theorems 3.5, 2.12, and 4.2,
So, by Theorem 3.3, the output of the GS algorithm includes , as needed.
Remark 4.3.
Interestingly, as , , and grow (and the other parameters remain fixed), the product of the relative distance and the adjusted rate for which we can decode approaches the relative radius of a unit-volume ball. Due to space constraints, we defer this derivation to the full version.
4.2 Average-Case Decoding
We now consider average-case decoding under a memoryless additive (continuous or discrete) channel whose density function is proportional to a scaling of . Specifically, we consider the continuous distribution with probability density function , and the discrete distribution over with probability mass function . Following Section 3.4, for any define
For these channel distributions we derive suitable bounds on , then reach the conclusion via Lemmas 3.7 and 3.3.
Lemma 4.4.
For any , any defining a continuous or discrete distribution , and ,
with equality in the continuous case and strict inequality in the discrete case.
Due to space constraints, the proof is left to the full version.
Now, for any channel parameter and for , define
| (4.4) |
where the inequality is by Lemma 4.4. By Theorems 3.5 and 3.3, to decode (with high probability) a GRS code of adjusted rate over a prime field under a channel with parameter , it suffices to set so that . In other words, we can decode under channel parameter for any less than
| (4.5) |
The following makes this formal.
Theorem 4.5.
Let , , , and be prime. Under a memoryless additive (continuous or discrete) channel with distribution , the GS soft-decision algorithm, using weight vector given by for any , list-decodes any GRS code with adjusted rate , in time polynomial in , , and , except with probability less than
The proof is similar to that of Theorem 4.2, but using Lemma 3.7 to bound the probability that . Due to space constraints, the details are left to the full version.
Remark 4.6.
Theorem 4.5 outperforms Theorem 4.2 (for worst-case decoding) by a factor that approaches in the adjusted rate it can handle, as and grow. Specifically, consider a channel with parameter . A calculation reveals that its relative error (in , relative to ) is tightly concentrated around , so following the analysis in Remark 4.3, Theorem 4.2 applies for that approaches . By comparison, Theorem 4.5 applies for that approaches .
5 The Metric and Gaussian Error
In the remainder of the paper we instantiate our general list-decoding results for (semi)metrics (Theorems 4.2 and 4.5) to specific metrics of interest and memoryless additive channels. In this section, we consider the metric and Gaussian channels.
We specialize Equation 4.1 to , i.e., the Gaussian function
By a straightforward calculation it can be seen that this function is its own Fourier transform: . Note that by the time-scaling property of the Fourier transform (Lemma 2.8). Finally, recalling that , we get that is invariant under rotations.
5.1 Bounds
In this subsection we derive fairly tight bounds on the factor that appears in the quantities that govern the adjusted rates under which we can decode in the worst and average cases (Equations 4.2 and 4.4, respectively). For this purpose we need to define a suitable “fudge factor.” For , define
Notice that is positive for , is strictly increasing, and rapidly approaches as increases. Next, for real such that , define
Similarly, is positive for , and rapidly approaches as both increase.
We next state some bounds on and ; Lemma 5.1 is the main one we use. Due to space constraints, the proofs are given in the full version.
Lemma 5.1.
For any real and positive integer such that ,
This follows directly from Lemmas 5.2 and 5.4 below.
Lemma 5.2.
For any real and positive integer , let and where . Then
Next we bound the roughness quantities from Lemmas 5.1 and 5.2, using the following classic tail inequality.
Lemma 5.3 (adapted from [2, Lemma 2.4]).
For any lattice , unit vector , and , let . Then
Lemma 5.4.
Let and . Then
5.2 Worst-Case Decoding
We now address list-decoding in the metric, under worst-case error of bounded distance, by specializing the material of Section 4.1 to and using our bounds on from Section 5.1. So, we consider decoding distance , where is the code length and is the relative decoding distance. Then by Equations 4.2 and 4.3, we can list-decode for any less than
| (5.1) |
where the inequality follows by Lemma 5.1.
Corollary 5.5 below is obtained by nearly maximizing the right-hand side of (5.1). More specifically, a standard calculation shows that taking maximizes the “main term” , to have value . For moderate or larger values of (and hence ), this very nearly maximizes the entire expression, because since , and rapidly approaches as grows. For example, for . So, as grows, the for which we can list-decode rapidly approaches .
Corollary 5.5.
For any and prime , the GS algorithm using weight vector given by for list-decodes, up to distance in time , any GRS code with adjusted rate
Proof.
For , the lower bounds on and imply that . Then by hypothesis and Lemmas 5.1 and 4.2,
The claim then follows directly by Theorem 4.2.
Comparison to [10]
The previous best result for list-decoding (Generalized) Reed–Solomon codes in the metric was given by Mook and Peikert [10].999By a standard reduction, the result from [10] also applies to GRS codes, not just RS codes as was originally stated.
Proposition 5.6 ([10, Theorem 3.4]).
For any GRS code with any adjusted rate and any , there is a -time algorithm that list-decodes up to distance .
Equivalently, for a relative decoding distance , the result from [10] works for adjusted rates approaching , so it applies only for
By contrast, our Theorem 4.2 works for any (arbitrarily large) (and Corollary 5.5 gives a simpler and more explicit rate bound for any ). Moreover, for those for which both Theorems 4.2 and 5.6 apply, our result works for a larger as long as (see (4.3)). For typical (moderate or larger) , this holds for all , which corresponds to . (For tiny , Theorem 4.2 works for , whereas [10] works for , so the latter is better for very small distances.)
We also point out that [10] proves that for any , which corresponds to , its (very simple) choice of weight vector gives an optimal tradeoff between and for the GS/KV soft-decision algorithm and analysis. However, the optimality argument breaks down for (equivalently, for ). And indeed, as we have just seen, we obtain a better distance-rate tradeoff than [10] for almost all such . This highlights the interesting question of determining an optimal choice of weights for the GS soft-decision algorithm for (especially at the low end of this range).
5.3 Unique Decoding for a Subclass of GRS Codes
For a certain natural subclass of GRS codes, and certain rates and decoding distances covered by our list-decoding algorithm, decoding is in fact unique (i.e., the list size is at most one). We show this by giving a lower bound on the minimum distance of such codes, and then observing that our list-decoding algorithm can decode to beyond half this distance for all small enough rates.
Lemma 5.7 (adapted from [12, Theorem 4]).
Any prime-field GRS code (whose twist factors equal the nonzero evaluation points ) of rate has squared minimum distance at least
Due to space constraints, the proof is left to the full version.
Lemma 5.7 gives a relationship between the code rate and (a lower bound on) half the minimum distance, for which decoding to that distance yields a unique solution. By taking the functional inverse of half this minimum-distance bound, we see that decoding to relative distance yields a unique solution as long as
which approaches as grows. This curve is shown in Figure 1. Observe that for any for which our list-decoding algorithm outperforms the one of [10], we have that . In other words, we can efficiently list decode to relative distance for all rates up to (and beyond), thus yielding a unique decoder for these parameters. Alternatively, as the rate approaches zero, we can efficiently list decode to a multiple of the unique-decoding distance bound that approaches .
5.4 Average-Case Decoding
We now consider average-case decoding under a memoryless additive (continuous or discrete) Gaussian channel, by specializing the material of Section 4.2 to and using our bounds on from Section 5.1. Consider a Gaussian channel of parameter . Then by Equations 4.4 and 4.5, we can list-decode for any less than
| (5.2) |
where the inequality is by Lemma 5.1.
Corollary 5.8 below is obtained by nearly maximizing the right-hand side of (5.2). More specifically, setting maximizes the “main term” , to have value . As above, for moderate or larger values of (and hence ), this very nearly maximizes the entire expression, because rapidly approaches as grows.101010By contrast, for values of very close to , in which case the bound is maximized by taking somewhat larger than . So, as grows, the rate for which we can list-decode rapidly approaches .
Corollary 5.8.
For any , , and prime , the GS algorithm using weight vector given by list-decodes, in time , any GRS code with adjusted rate
except with probability less than .
Proof.
By hypothesis, Lemmas 4.4, 5.1, and 4.4,
The claim then follows directly by Theorem 4.5, and the fact that by Lemma 5.2.
6 The Metric and Laplacian Error
In this section, we consider the metric and Laplacian channels. We specialize Equation 4.1 to , i.e., the Laplacian function
(The Fourier transform of this function is given by , but we will not use this; as already noted earlier, satisfies ˜2.12.)
Throughout this section we use the hyperbolic tangent function
and its reciprocal . Observe that approaches as grows; it also satisfies for all , and approaches as approaches zero.111111Both facts can be seen from the Taylor series , valid for .
6.1 Bounds
In this subsection, we analyze the exact value of and derive an asymptotic bound. This appears in the quantities that govern the adjusted rates under which we can decode in the worst and average cases (Equations 4.2 and 4.4, respectively). For this purpose, we define a suitable “fudge factor”. For any real , define
| (6.1) |
where the upper bound comes from the fact that . Note that, as grows, the first term in the sum rapidly approaches one, and the second term rapidly approaches zero. More precisely, a brief calculation reveals that
| (6.2) |
Lemma 6.1.
For any and positive integer ,
Note that by Equation 6.2, for any fixed , as (or equivalently, ) grows, rapidly approaches . In turn, this approaches as grows.
The proof of Lemma 6.1 follows directly from Lemma 6.2 below and Equation 6.1. Due to space constraints, the details are left to the full version.
Lemma 6.2.
For any and positive integer ,
Due to space constraints, the proof is left to the full version.
6.2 Worst-Case Decoding
Now we address list-decoding in the metric, under worst-case error of bounded distance, by specializing the material of Section 4.1 to and using our bound on from Lemma 6.1. We consider decoding distance , where is the code length and is the relative decoding distance. Then by Equations 4.2, 4.3, and 6.1, we can list-decode for any less than
| (6.3) |
Corollary 6.3 below is obtained by maximizing the “main term” of the right-hand side of (6.3). By calculus, this is done by taking , where
Substituting, this means we can list-decode for any less than
| (6.4) |
We consider this quantity’s asymptotic behavior for large and small :
-
As grows, and approaches , hence approaches as also grows. This is consistent with Remark 4.3.
-
As approaches zero, approaches and approaches , hence approaches as also grows.
Corollary 6.3.
For any and prime , the GS algorithm using weight vector for list-decodes, up to distance in time , any GRS code with adjusted rate (see Equation 6.4).
Proof.
Comparison to [12, 16]
To our knowledge, the only prior algorithms for (unique or list) decoding Reed–Solomon codes in the (Lee) metric are [12, Section 5] and [16]. We note that both of these require discrete (integer) error, whereas our algorithm works for continuous error.
For a certain subclass of GRS codes (and BCH codes more generally), [12] gives a unique decoding algorithm for up to half (a lower bound on) the minimum distance, using Euclid’s algorithm for polynomials. This algorithm decodes up to any relative distance . For any prime-field GRS code, [16] gives a list-decoding algorithm that uses GS as a subroutine, and has a piecewise distance-rate tradeoff due to its optimization over an integer parameter. (The algorithm works by putting equal weight on a range of alphabet symbols centered at the received symbol, optimizing over the range size for a given rate.)
6.3 Unique Decoding for a Subclass of GRS Codes
As in Section 5.3, for the same subclass of GRS codes and certain parameters covered by our list-decoding algorithm, the decoding output is in fact unique. To show this, we give a lower bound on the minimum distance of such codes, and then observe that our list-decoding algorithm can decode to beyond half this distance for all small enough rates.
Lemma 6.4 (adapted from [12, Theorem 4]).
Any prime-field GRS code (whose twist factors equal the nonzero evaluation points ) of rate has minimum distance at least
Due to space constraints, the proof is left to the full version.
Lemma 6.4 gives a relationship between the code rate and (a lower bound on) half the minimum distance, for which decoding to that distance yields a unique solution. By taking the functional inverse of half this minimum-distance bound, we see that decoding to relative distance yields a unique solution as long as
which approaches as grows. This curve is shown in Figure 1. Observe that for any for which our list-decoding algorithm outperforms the unique decoder of [12] (or for which [12] does not apply), we have that . In other words, we can efficiently list decode to relative distance for all rates up to (and beyond), thus yielding a unique decoder for these parameters. Alternatively, as the rate approaches zero, we can efficiently list decode to a multiple of the unique-decoding distance bound that approaches .
6.4 Average-Case Decoding
We now consider average-case decoding under a memoryless additive (continuous or discrete) Laplacian channel, by specializing the material of Section 4.2 to and using our bound on from Lemma 6.1. Consider a Laplacian channel of parameter . Then by Equations 4.4 and 4.5, we can list-decode for any less than
| (6.5) |
where the inequality is by Lemma 6.1.
Corollary 6.5 below is obtained by nearly maximizing the right-hand side of (6.5), at least for moderate or large values of . Specifically, we use the bound to approximate the “main term” of (6.5) by . This is maximized at , which makes the original main term equal to . Note that does indeed approach this value as and grow, because approaches , and rapidly approaches (see Equation 6.2).
However, for small values of , the expression in (6.5) is maximized for significantly larger than , to have value much larger than . This maximization can be computed numerically, and indeed, approaches as approaches ; see Figure 1.
Corollary 6.5.
For any , , and prime , the GS algorithm using weight vector given by list-decodes, in time , any GRS code with adjusted rate
except with probability less than .
Proof.
By hypothesis, Lemmas 4.4, 6.1, and 4.4,
The claim then follows directly by Theorem 4.5, and (for the probability bound) the fact that by Lemma 6.2.
References
- [1] Wojciech Banaszczyk. New bounds in some transference theorems in the geometry of numbers. Mathematische Annalen, 296(4):625–635, 1993.
- [2] Wojciech Banaszczyk. Inequalites for convex bodies and polar reciprocal lattices in . Discrete & Computational Geometry, 13:217–231, 1995.
- [3] Peter Elias. Zero error capacity under list decoding. IEEE Transactions on Information Theory, 34(5):1070–1074, September 1988. Originally appeared as Quarterly Progress Report, vol. 48, pp. 88-90, Research Laboratory of Electronics, MIT, January 1958. doi:10.1109/18.21233.
- [4] N. D. Elkies, A. M. Odlyzko, and J. A. Rush. On the packing densities of superballs and other bodies. Inventiones mathematicae, 105:613–639, December 1991.
- [5] Venkatesan Guruswami. List decoding of error correcting codes. PhD thesis, Massachusetts Institute of Technology, 2001. URL: http://dspace.mit.edu/handle/1721.1/8700.
- [6] Venkatesan Guruswami, Atri Rudra, and Madhu Sudan. Essential coding theory, March 2019. URL: https://cse.buffalo.edu/faculty/atri/courses/coding-theory/book/web-coding-book.pdf.
- [7] Venkatesan Guruswami and Madhu Sudan. Improved decoding of Reed-Solomon and algebraic-geometry codes. IEEE Trans. Inf. Theory, 45(6):1757–1767, 1999. Preliminary version in FOCS 1998. doi:10.1109/18.782097.
- [8] Ralf Koetter and Alexander Vardy. Algebraic soft-decision decoding of Reed-Solomon codes. IEEE Trans. Inf. Theory, 49(11):2809–2825, 2003. doi:10.1109/TIT.2003.819332.
- [9] Daniele Micciancio and Oded Regev. Worst-case to average-case reductions based on Gaussian measures. SIAM J. Comput., 37(1):267–302, 2007. Preliminary version in FOCS 2004. doi:10.1137/S0097539705447360.
- [10] Ethan Mook and Chris Peikert. Lattice (list) decoding near Minkowski’s inequality. IEEE Trans. Inf. Theory, 68(2):863–870, 2022. doi:10.1109/TIT.2021.3126540.
- [11] Irving S. Reed and Gustave Solomon. Polynomial codes over certain finite fields. Journal of the Society for Industrial and Applied Mathematics, 8(2):300–304, 1960.
- [12] Ron M. Roth and Paul H. Siegel. Lee-metric BCH codes and their application to constrained and partial-response channels. IEEE Trans. Inf. Theory, 40(4):1083–1096, 1994. doi:10.1109/18.335966.
- [13] Jean-Pierre Serre. A Course in Arithmetic. Springer New York, NY, 1973. doi:10.1007/978-1-4684-9884-4.
- [14] Madhu Sudan. Decoding of Reed Solomon codes beyond the error-correction bound. J. Complex., 13(1):180–193, 1997. doi:10.1006/JCOM.1997.0439.
- [15] John M. Wozencraft. List decoding. Quarterly Progress Report, Research Laboratory of Electronics, MIT, 48:90–95, 1958.
- [16] Xin-Wen Wu, Margreta Kuijper, and Parampalli Udaya. Lee-metric decoding of BCH and Reed–Solomon codes. Electronics Letters, 39(21):1522–1524, October 2003.
