Abstract 1 Introduction 2 Preliminaries 3 Average Baby PIH from Baby PIH References Appendix A From Average Baby PIH to Inapproximability of 𝒌-ExactCover

On Average Baby PIH and Its Applications

Yuwei Liu ORCID Shanghai Jiao Tong University, China Yijia Chen ORCID Shanghai Jiao Tong University, China Shuangle Li ORCID Nanjing University, China Bingkai Lin ORCID Nanjing University, China Xin Zheng ORCID Nanjing University, China
Abstract

The Parameterized Inapproximability Hypothesis (PIH) asserts that no FPT algorithm can decide whether a given 2CSP instance parameterized by the number of variables is satisfiable, or at most a constant fraction of its constraints can be satisfied simultaneously. In a recent breakthrough, Guruswami, Lin, Ren, Sun, and Wu (STOC 2024) proved the PIH under the Exponential Time Hypothesis (ETH). However, it remains a major open problem whether the PIH can be established assuming only 𝖶[𝟣]FPT. Towards this goal, Guruswami, Ren, and Sandeep (CCC 2024) showed a weaker version of the PIH called the Baby PIH under 𝖶[𝟣]FPT. In addition, they proposed one more intermediate assumption known as the Average Baby PIH, which might lead to further progress on the PIH. As the main contribution of this paper, we prove that the Average Baby PIH holds assuming 𝖶[𝟣]FPT.

Given a 2CSP instance where the number of its variables is the parameter, the Average Baby PIH states that no FPT algorithm can decide whether (i) it is satisfiable or (ii) any multi-assignment that satisfies all constraints must assign each variable more than r values on average for any fixed constant r>1. So there is a gap between (i) and (ii) on the average number of values that are assigned to a variable, i.e., 1 vs. r. If this gap occurs in each variable instead of on average, we get the original Baby PIH. So central to our paper is an FPT self-reduction for 2CSP instances that turns the above gap for each variable into a gap on average. By the known 𝖶[𝟣]-hardness for the Baby PIH, this proves that the Average Baby PIH holds under 𝖶[𝟣]FPT.

As applications, we obtain (i) for the first time, the 𝖶[𝟣]-hardness of constant approximating k-ExactCover, and (ii) a tight relationship between running time lower bounds in the Average Baby PIH and approximating the parameterized Nearest Codeword Problem (k-NCP).

Keywords and phrases:
Average Baby PIH, Parameterized Inapproximability, Constraint Satisfaction Problem, Exact Set Cover, W[1]-hardness
Copyright and License:
[Uncaptioned image] © Yuwei Liu, Yijia Chen, Shuangle Li, Bingkai Lin, and Xin Zheng; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Problems, reductions and completeness
; Theory of computation Parameterized complexity and exact algorithms
Acknowledgements:
The authors want to thank Guohang Liu, Mingjun Liu, Yangluo Zheng for discussions in the early stage of this work. The comments and suggestions from anonymous reviewers also help to improve the paper significantly. In particular, Theorem 2 is due to one of them.
Funding:
Yuwei Liu and Yijia Chen are supported by the National Natural Science Foundation of China (Project 62372291).
Editors:
Olaf Beyersdorff, Michał Pilipczuk, Elaine Pimentel, and Nguyễn Kim Thắng

1 Introduction

In classical complexity theory, the PCP theorem [3, 2, 8] serves as an essential tool for proving most of the existing results in the hardness of approximation. As a variant, the Multi-Assignment PCP theorem [1, Lemma 11] states that, for any constant r>1 and 0<ε<1, it is even NP-hard to decide whether a CSP instance is satisfiable, or any multi-assignment (see Definition 7.) that assigns each variable no more than r values cannot satisfy a (1ε)-fraction of constraints. Among others, the Multi-Assignment PCP theorem was used to show the NP-hardness of approximating SetCover [1]. It turns out that the Multi-Assignment PCP theorem is a simple consequence of the PCP theorem by a straightforward probabilistic argument. Nevertheless, Barto and Kozik [4] gave a direct and purely combinatorial proof for the simple case of ε=0. Observe that it means that the CSP instances under consideration are either satisfiable or cannot be satisfied by a desired multi-assignment. This restricted version of the Multi-Assignment PCP theorem is termed the Baby PCP Theorem in [4].

As an analog of the PCP theorem in parameterized complexity theory, the Parameterized Inapproximability Hypothesis [21], PIH for short, is an important assumption from which we can prove many FPT inapproximability results, including the inapproximability of k-Clique, k-ExactCover [13], and Direct Odd Cycle Transversal [21], etc. It claims that for some constant 0<ε<1, no f(k)nO(1)-time (i.e., FPT) algorithm can distinguish a satisfiable 2CSP instance with k variables from one where less than (1ε)-fraction of constraints can be satisfied simultaneously [21]. Unlike the PCP theorem, the PIH is still a major open problem in parameterized complexity. The current state of the art is that the PIH holds under the Exponential Time Hypothesis (ETH) [12, 11], and a proof of the PIH under the minimum assumption 𝖶[𝟣]FPT remains elusive. Toward this goal, studying some consequences of the PIH and proving them under 𝖶[𝟣]FPT might provide new insights and valuable lessons.

Recently, Guruswami, Ren, and Sandeep [13] proved a parameterized version of the Baby PCP Theorem, appropriately coined as the Baby PIH, under 𝖶[𝟣]FPT. The Baby PIH states that for any constant r>1, no FPT algorithm can distinguish a satisfiable 2CSP instance from one with no satisfying multi-assignment which assigns each variable no more than r values. Just like the relationship between the PCP theorem and the Baby PCP theorem, the Baby PIH is a direct consequence of the PIH. As a next step, a further complexity assumption is suggested, i.e., the Average Baby PIH [13, Conjecture 3], which seems to be sandwiched between the PIH and the Baby PIH. It postulates the 𝖶[𝟣]-hardness of the problem known as Avg-r-Gap-2CSP (see Definition 8) which asks for distinguishing a satisfiable 2CSP instance from one without satisfying multi-assignment which assigns each variable no more than r values on average. The authors of [13] also demonstrated the difference between the Baby PIH and the Average Baby PIH. In fact, for all r>1 and δ>0, they constructed 2CSP instances with variable set X that cannot be satisfied by any multi-assignment assigning each variable in X no more than r values, but can be satisfied by a multi-assignment that assigns in total (1+δ)|X| values to all the variables in X, that is, every variable is assigned 1+δ values on average. Compared to proving 𝖶[𝟣]-hardness111Strictly speaking, the PIH is not a computational problem and we cannot directly define its hardness. The formal statement should be “proving 𝖶[𝟣]-hardness of the problem described in the PIH”, and we use “𝖶[𝟣]-hardness for the PIH” for short in the introduction. of the PIH, it is apparently easier to show the 𝖶[𝟣]-hardness of the Average Baby PIH, and studying the Average Baby PIH might bring us further closer to a proof of the 𝖶[𝟣]-hardness for the PIH. Moreover, the Average Baby PIH is already sufficient for proving some non-trivial inapproximability results such as k-ExactCover [13].

1.1 Main Results

Let Π=(X,Σ,Φ) be a 2CSP instance with a set X of variables, an alphabet Σ, and a set Φ of constraints. A multi-assignment σ^:X2Σ relaxes the standard notion of assignments by assigning each variable xX a set of values in Σ, i.e., σ^(x)Σ. Thereby, Π is said to be satisfied by σ^ if for every constraint φΦ, one can pick for each variable x of φ a value from the set σ^(x) assigned to this variable to satisfy φ. We say that σ^ assigns xX|σ^(x)| values to X in total, or equivalently, each variable in X is assigned xX|σ^(x)||X| values on average (see Definition 7). Our main result is the following theorem stating that the Average Baby PIH holds under 𝖶[𝟣]FPT.

Theorem 1 (Informal, see Theorem 16).

Assume 𝖶[𝟣]FPT. Then for any constant r>1, given a 2CSP instance Π=(X,Σ,Φ) parameterized by |X|, no FPT time algorithm can distinguish between:

  • Π is satisfiable.

  • Any multi-assignment assigning no more than r|X| values to X does not satisfy Π.

Clearly any standard assignment σ:XΣ can be identified with a multi-assignment σ^ that assigns each variable xX a set of a single value, i.e., σ^(x)={σ(x)}. Hence, there is a constant r gap in Theorem 1 between YES and NO instances on the average number of values assigned to each variable, which gives us the aforementioned Avg-r-Gap-2CSP problem. On the other hand, the constant gap for the PIH is on the fraction of constraints that can be satisfied by an assignment. That is, a YES instance is a 2CSP instance whose all constraints can be satisfied by an assignment, while any assignment can only satisfy at most a constant fraction of constraints in a NO instance. So, perhaps surprisingly, the difference between the Average Baby PIH and the PIH can be pinpointed within the Avg-r-Gap-2CSP problem precisely in terms of whether a given instance contains a “dense” or “sparse” set of constraints.222This is pointed out by an anonymous reviewer. More precisely:

Theorem 2.
  • Under 𝖶[𝟣]FPT, the Average Baby PIH holds for Avg-r-Gap-2CSP instances Π=(X,Σ,Φ) with |Φ|=ω(|X|).

  • If the Average Baby PIH holds for Avg-r-Gap-2CSP instances Π=(X,Σ,Φ) with |Φ|=O(|X|), then the PIH holds as well.

As a first application of the Average Baby PIH under 𝖶[𝟣]FPT, using a reduction in [13], we obtain the 𝖶[𝟣]-hardness of constant approximating the k-ExactCover problem (see Definition 9), improving its previous approxmation lower bound under a stronger assumption, i.e., the Gap-ETH [23].

Theorem 3 (Theorem 27 restated).

For any constant r>1, r-approximating k-ExactCover is 𝖶[𝟣]-hard.

We remark that the 𝖶[𝟣]-hardness of approximating k-ExactCover has been a long-standing open problem in parameterized complexity. Although the 𝖶[𝟣], 𝖶[𝟤], ETH-hardness of approximating the k-SetCover problem has been established in [6, 15, 18, 20], as a special case of k-SetCover, the hardness of approximating k-ExactCover was only known under the PIH [23] prior to our work.

The second application is a close relationship between running time lower bounds for constant approximating the parameterized Nearest Codeword Problem γ-Gap-k-NCPp (see Definition 10) and Avg-r-Gap-2CSP. Its proof is a straightforward composition of two known reductions in [23, 13] .

Theorem 4.

For any prime p, computable function g, and constant r, if no f(k)no(g(k))-time algorithm can decide Avg-r-Gap-2CSP with k variables for any computable function f, then r-Gap-k-NCPp cannot be solved in time f(k)no(g(k)) for any computable function f.

Proof Sketch.

Theorem 4 follows from the gap-preserving reduction [13] from Avg-r-Gap-2CSP to k-ExactCover (see also Appendix A), and the gap-preserving reduction from k-ExactCover to r-Gap-k-NCPp [23, Theorem 28]. Note that in both reductions the parameter k is preserved.

1.2 Technical Overview: Local-to-Global Reduction For 2CSP

To prove Theorem 1, we show that the 𝖶[𝟣]-hardness for the Baby PIH implies the 𝖶[𝟣]-hardness for the Average Baby PIH. Here, the “𝖶[𝟣]-hardness for the Baby PIH” means that, for all r>1, it’s 𝖶[𝟣]-hard to decide whether (i) a 2CSP instance is satisfiable, or (ii) it cannot be satisfied by any multi-assignment assigning each variable no more than r values. Thus there is a constant r gap between (i) and (ii) in the number of values assigned to each variable. Thereby, the gap is “local.” As already mentioned, for the Average Baby PIH, the gap is on the average number of values, or equivalently, the total number of values assigned to all the variables. Hence, the gap is “global.” Our reduction from the Baby PIH to the Average Baby PIH is thus said to be “local-to-global.”

Technically, our reduction relies on a simple but crucial property of high-distance error-correcting codes (ECC) shown in [16, 20]. In particular, we need an ECC 𝒞𝔽pm with relative distance 1δ such that any two distinct codewords x,y𝒞 can agree on at most δm entries. So if we have a set S of codewords and an ε-fraction of entries (denoted by I[m]) with εδ such that for each iI, we can find distinct x,yS that agree on their i-th entry, then the size of S must be large. The lower bound of |S| is called the collision number of 𝒞, denote by Colε(𝒞). A simple counting argument in [20] shows that Colε(𝒞)2ε/δ.

Now given a 2CSP instance Π0=(X0,Σ0,Φ0), let n=|Π0|>|Σ0|, parameter k=|X0|, and k=|Φ0|. We fix some ECC with very large relative distance (e.g., a Reed-Solomon code) 𝒞:𝔽pk𝔽pk′′ for prime n1/kp<2n1/k and k′′=Θ(k5). Then we have Colε(𝒞)>2rk. We construct a new 2CSP instance Π=(X1˙X2,Σ,Φ) as:

  • X1={u1,,uk}, X2={v1,,vk′′}.

  • Each uj takes value from the (encoding of) satisfying assignments of φj=(xi1xi2,Cj)Φ0, i.e., {(𝒞(a1),𝒞(a2)):(a1,a2)Cj}. Each v takes value from 𝔽pk.

  • For each ujX1 and vX2, there is a constraint that checks whether uj’s assigned value (w1,w2) and v’s assigned value s satisfy w1[]=s[i1] and w2[]=s[i2].

See Figure 1 for an illustration. Finally, we duplicate X1 and X2 each an appropriate number of times to make them equal in size, finishing our reduction.

Figure 1: An illustration of our construction for an input instance Π0=(X0,Σ0,Φ0) with |X0|=|Φ0|=3.

In general, an assignment to X1 should correspond to a satisfying multi-assignment to the original instance Π0, and the value assigned to each vX2 is a “guess” of the -th entry of the encoding of every xi. It is easy to see that if the input instance Π0 is satisfiable, then so does the new 2CSP instance Π, since for each ujX1 corresponding to φj=(xi1xi2,Cj), we can simply assign it the value (𝒞(σ(xi1)),𝒞(σ(xi2))), where σ is a satisfying assignment for Π0. At the same time, the value assigned to each vX2 is (𝒞(σ(x1))[],,𝒞(σ(xk))[])𝔽pk.

For soundness, suppose Π0 has no satisfying multi-assignment assigning at most r values to each variable, we need to argue that Π has no satisfying multi-assignment that assigns r(1ε)(|X1|+|X2|)/2 values in total. To that end, we exploit the collision number of the code 𝒞. Fix any satisfying multi-assignment σ^ to Π. Recall that each variable ujX1 is assigned some value from a satisfying partial assignment to φjΦ0. Then, σ^(X1) naturally gives a satisfying multi-assignment to Π0, which implies that there exists a variable xiX0 such that more than r different values are assigned by σ^ to uj, for which the corresponding constraint φj contains xi.

Now we have two possible cases for σ^. In the first case, for (1ε)-fraction of variables in X2, the multi-assignment σ^ assigns each of them more than r values. We are done, since this implies that the total number of assigned values by σ^ is more than (1ε)r|X2|. For the second case, there exists an ε-fraction of variables in X2, each of which is assigned by σ^ at most r values. Given such a variable v, we have more than r different codewords assigned to X1 in xi’s position which has at most r possible values in the -th entry. This entails the existence of two different codewords with the same -th entry. Since there are εm such entries, the assignment to X1 must contain at least Colε(𝒞) different codewords. Each assignment to ujX1 contributes two codewords, so the total number of assigned values by σ^ is at least Colε(𝒞)/2>rk. In summary, both cases guarantee a constant gap on either X1 or X2, showing that any satisfying multi-assignment to Π must assign r(1ε)(|X1|+|X2|)/2 values to X1X2 in total.

More details are referred to Section 3.

1.3 Discussions

For minimization problems, the technique of constructing two parts of variables and arguing that at least one part has a large gap seems quite general, as exhibited by the previous works showing the 𝖶[𝟣]-hardness of approximating k-SetCover [6, 20] and k-NCP [17]. The use of the collision number of error-correcting codes in the context of parameterized inapproximability was first introduced in [16] and further developed in [20, 17]. We ask whether these techniques can be unified.

Question 1.

Is there a general framework for proving parameterized inapproximability of minimization problems?

We also suggest two new variants of the Average Baby PIH, which might serve as a next step towards proving the PIH under 𝖶[𝟣]FPT. On closer inspection of our construction, particularly Case 1 in the proof of Lemma 20, it only guarantees the total number of values assigned to X1 (i.e., ΣxX1|σ^(x)|) is large. This could happen if an o(1)-fraction of x’s in X1 is assigned super-constant number of values. We ask if a larger gap can be achieved. So the first one asks whether the Average Baby PIH can be strengthened by requiring a constant fraction of variables to be assigned multiple values.

Question 2.

Under 𝖶[𝟣]FPT, can we prove that for any constant r>2, there exists a constant c>0 such that no FPT algorithm can decide whether a 2CSP instance is satisfiable, or any satisfying multi-assignment must have at least a c-fraction of variables assigned r values?

We remark that the case of r=2 follows from the inapproximability of k-Clique [19, 14, 5].

The second variant is already contained in Theorem 2, thus equivalent to PIH.

Question 3.

Under 𝖶[𝟣]FPT, can we prove that the Average PIH holds even for Avg-r-Gap-2CSP instances with the number of constraints being linear in the number of variables?

It is also interesting to consider whether the inapproximability factor in the Average Baby PIH can be improved to ω(1), since this would directly lead to better lower bounds for approximating k-ExactCover. The current obstacle is that, although the running time of our reduction does not depend on the approximation factor, our reduction relies on the gap created in the Baby PIH [13]. In order to achieve an r-gap in the Baby PIH, the reduction in [13] runs in time Ω(n(2r)r), consequently, the existing FPT reduction cannot create a super-constant r=ω(1) gap .

Question 4.

Under 𝖶[𝟣]FPT, can we prove that the Average PIH (Theorem 1) holds for inapproximability factor r=ω(1), hence giving better inapproximability result for k-ExactCover?

1.4 Organization

In Section 2, we introduce the main computational problems and complexity assumptions studied in this paper. As the central contribution, Section 3 explains our reduction from the Baby PIH to the Average Baby PIH. This, in fact, establishes the Average Baby PIH under 𝖶[𝟣]FPT. In Appendix A, we present a reduction from Avg-r-Gap-2CSP to the constant approximation of k-ExactCover, which slightly differs from the construction in [13].

2 Preliminaries

For a positive integer n, we use [n] to denote the set {1,2,,n}. S1˙˙Sk is the disjoint union of sets S1,,Sk, where we tacitly assume that S1,,Sk are pairwise disjoint. We use log (without subscript) to denote the logarithm number with base 2. For any prime number p, we write 𝔽p for the (unique) finite field of size p. The asymptotic notations, i.e., O,Ω,ω, and Θ, are used following the general convention. The reader is assumed to be familiar with basic notions in parameterized complexity theory, in particular FPT and 𝖶[𝟣]. Otherwise, the standard references are, e.g., [10, 9, 7].

2.1 Problems

Definition 5 (Parameterized 2CSP).

A 2CSP instance is defined as a triple Π=(X,Σ,Φ) where:

  • X is a set of variable.

  • Σ=˙xXΣx, where each Σx contains values that the variable xX can be assigned. Often, we assume that there exists an n such that |Σx|n for all xX.

  • Φ={φ1,,φk}, where each φj=(xi1xi2,Cj) for some xi1,xi2X, and Ci is a subset of Σxi1×Σxi2.

The problem is to decide whether there exists an assignment σ:XΣ that satisfies:

  • For all xX, σ(x)Σx.

  • For all φj=(xi1xi2,Cj)Φ, (σ(xi1),σ(xi2))Cj.

The parameter for this problem is k=|X|, the number of variables. Each pair of variables has at most one constraint, so |Φ|(k2). Without loss of generality, each variable is related to some constraint in Φ. The size of instance Π is defined as |Π|=|Σ|+|Φ|, where the size of each φj is defined as |φj|=|Cj|.

The approximation of parameterized 2CSP refers to the following problem.

Definition 6 (ε-Gap-2CSP).

Given a 2CSP instance Π=(X,Σ,Φ) with parameter k=|X|, we want to distinguish between:

  • Π is satisfiable;

  • any assignment can satisfy at most an ε-fraction of constraints in Φ.

As already mentioned, the notion of multi-assignment extends the usual assignment in such a way that each variable can be assigned multiple values.

Definition 7 (Multi-assignment).

A multi-assignment of a 2CSP instance Π=(X,Σ,Φ) is a function σ^:X2Σ333Here we use 2Σ to denote the power set of Σ. such that for all xX we have σ^(x)Σx. Furthermore, we say that σ^ satisfies Π if:

  • For all φj=(xi1xi2,Cj)Φ, there exist c1σ^(xi1) and c2σ^(xi2) with (c1,c2)Cj.

The individual size of σ^ is defined as maxxX|σ^(x)|, and the total size of σ^ is xX|σ^(x)|.

Let r1. We say that a 2CSP instance Π=(X,Σ,Φ) is r-list satisfiable if there exists a multi-assignment σ^ with individual size no more than r which satisfies Π, and Π is r-average list satisfiable if there exists a multi-assignment σ^ with total size no more than r|X| which satisfies Π.

Definition 8 (Avg-r-Gap-2CSP).

Given a 2CSP instance Π, the goal is to distinguish between the following two cases:

  • Π is satisfiable.

  • Π is not r-average list satisfiable.

We also consider the k-ExactCover problem (aka, the k-UniqueSetCover problem) and the k-NCP problem (aka, k-MLD, for the parameterized Maximum Likelihood Decoding problem) as defined below.

Definition 9 (k-ExactCover).

Given a set U (which we call universe) and a collection of U’s subsets 𝒮, the goal is to distinguish between the following two cases:

  • there exist at most k disjoint sets in 𝒮 that form a partition of U,

  • or U is not the union of any k sets in 𝒮.

Definition 10 (k-NCP).

For prime p, integer d>0, given a (multi-)set V of vectors in 𝔽pd, and a target vector t𝔽pd, the k-NCPp problem asks for distinguishing between:

  • the Hamming distance between t and the vector space spanned by V is at most k,

  • or the Hamming distance between t and the vector space spanned by V is at least k+1.

2.2 Hypotheses

Hypothesis 11 (PIH [21]).

For every constant 0<ε<1, there is no FPT algorithm solving ε-Gap-2CSP.

The Baby PIH, a hypothesis implied by PIH, asserts the hardness of approximating individual size of a satisfying multi-assignment. Formally,

Hypothesis 12 (Baby PIH [13]).

For any constant r>0, no FPT algorithm can on input a 2CSP instance, distinguish whether it is satisfiable, or cannot be satisfied by any multi-assignment with individual size at most r.

We emphasize that the Baby PIH is a hardness hypothesis with a local condition, i.e., the individual size of satisfying assignments. It is shown that the standard assumption 𝖶[𝟣]FPT implies the Baby PIH:

Theorem 13 ([13]).

The Baby PIH holds under 𝖶[𝟣]FPT.

In contrast, the Average Baby PIH is defined on a global condition concerning the total size of satisfying assignments. The precise statement of this complexity assumption contains a technical property on the “shape” of the constraints in a 2CSP instance.

Definition 14 (Rectangular relation).

A 2CSP instance Π=(X,Σ,Φ) is said to have rectangular relations if for each φj=(xi1xi2,Cj)Φ, there exist a set Qj and mappings πj,ρj:ΣQj, such that (a,b)Cj iff πj(a)=ρj(b). We call Qj the underlying set of φj.

Some explanation for “rectangular” might be in order. Recall that a subset SΣ2 is a (combinatorial) rectangle if and only if there exist A,BΣ such that S=A×B. It is easy to verify that RΣ2 is rectangular if and only if R is the union of a set of pairwise disjoint rectangles.

Hypothesis 15 (Average Baby PIH).

For any constant r>0, there exists no FPT algorithm solving the Avg-r-Gap-2CSP problem, even when the instance contains only rectangular relations.

3 Average Baby PIH from Baby PIH

In this section, we show that the Average Baby PIH even for instances with only rectangular relations, is implied by the Baby PIH.

3.1 Proofs of Main Results

We employ a local-to-global reduction developed in [17] to amplify the local gap for one variable (Theorem 13) into a global gap for all variables, thus proving the Average Baby PIH from the Baby PIH.

Theorem 16.

Under 𝖶[𝟣]FPT, for any constant r>0, no FPT algorithm can distinguish a given 2CSP instance with rectangular relation is satisfiable, or cannot be satisfied by any multi-assignment with total size no more than r.

To show Theorem 16, we first introduce some tools from coding theory. The collision number of an error-correcting code characterizes the number of codewords needed to find “collision” on a constant fraction of coordinates. We use the definition in [17]:

Definition 17 (ε-Collision Number).

Let m1 and x,yΣm with xy. For every i[m] we say that x and y collide on position i if x[i]=y[i]. Furthermore, a subset SΣm collides on position i if there exist distinct x,yS with x[i]=y[i]. We define the collision set of S as

ColSet(S)={i[m]|S collides on position i}.

Observe that if |S|1, then ColSet(S)=.

Now for every CΣm and 0<ε<1 the ε-collision number of C, denoted by Colε(C), is the maximum s|C|+1 such that for all S(Cs1) we have

|ColSet(S)|εm.

For Reed-Solomon codes, we have the following lower bounds on their collision number.

Theorem 18 (Theorem 10 in [20], see also [16]).

For any 0<ε<1, any Reed-Solomon code 𝒞RS:𝔽pk𝔽pm with sufficiently large k<mp, Colε(𝒞RS)2εmk.

Our proof of Theorem 16 consists of two reductions. The first one (Lemma 20) reduces 2CSP instances from the Baby PIH, i.e., with a “local” gap as explained in Section 1.2, to a new instance whose constraints are between two disjoint groups of variables. The new instance has a different “global” gap on each group of variables. As the sizes of the two groups might not be balanced, we do not necessarily have a “global” gap on all the variables. But this is easily remedied by the second reduction (Lemma 22) which makes an appropriate number of copies of the two groups.

Definition 19 ((r,s)-Average Multi-Assignment).

Let Π=(X,Σ,Φ) be a bipartite 2CSP instance, in particular X=X1˙X2 and every φ=(x1x2,C)Φ has x1X1 and x2X2. Then for r1,r21 an (r1,r2)-average multi-assignment of Π is a multi-assignment σ^:X2Σ such that

xX1|σ^(x)||X1|r1 and xX2|σ^(x)||X2|r2.

That is, the total size of σ^ restricted to X1 is at most r1|X1|, and the total size of σ^ restricted to X2 is at most r2|X2| (cf. Definition 7). We say Π is (r1,r2)-average list satisfiable if there is an (r1,r2)-average multi-assignment which satisfies Π.

Lemma 20.

There is an algorithm 𝒜 which on input a 2CSP instance Π0=(X0,Σ0,Φ0), ε>0, and r1 computes a bipartite 2CSP instance Π=(X1˙X2,Σ,Φ) with the following properties.

Completeness.

If Π0 is satisfiable, then so is Π,

Soundness.

For every r1 if Π0 is not 2r-list satisfiable, then Π is not (r1,r2)-average list satisfiable for every r1,r2 with

r1+r22(1ε)r.
Rectangularity.

All constraints in Φ are rectangular.

In addition, there exists a computable function f upper bounding the running time of 𝒜 as

f(|X0|+|Φ0|+1/ε+r)|Σ0|O(1). (1)

And the number of variables |X1|+|X2| and the number of constraints |Π| in Π can also be upper bounded by f(|X0|+|Φ0|+1/ε+r).

Proof.

For the given 2CSP instance Π0=(X0,Σ0,Φ0) we let

k=|X0| and k=|Φ0|.

Thereby we fix some enumerations of the variables in X0 and the constraints in Φ0 as

X0={x1,,xk} and Φ0={φ1,,φk}.

Let 𝒞:𝔽pk𝔽pk′′ be a Reed-Solomon code with

2|Σ0|1/k>p|Σ0|1/k and k′′=8(1ε)2r2εk(k)2+1.

Clearly |Σ0|pk, and therefore we can assume without loss of generality

Σ0𝔽pk.

Moreover, we only consider the case that

k′′p(=|𝔽p|)<2|Σ0|1/k,

i.e., Σ0 is sufficiently larger than k and k.444 Otherwise, the original instance Π0 can be solved in time of the form (1), and we can then output some predetermined Π depending on whether Π0 is satisfiable. Hence we can invoke Theorem 18 on Σ𝔽p, kk, mk′′, and εε to obtain

Colε(𝒞(𝔽pk))2εk′′k>4(1ε)rk, (2)

where the second inequality is by our choice of k′′.

Now the algorithm 𝒜 constructs the following bipartite 2CSP instance Π=(X,Σ,Φ).

Variables.

X=X1˙X2 with

X1={u1,,uk} and X2={v1,,vk′′}.
Alphabets.

Σ=uX1ΣuvX2Σv where:

  • For every j[k] the alphabet of the variable ujX1 is

    Σuj={(𝒞(a1),𝒞(a2))|φj=(xi1xi2,C) and (a1,a2)C}(𝒞(𝔽pk))2(𝔽pk′′)2. (3)

    That is, Σuj contains all the (partial) satisfying assignments of φj encoded by 𝒞:𝔽pk𝔽pk′′ as pairs of vectors in 𝔽pk′′. (Recall Σ0𝔽pk.)

  • For every [k′′] we have Σv=𝔽pk. Since p<2|Σ0|1/k, we have |Σv|2k|Σ0|.

Constraints.

Let j[k] and φj=(xi1xi2,C). Then for every [k′′] we have a constraint between the variable ujX1 and vX2 which checks whether uj is assigned to (w1,w2)(𝔽pk′′)2 and v to s𝔽q such that

w1[]=s[i1] and w2[]=s[i2]. (4)

Consequently (4) implies that the constraint is rectangular.555To see this, we take π(uj)=π(w1,w2)=(w1[],w2[]) and ρ(v)=(v[i1],v[i2]). Then equation (4) is precisely π(uj)=ρ(v) as in Definition 14. Moreover, the number of constraints in Π is

kk′′=k2(1ε)2r2εk(k)2+k.

The completeness of our reduction is straightforward. So we turn to the soundness. In particular, we assume that the given 2CSP instance Π0 is not 2r-list satisfiable. Furthermore, let σ^:X2Σ be a satisfying multi-assignment for Π. We need to show that, for any r1,r2 if there is a satisfying (r1,r2)-average multi-assignment σ^, then

r1+r2>2(1ε)r. (5)

To that end, let

𝖶𝗈𝗋𝖽σ^=ujX1(w1,w2)σ^(uj){w1,w2}𝔽pk′′. (6)

That is, 𝖶𝗈𝗋𝖽σ^ is the set of all codewords in 𝔽pk′′ that σ^ uses for the variables in X1.

Claim 21.

Let [k′′] with |σ^(v)|2r. Then 𝖶𝗈𝗋𝖽σ^ collides on position .

Proof of Claim 21.

Let [k′′] be fixed with |σ^(v)|2r.

Consider an arbitrary constraint φj=(xi1xi2,C)Φ0 (i.e., j[k]). Since σ^ is a satisfying multi-assignment for Π, there exist

(w1,w2)σ^(uj)Σuj(𝔽pk′′)2 and sσ^(v)𝔽pk

such that uj=(w1,w2) and v=s satisfy the constraint between uj and v in Π. By (w1,w2)Σuj and (3) there are a1,a2Σ0 with w1=𝒞(a1) and w2=𝒞(a2) such that

xi1=a1 and xi2=a2 satisfy φj. (7)

Then we say that a1 is (σ^,φj)-suitable for xi1 with respect to s, and similarly a2 is (σ^,φj)-suitable for xi2 with respect to s.

In addition, by (4)

𝒞(a1)[]=s[i1] and 𝒞(a2)[]=s[i2]. (8)

Now we define a multi-assignment σ^0:X02Σ0 for the original instance Π0=(X0,Σ0,Φ0) as follows. For every xX0 let

σ^0(x)=sσ^(v){aΣ0|j[k] and a is (σ^,φj)-suitable for x with respect to s}. (9)

(Recall that we have fixed an [k′′] and hence σ^(v).) Since every variable x must appear in at least one constraint φjΦ0 (cf. Definition 5), it is easy to see that σ^0 is a satisfying multi-assignment for Π0 by (7).

As Π0 is not 2r-list satisfiable, there exists an xiX0 (i.e., i[k]) with

|σ^0(xi)|2r+1.

We have assumed that

|σ^(v)|2r,

so by (9) there is an sσ^(v) such that

|{aΣ0|j[k] and a is (σ^,φj)-suitable for xi with respect to s}|2.

Hence there are a1,a2Σ0 with a1a2 and j1,j2[k] such that

  • a1 is (σ^,φj1)-suitable for xi with respect to s,

  • and a2 is (σ^,φj2)-suitable for xi with respect to s.

Then (8) implies that

𝒞(a1)[]=s[i]=𝒞(a2)[].

In other words, 𝒞(a1) and 𝒞(a2) collide on position . Clearly 𝒞(a1),𝒞(a2)𝖶𝗈𝗋𝖽σ^, so this finishes the proof of the claim.

Let

r1=xX1|σ^(x)||X1|=j[k]|σ^(uj)|k and r2=xX2|σ^(x)||X2|=[k′′]|σ^(v)|k′′.

Now we distinguish two cases.

  1. 1.

    There are more than ε fraction of [k′′] such that |σ^(v)|2r, then Claim 21 implies that 𝖶𝗈𝗋𝖽σ^ collides on more than ε fraction of positions [k′′]. Recall (2), i.e.,

    Colε(𝒞(𝔽pk))2εk′′k>4(1ε)rk.

    Hence,

    |𝖶𝗈𝗋𝖽σ^|Colε(𝒞(𝔽pk))>4(1ε)rk.

    By the definition (6) of 𝖶𝗈𝗋𝖽σ^ we deduce

    |𝖶𝗈𝗋𝖽σ^|= |ujX1(w1,w2)σ^(ui){w1,w2}|
    ujX1|(w1,w2)σ^(ui){w1,w2}|ujX12|σ^(ui)|

    It follows that

    r1=j[k]|σ^(uj)|k>4(1ε)rk2k=2(1ε)r.
  2. 2.

    There are at most ε fraction of [k′′] with |σ^(v)|2r. Or equivalently, there are at least (1ε) fraction of [k′′] with |σ^(v)|2r+1. Then

    r2=[k′′]|σ^(v)|k′′(1ε)k′′(2r+1)+εk′′k′′>2(1ε)r.

So both cases lead to (5) as desired.

With some proper replication, the unbalanced (r1,r2)-gap can be turned into a balanced one, and yield the desired r-average list unsatisfiability.

Lemma 22.

For any bipartite 2CSP instance Π=(X1˙X2,Σ,Φ) and r>1 we can compute in polynomial time a 2CSP instance Π=(X,Σ,Φ) with

|X|=2|X1||X2|

such that

Completeness.

If Π is satisfiable, then so is Π,

Soundness.

Let r1. If Π is not (r1,r2)-average list satisfiable for every r1,r21 with r1+r22r, then Π is not r-average list satisfiable. Or equivalently, if Π is r-average list satisfiable, then for some r1,r2 with r1+r22r the bipartite Π is (r1,r2)-average list satisfiable.

Furthermore, if Π is rectangular, then so is Π.

Proof.

Let

k1=|X1| and k2=|X2|.

The desired Π=(X,Σ,Φ) is constructed as below.

Variables.

X consists of k2 copies of X1 and k1 copies of X2, i.e., X=X1˙X2 where

X1={x(i)|xX1 and i[k2]} and X2={x(i)|xX2 and i[k1]}.

Note, |X1|=|X2|=k1k2, therefore Π contains 2k1k2 many variables.

Alphabets.

Σ=xXΣx where:

  • For every xX1 and i[k2], let Σx(i)=Σx. Recall that ΣxΣ is the alphabet for the variable x in the original 2CSP instance Π.

  • Similarly, for every xX2 and i[k1] let Σx(i)=Σx.

Constraints.

For every constraint φ=(x1x2,C)Φ with x1X1 and x2X2, i1[k2], and i2[k1] we have a constraint

φi1,i2=(x1(i1)x2(i2),C)Φ.

That is, φi1,i2 is a copy of φ where the variable x1 is replaced by its i1-th copy x1(i1) and x2 by its i2-th copy x2(i2). It immediately implies that if Π is rectangular, then Π is rectangular too.

Again the completeness is immediate. Towards the soundness, let σ^:X2Σ be a satisfying r-average multi-assignment for Π. In particular,

r=xX|σ^(x)||X|=xX1|σ^(x)|+xX2|σ^(x)||X1|+|X2|=xX1|σ^(x)|+xX2|σ^(x)|2k1k2.

We set

r1=xX1|σ^(x)||X1|=xX1|σ^(x)|k1k2 and r2=xX2|σ^(x)||X2|=xX2|σ^(x)|k1k2 (10)

It follows that

r1+r2=xX1|σ^(x)|+xX2|σ^(x)|k1k2=2r.

Note that

X1=˙i[k2]{x(i)xX1}. (11)

Therefore,

r1k1k2 =r1|X1| (by |X1|=k1k2)
=xX1|σ^(x)| (by (10))
=i[k2]xX1|σ^(x(i))|. (by (11))

Hence, there exists an i1[k2] such that

xX1|σ^(x(i1))|r1k1,or equivalentlyxX1|σ^(x(i1))||X1|r1

by |X1|=k1. Arguing similarly for X2 we get an i2[k1] such that

xX2|σ^(x(i2))||X2|r2

Finally we define a multi-assignment σ^ for the original instance Π by

σ^(x)={σ^(x(i1))if xX1σ^(x(i2))if xX2.

By the above argument, σ^ is (r1,r2)-average. Moreover, it satisfies Π, since σ^ satisfies Π.

Putting all pieces together, we have Theorem 16.

Proof of Theorem 16.

We give an FPT reduction from instances in the Baby PIH (Theorem 13) to Avg-r-Gap-2CSP. Then, since the Baby PIH holds under 𝖶[𝟣]FPT, we deduce that the Average Baby PIH also holds under 𝖶[𝟣]FPT.

For any 2CSP instance Π0=(X0,Σ0,Φ0), we can construct a bipartite 2CSP instance Π1=(X1,Σ1,Φ1) by Lemma 20, and then construct an Avg-r-Gap-2CSP instance Π=(X,Σ,Φ) from Π1 by Lemma 22. Trivially, Π is satisfiable when Π0 is satisfiable. When Π0 is not r-list satisfiable, Π1 is not (r1,r2)-average list satisfiable for all constants r1,r2 with r1+r22(1ε)r, and thus Π is not (1ε)r-average list satisfiable. Furthermore, Π has rectangular relations because Π1 has rectangular relations.

Moreover, the running time of this reduction can be bounded by

f(|X0|+|Φ0|+1/ε+r)|Σ0|O(1)

for a computable function f, and

|X|+|Φ|f(|X0|+|Φ0|+1/ε+r)|Σ0|O(1)

as well, so the reduction is an FPT reduction.

3.2 Average Baby PIH on Dense and Sparse Instances

In this section we prove Theorem 2, which is divided into two separate lemmas to help with readability. For our purposes, a 2CSP instance Π=(X,Σ,Φ) is dense if |Φ|=ω(|X|); or it is sparse, if |Φ|=O(|X|).

Lemma 23.

Under 𝖶[𝟣]FPT, the Average Baby PIH holds for all Avg-r-Gap-2CSP instances that are dense.

Proof.

The reduction in the proof of Lemma 20 yields complete bipartite 2CSP instances Π1=(X1˙X2,Σ1,Φ1), i.e., for each x1X1 and x2X2, there exists a constraint φ=(x1x2,C)Φ. Then the reduction in the proof of Lemma 22 makes |X2| copies of X1 and |X1| copies of |X2|, while keeping the constraints in each pair of copies. So in the final instance Π=(X,Σ,Φ) from the proof of Theorem 16, the number of constraints is

|Φ|=|X1|2|X2|2=|X|24.

Now consider any function hω(1). We produce a new instance Π=(X,Σ,Φ) by simply copying Π for t times, where t is chosen as the minimum number satisfying

h(t|X|)|X|4.

Note that there is no constraint between different copies. Then, the new parameter is |X|=t|X|, and

|Φ|=t|Φ|=t|X|24=|X|4|X|h(|X|)|X|.

It’s clear that this reduction runs in FPT time. Also, if Π is satisfiable, then Π is satisfiable. If any satisfying multi-assignment to Π must have total size more than r|X|, then any satisfying multi-assignment to Π must assign each copy of Π more than r|X| values, so in total more than r|X| values, preserving the gap.

Lemma 24.

Let r>1. If there exists a constant c>0 such that no FPT algorithm can solve Avg-r-Gap-2CSP on instance Π=(X,Σ,Φ) with |Φ|cr|X|, i.e., Π is sparse, then the PIH holds.

Proof Sketch.

Let Π=(X,Σ,Φ) be a NO instance of Avg-r-Gap-2CSP. For any (standard) assignment σ:XΣ, assume that σ violates t constraints, then one can simply add at most 2t values to σ and obtain a satisfying multi-assignment σ^ with total size |X|+2t. Since Π is a NO instance, we have |X|+2t>r|X|. Thus,

t>r12|X|=r12crcr|X|r12cr|Φ|.

In other words, any assignment to Π must violate a constant fraction of the constraints in Π. This gives a reduction from Avg-r-Gap-2CSP to PIH.

Putting Lemma 23 and Lemma 24 together, we obtain Theorem 2. As already mentioned in the introduction, this result indicates that the current barrier to the 𝖶[𝟣]-hardness of the PIH is the lack of reduction for Avg-r-Gap-2CSP on sparse instances, i.e., instances with linearly many constraints.

References

  • [1] Noga Alon, Dana Moshkovitz, and Shmuel Safra. Algorithmic construction of sets for k-restrictions. ACM Trans. Algorithms, 2(2):153–177, 2006. doi:10.1145/1150334.1150336.
  • [2] Sanjeev Arora, Carsten Lund, Rajeev Motwani, Madhu Sudan, and Mario Szegedy. Proof verification and the hardness of approximation problems. J. ACM, 45(3):501–555, 1998. doi:10.1145/278298.278306.
  • [3] Sanjeev Arora and Shmuel Safra. Probabilistic checking of proofs: A new characterization of NP. J. ACM, 45(1):70–122, 1998. doi:10.1145/273865.273901.
  • [4] Libor Barto and Marcin Kozik. Combinatorial gap theorem and reductions between promise CSPs. In Joseph (Seffi) Naor and Niv Buchbinder, editors, Proceedings of the 2022 ACM-SIAM Symposium on Discrete Algorithms, SODA 2022, Virtual Conference / Alexandria, VA, USA, January 9 - 12, 2022, pages 1204–1220. SIAM, 2022. doi:10.1137/1.9781611977073.50.
  • [5] Yijia Chen, Yi Feng, Bundit Laekhanukit, and Yanlin Liu. Simple combinatorial construction of the ko(1)-lower bound for approximating the parameterized k-Clique. In 2025 Symposium on Simplicity in Algorithms (SOSA), pages 263–280. Society for Industrial and Applied Mathematics, 2025. doi:10.1137/1.9781611978315.21.
  • [6] Yijia Chen and Bingkai Lin. The constant inapproximability of the parameterized Dominating Set problem. SIAM J. Comput., 48(2):513–533, 2019. doi:10.1137/17M1127211.
  • [7] Marek Cygan, Fedor V. Fomin, Lukasz Kowalik, Daniel Lokshtanov, Dániel Marx, Marcin Pilipczuk, Michal Pilipczuk, and Saket Saurabh. Parameterized Algorithms. Springer, 2015. doi:10.1007/978-3-319-21275-3.
  • [8] Irit Dinur. The PCP theorem by gap amplification. J. ACM, 54(3):12, 2007. doi:10.1145/1236457.1236459.
  • [9] Rodney G. Downey and Michael R. Fellows. Fundamentals of Parameterized Complexity. Texts in Computer Science. Springer, 2013. doi:10.1007/978-1-4471-5559-1.
  • [10] Jörg Flum and Martin Grohe. Parameterized Complexity Theory. Texts in Theoretical Computer Science. An EATCS Series. Springer, 2006. doi:10.1007/3-540-29953-X.
  • [11] Venkatesan Guruswami, Bingkai Lin, Xuandi Ren, Yican Sun, and Kewen Wu. Almost optimal time lower bound for approximating parameterized Clique, CSP, and more, under ETH. CoRR, abs/2404.08870, 2024. doi:10.48550/arXiv.2404.08870.
  • [12] Venkatesan Guruswami, Bingkai Lin, Xuandi Ren, Yican Sun, and Kewen Wu. Parameterized Inapproximability Hypothesis under Exponential Time Hypothesis. In Bojan Mohar, Igor Shinkar, and Ryan O’Donnell, editors, Proceedings of the 56th Annual ACM Symposium on Theory of Computing, STOC 2024, Vancouver, BC, Canada, June 24-28, 2024, pages 24–35. ACM, 2024. doi:10.1145/3618260.3649771.
  • [13] Venkatesan Guruswami, Xuandi Ren, and Sai Sandeep. Baby PIH: parameterized inapproximability of min CSP. In Rahul Santhanam, editor, 39th Computational Complexity Conference, CCC 2024, July 22-25, 2024, Ann Arbor, MI, USA, volume 300 of LIPIcs, pages 27:1–27:17. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2024. doi:10.4230/LIPICS.CCC.2024.27.
  • [14] Karthik C. S. and Subhash Khot. Almost polynomial factor inapproximability for parameterized k-Clique. In Shachar Lovett, editor, 37th Computational Complexity Conference, CCC 2022, July 20-23, 2022, Philadelphia, PA, USA, volume 234 of LIPIcs, pages 6:1–6:21. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPICS.CCC.2022.6.
  • [15] Karthik C. S., Bundit Laekhanukit, and Pasin Manurangsi. On the parameterized complexity of approximating Dominating Set. J. ACM, 66(5):33:1–33:38, 2019. doi:10.1145/3325116.
  • [16] Karthik C. S. and Inbal Livni Navon. On hardness of approximation of parameterized Set Cover and Label Cover: Threshold graphs from error correcting codes. In Hung Viet Le and Valerie King, editors, 4th Symposium on Simplicity in Algorithms, SOSA 2021, Virtual Conference, January 11-12, 2021, pages 210–223. SIAM, 2021. doi:10.1137/1.9781611976496.24.
  • [17] Shuangle Li, Bingkai Lin, and Yuwei Liu. Improved lower bounds for approximating parameterized nearest codeword and related problems under ETH. In Karl Bringmann, Martin Grohe, Gabriele Puppis, and Ola Svensson, editors, 51st International Colloquium on Automata, Languages, and Programming, ICALP 2024, July 8-12, 2024, Tallinn, Estonia, volume 297 of LIPIcs, pages 107:1–107:20. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2024. doi:10.4230/LIPICS.ICALP.2024.107.
  • [18] Bingkai Lin. A simple gap-producing reduction for the parameterized Set Cover problem. In Christel Baier, Ioannis Chatzigiannakis, Paola Flocchini, and Stefano Leonardi, editors, 46th International Colloquium on Automata, Languages, and Programming, ICALP 2019, July 9-12, 2019, Patras, Greece, volume 132 of LIPIcs, pages 81:1–81:15. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019. doi:10.4230/LIPICS.ICALP.2019.81.
  • [19] Bingkai Lin. Constant approximating k-Clique is W[1]-hard. In Samir Khuller and Virginia Vassilevska Williams, editors, STOC ’21: 53rd Annual ACM SIGACT Symposium on Theory of Computing, Virtual Event, Italy, June 21-25, 2021, pages 1749–1756. ACM, 2021. doi:10.1145/3406325.3451016.
  • [20] Bingkai Lin, Xuandi Ren, Yican Sun, and Xiuhan Wang. Constant approximating parameterized k-SetCover is W[2]-hard. In Nikhil Bansal and Viswanath Nagarajan, editors, Proceedings of the 2023 ACM-SIAM Symposium on Discrete Algorithms, SODA 2023, Florence, Italy, January 22-25, 2023, pages 3305–3316. SIAM, 2023. doi:10.1137/1.9781611977554.CH126.
  • [21] Daniel Lokshtanov, M. S. Ramanujan, Saket Saurabh, and Meirav Zehavi. Parameterized complexity and approximability of Directed Odd Cycle Transversal. In Shuchi Chawla, editor, Proceedings of the 2020 ACM-SIAM Symposium on Discrete Algorithms, SODA 2020, Salt Lake City, UT, USA, January 5-8, 2020, pages 2181–2200. SIAM, 2020. doi:10.1137/1.9781611975994.134.
  • [22] Carsten Lund and Mihalis Yannakakis. On the hardness of approximating minimization problems. J. ACM, 41(5):960–981, 1994. doi:10.1145/185675.306789.
  • [23] Pasin Manurangsi. Tight running time lower bounds for strong inapproximability of maximum k-Coverage, Unique Set Cover and related problems (via t-wise agreement testing theorem). In Shuchi Chawla, editor, Proceedings of the 2020 ACM-SIAM Symposium on Discrete Algorithms, SODA 2020, Salt Lake City, UT, USA, January 5-8, 2020, pages 62–81. SIAM, 2020. doi:10.1137/1.9781611975994.5.

Appendix A From Average Baby PIH to Inapproximability of 𝒌-ExactCover

We present a proof relying on a construction that slightly differs from the one in [13]. Their proof makes use of the (T,m)-set gadget [22, 1] that was previously used to show the hardness of approximating SetCover problem. On the other hand, our proof develops a novel composition of the collision number of ECCs (recall Definition 17) with the following well-known combinatorial object.

Definition 25 (Hypercube Partition System).

Let A,B be two sets. Then the (A,B)-hypercube partition system is defined by

  • the universe =AB(={z|a function z:BA}), and

  • a collection of subsets {Px,y}xB,yA where each Px,y={z|z(x)=y}.

Theorem 26 (cf. Theorem 21 in [13]).

Assume that the Average Baby PIH holds on all 2CSP instances with rectangular relations. Then k-ExactCover cannot be approximated in FPT time within any constant factor. More precisely, for every constant r>1 no FPT algorithm, on a given k-SetCover instance Π=(S,U) with size n and k1, can distinguish between the following two cases:

  • We can choose k disjoint sets in S whose union is U.

  • U is not the union of any rk sets in S.

Proof.

Let Π=(X,Σ,Φ) be an Avg-r-Gap-2CSP instance with rectangular relations. We set k=|X|. Moreover, for each rectangular constraint φj=(xi1xi2,Cj)Φ we use Qj to denote the underlying set and πj,ρj:ΣQj the associated mappings as in Definition 14. That is, for every a,bΣ, it holds that (a,b)Cj if and only if πj(a)=ρj(b). Then we set

t=maxφjΦ|Qj|. (12)

Clearly, we can assume without loss of generality

t|Π|.

Now we reduce Π to a k-ExactCover instance. To that end, we choose a further alphabet Δ whose size is a prime and satisfies

max{logt,22r2k2}|Δ|2max{logt,22r2k2}.

Moreover,let

d=2r2k2logtlog|Δ|.

This leads to the following code with very large distance (here we simply use Reed-Solomon code again)

Enc:Δlogtlog|Δ|Δd.

Plugging

klogtlog|Δ|,md,p|Δ|,andε1/2

in Theorem 18 we conclude that the 1/2-collision number of Enc is

Col1/2(Enc)dlogt/log|Δ|>rk.

Observe that (12) implies that every tuple in Ci can be identified with a string in Δlogmlog|Δ|, i.e., the domain of Enc.

Then, for each variable xX and every its possible value aΣ, we define a set Sx,a as follows. For each constraint φj=(xi1xi2,Cj)Φ with associated set Tj and mappings πj,ρj:ΣQj, and for each [d], we construct a ([2],Δ)-hypercube partition system

((j,),{Pu,v(j,)}uΔ,v[2]). (13)

Then for each (a,b)Cj we add PEnc(πj(a))[],1(j,) to Sxi1,a and similarly PEnc(ρj(b))[],2(j,) to Sxi2,b. Finally, let the universe be

U=˙φjΦ,[d](j,),and𝒮={Sx,a|xX and aΣ}.

For the completeness, let σ:XΣ be a satisfying assignment of Π, it is routine to check that {Sx,σ(x)}xX is a partition of U.

For the soundness, assume that every satisfying multi-assignment of Π has total size at least rk (cf. Definition 7). Let 𝒮𝒮 be a cover of U. Consider the multi-assignment that maps every variable xX to {aΣ|Sx,a𝒮}. If this multi-assignment satisfies Π, the our assumption implies |𝒮|rk. Otherwise, assume that there exists some constraint φj=(xi1xi2,Cj)Φ which is not satisfied. Note that the above multi-assignment assigns xi1 to E1={aΣ|Sxi1,a𝒮} and xi2 to E2={bΣ|Sxi2,b𝒮}. Since φj is not satisfied, for all (a,b)E1×E2 we have Enc(πj(a))Enc(ρj(b)). However, for each [d], since (j,) is covered by 𝒮, there must exist aE1 and bE2 with Enc(πj(a))[]=Enc(ρj(b))[]. Therefore, the set {πj(a)}aE1{ρj(b)}bE2 collides on all coordinates [d], hence it must have size at least Col1/2(Enc). We deduce

|𝒮||E1|+|E2||{πj(a)}aE1{ρj(b)}bE2|Col1/2(Enc)>rk.

Finally, in each hypercube partition system (13) it holds that

|(j,)|=2|Δ|4logt+422r2k2|Π|2+422r2k2,

and there are at most (k2)dk2r2k2logtr2k4log|Π| such systems. The size of the universe U is thus at most g(r,k)|Π|3 for some appropriate computable function g:2, while the parameter of the k-ExactCover instance remains k=|X|. It follows easily that the running time of this reduction is FPT.

Combining Theorem 26 and Theorem 16, we obtain:

Theorem 27.

For any constant r>1, r-approximating k-ExactCover is 𝖶[𝟣]-hard.