On Average Baby PIH and Its Applications

Liu, Yuwei; Chen, Yijia; Li, Shuangle; Lin, Bingkai; Zheng, Xin

doi:10.4230/LIPIcs.STACS.2025.65

On Average Baby PIH and Its Applications

Yuwei Liu

Shanghai Jiao Tong University, China Yijia Chen

Shanghai Jiao Tong University, China Shuangle Li

Nanjing University, China Bingkai Lin

Nanjing University, China Xin Zheng

Nanjing University, China

Abstract

The Parameterized Inapproximability Hypothesis (PIH) asserts that no FPT algorithm can decide whether a given $2$ CSP instance parameterized by the number of variables is satisfiable, or at most a constant fraction of its constraints can be satisfied simultaneously. In a recent breakthrough, Guruswami, Lin, Ren, Sun, and Wu (STOC 2024) proved the PIH under the Exponential Time Hypothesis (ETH). However, it remains a major open problem whether the PIH can be established assuming only $\mathsf{W[1]}\neq\textup{{FPT}}$ . Towards this goal, Guruswami, Ren, and Sandeep (CCC 2024) showed a weaker version of the PIH called the Baby PIH under $\mathsf{W[1]}\neq\textup{{FPT}}$ . In addition, they proposed one more intermediate assumption known as the Average Baby PIH, which might lead to further progress on the PIH. As the main contribution of this paper, we prove that the Average Baby PIH holds assuming $\mathsf{W[1]}\neq\textup{{FPT}}$ .

Given a $2$ CSP instance where the number of its variables is the parameter, the Average Baby PIH states that no FPT algorithm can decide whether (i) it is satisfiable or (ii) any multi-assignment that satisfies all constraints must assign each variable more than $r$ values on average for any fixed constant $r>1$ . So there is a gap between (i) and (ii) on the average number of values that are assigned to a variable, i.e., $1$ vs. $r$ . If this gap occurs in each variable instead of on average, we get the original Baby PIH. So central to our paper is an FPT self-reduction for $2$ CSP instances that turns the above gap for each variable into a gap on average. By the known $\mathsf{W[1]}$ -hardness for the Baby PIH, this proves that the Average Baby PIH holds under $\mathsf{W[1]}\neq\textup{{FPT}}$ .

As applications, we obtain (i) for the first time, the $\mathsf{W[1]}$ -hardness of constant approximating $k$ -ExactCover, and (ii) a tight relationship between running time lower bounds in the Average Baby PIH and approximating the parameterized Nearest Codeword Problem ( $k$ -NCP).

Keywords and phrases:

Average Baby PIH, Parameterized Inapproximability, Constraint Satisfaction Problem, Exact Set Cover, W[1]-hardness

Copyright and License:

2012 ACM Subject Classification:

Theory of computation

\rightarrow

Problems, reductions and completeness ; Theory of computation

\rightarrow

Parameterized complexity and exact algorithms

Acknowledgements:

The authors want to thank Guohang Liu, Mingjun Liu, Yangluo Zheng for discussions in the early stage of this work. The comments and suggestions from anonymous reviewers also help to improve the paper significantly. In particular, Theorem 2 is due to one of them.

Funding:

Yuwei Liu and Yijia Chen are supported by the National Natural Science Foundation of China (Project 62372291).

DOI:

10.4230/LIPIcs.STACS.2025.65

Event:

42nd International Symposium on Theoretical Aspects of Computer Science (STACS 2025)

Editors:

Olaf Beyersdorff, Michał Pilipczuk, Elaine Pimentel, and Nguyễn Kim Thắng

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

In classical complexity theory, the PCP theorem [3, 2, 8] serves as an essential tool for proving most of the existing results in the hardness of approximation. As a variant, the Multi-Assignment PCP theorem [1, Lemma 11] states that, for any constant $r>1$ and $0<\varepsilon<1$ , it is even NP-hard to decide whether a CSP instance is satisfiable, or any multi-assignment (see Definition 7.) that assigns each variable no more than $r$ values cannot satisfy a $(1-\varepsilon)$ -fraction of constraints. Among others, the Multi-Assignment PCP theorem was used to show the NP-hardness of approximating SetCover [1]. It turns out that the Multi-Assignment PCP theorem is a simple consequence of the PCP theorem by a straightforward probabilistic argument. Nevertheless, Barto and Kozik [4] gave a direct and purely combinatorial proof for the simple case of $\varepsilon=0$ . Observe that it means that the CSP instances under consideration are either satisfiable or cannot be satisfied by a desired multi-assignment. This restricted version of the Multi-Assignment PCP theorem is termed the Baby PCP Theorem in [4].

As an analog of the PCP theorem in parameterized complexity theory, the Parameterized Inapproximability Hypothesis [21], PIH for short, is an important assumption from which we can prove many FPT inapproximability results, including the inapproximability of $k$ -Clique, $k$ -ExactCover [13], and Direct Odd Cycle Transversal [21], etc. It claims that for some constant $0<\varepsilon<1$ , no $f(k)\cdot n^{O(1)}$ -time (i.e., FPT) algorithm can distinguish a satisfiable 2CSP instance with $k$ variables from one where less than $(1-\varepsilon)$ -fraction of constraints can be satisfied simultaneously [21]. Unlike the PCP theorem, the PIH is still a major open problem in parameterized complexity. The current state of the art is that the PIH holds under the Exponential Time Hypothesis (ETH) [12, 11], and a proof of the PIH under the minimum assumption $\mathsf{W[1]}\neq\textup{{FPT}}$ remains elusive. Toward this goal, studying some consequences of the PIH and proving them under $\mathsf{W[1]}\neq\textup{{FPT}}$ might provide new insights and valuable lessons.

Recently, Guruswami, Ren, and Sandeep [13] proved a parameterized version of the Baby PCP Theorem, appropriately coined as the Baby PIH, under $\mathsf{W[1]}\neq\textup{{FPT}}$ . The Baby PIH states that for any constant $r>1$ , no FPT algorithm can distinguish a satisfiable 2CSP instance from one with no satisfying multi-assignment which assigns each variable no more than $r$ values. Just like the relationship between the PCP theorem and the Baby PCP theorem, the Baby PIH is a direct consequence of the PIH. As a next step, a further complexity assumption is suggested, i.e., the Average Baby PIH [13, Conjecture 3], which seems to be sandwiched between the PIH and the Baby PIH. It postulates the $\mathsf{W[1]}$ -hardness of the problem known as Avg- $r$ -Gap- $\mathrm{2CSP}$ (see Definition 8) which asks for distinguishing a satisfiable 2CSP instance from one without satisfying multi-assignment which assigns each variable no more than $r$ values on average. The authors of [13] also demonstrated the difference between the Baby PIH and the Average Baby PIH. In fact, for all $r>1$ and $\delta>0$ , they constructed 2CSP instances with variable set $X$ that cannot be satisfied by any multi-assignment assigning each variable in $X$ no more than $r$ values, but can be satisfied by a multi-assignment that assigns in total $(1+\delta)|X|$ values to all the variables in $X$ , that is, every variable is assigned $1+\delta$ values on average. Compared to proving $\mathsf{W[1]}$ -hardness¹¹1Strictly speaking, the PIH is not a computational problem and we cannot directly define its hardness. The formal statement should be “proving $\mathsf{W[1]}$ -hardness of the problem described in the PIH”, and we use “ $\mathsf{W[1]}$ -hardness for the PIH” for short in the introduction. of the PIH, it is apparently easier to show the $\mathsf{W[1]}$ -hardness of the Average Baby PIH, and studying the Average Baby PIH might bring us further closer to a proof of the $\mathsf{W[1]}$ -hardness for the PIH. Moreover, the Average Baby PIH is already sufficient for proving some non-trivial inapproximability results such as $k$ -ExactCover [13].

1.1 Main Results

Let $\Pi=(X,\Sigma,\Phi)$ be a 2CSP instance with a set $X$ of variables, an alphabet $\Sigma$ , and a set $\Phi$ of constraints. A multi-assignment $\hat{\sigma}:X\rightarrow 2^{\Sigma}$ relaxes the standard notion of assignments by assigning each variable $x\in$ $X$ a set of values in $\Sigma$ , i.e., $\hat{\sigma}(x)\subseteq\Sigma$ . Thereby, $\Pi$ is said to be satisfied by $\hat{\sigma}$ if for every constraint $\varphi\in\Phi$ , one can pick for each variable $x$ of $\varphi$ a value from the set $\hat{\sigma}(x)$ assigned to this variable to satisfy $\varphi$ . We say that $\hat{\sigma}$ assigns $\sum_{x\in X}|\hat{\sigma}(x)|$ values to $X$ in total, or equivalently, each variable in $X$ is assigned $\frac{\sum_{x\in X}|\hat{\sigma}(x)|}{|X|}$ values on average (see Definition 7). Our main result is the following theorem stating that the Average Baby PIH holds under $\mathsf{W[1]}\neq\textup{{FPT}}$ .

Theorem 1 (Informal, see Theorem 16).

Assume $\mathsf{W[1]}\neq\textup{{FPT}}$ . Then for any constant $r>1$ , given a 2CSP instance $\Pi=(X,\Sigma,\Phi)$ parameterized by $|X|$ , no FPT time algorithm can distinguish between:

$\blacksquare$

$\Pi$ is satisfiable.
$\blacksquare$

Any multi-assignment assigning no more than $r|X|$ values to $X$ does not satisfy $\Pi$ .

Clearly any standard assignment $\sigma:X\to\Sigma$ can be identified with a multi-assignment $\hat{\sigma}$ that assigns each variable $x\in X$ a set of a single value, i.e., $\hat{\sigma}(x)=\{\sigma(x)\}$ . Hence, there is a constant $r$ gap in Theorem 1 between YES and NO instances on the average number of values assigned to each variable, which gives us the aforementioned Avg- $r$ -Gap- $\mathrm{2CSP}$ problem. On the other hand, the constant gap for the PIH is on the fraction of constraints that can be satisfied by an assignment. That is, a YES instance is a 2CSP instance whose all constraints can be satisfied by an assignment, while any assignment can only satisfy at most a constant fraction of constraints in a NO instance. So, perhaps surprisingly, the difference between the Average Baby PIH and the PIH can be pinpointed within the Avg- $r$ -Gap- $\mathrm{2CSP}$ problem precisely in terms of whether a given instance contains a “dense” or “sparse” set of constraints.²²2This is pointed out by an anonymous reviewer. More precisely:

Theorem 2.

$\blacksquare$

Under $\mathsf{W[1]}\neq\textup{{FPT}}$ , the Average Baby PIH holds for Avg- $r$ -Gap- $\mathrm{2CSP}$ instances $\Pi=(X,\Sigma,\Phi)$ with $|\Phi|=\omega(|X|)$ .
$\blacksquare$

If the Average Baby PIH holds for Avg- $r$ -Gap- $\mathrm{2CSP}$ instances $\Pi=(X,\Sigma,\Phi)$ with $|\Phi|=O(|X|)$ , then the PIH holds as well.

As a first application of the Average Baby PIH under $\mathsf{W[1]}\neq\textup{{FPT}}$ , using a reduction in [13], we obtain the $\mathsf{W[1]}$ -hardness of constant approximating the $k$ -ExactCover problem (see Definition 9), improving its previous approxmation lower bound under a stronger assumption, i.e., the Gap-ETH [23].

Theorem 3 (Theorem 27 restated).

For any constant $r>1$ , $r$ -approximating $k$ -ExactCover is $\mathsf{W[1]}$ -hard.

We remark that the $\mathsf{W[1]}$ -hardness of approximating $k$ -ExactCover has been a long-standing open problem in parameterized complexity. Although the $\mathsf{W[1]}$ , $\mathsf{W[2]}$ , ETH-hardness of approximating the $k$ -SetCover problem has been established in [6, 15, 18, 20], as a special case of $k$ -SetCover, the hardness of approximating $k$ -ExactCover was only known under the PIH [23] prior to our work.

The second application is a close relationship between running time lower bounds for constant approximating the parameterized Nearest Codeword Problem $\gamma$ -Gap- $k$ -NCP_p (see Definition 10) and Avg- $r$ -Gap- $\mathrm{2CSP}$ . Its proof is a straightforward composition of two known reductions in [23, 13] .

Theorem 4.

For any prime $p$ , computable function $g$ , and constant $r$ , if no $f(k)\cdot n^{o(g(k))}$ -time algorithm can decide Avg- $r$ -Gap-2CSP with $k$ variables for any computable function $f$ , then $r$ -Gap- $k$ -NCP_p cannot be solved in time $f(k)\cdot n^{o(g(k))}$ for any computable function $f$ .

Proof Sketch.

Theorem 4 follows from the gap-preserving reduction [13] from Avg- $r$ -Gap-2CSP to $k$ -ExactCover (see also Appendix A), and the gap-preserving reduction from $k$ -ExactCover to $r$ -Gap- $k$ -NCP_p [23, Theorem 28]. Note that in both reductions the parameter $k$ is preserved. $\hfill\blacktriangleleft$

1.2 Technical Overview: Local-to-Global Reduction For 2CSP

To prove Theorem 1, we show that the $\mathsf{W[1]}$ -hardness for the Baby PIH implies the $\mathsf{W[1]}$ -hardness for the Average Baby PIH. Here, the “ $\mathsf{W[1]}$ -hardness for the Baby PIH” means that, for all $r>1$ , it’s $\mathsf{W[1]}$ -hard to decide whether (i) a 2CSP instance is satisfiable, or (ii) it cannot be satisfied by any multi-assignment assigning each variable no more than $r$ values. Thus there is a constant $r$ gap between (i) and (ii) in the number of values assigned to each variable. Thereby, the gap is “local.” As already mentioned, for the Average Baby PIH, the gap is on the average number of values, or equivalently, the total number of values assigned to all the variables. Hence, the gap is “global.” Our reduction from the Baby PIH to the Average Baby PIH is thus said to be “local-to-global.”

Technically, our reduction relies on a simple but crucial property of high-distance error-correcting codes (ECC) shown in [16, 20]. In particular, we need an ECC $\mathcal{C}\subseteq\mathbb{F}_{p}^{m}$ with relative distance $1-\delta$ such that any two distinct codewords $x,y\in\mathcal{C}$ can agree on at most $\delta m$ entries. So if we have a set $S$ of codewords and an $\varepsilon$ -fraction of entries (denoted by $I\subseteq[m]$ ) with $\varepsilon\gg\delta$ such that for each $i\in I$ , we can find distinct $x,y\in S$ that agree on their $i$ -th entry, then the size of $S$ must be large. The lower bound of $|S|$ is called the collision number of $\mathcal{C}$ , denote by $\mathrm{Col}_{\varepsilon}(\mathcal{C})$ . A simple counting argument in [20] shows that $\mathrm{Col}_{\varepsilon}(\mathcal{C})\geq\sqrt{2\varepsilon/\delta}$ .

Now given a 2CSP instance $\Pi_{0}=(X_{0},\Sigma_{0},\Phi_{0})$ , let $n=|\Pi_{0}|>|\Sigma_{0}|$ , parameter $k=|X_{0}|$ , and $k^{\prime}=|\Phi_{0}|$ . We fix some ECC with very large relative distance (e.g., a Reed-Solomon code) $\mathcal{C}:\mathbb{F}_{p}^{k}\rightarrow\mathbb{F}_{p}^{k^{\prime\prime}}$ for prime $n^{1/k}\leq p<2n^{1/k}$ and $k^{\prime\prime}=\Theta(k^{5})$ . Then we have $\mathrm{Col}_{\varepsilon}(\mathcal{C})>2rk$ . We construct a new 2CSP instance $\Pi=(X_{1}\mathbin{\dot{\cup}}X_{2},\Sigma,\Phi)$ as:

$\blacksquare$

$X_{1}=\{u_{1},\cdots,u_{k^{\prime}}\}$ , $X_{2}=\{v_{1},\cdots,v_{k^{\prime\prime}}\}$ .
$\blacksquare$

Each $u_{j}$ takes value from the (encoding of) satisfying assignments of $\varphi_{j}=(x_{i_{1}}x_{i_{2}},C_{j})\in\Phi_{0}$ , i.e., $\{(\mathcal{C}(a_{1}),\mathcal{C}(a_{2})):(a_{1},a_{2})\in C_{j}\}$ . Each $v_{\ell}$ takes value from $\mathbb{F}_{p}^{k}$ .
$\blacksquare$

For each $u_{j}\in X_{1}$ and $v_{\ell}\in X_{2}$ , there is a constraint that checks whether $u_{j}$ ’s assigned value $(w_{1},w_{2})$ and $v_{\ell}$ ’s assigned value $s$ satisfy $w_{1}[\ell]=s[i_{1}]$ and $w_{2}[\ell]=s[i_{2}]$ .

See Figure 1 for an illustration. Finally, we duplicate $X_{1}$ and $X_{2}$ each an appropriate number of times to make them equal in size, finishing our reduction.

Figure 1: An illustration of our construction for an input instance

\Pi_{0}=(X_{0},\Sigma_{0},\Phi_{0})

with

|X_{0}|=|\Phi_{0}|=3

.

In general, an assignment to $X_{1}$ should correspond to a satisfying multi-assignment to the original instance $\Pi_{0}$ , and the value assigned to each $v_{\ell}\in X_{2}$ is a “guess” of the $\ell$ -th entry of the encoding of every $x_{i}$ . It is easy to see that if the input instance $\Pi_{0}$ is satisfiable, then so does the new 2CSP instance $\Pi$ , since for each $u_{j}\in X_{1}$ corresponding to $\varphi_{j}=(x_{i_{1}}x_{i_{2}},C_{j})$ , we can simply assign it the value $(\mathcal{C}(\sigma(x_{i_{1}})),\mathcal{C}(\sigma(x_{i_{2}})))$ , where $\sigma$ is a satisfying assignment for $\Pi_{0}$ . At the same time, the value assigned to each $v_{\ell}\in X_{2}$ is $(\mathcal{C}(\sigma(x_{1}))[\ell],\cdots,\mathcal{C}(\sigma(x_{k}))[\ell])\in% \mathbb{F}_{p}^{k}$ .

For soundness, suppose $\Pi_{0}$ has no satisfying multi-assignment assigning at most $r$ values to each variable, we need to argue that $\Pi$ has no satisfying multi-assignment that assigns $r(1-\varepsilon)(|X_{1}|+|X_{2}|)/2$ values in total. To that end, we exploit the collision number of the code $\mathcal{C}$ . Fix any satisfying multi-assignment $\hat{\sigma}$ to $\Pi$ . Recall that each variable $u_{j}\in X_{1}$ is assigned some value from a satisfying partial assignment to $\varphi_{j}\in\Phi_{0}$ . Then, $\hat{\sigma}(X_{1})$ naturally gives a satisfying multi-assignment to $\Pi_{0}$ , which implies that there exists a variable $x_{i}\in X_{0}$ such that more than $r$ different values are assigned by $\hat{\sigma}$ to $u_{j}$ , for which the corresponding constraint $\varphi_{j}$ contains $x_{i}$ .

Now we have two possible cases for $\hat{\sigma}$ . In the first case, for $(1-\varepsilon)$ -fraction of variables in $X_{2}$ , the multi-assignment $\hat{\sigma}$ assigns each of them more than $r$ values. We are done, since this implies that the total number of assigned values by $\hat{\sigma}$ is more than $(1-\varepsilon)r|X_{2}|$ . For the second case, there exists an $\varepsilon$ -fraction of variables in $X_{2}$ , each of which is assigned by $\hat{\sigma}$ at most $r$ values. Given such a variable $v_{\ell}$ , we have more than $r$ different codewords assigned to $X_{1}$ in $x_{i}$ ’s position which has at most $r$ possible values in the $\ell$ -th entry. This entails the existence of two different codewords with the same $\ell$ -th entry. Since there are $\varepsilon m$ such entries, the assignment to $X_{1}$ must contain at least $\mathrm{Col}_{\varepsilon}(\mathcal{C})$ different codewords. Each assignment to $u_{j}\in X_{1}$ contributes two codewords, so the total number of assigned values by $\hat{\sigma}$ is at least $\mathrm{Col}_{\varepsilon}(\mathcal{C})/2>rk$ . In summary, both cases guarantee a constant gap on either $X_{1}$ or $X_{2}$ , showing that any satisfying multi-assignment to $\Pi$ must assign $r(1-\varepsilon)(|X_{1}|+|X_{2}|)/2$ values to $X_{1}\cup X_{2}$ in total.

More details are referred to Section 3.

1.3 Discussions

For minimization problems, the technique of constructing two parts of variables and arguing that at least one part has a large gap seems quite general, as exhibited by the previous works showing the $\mathsf{W[1]}$ -hardness of approximating $k$ -SetCover [6, 20] and $k$ -NCP [17]. The use of the collision number of error-correcting codes in the context of parameterized inapproximability was first introduced in [16] and further developed in [20, 17]. We ask whether these techniques can be unified.

Question 1.

Is there a general framework for proving parameterized inapproximability of minimization problems?

We also suggest two new variants of the Average Baby PIH, which might serve as a next step towards proving the PIH under $\mathsf{W[1]}\neq\textup{{FPT}}$ . On closer inspection of our construction, particularly Case 1 in the proof of Lemma 20, it only guarantees the total number of values assigned to $X_{1}$ (i.e., $\Sigma_{x\in X_{1}}|\hat{\sigma}(x)|$ ) is large. This could happen if an $o(1)$ -fraction of $x$ ’s in $X_{1}$ is assigned super-constant number of values. We ask if a larger gap can be achieved. So the first one asks whether the Average Baby PIH can be strengthened by requiring a constant fraction of variables to be assigned multiple values.

Question 2.

Under $\mathsf{W[1]}\neq\textup{{FPT}}$ , can we prove that for any constant $r>2$ , there exists a constant $c>0$ such that no FPT algorithm can decide whether a 2CSP instance is satisfiable, or any satisfying multi-assignment must have at least a $c$ -fraction of variables assigned $r$ values?

We remark that the case of $r=2$ follows from the inapproximability of $k$ -Clique [19, 14, 5].

The second variant is already contained in Theorem 2, thus equivalent to PIH.

Question 3.

Under $\mathsf{W[1]}\neq\textup{{FPT}}$ , can we prove that the Average PIH holds even for Avg- $r$ -Gap- $\mathrm{2CSP}$ instances with the number of constraints being linear in the number of variables?

It is also interesting to consider whether the inapproximability factor in the Average Baby PIH can be improved to $\omega(1)$ , since this would directly lead to better lower bounds for approximating $k$ -ExactCover. The current obstacle is that, although the running time of our reduction does not depend on the approximation factor, our reduction relies on the gap created in the Baby PIH [13]. In order to achieve an $r$ -gap in the Baby PIH, the reduction in [13] runs in time $\Omega(n^{(2r)^{r}})$ , consequently, the existing FPT reduction cannot create a super-constant $r=\omega(1)$ gap .

Question 4.

Under $\mathsf{W[1]}\neq\textup{{FPT}}$ , can we prove that the Average PIH (Theorem 1) holds for inapproximability factor $r=\omega(1)$ , hence giving better inapproximability result for $k$ -ExactCover?

1.4 Organization

In Section 2, we introduce the main computational problems and complexity assumptions studied in this paper. As the central contribution, Section 3 explains our reduction from the Baby PIH to the Average Baby PIH. This, in fact, establishes the Average Baby PIH under $\mathsf{W[1]}\neq\textup{{FPT}}$ . In Appendix A, we present a reduction from Avg- $r$ -Gap- $\mathrm{2CSP}$ to the constant approximation of $k$ -ExactCover, which slightly differs from the construction in [13].

2 Preliminaries

For a positive integer $n$ , we use $[n]$ to denote the set $\{1,2,\cdots,n\}$ . $S_{1}\mathbin{\dot{\cup}}\cdots\mathbin{\dot{\cup}}S_{k}$ is the disjoint union of sets $S_{1},\ldots,S_{k}$ , where we tacitly assume that $S_{1},\ldots,S_{k}$ are pairwise disjoint. We use $\log$ (without subscript) to denote the logarithm number with base $2$ . For any prime number $p$ , we write $\mathbb{F}_{p}$ for the (unique) finite field of size $p$ . The asymptotic notations, i.e., $O,\Omega,\omega$ , and $\Theta$ , are used following the general convention. The reader is assumed to be familiar with basic notions in parameterized complexity theory, in particular FPT and $\mathsf{W[1]}$ . Otherwise, the standard references are, e.g., [10, 9, 7].

2.1 Problems

Definition 5 (Parameterized 2CSP).

A $\mathrm{2CSP}$ instance is defined as a triple $\Pi=(X,\Sigma,\Phi)$ where:

$\blacksquare$

$X$ is a set of variable.
$\blacksquare$

$\Sigma=\dot{\bigcup}_{x\in X}\Sigma_{x}$ , where each $\Sigma_{x}$ contains values that the variable $x\in X$ can be assigned. Often, we assume that there exists an $n\in\mathbb{N}$ such that $|\Sigma_{x}|\leq n$ for all $x\in X$ .
$\blacksquare$

$\Phi=\{\varphi_{1},\cdots,\varphi_{k^{\prime}}\}$ , where each $\varphi_{j}=(x_{i_{1}}x_{i_{2}},C_{j})$ for some $x_{i_{1}},x_{i_{2}}\in X$ , and $C_{i}$ is a subset of $\Sigma_{x_{i_{1}}}\times\Sigma_{x_{i_{2}}}$ .

The problem is to decide whether there exists an assignment $\sigma:X\rightarrow\Sigma$ that satisfies:

$\blacksquare$

For all $x\in X$ , $\sigma(x)\in\Sigma_{x}$ .
$\blacksquare$

For all $\varphi_{j}=(x_{i_{1}}x_{i_{2}},C_{j})\in\Phi$ , $(\sigma(x_{i_{1}}),\sigma(x_{i_{2}}))\in C_{j}$ .

The parameter for this problem is $k=|X|$ , the number of variables. Each pair of variables has at most one constraint, so $|\Phi|\leq\binom{k}{2}$ . Without loss of generality, each variable is related to some constraint in $\Phi$ . The size of instance $\Pi$ is defined as $|\Pi|=|\Sigma|+|\Phi|$ , where the size of each $\varphi_{j}$ is defined as $|\varphi_{j}|=|C_{j}|$ .

The approximation of parameterized 2CSP refers to the following problem.

Definition 6 ( $\varepsilon$ -Gap-2CSP).

Given a 2CSP instance $\Pi=(X,\Sigma,\Phi)$ with parameter $k=|X|$ , we want to distinguish between:

$\blacksquare$

$\Pi$ is satisfiable;
$\blacksquare$

any assignment can satisfy at most an $\varepsilon$ -fraction of constraints in $\Phi$ .

As already mentioned, the notion of multi-assignment extends the usual assignment in such a way that each variable can be assigned multiple values.

Definition 7 (Multi-assignment).

A multi-assignment of a 2CSP instance $\Pi=(X,\Sigma,\Phi)$ is a function $\hat{\sigma}:X\rightarrow 2^{\Sigma}$ , ³³3Here we use $2^{\Sigma}$ to denote the power set of $\Sigma$ . such that for all $x\in X$ we have $\hat{\sigma}(x)\subseteq\Sigma_{x}$ . Furthermore, we say that $\hat{\sigma}$ satisfies $\Pi$ if:

$\blacksquare$

For all $\varphi_{j}=(x_{i_{1}}x_{i_{2}},C_{j})\in\Phi$ , there exist $c_{1}\in\hat{\sigma}(x_{i_{1}})$ and $c_{2}\in\hat{\sigma}(x_{i_{2}})$ with $(c_{1},c_{2})\in C_{j}$ .

The individual size of $\hat{\sigma}$ is defined as $\max_{x\in X}|\hat{\sigma}(x)|$ , and the total size of $\hat{\sigma}$ is $\sum_{x\in X}|\hat{\sigma}(x)|$ .

Let $r\geq 1$ . We say that a 2CSP instance $\Pi=(X,\Sigma,\Phi)$ is $r$ -list satisfiable if there exists a multi-assignment $\hat{\sigma}$ with individual size no more than $r$ which satisfies $\Pi$ , and $\Pi$ is $r$ -average list satisfiable if there exists a multi-assignment $\hat{\sigma}$ with total size no more than $r|X|$ which satisfies $\Pi$ .

Definition 8 (Avg- $r$ -Gap- $\mathrm{2CSP}$ ).

Given a $\mathrm{2CSP}$ instance $\Pi$ , the goal is to distinguish between the following two cases:

$\blacksquare$

$\Pi$ is satisfiable.
$\blacksquare$

$\Pi$ is not $r$ -average list satisfiable.

We also consider the $k$ -ExactCover problem (aka, the $k$ -UniqueSetCover problem) and the $k$ -NCP problem (aka, $k$ -MLD, for the parameterized Maximum Likelihood Decoding problem) as defined below.

Definition 9 ( $k$ -ExactCover).

Given a set $U$ (which we call universe) and a collection of $U$ ’s subsets $\mathcal{S}$ , the goal is to distinguish between the following two cases:

$\blacksquare$

there exist at most $k$ disjoint sets in $\mathcal{S}$ that form a partition of $U$ ,
$\blacksquare$

or $U$ is not the union of any $k$ sets in $\mathcal{S}$ .

Definition 10 ( $k$ -NCP).

For prime $p$ , integer $d>0$ , given a (multi-)set $V$ of vectors in $\mathbb{F}_{p}^{d}$ , and a target vector $\vec{t}\in\mathbb{F}_{p}^{d}$ , the $k$ -NCP_p problem asks for distinguishing between:

$\blacksquare$

the Hamming distance between $\vec{t}$ and the vector space spanned by $V$ is at most $k$ ,
$\blacksquare$

or the Hamming distance between $\vec{t}$ and the vector space spanned by $V$ is at least $k+1$ .

2.2 Hypotheses

Hypothesis 11 (PIH [21]).

For every constant $0<\varepsilon<1$ , there is no FPT algorithm solving $\varepsilon$ -Gap-2CSP.

The Baby PIH, a hypothesis implied by PIH, asserts the hardness of approximating individual size of a satisfying multi-assignment. Formally,

Hypothesis 12 (Baby PIH [13]).

For any constant $r>0$ , no FPT algorithm can on input a 2CSP instance, distinguish whether it is satisfiable, or cannot be satisfied by any multi-assignment with individual size at most $r$ .

We emphasize that the Baby PIH is a hardness hypothesis with a local condition, i.e., the individual size of satisfying assignments. It is shown that the standard assumption $\mathsf{W[1]}\neq\textup{{FPT}}$ implies the Baby PIH:

Theorem 13 ([13]).

The Baby PIH holds under $\mathsf{W[1]}\neq\textup{{FPT}}$ .

In contrast, the Average Baby PIH is defined on a global condition concerning the total size of satisfying assignments. The precise statement of this complexity assumption contains a technical property on the “shape” of the constraints in a 2CSP instance.

Definition 14 (Rectangular relation).

A 2CSP instance $\Pi=(X,\Sigma,\Phi)$ is said to have rectangular relations if for each $\varphi_{j}=(x_{i_{1}}x_{i_{2}},C_{j})\in\Phi$ , there exist a set $Q_{j}$ and mappings $\pi_{j},\rho_{j}:\Sigma\rightarrow Q_{j}$ , such that $(a,b)\in C_{j}$ iff $\pi_{j}(a)=\rho_{j}(b)$ . We call $Q_{j}$ the underlying set of $\varphi_{j}$ .

Some explanation for “rectangular” might be in order. Recall that a subset $S\subseteq\Sigma^{2}$ is a (combinatorial) rectangle if and only if there exist $A,B\subseteq\Sigma$ such that $S=A\times B$ . It is easy to verify that $R\subseteq\Sigma^{2}$ is rectangular if and only if $R$ is the union of a set of pairwise disjoint rectangles.

Hypothesis 15 (Average Baby PIH).

For any constant $r>0$ , there exists no FPT algorithm solving the Avg- $r$ -Gap-2CSP problem, even when the instance contains only rectangular relations.

3 Average Baby PIH from Baby PIH

In this section, we show that the Average Baby PIH even for instances with only rectangular relations, is implied by the Baby PIH.

3.1 Proofs of Main Results

We employ a local-to-global reduction developed in [17] to amplify the local gap for one variable (Theorem 13) into a global gap for all variables, thus proving the Average Baby PIH from the Baby PIH.

Theorem 16.

Under $\mathsf{W[1]}\neq\textup{{FPT}}$ , for any constant $r>0$ , no FPT algorithm can distinguish a given $2$ CSP instance with rectangular relation is satisfiable, or cannot be satisfied by any multi-assignment with total size no more than $r$ .

To show Theorem 16, we first introduce some tools from coding theory. The collision number of an error-correcting code characterizes the number of codewords needed to find “collision” on a constant fraction of coordinates. We use the definition in [17]:

Definition 17 ( $\varepsilon$ -Collision Number).

Let $m\geq 1$ and $x,y\in\Sigma^{m}$ with $x\neq y$ . For every $i\in[m]$ we say that $x$ and $y$ collide on position $i$ if $x[i]=y[i]$ . Furthermore, a subset $S\subseteq\Sigma^{m}$ collides on position $i$ if there exist distinct $x,y\in S$ with $x[i]=y[i]$ . We define the collision set of $S$ as

\mathrm{ColSet}(S)=\big{\{}i\in[m]\;\big{|}\;\text{$S$ collides on position $i% $}\big{\}}.

Observe that if $|S|\leq 1$ , then $\mathrm{ColSet}(S)=\varnothing$ .

Now for every $C\subseteq\Sigma^{m}$ and $0<\varepsilon<1$ the $\varepsilon$ -collision number of $C$ , denoted by $\mathrm{Col}_{\varepsilon}(C)$ , is the maximum $s\leq|C|+1$ such that for all $S\in\binom{C}{s-1}$ we have

\big{|}\mathrm{ColSet}(S)\big{|}\leq\varepsilon m.

For Reed-Solomon codes, we have the following lower bounds on their collision number.

Theorem 18 (Theorem 10 in [20], see also [16]).

For any $0<\varepsilon<1$ , any Reed-Solomon code $\mathcal{C}^{RS}:\mathbb{F}_{p}^{k}\rightarrow\mathbb{F}_{p}^{m}$ with sufficiently large $k<m\leq p$ , $\mathrm{Col}_{\varepsilon}(\mathcal{C}^{RS})\geq\sqrt{\frac{2\varepsilon m}{k}}$ .

Our proof of Theorem 16 consists of two reductions. The first one (Lemma 20) reduces $2$ CSP instances from the Baby PIH, i.e., with a “local” gap as explained in Section 1.2, to a new instance whose constraints are between two disjoint groups of variables. The new instance has a different “global” gap on each group of variables. As the sizes of the two groups might not be balanced, we do not necessarily have a “global” gap on all the variables. But this is easily remedied by the second reduction (Lemma 22) which makes an appropriate number of copies of the two groups.

Definition 19 ( $(r,s)$ -Average Multi-Assignment).

Let $\Pi=(X,\Sigma,\Phi)$ be a bipartite $2$ CSP instance, in particular $X=X_{1}\mathbin{\dot{\cup}}X_{2}$ and every $\varphi=(x_{1}x_{2},C)\in\Phi$ has $x_{1}\in X_{1}$ and $x_{2}\in X_{2}$ . Then for $r_{1},r_{2}\geq 1$ an $(r_{1},r_{2})$ -average multi-assignment of $\Pi$ is a multi-assignment $\hat{\sigma}:X\to 2^{\Sigma}$ such that

\displaystyle\frac{\sum_{x\in X_{1}}|\hat{\sigma}(x)|}{|X_{1}|}\leq r_{1}

and

\displaystyle\frac{\sum_{x\in X_{2}}|\hat{\sigma}(x)|}{|X_{2}|}\leq r_{2}.

That is, the total size of $\hat{\sigma}$ restricted to $X_{1}$ is at most $r_{1}|X_{1}|$ , and the total size of $\hat{\sigma}$ restricted to $X_{2}$ is at most $r_{2}|X_{2}|$ (cf. Definition 7). We say $\Pi$ is $(r_{1},r_{2})$ -average list satisfiable if there is an $(r_{1},r_{2})$ -average multi-assignment which satisfies $\Pi$ .

Lemma 20.

There is an algorithm $\mathcal{A}$ which on input a $2$ CSP instance $\Pi_{0}=(X_{0},\Sigma_{0},\Phi_{0})$ , $\varepsilon>0$ , and $r\geq 1$ computes a bipartite $2$ CSP instance $\Pi=(X_{1}\mathbin{\dot{\cup}}X_{2},\Sigma,\Phi)$ with the following properties.

Completeness.

If $\Pi_{0}$ is satisfiable, then so is $\Pi$ ,

Soundness.

For every $r\geq 1$ if $\Pi_{0}$ is not $2r$ -list satisfiable, then $\Pi$ is not $(r_{1},r_{2})$ -average list satisfiable for every $r_{1},r_{2}\in\mathbb{N}$ with

r_{1}+r_{2}\leq 2(1-\varepsilon)r.

Rectangularity.

All constraints in $\Phi$ are rectangular.

In addition, there exists a computable function $f$ upper bounding the running time of $\mathcal{A}$ as

f(|X_{0}|+|\Phi_{0}|+1/\varepsilon+r)|\Sigma_{0}|^{O(1)}.

(1)

And the number of variables $|X_{1}|+|X_{2}|$ and the number of constraints $|\Pi|$ in $\Pi$ can also be upper bounded by $f(|X_{0}|+|\Phi_{0}|+1/\varepsilon+r)$ .

Proof.

For the given $2$ CSP instance $\Pi_{0}=(X_{0},\Sigma_{0},\Phi_{0})$ we let

\displaystyle k=|X_{0}|

and

\displaystyle k^{\prime}=|\Phi_{0}|.

Thereby we fix some enumerations of the variables in $X_{0}$ and the constraints in $\Phi_{0}$ as

\displaystyle X_{0}=\{x_{1},\ldots,x_{k}\}

and

\displaystyle\Phi_{0}=\big{\{}\varphi_{1},\ldots,\varphi_{k^{\prime}}\big{\}}.

Let $\mathcal{C}:\mathbb{F}_{p}^{k}\to\mathbb{F}_{p}^{k^{\prime\prime}}$ be a Reed-Solomon code with

\displaystyle 2|\Sigma_{0}|^{1/k}>p\geq|\Sigma_{0}|^{1/k}

and

\displaystyle k^{\prime\prime}=\left\lfloor\frac{8(1-\varepsilon)^{2}r^{2}}{% \varepsilon}k(k^{\prime})^{2}\right\rfloor+1.

Clearly $|\Sigma_{0}|\leq p^{k}$ , and therefore we can assume without loss of generality

\Sigma_{0}\subseteq\mathbb{F}_{p}^{k}.

Moreover, we only consider the case that

k^{\prime\prime}\leq p\ \Big{(}=\big{|}\mathbb{F}_{p}\big{|}\Big{)}<2|\Sigma_{% 0}|^{1/k},

i.e., $\Sigma_{0}$ is sufficiently larger than $k$ and $k^{\prime}$ .⁴⁴4 Otherwise, the original instance $\Pi_{0}$ can be solved in time of the form (1), and we can then output some predetermined $\Pi$ depending on whether $\Pi_{0}$ is satisfiable. Hence we can invoke Theorem 18 on $\Sigma\leftarrow\mathbb{F}_{p}$ , $k\leftarrow k$ , $m\leftarrow k^{\prime\prime}$ , and $\varepsilon\leftarrow\varepsilon$ to obtain

\mathrm{Col}_{\varepsilon}(\mathcal{C}(\mathbb{F}_{p}^{k}))\geq\sqrt{\frac{2% \varepsilon k^{\prime\prime}}{k}}>4(1-\varepsilon)rk^{\prime},

(2)

where the second inequality is by our choice of $k^{\prime\prime}$ .

Now the algorithm $\mathcal{A}$ constructs the following bipartite $2$ CSP instance $\Pi=(X,\Sigma,\Phi)$ .

Variables.

$X=X_{1}\mathbin{\dot{\cup}}X_{2}$ with

\displaystyle X_{1}=\{u_{1},\ldots,u_{k^{\prime}}\}

and

\displaystyle X_{2}=\{v_{1},\ldots,v_{k^{\prime\prime}}\}.

Alphabets.

$\Sigma=\bigcup_{u\in X_{1}}\Sigma_{u}\cup\bigcup_{v\in X_{2}}\Sigma_{v}$ where:

$\blacksquare$

For every $j\in[k^{\prime}]$ the alphabet of the variable $u_{j}\in X_{1}$ is

$\displaystyle\Sigma_{u_{j}}=\Big{\{}\big{(}\mathcal{C}(a_{1}),\mathcal{C}(a_{2% })\big{)}\;\Big{|}\;\text{$\varphi_{j}=(x_{i_{1}}x_{i_{2}},C)$ and $(a_{1},a_{2})\in C$}\Big{\}}\subseteq\big{(}\mathcal{C}(\mathbb{F}_{p}^{k% })\big{)}^{2}\subseteq\big{(}\mathbb{F}_{p}^{k^{\prime\prime}}\big{)}^{2}.$ (3)

That is, $\Sigma_{u_{j}}$ contains all the (partial) satisfying assignments of $\varphi_{j}$ encoded by $\mathcal{C}:\mathbb{F}_{p}^{k}\to\mathbb{F}_{p}^{k^{\prime\prime}}$ as pairs of vectors in $\mathbb{F}_{p}^{k^{\prime\prime}}$ . (Recall $\Sigma_{0}\subseteq\mathbb{F}_{p}^{k}$ .)
$\blacksquare$

For every $\ell\in[k^{\prime\prime}]$ we have $\Sigma_{v_{\ell}}=\mathbb{F}_{p}^{k}$ . Since $p<2|\Sigma_{0}|^{1/k}$ , we have $|\Sigma_{v_{\ell}}|\leq 2^{k}|\Sigma_{0}|$ .

Constraints.

Let $j\in[k^{\prime}]$ and $\varphi_{j}=(x_{i_{1}}x_{i_{2}},C)$ . Then for every $\ell\in[k^{\prime\prime}]$ we have a constraint between the variable $u_{j}\in X_{1}$ and $v_{\ell}\in X_{2}$ which checks whether $u_{j}$ is assigned to $(w_{1},w_{2})\in\big{(}\mathbb{F}_{p}^{k^{\prime\prime}}\big{)}^{2}$ and $v_{\ell}$ to $s\in\mathbb{F}_{q}^{\ell}$ such that

\displaystyle w_{1}[\ell]=s[i_{1}]

and

\displaystyle w_{2}[\ell]=s[i_{2}].

(4)

Consequently (4) implies that the constraint is rectangular.⁵⁵5To see this, we take $\pi(u_{j})=\pi(w_{1},w_{2})=(w_{1}[\ell],w_{2}[\ell])$ and $\rho(v_{\ell})=(v_{\ell}[i_{1}],v_{\ell}[i_{2}])$ . Then equation (4) is precisely $\pi(u_{j})=\rho(v_{\ell})$ as in Definition 14. Moreover, the number of constraints in $\Pi$ is

k^{\prime}k^{\prime\prime}=k^{\prime}\left\lfloor\frac{2(1-\varepsilon)^{2}r^{% 2}}{\varepsilon}k(k^{\prime})^{2}\right\rfloor+k^{\prime}.

The completeness of our reduction is straightforward. So we turn to the soundness. In particular, we assume that the given $2$ CSP instance $\Pi_{0}$ is not $2r$ -list satisfiable. Furthermore, let $\hat{\sigma}:X\to 2^{\Sigma}$ be a satisfying multi-assignment for $\Pi$ . We need to show that, for any $r_{1},r_{2}\in\mathbb{N}$ if there is a satisfying $(r_{1},r_{2})$ -average multi-assignment $\hat{\sigma}$ , then

r_{1}+r_{2}>2(1-\varepsilon)r.

(5)

To that end, let

\mathsf{Word}_{\hat{\sigma}}=\bigcup_{u_{j}\in X_{1}}\bigcup_{(w_{1},w_{2})\in% \hat{\sigma}(u_{j})}\{w_{1},w_{2}\}\subseteq\mathbb{F}_{p}^{k^{\prime\prime}}.

(6)

That is, $\mathsf{Word}_{\hat{\sigma}}$ is the set of all codewords in $\mathbb{F}_{p}^{k^{\prime\prime}}$ that $\hat{\sigma}$ uses for the variables in $X_{1}$ .

Claim 21.

Let $\ell\in[k^{\prime\prime}]$ with $|\hat{\sigma}(v_{\ell})|\leq 2r$ . Then $\mathsf{Word}_{\hat{\sigma}}$ collides on position $\ell$ .

Proof of Claim 21.

Let $\ell\in[k^{\prime\prime}]$ be fixed with $|\hat{\sigma}(v_{\ell})|\leq 2r$ .

Consider an arbitrary constraint $\varphi_{j}=(x_{i_{1}}x_{i_{2}},C)\in\Phi_{0}$ (i.e., $j\in[k^{\prime}]$ ). Since $\hat{\sigma}$ is a satisfying multi-assignment for $\Pi$ , there exist

\displaystyle(w_{1},w_{2})\in\hat{\sigma}(u_{j})\subseteq\Sigma_{u_{j}}% \subseteq\big{(}\mathbb{F}_{p}^{k^{\prime\prime}}\big{)}^{2}

and

\displaystyle s\in\hat{\sigma}(v_{\ell})\subseteq\mathbb{F}_{p}^{k}

such that $u_{j}=(w_{1},w_{2})$ and $v_{\ell}=s$ satisfy the constraint between $u_{j}$ and $v_{\ell}$ in $\Pi$ . By $(w_{1},w_{2})\in\Sigma_{u_{j}}$ and (3) there are $a_{1},a_{2}\in\Sigma_{0}$ with $w_{1}=\mathcal{C}(a_{1})$ and $w_{2}=\mathcal{C}(a_{2})$ such that

\text{$x_{i_{1}}=a_{1}$ and $x_{i_{2}}=a_{2}$ satisfy $\varphi_{j}$}.

(7)

Then we say that $a_{1}$ is $(\hat{\sigma},\varphi_{j})$ -suitable for $x_{i_{1}}$ with respect to $s$ , and similarly $a_{2}$ is $(\hat{\sigma},\varphi_{j})$ -suitable for $x_{i_{2}}$ with respect to $s$ .

In addition, by (4)

\displaystyle\mathcal{C}(a_{1})[\ell]=s[i_{1}]

and

\displaystyle\mathcal{C}(a_{2})[\ell]=s[i_{2}].

(8)

Now we define a multi-assignment $\hat{\sigma}_{0}:X_{0}\to 2^{\Sigma_{0}}$ for the original instance $\Pi_{0}=(X_{0},\Sigma_{0},\Phi_{0})$ as follows. For every $x\in X_{0}$ let

\displaystyle\hat{\sigma}_{0}(x)=\bigcup_{s\in\hat{\sigma}(v_{\ell})}\big{\{}a% \in\Sigma_{0}\;\big{|}\;\text{$j\in[k^{\prime}]$ and $a$ is $(\hat{\sigma},% \varphi_{j})$-suitable for $x$ with respect to $s$}\big{\}}.

(9)

(Recall that we have fixed an $\ell\in[k^{\prime\prime}]$ and hence $\hat{\sigma}(v_{\ell})$ .) Since every variable $x$ must appear in at least one constraint $\varphi_{j}\in\Phi_{0}$ (cf. Definition 5), it is easy to see that $\hat{\sigma}_{0}$ is a satisfying multi-assignment for $\Pi_{0}$ by (7).

As $\Pi_{0}$ is not $2r$ -list satisfiable, there exists an $x_{i^{*}}\in X_{0}$ (i.e., $i^{*}\in[k]$ ) with

\big{|}\hat{\sigma}_{0}(x_{i^{*}})\big{|}\geq 2r+1.

We have assumed that

|\hat{\sigma}(v_{\ell})|\leq 2r,

so by (9) there is an $s\in\hat{\sigma}(v_{\ell})$ such that

\Big{|}\big{\{}a\in\Sigma_{0}\;\big{|}\;\text{$j\in[k^{\prime}]$ and $a$ is $(% \hat{\sigma},\varphi_{j})$-suitable for $x_{i^{*}}$ with respect to $s$}\big{\}}\Big{|}\geq 2.

Hence there are $a_{1},a_{2}\in\Sigma_{0}$ with $a_{1}\neq a_{2}$ and $j_{1},j_{2}\in[k^{\prime}]$ such that

$\blacksquare$

$a_{1}$ is $(\hat{\sigma},\varphi_{j_{1}})$ -suitable for $x_{i^{*}}$ with respect to $s$ ,
$\blacksquare$

and $a_{2}$ is $(\hat{\sigma},\varphi_{j_{2}})$ -suitable for $x_{i^{*}}$ with respect to $s$ .

Then (8) implies that

\mathcal{C}(a_{1})[\ell]=s[i^{*}]=\mathcal{C}(a_{2})[\ell].

In other words, $\mathcal{C}(a_{1})$ and $\mathcal{C}(a_{2})$ collide on position $\ell$ . Clearly $\mathcal{C}(a_{1}),\mathcal{C}(a_{2})\in\mathsf{Word}_{\hat{\sigma}}$ , so this finishes the proof of the claim. $\hfill\vartriangleleft$

Let

\displaystyle r_{1}=\frac{\sum_{x\in X_{1}}|\hat{\sigma}(x)|}{|X_{1}|}=\frac{% \sum_{j\in[k^{\prime}]}\big{|}\hat{\sigma}(u_{j})\big{|}}{k^{\prime}}

and

\displaystyle r_{2}=\frac{\sum_{x\in X_{2}}|\hat{\sigma}(x)|}{|X_{2}|}=\frac{% \sum_{\ell\in[k^{\prime\prime}]}\big{|}\hat{\sigma}(v_{\ell})\big{|}}{k^{% \prime\prime}}.

Now we distinguish two cases.

1.

There are more than $\varepsilon$ fraction of $\ell\in[k^{\prime\prime}]$ such that $|\hat{\sigma}(v_{\ell})|\leq 2r$ , then Claim 21 implies that $\mathsf{Word}_{\hat{\sigma}}$ collides on more than $\varepsilon$ fraction of positions $\ell\in[k^{\prime\prime}]$ . Recall (2), i.e.,

$\mathrm{Col}_{\varepsilon}(\mathcal{C}(\mathbb{F}_{p}^{k}))\geq\sqrt{\frac{2% \varepsilon k^{\prime\prime}}{k}}>4(1-\varepsilon)rk^{\prime}.$

Hence,

$\big{|}\mathsf{Word}_{\hat{\sigma}}\big{|}\geq\mathrm{Col}_{\varepsilon}(% \mathcal{C}(\mathbb{F}_{p}^{k}))>4(1-\varepsilon)rk^{\prime}.$

By the definition (6) of $\mathsf{Word}_{\hat{\sigma}}$ we deduce

$\displaystyle\big{|}\mathsf{Word}_{\hat{\sigma}}\big{|}=$ $\displaystyle\left|\bigcup_{u_{j}\in X_{1}}\bigcup_{(w_{1},w_{2})\in\hat{% \sigma}(u_{i})}\{w_{1},w_{2}\}\right|$

$\displaystyle\leq$ $\displaystyle\sum_{u_{j}\in X_{1}}\left|\bigcup_{(w_{1},w_{2})\in\hat{\sigma}(% u_{i})}\{w_{1},w_{2}\}\right|\leq\sum_{u_{j}\in X_{1}}2\big{|}\hat{\sigma}(u_{% i})\big{|}$

It follows that

$r_{1}=\frac{\sum_{j\in[k^{\prime}]}\big{|}\hat{\sigma}(u_{j})\big{|}}{k^{% \prime}}>\frac{4(1-\varepsilon)rk^{\prime}}{2k^{\prime}}=2(1-\varepsilon)r.$
2.

There are at most $\varepsilon$ fraction of $\ell\in[k^{\prime\prime}]$ with $|\hat{\sigma}(v_{\ell})|\leq 2r$ . Or equivalently, there are at least $(1-\varepsilon)$ fraction of $\ell\in[k^{\prime\prime}]$ with $|\hat{\sigma}(v_{\ell})|\geq 2r+1$ . Then

$r_{2}=\frac{\sum_{\ell\in[k^{\prime\prime}]}\big{|}\hat{\sigma}(v_{\ell})\big{% |}}{k^{\prime\prime}}\geq\frac{(1-\varepsilon)k^{\prime\prime}(2r+1)+% \varepsilon k^{\prime\prime}}{k^{\prime\prime}}>2(1-\varepsilon)r.$

So both cases lead to (5) as desired. $\hfill\blacktriangleleft$

With some proper replication, the unbalanced $(r_{1},r_{2})$ -gap can be turned into a balanced one, and yield the desired $r$ -average list unsatisfiability.

Lemma 22.

For any bipartite $2$ CSP instance $\Pi=(X_{1}\mathbin{\dot{\cup}}X_{2},\Sigma,\Phi)$ and $r>1$ we can compute in polynomial time a $2$ CSP instance $\Pi^{\prime}=(X^{\prime},\Sigma^{\prime},\Phi^{\prime})$ with

|X|=2|X_{1}||X_{2}|

such that

Completeness.: If $\Pi$ is satisfiable, then so is $\Pi^{\prime}$ ,
Soundness.: Let $r\geq 1$ . If $\Pi$ is not $(r_{1},r_{2})$ -average list satisfiable for every $r_{1},r_{2}\geq 1$ with $r_{1}+r_{2}\leq 2r$ , then $\Pi^{\prime}$ is not $r$ -average list satisfiable. Or equivalently, if $\Pi^{\prime}$ is $r$ -average list satisfiable, then for some $r_{1},r_{2}\in\mathbb{N}$ with $r_{1}+r_{2}\leq 2r$ the bipartite $\Pi$ is $(r_{1},r_{2})$ -average list satisfiable.

Furthermore, if $\Pi$ is rectangular, then so is $\Pi^{\prime}$ .

Proof.

Let

\displaystyle k_{1}=|X_{1}|

and

\displaystyle k_{2}=|X_{2}|.

The desired $\Pi^{\prime}=(X^{\prime},\Sigma^{\prime},\Phi^{\prime})$ is constructed as below.

Variables.

$X^{\prime}$ consists of $k_{2}$ copies of $X_{1}$ and $k_{1}$ copies of $X_{2}$ , i.e., $X^{\prime}=X^{\prime}_{1}\mathbin{\dot{\cup}}X^{\prime}_{2}$ where

\displaystyle X^{\prime}_{1}=\big{\{}x^{(i)}\;\big{|}\;\text{$x\in X_{1}$ and % $i\in[k_{2}]$}\big{\}}

and

\displaystyle X^{\prime}_{2}=\big{\{}x^{(i)}\;\big{|}\;\text{$x\in X_{2}$ and % $i\in[k_{1}]$}\big{\}}.

Note, $|X^{\prime}_{1}|=|X^{\prime}_{2}|=k_{1}k_{2}$ , therefore $\Pi^{\prime}$ contains $2k_{1}k_{2}$ many variables.

Alphabets.

$\Sigma^{\prime}=\bigcup_{x\in X^{\prime}}\Sigma^{\prime}_{x}$ where:

$\blacksquare$

For every $x\in X_{1}$ and $i\in[k_{2}]$ , let $\Sigma^{\prime}_{x^{(i)}}=\Sigma_{x}$ . Recall that $\Sigma_{x}\subseteq\Sigma$ is the alphabet for the variable $x$ in the original $2$ CSP instance $\Pi$ .
$\blacksquare$

Similarly, for every $x\in X_{2}$ and $i\in[k_{1}]$ let $\Sigma^{\prime}_{x^{(i)}}=\Sigma_{x}$ .

Constraints.

For every constraint $\varphi=(x_{1}x_{2},C)\in\Phi$ with $x_{1}\in X_{1}$ and $x_{2}\in X_{2}$ , $i_{1}\in[k_{2}]$ , and $i_{2}\in[k_{1}]$ we have a constraint

\varphi^{i_{1},i_{2}}=\left(x_{1}^{(i_{1})}x_{2}^{(i_{2})},C\right)\in\Phi^{% \prime}.

That is, $\varphi^{i_{1},i_{2}}$ is a copy of $\varphi$ where the variable $x_{1}$ is replaced by its $i_{1}$ -th copy $x_{1}^{(i_{1})}$ and $x_{2}$ by its $i_{2}$ -th copy $x_{2}^{(i_{2})}$ . It immediately implies that if $\Pi$ is rectangular, then $\Pi^{\prime}$ is rectangular too.

Again the completeness is immediate. Towards the soundness, let $\hat{\sigma}^{\prime}:X^{\prime}\to 2^{\Sigma^{\prime}}$ be a satisfying $r$ -average multi-assignment for $\Pi^{\prime}$ . In particular,

r=\frac{\sum_{x\in X^{\prime}}\big{|}\hat{\sigma}^{\prime}(x)\big{|}}{|X^{% \prime}|}=\frac{\sum_{x\in X^{\prime}_{1}}\big{|}\hat{\sigma}^{\prime}(x)\big{% |}+\sum_{x\in X^{\prime}_{2}}\big{|}\hat{\sigma}^{\prime}(x)\big{|}}{|X^{% \prime}_{1}|+|X^{\prime}_{2}|}=\frac{\sum_{x\in X^{\prime}_{1}}\big{|}\hat{% \sigma}^{\prime}(x)\big{|}+\sum_{x\in X^{\prime}_{2}}\big{|}\hat{\sigma}^{% \prime}(x)\big{|}}{2k_{1}k_{2}}.

We set

\displaystyle r_{1}=\frac{\sum_{x\in X^{\prime}_{1}}\big{|}\hat{\sigma}^{% \prime}(x)\big{|}}{|X^{\prime}_{1}|}=\frac{\sum_{x\in X^{\prime}_{1}}\big{|}% \hat{\sigma}^{\prime}(x)\big{|}}{k_{1}k_{2}}

and

\displaystyle r_{2}=\frac{\sum_{x\in X^{\prime}_{2}}\big{|}\hat{\sigma}^{% \prime}(x)\big{|}}{|X^{\prime}_{2}|}=\frac{\sum_{x\in X^{\prime}_{2}}\big{|}% \hat{\sigma}^{\prime}(x)\big{|}}{k_{1}k_{2}}

(10)

It follows that

r_{1}+r_{2}=\frac{\sum_{x\in X^{\prime}_{1}}\big{|}\hat{\sigma}^{\prime}(x)% \big{|}+\sum_{x\in X^{\prime}_{2}}\big{|}\hat{\sigma}^{\prime}(x)\big{|}}{k_{1% }k_{2}}=2r.

Note that

X^{\prime}_{1}=\dot{\bigcup}_{i\in[k_{2}]}\big{\{}x^{(i)}\mid x\in X_{1}\big{% \}}.

(11)

Therefore,

$\displaystyle r_{1}k_{1}k_{2}$	$\displaystyle=r_{1}\|X^{\prime}_{1}\|$	(by $\|X^{\prime}_{1}\|=k_{1}k_{2}$ )
	$\displaystyle=\sum_{x\in X^{\prime}_{1}}\big{\|}\hat{\sigma}^{\prime}(x)\big{\|}$	(by (10))
	$\displaystyle=\sum_{i\in[k_{2}]}\sum_{x\in X_{1}}\big{\|}\hat{\sigma}^{\prime}(% x^{(i)})\big{\|}.$	(by (11))

Hence, there exists an $i_{1}\in[k_{2}]$ such that

\sum_{x\in X_{1}}\big{|}\hat{\sigma}^{\prime}(x^{(i_{1})})\big{|}\leq r_{1}k_{% 1},\quad\text{or equivalently}\ \ \frac{\sum_{x\in X_{1}}\big{|}\hat{\sigma}^{% \prime}(x^{(i_{1})})\big{|}}{|X_{1}|}\leq r_{1}

by $|X_{1}|=k_{1}$ . Arguing similarly for $X_{2}$ we get an $i_{2}\in[k_{1}]$ such that

\frac{\sum_{x\in X_{2}}\big{|}\hat{\sigma}^{\prime}(x^{(i_{2})})\big{|}}{|X_{2% }|}\leq r_{2}

Finally we define a multi-assignment $\hat{\sigma}$ for the original instance $\Pi$ by

\hat{\sigma}(x)=\begin{cases}\hat{\sigma}^{\prime}(x^{(i_{1})})&\text{if $x\in X% _{1}$}\\ \hat{\sigma}^{\prime}(x^{(i_{2})})&\text{if $x\in X_{2}$}.\end{cases}

By the above argument, $\hat{\sigma}$ is $(r_{1},r_{2})$ -average. Moreover, it satisfies $\Pi$ , since $\hat{\sigma}^{\prime}$ satisfies $\Pi^{\prime}$ . $\hfill\blacktriangleleft$

Putting all pieces together, we have Theorem 16.

Proof of Theorem 16.

We give an FPT reduction from instances in the Baby PIH (Theorem 13) to Avg- $r$ -Gap- $\mathrm{2CSP}$ . Then, since the Baby PIH holds under $\mathsf{W[1]}\neq\textup{{FPT}}$ , we deduce that the Average Baby PIH also holds under $\mathsf{W[1]}\neq\textup{{FPT}}$ .

For any $2$ CSP instance $\Pi_{0}=(X_{0},\Sigma_{0},\Phi_{0})$ , we can construct a bipartite $2$ CSP instance $\Pi_{1}=(X_{1},\Sigma_{1},\Phi_{1})$ by Lemma 20, and then construct an Avg- $r$ -Gap- $\mathrm{2CSP}$ instance $\Pi=(X,\Sigma,\Phi)$ from $\Pi_{1}$ by Lemma 22. Trivially, $\Pi$ is satisfiable when $\Pi_{0}$ is satisfiable. When $\Pi_{0}$ is not $r$ -list satisfiable, $\Pi_{1}$ is not $(r_{1},r_{2})$ -average list satisfiable for all constants $r_{1},r_{2}$ with $r_{1}+r_{2}\geq 2(1-\varepsilon)r$ , and thus $\Pi$ is not $(1-\varepsilon)r$ -average list satisfiable. Furthermore, $\Pi$ has rectangular relations because $\Pi_{1}$ has rectangular relations.

Moreover, the running time of this reduction can be bounded by

f(|X_{0}|+|\Phi_{0}|+1/\varepsilon+r)|\Sigma_{0}|^{O(1)}

for a computable function $f$ , and

|X|+|\Phi|\leq f(|X_{0}|+|\Phi_{0}|+1/\varepsilon+r)|\Sigma_{0}|^{O(1)}

as well, so the reduction is an FPT reduction. $\hfill\blacktriangleleft$

3.2 Average Baby PIH on Dense and Sparse Instances

In this section we prove Theorem 2, which is divided into two separate lemmas to help with readability. For our purposes, a 2CSP instance $\Pi=(X,\Sigma,\Phi)$ is dense if $|\Phi|=\omega(|X|)$ ; or it is sparse, if $|\Phi|=O(|X|)$ .

Lemma 23.

Under $\mathsf{W[1]}\neq\textup{{FPT}}$ , the Average Baby PIH holds for all Avg- $r$ -Gap- $\mathrm{2CSP}$ instances that are dense.

Proof.

The reduction in the proof of Lemma 20 yields complete bipartite 2CSP instances $\Pi_{1}=(X_{1}\mathbin{\dot{\cup}}X_{2},\Sigma_{1},\Phi_{1})$ , i.e., for each $x_{1}\in X_{1}$ and $x_{2}\in X_{2}$ , there exists a constraint $\varphi=(x_{1}x_{2},C)\in\Phi$ . Then the reduction in the proof of Lemma 22 makes $|X_{2}|$ copies of $X_{1}$ and $|X_{1}|$ copies of $|X_{2}|$ , while keeping the constraints in each pair of copies. So in the final instance $\Pi=(X,\Sigma,\Phi)$ from the proof of Theorem 16, the number of constraints is

|\Phi|=|X_{1}|^{2}|X_{2}|^{2}=\frac{|X|^{2}}{4}.

Now consider any function $h\in\omega(1)$ . We produce a new instance $\Pi^{\prime}=(X^{\prime},\Sigma^{\prime},\Phi^{\prime})$ by simply copying $\Pi$ for $t$ times, where $t$ is chosen as the minimum number satisfying

h(t|X|)\geq\frac{|X|}{4}.

Note that there is no constraint between different copies. Then, the new parameter is $|X^{\prime}|=t|X|$ , and

|\Phi^{\prime}|=t|\Phi|=\frac{t|X|^{2}}{4}=\frac{|X|}{4}|X^{\prime}|\leq h(|X^% {\prime}|)|X^{\prime}|.

It’s clear that this reduction runs in FPT time. Also, if $\Pi$ is satisfiable, then $\Pi^{\prime}$ is satisfiable. If any satisfying multi-assignment to $\Pi$ must have total size more than $r|X|$ , then any satisfying multi-assignment to $\Pi^{\prime}$ must assign each copy of $\Pi$ more than $r|X|$ values, so in total more than $r|X^{\prime}|$ values, preserving the gap. $\hfill\blacktriangleleft$

Lemma 24.

Let $r>1$ . If there exists a constant $c>0$ such that no FPT algorithm can solve Avg- $r$ -Gap- $\mathrm{2CSP}$ on instance $\Pi=(X,\Sigma,\Phi)$ with $|\Phi|\leq c\cdot r|X|$ , i.e., $\Pi$ is sparse, then the PIH holds.

Proof Sketch.

Let $\Pi=(X,\Sigma,\Phi)$ be a NO instance of Avg- $r$ -Gap- $\mathrm{2CSP}$ . For any (standard) assignment $\sigma:X\rightarrow\Sigma$ , assume that $\sigma$ violates $t$ constraints, then one can simply add at most $2t$ values to $\sigma$ and obtain a satisfying multi-assignment $\hat{\sigma}$ with total size $|X|+2t$ . Since $\Pi$ is a NO instance, we have $|X|+2t>r|X|$ . Thus,

t>\frac{r-1}{2}|X|=\frac{r-1}{2c\cdot r}\cdot c\cdot r|X|\geq\frac{r-1}{2c% \cdot r}|\Phi|.

In other words, any assignment to $\Pi$ must violate a constant fraction of the constraints in $\Pi$ . This gives a reduction from Avg- $r$ -Gap- $\mathrm{2CSP}$ to PIH. $\hfill\blacktriangleleft$

Putting Lemma 23 and Lemma 24 together, we obtain Theorem 2. As already mentioned in the introduction, this result indicates that the current barrier to the $\mathsf{W[1]}$ -hardness of the PIH is the lack of reduction for Avg- $r$ -Gap- $\mathrm{2CSP}$ on sparse instances, i.e., instances with linearly many constraints.

References

[1] Noga Alon, Dana Moshkovitz, and Shmuel Safra. Algorithmic construction of sets for k-restrictions. ACM Trans. Algorithms, 2(2):153–177, 2006. doi:10.1145/1150334.1150336.
[2] Sanjeev Arora, Carsten Lund, Rajeev Motwani, Madhu Sudan, and Mario Szegedy. Proof verification and the hardness of approximation problems. J. ACM, 45(3):501–555, 1998. doi:10.1145/278298.278306.
[3] Sanjeev Arora and Shmuel Safra. Probabilistic checking of proofs: A new characterization of NP. J. ACM, 45(1):70–122, 1998. doi:10.1145/273865.273901.
[4] Libor Barto and Marcin Kozik. Combinatorial gap theorem and reductions between promise CSPs. In Joseph (Seffi) Naor and Niv Buchbinder, editors, Proceedings of the 2022 ACM-SIAM Symposium on Discrete Algorithms, SODA 2022, Virtual Conference / Alexandria, VA, USA, January 9 - 12, 2022, pages 1204–1220. SIAM, 2022. doi:10.1137/1.9781611977073.50.
[5] Yijia Chen, Yi Feng, Bundit Laekhanukit, and Yanlin Liu. Simple combinatorial construction of the k ${}^{\mbox{o(1)}}$ -lower bound for approximating the parameterized k-Clique. In 2025 Symposium on Simplicity in Algorithms (SOSA), pages 263–280. Society for Industrial and Applied Mathematics, 2025. doi:10.1137/1.9781611978315.21.
[6] Yijia Chen and Bingkai Lin. The constant inapproximability of the parameterized Dominating Set problem. SIAM J. Comput., 48(2):513–533, 2019. doi:10.1137/17M1127211.
[7] Marek Cygan, Fedor V. Fomin, Lukasz Kowalik, Daniel Lokshtanov, Dániel Marx, Marcin Pilipczuk, Michal Pilipczuk, and Saket Saurabh. Parameterized Algorithms. Springer, 2015. doi:10.1007/978-3-319-21275-3.
[8] Irit Dinur. The PCP theorem by gap amplification. J. ACM, 54(3):12, 2007. doi:10.1145/1236457.1236459.
[9] Rodney G. Downey and Michael R. Fellows. Fundamentals of Parameterized Complexity. Texts in Computer Science. Springer, 2013. doi:10.1007/978-1-4471-5559-1.
[10] Jörg Flum and Martin Grohe. Parameterized Complexity Theory. Texts in Theoretical Computer Science. An EATCS Series. Springer, 2006. doi:10.1007/3-540-29953-X.
[11] Venkatesan Guruswami, Bingkai Lin, Xuandi Ren, Yican Sun, and Kewen Wu. Almost optimal time lower bound for approximating parameterized Clique, CSP, and more, under ETH. CoRR, abs/2404.08870, 2024. doi:10.48550/arXiv.2404.08870.
[12] Venkatesan Guruswami, Bingkai Lin, Xuandi Ren, Yican Sun, and Kewen Wu. Parameterized Inapproximability Hypothesis under Exponential Time Hypothesis. In Bojan Mohar, Igor Shinkar, and Ryan O’Donnell, editors, Proceedings of the 56th Annual ACM Symposium on Theory of Computing, STOC 2024, Vancouver, BC, Canada, June 24-28, 2024, pages 24–35. ACM, 2024. doi:10.1145/3618260.3649771.
[13] Venkatesan Guruswami, Xuandi Ren, and Sai Sandeep. Baby PIH: parameterized inapproximability of min CSP. In Rahul Santhanam, editor, 39th Computational Complexity Conference, CCC 2024, July 22-25, 2024, Ann Arbor, MI, USA, volume 300 of LIPIcs, pages 27:1–27:17. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2024. doi:10.4230/LIPICS.CCC.2024.27.
[14] Karthik C. S. and Subhash Khot. Almost polynomial factor inapproximability for parameterized k-Clique. In Shachar Lovett, editor, 37th Computational Complexity Conference, CCC 2022, July 20-23, 2022, Philadelphia, PA, USA, volume 234 of LIPIcs, pages 6:1–6:21. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPICS.CCC.2022.6.
[15] Karthik C. S., Bundit Laekhanukit, and Pasin Manurangsi. On the parameterized complexity of approximating Dominating Set. J. ACM, 66(5):33:1–33:38, 2019. doi:10.1145/3325116.
[16] Karthik C. S. and Inbal Livni Navon. On hardness of approximation of parameterized Set Cover and Label Cover: Threshold graphs from error correcting codes. In Hung Viet Le and Valerie King, editors, 4th Symposium on Simplicity in Algorithms, SOSA 2021, Virtual Conference, January 11-12, 2021, pages 210–223. SIAM, 2021. doi:10.1137/1.9781611976496.24.
[17] Shuangle Li, Bingkai Lin, and Yuwei Liu. Improved lower bounds for approximating parameterized nearest codeword and related problems under ETH. In Karl Bringmann, Martin Grohe, Gabriele Puppis, and Ola Svensson, editors, 51st International Colloquium on Automata, Languages, and Programming, ICALP 2024, July 8-12, 2024, Tallinn, Estonia, volume 297 of LIPIcs, pages 107:1–107:20. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2024. doi:10.4230/LIPICS.ICALP.2024.107.
[18] Bingkai Lin. A simple gap-producing reduction for the parameterized Set Cover problem. In Christel Baier, Ioannis Chatzigiannakis, Paola Flocchini, and Stefano Leonardi, editors, 46th International Colloquium on Automata, Languages, and Programming, ICALP 2019, July 9-12, 2019, Patras, Greece, volume 132 of LIPIcs, pages 81:1–81:15. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019. doi:10.4230/LIPICS.ICALP.2019.81.
[19] Bingkai Lin. Constant approximating k-Clique is W[1]-hard. In Samir Khuller and Virginia Vassilevska Williams, editors, STOC ’21: 53rd Annual ACM SIGACT Symposium on Theory of Computing, Virtual Event, Italy, June 21-25, 2021, pages 1749–1756. ACM, 2021. doi:10.1145/3406325.3451016.
[20] Bingkai Lin, Xuandi Ren, Yican Sun, and Xiuhan Wang. Constant approximating parameterized k-SetCover is W[2]-hard. In Nikhil Bansal and Viswanath Nagarajan, editors, Proceedings of the 2023 ACM-SIAM Symposium on Discrete Algorithms, SODA 2023, Florence, Italy, January 22-25, 2023, pages 3305–3316. SIAM, 2023. doi:10.1137/1.9781611977554.CH126.
[21] Daniel Lokshtanov, M. S. Ramanujan, Saket Saurabh, and Meirav Zehavi. Parameterized complexity and approximability of Directed Odd Cycle Transversal. In Shuchi Chawla, editor, Proceedings of the 2020 ACM-SIAM Symposium on Discrete Algorithms, SODA 2020, Salt Lake City, UT, USA, January 5-8, 2020, pages 2181–2200. SIAM, 2020. doi:10.1137/1.9781611975994.134.
[22] Carsten Lund and Mihalis Yannakakis. On the hardness of approximating minimization problems. J. ACM, 41(5):960–981, 1994. doi:10.1145/185675.306789.
[23] Pasin Manurangsi. Tight running time lower bounds for strong inapproximability of maximum k-Coverage, Unique Set Cover and related problems (via t-wise agreement testing theorem). In Shuchi Chawla, editor, Proceedings of the 2020 ACM-SIAM Symposium on Discrete Algorithms, SODA 2020, Salt Lake City, UT, USA, January 5-8, 2020, pages 62–81. SIAM, 2020. doi:10.1137/1.9781611975994.5.

Appendix A From Average Baby PIH to Inapproximability of $𝒌$ -ExactCover

We present a proof relying on a construction that slightly differs from the one in [13]. Their proof makes use of the $(T,m)$ -set gadget [22, 1] that was previously used to show the hardness of approximating SetCover problem. On the other hand, our proof develops a novel composition of the collision number of ECCs (recall Definition 17) with the following well-known combinatorial object.

Definition 25 (Hypercube Partition System).

Let $A, B$ be two sets. Then the $(A,B)$ -hypercube partition system is defined by

$\blacksquare$

the universe $\mathcal{M}=A^{B}\;\Big{(}=\big{\{}z\;\big{|}\;\text{a function $z:B\to A$}% \big{\}}\Big{)}$ , and
$\blacksquare$

a collection of subsets $\big{\{}P_{x,y}\big{\}}_{x\in B,y\in A}$ where each $P_{x,y}=\big{\{}z\in\mathcal{M}\;\big{|}\;z(x)=y\big{\}}$ .

Theorem 26 (cf. Theorem 21 in [13]).

Assume that the Average Baby PIH holds on all 2CSP instances with rectangular relations. Then $k$ -ExactCover cannot be approximated in FPT time within any constant factor. More precisely, for every constant $r>1$ no FPT algorithm, on a given $k$ -SetCover instance $\Pi=(S,U)$ with size $n$ and $k\geq 1$ , can distinguish between the following two cases:

$\blacksquare$

We can choose $k$ disjoint sets in $S$ whose union is $U$ .
$\blacksquare$

$U$ is not the union of any $r k$ sets in $S$ .

Proof.

Let $\Pi=(X,\Sigma,\Phi)$ be an Avg- $r$ -Gap- $\mathrm{2CSP}$ instance with rectangular relations. We set $k=|X|$ . Moreover, for each rectangular constraint $\varphi_{j}=(x_{i_{1}}x_{i_{2}},C_{j})\in\Phi$ we use $Q_{j}$ to denote the underlying set and $\pi_{j},\rho_{j}:\Sigma\to Q_{j}$ the associated mappings as in Definition 14. That is, for every $a,b\in\Sigma$ , it holds that $(a,b)\in C_{j}$ if and only if $\pi_{j}(a)=\rho_{j}(b)$ . Then we set

t=\max_{\varphi_{j}\in\Phi}|Q_{j}|.

(12)

Clearly, we can assume without loss of generality

t\leq|\Pi|.

Now we reduce $\Pi$ to a $k$ -ExactCover instance. To that end, we choose a further alphabet $\Delta$ whose size is a prime and satisfies

\max\left\{\left\lceil\log t\right\rceil,2^{2r^{2}k^{2}}\right\}\leq|\Delta|% \leq 2\max\left\{\left\lceil\log t\right\rceil,2^{2r^{2}k^{2}}\right\}.

Moreover,let

d=\left\lceil\frac{2r^{2}k^{2}\log t}{\log|\Delta|}\right\rceil.

This leads to the following code with very large distance (here we simply use Reed-Solomon code again)

\text{Enc}:\Delta^{\left\lceil\frac{\log t}{\log|\Delta|}\right\rceil}\to% \Delta^{d}.

Plugging

k\leftarrow\left\lfloor\frac{\log t}{\log|\Delta|}\right\rfloor,\;m\leftarrow d% ,\;p\leftarrow|\Delta|,\;\text{and}\;\varepsilon\leftarrow 1/2

in Theorem 18 we conclude that the $1/2$ -collision number of Enc is

\mathrm{Col}_{1/2}(\text{Enc})\geq\sqrt{\frac{d}{\log t/\log|\Delta|}}>rk.

Observe that (12) implies that every tuple in $C_{i}$ can be identified with a string in $\Delta^{\left\lceil\frac{\log m}{\log|\Delta|}\right\rceil}$ , i.e., the domain of Enc.

Then, for each variable $x\in X$ and every its possible value $a\in\Sigma$ , we define a set $S_{x,a}$ as follows. For each constraint $\varphi_{j}=(x_{i_{1}}x_{i_{2}},C_{j})\in\Phi$ with associated set $T_{j}$ and mappings $\pi_{j},\rho_{j}:\Sigma\to Q_{j}$ , and for each $\ell\in[d]$ , we construct a $([2],\Delta)$ -hypercube partition system

\left(\mathcal{M}^{(j,\ell)},\{P_{u,v}^{(j,\ell)}\}_{u\in\Delta,v\in[2]}\right).

(13)

Then for each $(a,b)\in C_{j}$ we add $P^{(j,\ell)}_{\text{Enc}(\pi_{j}(a))[\ell],1}$ to $S_{x_{i_{1}},a}$ and similarly $P^{(j,\ell)}_{\text{Enc}(\rho_{j}(b))[\ell],2}$ to $S_{x_{i_{2}},b}$ . Finally, let the universe be

U=\dot{\bigcup}_{\varphi_{j}\in\Phi,\ell\in[d]}\mathcal{M}^{(j,\ell)},\quad% \text{and}\quad\mathcal{S}=\big{\{}S_{x,a}\;\big{|}\;\text{$x\in X$ and $a\in% \Sigma$}\big{\}}.

For the completeness, let $\sigma:X\to\Sigma$ be a satisfying assignment of $\Pi$ , it is routine to check that $\{S_{x,\sigma(x)}\}_{x\in X}$ is a partition of $U$ .

For the soundness, assume that every satisfying multi-assignment of $\Pi$ has total size at least $r k$ (cf. Definition 7). Let $\mathcal{S}^{\prime}\subseteq\mathcal{S}$ be a cover of $U$ . Consider the multi-assignment that maps every variable $x\in X$ to $\big{\{}a\in\Sigma\;\big{|}\;S_{x,a}\in\mathcal{S}^{\prime}\big{\}}$ . If this multi-assignment satisfies $\Pi$ , the our assumption implies $|\mathcal{S}^{\prime}|\geq rk$ . Otherwise, assume that there exists some constraint $\varphi_{j}=(x_{i_{1}}x_{i_{2}},C_{j})\in\Phi$ which is not satisfied. Note that the above multi-assignment assigns $x_{i_{1}}$ to $E_{1}=\big{\{}a\in\Sigma\;\big{|}\;S_{x_{i_{1}},a}\in\mathcal{S}^{\prime}\big{\}}$ and $x_{i_{2}}$ to $E_{2}=\big{\{}b\in\Sigma\;\big{|}\;S_{x_{i_{2}},b}\in\mathcal{S}^{\prime}\big{\}}$ . Since $\varphi_{j}$ is not satisfied, for all $(a,b)\in E_{1}\times E_{2}$ we have $\text{Enc}(\pi_{j}(a))\neq\text{Enc}(\rho_{j}(b))$ . However, for each $\ell\in[d]$ , since $\mathcal{M}^{(j,\ell)}$ is covered by $\mathcal{S}^{\prime}$ , there must exist $a\in E_{1}$ and $b\in E_{2}$ with $\text{Enc}(\pi_{j}(a))[\ell]=\text{Enc}(\rho_{j}(b))[\ell]$ . Therefore, the set $\{\pi_{j}(a)\}_{a\in E_{1}}\cup\{\rho_{j}(b)\}_{b\in E_{2}}$ collides on all coordinates $\ell\in[d]$ , hence it must have size at least $\mathrm{Col}_{1/2}(\text{Enc})$ . We deduce

|\mathcal{S}^{\prime}|\geq|E_{1}|+|E_{2}|\geq\big{|}\{\pi_{j}(a)\}_{a\in E_{1}% }\cup\{\rho_{j}(b)\}_{b\in E_{2}}\big{|}\geq\mathrm{Col}_{1/2}(\text{Enc})>rk.

Finally, in each hypercube partition system (13) it holds that

\big{|}\mathcal{M}^{(j,\ell)}\big{|}=2^{|\Delta|}\leq 4^{\left\lceil\log t% \right\rceil}+4^{2^{2r^{2}k^{2}}}\leq|\Pi|^{2}+4^{2^{2r^{2}k^{2}}},

and there are at most $\binom{k}{2}d\leq k^{2}r^{2}k^{2}\log t\leq r^{2}k^{4}\log|\Pi|$ such systems. The size of the universe $U$ is thus at most $g(r,k)|\Pi|^{3}$ for some appropriate computable function $g:\mathbb{N}^{2}\to\mathbb{N}$ , while the parameter of the $k$ -ExactCover instance remains $k=|X|$ . It follows easily that the running time of this reduction is FPT. $\hfill\blacktriangleleft$

Combining Theorem 26 and Theorem 16, we obtain:

Theorem 27.

For any constant $r>1$ , $r$ -approximating $k$ -ExactCover is $\mathsf{W[1]}$ -hard.

[bib.bib1] [1] Noga Alon, Dana Moshkovitz, and Shmuel Safra. Algorithmic construction of sets for k-restrictions. ACM Trans. Algorithms, 2(2):153–177, 2006. doi:10.1145/1150334.1150336.

[bib.bib2] [2] Sanjeev Arora, Carsten Lund, Rajeev Motwani, Madhu Sudan, and Mario Szegedy. Proof verification and the hardness of approximation problems. J. ACM, 45(3):501–555, 1998. doi:10.1145/278298.278306.

[bib.bib3] [3] Sanjeev Arora and Shmuel Safra. Probabilistic checking of proofs: A new characterization of NP. J. ACM, 45(1):70–122, 1998. doi:10.1145/273865.273901.

[bib.bib4] [4] Libor Barto and Marcin Kozik. Combinatorial gap theorem and reductions between promise CSPs. In Joseph (Seffi) Naor and Niv Buchbinder, editors, Proceedings of the 2022 ACM-SIAM Symposium on Discrete Algorithms, SODA 2022, Virtual Conference / Alexandria, VA, USA, January 9 - 12, 2022, pages 1204–1220. SIAM, 2022. doi:10.1137/1.9781611977073.50.

[bib.bib5] [5] Yijia Chen, Yi Feng, Bundit Laekhanukit, and Yanlin Liu. Simple combinatorial construction of the k ${}^{\mbox{o(1)}}$ -lower bound for approximating the parameterized k-Clique. In 2025 Symposium on Simplicity in Algorithms (SOSA), pages 263–280. Society for Industrial and Applied Mathematics, 2025. doi:10.1137/1.9781611978315.21.

[bib.bib6] [6] Yijia Chen and Bingkai Lin. The constant inapproximability of the parameterized Dominating Set problem. SIAM J. Comput., 48(2):513–533, 2019. doi:10.1137/17M1127211.

[bib.bib7] [7] Marek Cygan, Fedor V. Fomin, Lukasz Kowalik, Daniel Lokshtanov, Dániel Marx, Marcin Pilipczuk, Michal Pilipczuk, and Saket Saurabh. Parameterized Algorithms. Springer, 2015. doi:10.1007/978-3-319-21275-3.

[bib.bib8] [8] Irit Dinur. The PCP theorem by gap amplification. J. ACM, 54(3):12, 2007. doi:10.1145/1236457.1236459.

[bib.bib9] [9] Rodney G. Downey and Michael R. Fellows. Fundamentals of Parameterized Complexity. Texts in Computer Science. Springer, 2013. doi:10.1007/978-1-4471-5559-1.

[bib.bib10] [10] Jörg Flum and Martin Grohe. Parameterized Complexity Theory. Texts in Theoretical Computer Science. An EATCS Series. Springer, 2006. doi:10.1007/3-540-29953-X.

[bib.bib11] [11] Venkatesan Guruswami, Bingkai Lin, Xuandi Ren, Yican Sun, and Kewen Wu. Almost optimal time lower bound for approximating parameterized Clique, CSP, and more, under ETH. CoRR, abs/2404.08870, 2024. doi:10.48550/arXiv.2404.08870.

[bib.bib12] [12] Venkatesan Guruswami, Bingkai Lin, Xuandi Ren, Yican Sun, and Kewen Wu. Parameterized Inapproximability Hypothesis under Exponential Time Hypothesis. In Bojan Mohar, Igor Shinkar, and Ryan O’Donnell, editors, Proceedings of the 56th Annual ACM Symposium on Theory of Computing, STOC 2024, Vancouver, BC, Canada, June 24-28, 2024, pages 24–35. ACM, 2024. doi:10.1145/3618260.3649771.

[bib.bib13] [13] Venkatesan Guruswami, Xuandi Ren, and Sai Sandeep. Baby PIH: parameterized inapproximability of min CSP. In Rahul Santhanam, editor, 39th Computational Complexity Conference, CCC 2024, July 22-25, 2024, Ann Arbor, MI, USA, volume 300 of LIPIcs, pages 27:1–27:17. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2024. doi:10.4230/LIPICS.CCC.2024.27.

[bib.bib14] [14] Karthik C. S. and Subhash Khot. Almost polynomial factor inapproximability for parameterized k-Clique. In Shachar Lovett, editor, 37th Computational Complexity Conference, CCC 2022, July 20-23, 2022, Philadelphia, PA, USA, volume 234 of LIPIcs, pages 6:1–6:21. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPICS.CCC.2022.6.

[bib.bib15] [15] Karthik C. S., Bundit Laekhanukit, and Pasin Manurangsi. On the parameterized complexity of approximating Dominating Set. J. ACM, 66(5):33:1–33:38, 2019. doi:10.1145/3325116.

[bib.bib16] [16] Karthik C. S. and Inbal Livni Navon. On hardness of approximation of parameterized Set Cover and Label Cover: Threshold graphs from error correcting codes. In Hung Viet Le and Valerie King, editors, 4th Symposium on Simplicity in Algorithms, SOSA 2021, Virtual Conference, January 11-12, 2021, pages 210–223. SIAM, 2021. doi:10.1137/1.9781611976496.24.

[bib.bib17] [17] Shuangle Li, Bingkai Lin, and Yuwei Liu. Improved lower bounds for approximating parameterized nearest codeword and related problems under ETH. In Karl Bringmann, Martin Grohe, Gabriele Puppis, and Ola Svensson, editors, 51st International Colloquium on Automata, Languages, and Programming, ICALP 2024, July 8-12, 2024, Tallinn, Estonia, volume 297 of LIPIcs, pages 107:1–107:20. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2024. doi:10.4230/LIPICS.ICALP.2024.107.

[bib.bib18] [18] Bingkai Lin. A simple gap-producing reduction for the parameterized Set Cover problem. In Christel Baier, Ioannis Chatzigiannakis, Paola Flocchini, and Stefano Leonardi, editors, 46th International Colloquium on Automata, Languages, and Programming, ICALP 2019, July 9-12, 2019, Patras, Greece, volume 132 of LIPIcs, pages 81:1–81:15. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019. doi:10.4230/LIPICS.ICALP.2019.81.

[bib.bib19] [19] Bingkai Lin. Constant approximating k-Clique is W[1]-hard. In Samir Khuller and Virginia Vassilevska Williams, editors, STOC ’21: 53rd Annual ACM SIGACT Symposium on Theory of Computing, Virtual Event, Italy, June 21-25, 2021, pages 1749–1756. ACM, 2021. doi:10.1145/3406325.3451016.

[bib.bib20] [20] Bingkai Lin, Xuandi Ren, Yican Sun, and Xiuhan Wang. Constant approximating parameterized k-SetCover is W[2]-hard. In Nikhil Bansal and Viswanath Nagarajan, editors, Proceedings of the 2023 ACM-SIAM Symposium on Discrete Algorithms, SODA 2023, Florence, Italy, January 22-25, 2023, pages 3305–3316. SIAM, 2023. doi:10.1137/1.9781611977554.CH126.

[bib.bib21] [21] Daniel Lokshtanov, M. S. Ramanujan, Saket Saurabh, and Meirav Zehavi. Parameterized complexity and approximability of Directed Odd Cycle Transversal. In Shuchi Chawla, editor, Proceedings of the 2020 ACM-SIAM Symposium on Discrete Algorithms, SODA 2020, Salt Lake City, UT, USA, January 5-8, 2020, pages 2181–2200. SIAM, 2020. doi:10.1137/1.9781611975994.134.

[bib.bib22] [22] Carsten Lund and Mihalis Yannakakis. On the hardness of approximating minimization problems. J. ACM, 41(5):960–981, 1994. doi:10.1145/185675.306789.

[bib.bib23] [23] Pasin Manurangsi. Tight running time lower bounds for strong inapproximability of maximum k-Coverage, Unique Set Cover and related problems (via t-wise agreement testing theorem). In Shuchi Chawla, editor, Proceedings of the 2020 ACM-SIAM Symposium on Discrete Algorithms, SODA 2020, Salt Lake City, UT, USA, January 5-8, 2020, pages 62–81. SIAM, 2020. doi:10.1137/1.9781611975994.5.

$\displaystyle r_{1}k_{1}k_{2}$	$\displaystyle=r_{1}\|X^{\prime}_{1}\|$	(by $\|X^{\prime}_{1}\|=k_{1}k_{2}$ )
	$\displaystyle=\sum_{x\in X^{\prime}_{1}}\big{\|}\hat{\sigma}^{\prime}(x)\big{\|}$	(by (10))
	$\displaystyle=\sum_{i\in[k_{2}]}\sum_{x\in X_{1}}\big{\|}\hat{\sigma}^{\prime}(% x^{(i)})\big{\|}.$	(by (11))

	$\displaystyle\big{\|}\mathsf{Word}_{\hat{\sigma}}\big{\|}=$	$\displaystyle\left\|\bigcup_{u_{j}\in X_{1}}\bigcup_{(w_{1},w_{2})\in\hat{% \sigma}(u_{i})}\{w_{1},w_{2}\}\right\|$
	$\displaystyle\leq$	$\displaystyle\sum_{u_{j}\in X_{1}}\left\|\bigcup_{(w_{1},w_{2})\in\hat{\sigma}(% u_{i})}\{w_{1},w_{2}\}\right\|\leq\sum_{u_{j}\in X_{1}}2\big{\|}\hat{\sigma}(u_{% i})\big{\|}$

On Average Baby PIH and Its Applications

Abstract

Keywords and phrases:

Copyright and License:

2012 ACM Subject Classification:

Acknowledgements:

Funding:

DOI:

Event:

Editors:

Series and Publisher:

1 Introduction

1.1 Main Results

Theorem 1 (Informal, see Theorem 16).

Theorem 2.

Theorem 3 (Theorem 27 restated).

Theorem 4.

Proof Sketch.

1.2 Technical Overview: Local-to-Global Reduction For 2CSP

1.3 Discussions

Question 1.

Question 2.

Question 3.

Question 4.

1.4 Organization

2 Preliminaries

2.1 Problems

Definition 5 (Parameterized 2CSP).

Definition 6 (ε-Gap-2CSP).

Definition 7 (Multi-assignment).

Definition 8 (Avg-r-Gap-2⁢C⁢S⁢P).

Definition 9 (k-ExactCover).

Definition 10 (k-NCP).

2.2 Hypotheses

Hypothesis 11 (PIH [21]).

Hypothesis 12 (Baby PIH [13]).

Theorem 13 ([13]).

Definition 14 (Rectangular relation).

Hypothesis 15 (Average Baby PIH).

3 Average Baby PIH from Baby PIH

3.1 Proofs of Main Results

Theorem 16.

Definition 17 (ε-Collision Number).

Theorem 18 (Theorem 10 in [20], see also [16]).

Definition 19 ((r,s)-Average Multi-Assignment).

Lemma 20.

Proof.

Claim 21.

Proof of Claim 21.

Lemma 22.

Proof.

Proof of Theorem 16.

3.2 Average Baby PIH on Dense and Sparse Instances

Lemma 23.

Proof.

Lemma 24.

Proof Sketch.

References

Appendix A From Average Baby PIH to Inapproximability of 𝒌-ExactCover

Definition 25 (Hypercube Partition System).

Theorem 26 (cf. Theorem 21 in [13]).

Proof.

Theorem 27.

Definition 6 ( $\varepsilon$ -Gap-2CSP).

Definition 8 (Avg- $r$ -Gap- $\mathrm{2CSP}$ ).

Definition 9 ( $k$ -ExactCover).

Definition 10 ( $k$ -NCP).

Definition 17 ( $\varepsilon$ -Collision Number).

Definition 19 ( $(r,s)$ -Average Multi-Assignment).

Appendix A From Average Baby PIH to Inapproximability of $𝒌$ -ExactCover