Abstract 1 Introduction 2 Technical Overview of Our Main Results 3 Preliminaries 4 FPT for PARTIALCONRBDS parameterized by 𝒕 5 (𝟏,𝟏𝜺)-approximation for PARTIALCONRBDS 6 Conclusion References

FPT Approximations for Connected Maximum Coverage

Tanmay Inamdar ORCID Indian Institute of Technology Jodhpur, India Satyabrata Jana ORCID University of Warwick, Coventry, UK Madhumita Kundu ORCID University of Bergen, Norway Daniel Lokshtanov ORCID Department of Computer Science, University of California Santa Barbara, CA, USA Saket Saurabh ORCID The Institute of Mathematical Sciences, HBNI, Chennai, India
University of Bergen, Norway
Meirav Zehavi ORCID Ben-Gurion University of the Negev, Be’er-Sheva, Israel
Abstract

We revisit connectivity-constrained coverage through a unifying model, Partial Connected Red-Blue Dominating Set (PartialConRBDS). Given a bipartite graph G=(RB,E) with red vertices R and blue vertices B, an auxiliary connectivity graph Gconn on R, and integers k,t, the task is to find a set SR with |S|k such that Gconn[S] is connected and S dominates at least t blue vertices. This formulation captures connected variants of Maximum Coverage [Hochbaum–Rao, Inf. Proc. Lett., 2020; D’Angelo–Delfaraz, AAMAS 2025], Partial Vertex Cover, and Partial Dominating Set [Khuller et al., SODA 2014; Lamprou et al., TCS 2021] via standard encodings.

Limits to parameterized tractability.

PartialConRBDS is 𝖶[𝟣]-hard parameterized by k even under strong restrictions: it remains hard when Gconn is a clique or a star and the incidence graph G is 3-degenerate, or when G is K2,2-free.

Inapproximability.

For every ε>0, there is no polynomial-time (1, 11e+ε)-approximation unless 𝖯=𝖭𝖯. Moreover, under ETH, no algorithm running in f(k)no(k) time achieves an g(k)-approximation for k for any computable function g(), or for any ε>0, a (11e+ε)-approximation for t.

Graphical special cases.

Partial Connected Dominating Set is 𝖶[𝟤]-hard parameterized by k and inherits the same ETH-based f(k)no(k) inapproximability bound as above; Partial Connected Vertex Cover is 𝖶[𝟣]-hard parameterized by k.

These hardness boundaries delineate a natural “sweet spot” for study: within appropriate structural restrictions on the incidence graph, one can still aim for fine-grained (FPT) approximations.
Our algorithms. We solve PartialConRBDS exactly by reducing it to Relaxed Directed Steiner Out-Tree in time (2e)tn𝒪(1). For biclique-free incidences (i.e., when G excludes Kd,d as an induced subgraph), we obtain two complementary parameterized schemes:

  • An Efficient Parameterized Approximation Scheme (EPAS) running in time 2𝒪(k2d/ε)n𝒪(1) that either returns a connected solution of size at most k covering at least (1ε)t blue vertices, or correctly reports that no connected size-k solution covers t; and

  • A Parameterized Approximation Scheme (PAS) running in time 2𝒪(kd(k2+logd))n𝒪(1/ε) that either returns a connected solution of size at most (1+ε)k covering at least t blue vertices, or correctly reports that no connected size-k solution covers t.

Together, these results chart the boundary between hardness and FPT-approximability for connectivity-constrained coverage.

Keywords and phrases:
Partial Dominating Set, Connectivity, Maximum Coverage, FPT Approximation, Fixed-parameter Tractability
Funding:
Tanmay Inamdar: The author is supported by Anusandhan National Research Foundation (ANRF), grant number ANRF/ECRG/2024/004144/PMS, and IIT Jodhpur Research Initiation Grant, grant number I/RIG/TNI/20240072.
Satyabrata Jana: Supported by the Engineering and Physical Sciences Research Council (EPSRC) via the project MULTIPROCESS (grant no. EP/V044621/1).
Daniel Lokshtanov: The author is supported by NSF grant 2505099: Collaborative Research: AF Medium: Structure and Quasi-Polynomial Time Algorithms.
Saket Saurabh: The author is supported by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 819416).
Meirav Zehavi: The author is supported by Israel Science Foundation (ISF), grant no. 1470/24, and by European Research Council (ERC) grant no. 101039913 (PARAPATH).
Copyright and License:
[Uncaptioned image] © Tanmay Inamdar, Satyabrata Jana, Madhumita Kundu, Daniel Lokshtanov, Saket Saurabh, and Meirav Zehavi; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Mathematics of computing Graph algorithms
; Theory of computation Graph algorithms analysis ; Theory of computation Parameterized complexity and exact algorithms
Related Version:
Full Version: https://arxiv.org/abs/2601.08639
Editor:
Shubhangi Saraf

1 Introduction

In the last decade, FPT approximation has enjoyed a tremendous amount of success in obtaining near-optimal approximations for NP-hard optimization problems that resist polynomial-time approximations, as well as exact FPT algorithms. Indeed, this paradigm has led to the best-known approximation guarantees for many fundamental problems, including Min k-Cut [31], k-Median/k-Means clustering [8], k-Edge Separator [18], Balanced Separator [15], to mention a few. The field’s recent momentum owes much to the emergence of accompanying hardness-of-approximation frameworks in the FPT setting, which delineate the achievable frontier and indicate when to stop pursuing better ratios or faster schemes [25, 6, 42, 7, 28, 29, 20, 19, 2].

Possibly second only to clustering and treewidth, the strongest testament to the success of the FPT approximation paradigm lies in its applications to Max Coverage and its variants. In the classical Max Coverage problem, we are given a set system (𝒰,) and a positive integer k, where 𝒰 is a universe of n elements and is a family of m subsets of 𝒰. The goal is to select a subfamily of size k that covers as many elements of 𝒰 as possible. A well-known greedy algorithm achieves a tight (11/e)-approximation [13, 21], and the problem is known to be W[2]-hard when parameterized by k [14].

Recent results have established that this approximation factor remains tight even if one allows FPT running time. Specifically, Manurangsi [33] showed that assuming Gap-ETH, no algorithm can achieve an approximation ratio better than (11/e) in time f(k)(|𝒰|+||)o(k), even with the promise that there exist k sets that cover the entire universe. Very recently, this result was strengthened by Guruswami et al. [20] by weakening the assumption to ETH. Despite this barrier, significantly better results can be obtained for Max Coverage and its variants when the underlying set system has additional structural properties. This line of research was initiated by Marx [35], who gave an FPT (1ε)-approximation algorithm running in time 𝒪(2𝒪(k2/ε))111We use 𝒪() to suppress polynomial factors in the input size, i.e., 𝒪(T)=T|I|𝒪(1), where |I| denotes the input size. for the Partial Vertex Cover (PVC) problem. Here, the task is to select k vertices in a graph that cover as many edges as possible – a special case of Max Coverage. Subsequent works extended this to more general settings. In particular, for set systems where each element appears in at most d sets (d=2 for PVC), the running time was improved to 𝒪((d/ε)𝒪(k)) [40, 32]. More recently, Jain et al. [24] further generalized these results to Kd,d-free set systems, where no d sets share d common elements.

Building on the success of FPT approximation for the vanilla Max Coverage, attention has shifted to coverage with additional constraints, a class of problems previously studied in the classical polynomial-time approximation regime. Many of these classical results have been extended to obtain stronger FPT approximations for coverage problems with fairness [23], matroid [39, 23], and capacity [30] constraints, when the underlying set systems are structurally well-behaved.

Connectivity Constraints.

Among the constrained variants, connectivity-constrained coverage has received considerable attention in the classical setting. Motivated from applications in sensor and social networks, Khuller et al. [26] introduced Connected Partial Dominating Set, where, given a graph G, the goal is to find a minimum size connected subset SV(G) that dominates at least t vertices. For this problem, they designed a polynomial-time 𝒪(logΔ)-approximation, where Δ is the maximum degree in G. For the complementary problem, called Connected Budgeted Dominating Set – where the goal is to find a connected vertex-subset of size at most k that dominates the maximum number of vertices – they designed an (112(11e))-approximation (this was also independently obtained in [27]). Subsequently, Hochbaum et al. [22] introduced a more general Connected Max Coverage problem, which is a variant of Max Coverage, where the selected sub-family needs to induce a connected subgraph in another connectivity graph additionally provided in the input. For this problem, they gave an ((11e)max{1R1k,1k})-approximation, where R is the radius of the connectivity graph. Very recently, D’Angelo and Delfaraz [11] designed bicriteria approximations for the same problem; specifically, polylogarithmic approximations for the number of elements covered with a solution of size (1+ε)k.

However, to the best of our knowledge, these problems have not yet been systematically explored within the parameterized approximation framework. The goal of our work is twofold:

  1. 1.

    To formulate a general model of connectivity-constrained coverage that is expressive enough to capture a broad range of problems studied in the prior polynomial-time literature, and

  2. 2.

    To develop new techniques within the FPT approximation paradigm for designing approximation schemes for the two fundamental minimization and maximization problems that naturally arise in this framework.

Our model separates the constraint layer – a companion graph Gconn that can enforce connectivity, independence/packing, or fault tolerance – from the coverage layer – the incidence bipartite graph Gcov=(RB,E) that represents the set-element incidences from a hypergraph. This separation lets us handle a wide range of set-system (hypergraph) families, including bounded-rank and biclique-free incidences, bounded VC-dimension and geometric ranges, as well as multi-coverage and weighted variants. In the next subsection, we formally define our model and demonstrate its expressivity by showing how previously studied problems can be naturally captured within this framework.

1.1 Our Model of Connectivity-Constrained Coverage

We introduce the following problem that is central to this work.

Partial Connected Red-Blue Dominating Set (PartialConRBDS)

 

Input: An instance =(Gconn,Gcov,k,t), where

  • Gconn=(R,E) is an arbitrary graph, called the connectivity graph,

  • Gcov=(RB,E) is a bipartite graph, called the coverage graph, and

  • k and t are non-negative integers.

Question:   Does there exist a vertex subset SR such that,

  1. (1)

    |S|k,

  2. (2)

    Gconn[S] is connected, and

  3. (3)

    |NGcov(S)|t ?

Let us unpack the definition. The input consists of two graphs Gconn and Gcov, and two non-negative integers k and t. Here, Gcov is a bipartite graph with vertex set RB, that is to be thought of as the incidence graph of a set system – the “red side” (R) corresponds to the set family , and the “blue side” (B) corresponds to the universe 𝒰, with an edge representing set-element containments. Notice that under this interpretation, Max Coverage corresponds to selecting a k-sized subset of R that dominates the maximum number of vertices in B. Next, we have a connectivity graph Gconn, whose vertex set is also R. This graph is used to model the connectivity of the solution, i.e., a solution SR is required to induce a connected subgraph of Gconn.

While the decision question asks whether there exists such a connected solution of size at most k that dominates at least t vertices in B, two natural optimization variants stem from it: find a connected solution S that

  1. (i)

    maximizes |NGcov(S)| s.t. |S|k, and

  2. (ii)

    minimizes |S| s.t. |NGcov(S)|t.

To handle both variants in a unified way, we define the notion of an (α,β)-approximation algorithm: such an algorithm either finds a connected solution S such that |S|αk and |NGcov(S)|βt; or correctly outputs that there is no SR satisfying the original requirements.

Modeling prior problems.

Here we describe how we can model the problems from the prior literature as instances of PartialConRBDS. Consider Connected Partial/Budgeted Dominating Set, studied in [26, 27], where we are given a graph G, and we want to find a connected dominating set (with dual objectives). To model it as PartialConRBDS, we let GconnG, and Gcov is a bipartite graph on the vertex set RB, where R=V(G), and B is another copy of V(G). For each uV(G), we add the edges between u and the copies of all wNG[u] in the set B. Note that there is a close coupling between Gconn and Gcov while modeling Connected Partial Dominating Set, which is not always the case. Indeed, consider the Connected Max Coverage problem studied in [22, 11], where we are given a set system (𝒰,), and a graph G with V(G)=. To model it as PartialConRBDS, we let Gcov be the set-element incidence graph (as described above), and let GconnG. At the other extreme, the vanilla Max Coverage can be modeled by letting Gconn be a clique, rendering the connectivity requirement redundant.

Partial Hitting Set is the “dual variant”, where the roles of elements of the universe 𝒰 and the sets of are reversed ([17]) – we want to select a minimum number of elements from 𝒰 that hits at least t elements. This again can be modeled as PartialConRBDS by letting Gcov be the set-element incidence graph as before; however with the roles of R and B flipped (R corresponds to 𝒰, and B to ), and Gconn is a clique defined on the vertex set that corresponds to 𝒰.

1.2 Our Algorithmic Results

PartialConRBDS is 𝖶[𝟣]-hard parameterized by k even under strong restrictions: it remains hard when Gconn is a clique or a star and the incidence (coverage) graph Gcov is 3-degenerate, and even when Gcov is K2,2-free. For every ε>0, there is no polynomial-time (1, 11/e+ε)-approximation unless 𝖯=𝖭𝖯; moreover, under ETH, no algorithm running in time f(k)no(k) achieves either a (g(k),1)-approximation for any computable function g() (with respect to the size bound k) or a (1, 11/e+ε)-approximation (with respect to the coverage threshold t). The graphical special cases reflect the same picture: Partial Connected Dominating Set is 𝖶[𝟤]-hard parameterized by k and inherits the ETH-based f(k)no(k) inapproximability bounds above, while Partial Connected Vertex Cover is 𝖶[𝟣]-hard parameterized by k. For more details see Section 1.3. These hardness boundaries delineate a natural “sweet spot”: under suitable structural restrictions on the incidence graph one can still aim for fine-grained (FPT) approximation schemes. All of these results on parameterized and approximation lower bounds are established by modeling already existing problems such as Max Coverage, Connected Partial/Budgeted Dominating Set (See full version). Also the details of some sections/results marked with can be found in the full version of the paper.

A natural question is: which restrictions on the incidence graph suffice? Recall that Partial Vertex Cover admits a (1ε)-approximation (for the coverage target t) [35, 32, 40] and a +1 additive approximation (for the size bound k) [24]. In fact, these guarantees hold when 𝒟 is the class of incidence graphs of Kd,d-free set systems (i.e., no d sets share d common elements). A natural next step is to ask whether the same techniques extend to connectivity-constrained coverage – namely, when the chosen set SR must also satisfy graph-side constraints imposed by an arbitrary companion graph Gconn. Answering this question (almost) affirmatively is the main technical contribution of our work.

Our main algorithmic contribution is parameterized approximation schemes for the two natural optimization variants of PartialConRBDS when the coverage graph Gcov is Kd,d-free. Specifically, we prove the following two theorems. The first of the following two theorems is when we insist on a connected solution of size at most k, and wish to approximate the number of blue vertices dominated by the solution.

Theorem 1.

For any ε>0, there exists an (1,1ε)-approximation algorithm with running time 2𝒪(k2d/ε)||𝒪(1) for an instance of PartialConRBDS where the coverage graph is Kd,d-free, and the connectivity graph is arbitrary. That is, this algorithm takes an instance =(Gconn,Gcov,k,t) such that Gcov is Kd,d-free, runs in time 2𝒪(k2d/ε)||𝒪(1) and either (i) outputs SR of size at most k such that Gconn[S] is connected and |NGcov(S)|(1ε)t; or (ii) correctly reports that there exists no SR of size at most k such that Gconn[S] is connected, and |NGcov(S)|t.

We prove a complementary result, where we approximate the solution size, but insist that it dominates at least t blue vertices. More formally, we prove the following theorem.

Theorem 2 ().

For any ε>0, there exists an (1+ε,1)-approximation algorithm with running time 2𝒪(kd(k2+logd))||𝒪(1/ε) for an instance of PartialConRBDS, where the coverage graph is Kd,d-free, and the connectivity graph is arbitrary. That is, this algorithm takes an instance =(Gconn,Gcov,k,t) such that Gcov is Kd,d-free, runs in time 2𝒪(kd(k2+logd))||𝒪(1/ε), and either (i) outputs SR of size at most (1+ε)k such that Gconn[S] is connected and |NGcov(S)|t; or (ii) correctly reports that there exists no SR of size at most k such that Gconn[S] is connected, and |NGcov(S)|t.

We reiterate that both of the following results are in the setting where 𝒞 is arbitrary and 𝒟 is a family of bipartite incidence graphs that are Kd,d-free for some d1. Recall that Khuller et al. gave an 𝒪(logΔ)-approximation for Partial Connected Dominating Set. Our result above (Theorem 2) improves upon this result by returning an (1+ε)-approximation in 𝒪(2kΔ(k2+logΔ)) time.

Neighborhood Sparsifiers: A Conceptual Contribution.

A natural approach to solve our problem is to assign a weight to each red vertex proportional to its coverage degree in Gcov – say w(v)|NGcov(v)| and then pick a connected k-set in Gconn with maximum total weight, i.e., a maximum-weight k-vertex tree in Gconn (solvable in 2𝒪(k)n𝒪(1) time; see Section 5). The obstacle is additivity: overlaps in neighborhoods mean that a single weight function w:R0 satisfying w(S)εt|NGcov(S)|w(S)+εtfor all SR,|S|k (and computable in FPT time) appears out of reach.

Instead, when Gcov is Kd,d-free, we construct a small family of surrogate weight functions, one of which has the desired property. This reduces the task to, for each weight function, solving a maximum-weight k-tree in Gconn and taking the best outcome. Technically, the family arises from a combinatorial sparsification lemma for Gcov: via a greedy step coupled with random separation (later derandomized), we obtain a sparsified incidence graph that preserves the neighborhoods of all k-sets. In the sparsified instance, the simple per-vertex degree w(v)=𝖽𝖾𝗀sparse(v) serves as the required weight, yielding the desired approximation once lifted back to the original graph.

Lemma 3 (Neighborhood Sparsifiers for Kd,d-free coverage).

Let =(Gconn,Gcov,k,t) be an instance of PartialConRBDS with Gcov Kd,d-free. There exist =f(k,d,ε)log|B| weight functions {w1,,w},wi:R0, computable in time g(k,d,ε)|B|𝒪(1), such that the following holds. For every SR with |S|=k, for all j[] with wj(S)εt|NGcov(S)|wj(S), where wj(S)vSwj(v).

We believe that Lemma 3 will find applications beyond this work, in the spirit of seminal cut and spectral sparsifiers [3, 41]. We do not prove the lemma in the form stated here, but it follows from our results. We will include a proof of this specific version in the final version of the paper.

1.3 Potential for a Dichotomy Program

Our results motivate a clean classification program separating regimes that admit FPT-approximation from those that are provably hard. We study the (parameterized) complexity of PartialConRBDS over “(𝒞×𝒟)” families of instances, where the constraint graph Gconn ranges over a class 𝒞 and the coverage/incidence graph Gcov ranges over a class 𝒟. We instantiate this framework by varying 𝒞 (e.g., bounded treewidth, clique/star) and 𝒟 (e.g., biclique-free, bounded rank/degeneracy), thereby charting the frontier between FPT-approximability and hardness. We exemplify this by varying 𝒞 and 𝒟 in a few different ways.

𝒞 = cliques, 𝒟 = arbitrary.

As mentioned earlier, this models an arbitrary instance of Max Coverage, and therefore PartialConRBDS inherits all its positive and negative results. We explicitly spell out these approximation and parameterized lower bounds in order to contrast them with our positive algorithmic results.

𝒞 = max degree Δ, 𝒟 = arbitrary.

In this case, we can enumerate nΔ𝒪(k) connected vertex-subsets of Gconn of size at most k, and check whether any subset dominates at least t blue vertices in Gcov. Therefore, in this case PartialConRBDS is FPT parameterized by k+Δ. Note, in particular, that this captures the special case when Gconn is a path (in this case, in fact, the problem is polynomial-time solvable). It may be tempting to conjecture that PartialConRBDS is (fixed-parameter) tractable even when 𝒞 is the class of trees. This, however, turns out not to be the case.

𝒞 = stars, 𝒟 = arbitrary.

In this case, we can again model an arbitrary instance of Max Coverage, by adding a dummy set that maps to the central vertex of the star, and all the original sets mapping to the leaves of the star. Thus, all the intractable results carry over from Max Coverage.

𝒞 = arbitrary, 𝒟 = arbitrary.

In this most general case, we design an FPT algorithm for PartialConRBDS parameterized by t, which runs in time 𝒪((2e)t) in Section 4. This shows that, parameterized by the desired coverage, the problem is fixed-parameter tractable in spite of the connectivity requirements.

𝒞 = arbitrary, 𝒟 = each vertex in R has degree at most ΔR.

In this case, note that any k-sized set SR has |NGcov(S)|kΔ, implying that if t>kΔR then we can conclude that we have a No-instance. Otherwise, the FPT algorithm from the previous paragraph implies that PartialConRBDS is FPT parameterized by k+ΔR.

𝒞 = arbitrary, 𝒟 = each vertex in B has degree at most ΔB.

Note that when ΔB=2, this models the coverage of edges by vertices in a graph. Therefore, when Gconn is a clique or a star, one can model Partial Vertex Cover, which is known to be W[1]-hard parameterized by k; and PartialConRBDS inherits this hardness.

2 Technical Overview of Our Main Results

Next, we describe some of the technical ideas that lead to our algorithmic results Theorem 1 and Theorem 2. Due to the inherent nature of the problem, our algorithms and the corresponding arguments need to juggle between the two graphs Gconn and Gcov, and their interplay with a hypothetical solution that we seek. These arguments go via a couple of auxiliary graphs that make finding a solution easier, albeit at the expense of a small loss in the quality of the solution (either in terms of the solution size, or in terms of the number of dominated blue vertices).

Overview of the proof of Theorem 1.

Suppose that the input instance =(Gconn,Gcov,k,t) is a Yes-instance of PartialConRBDS, where Gconn is arbitrary and Gcov is Kd,d-free. Let SR=V(Gconn)=V(Gcov) be an (unknown) solution of size k that is connected in Gconn and dominates at least t blue vertices.

1. Defining and working with a conflict graph.

We construct an auxiliary graph, called the conflict graph, denoted as Gconf, where V(Gconf)=R, and there is an edge between two vertices iff the number of common blue neighbors of the two vertices is a “significant fraction” of t (we will use εtk2). Using the fact that Gcov is Kd,d-free, one can show that the maximum degree of Gconf is bounded by some f(k,d,ε), notably it is independent of t (cf. Lemma 6). That is, for each red vertex u, the number of other red vertices that have a significant overlap with u in terms of blue domination is bounded by f(k,d,ε). Notice that some pairs of vertices from S (our unknown solution) can have significant overlap, but maybe not all. Therefore, Gconf[S] may be disconnected with at most k connected components, denoted by 𝒞. Since Δ(Gconf)f(k,d,ε), we can use the technique of random separation to obtain a collection 𝒞 of induced connected subgraphs of Gconf such that, 𝒞𝒞 (even though S is unknown to us). The success probability of this step is inverse FPT in k,d, and ε; and this step can be derandomized using known combinatorial tools. Let us assume henceforth that we have such a collection 𝒞 in our hand. The algorithmic task is to figure out which set of connected components from 𝒞 together form the desired solution.

2. Using 𝒞 to sparsify Gcov.

Recall that our unknown solution S is partitioned across 𝒞 in such a way that, either all or none of the vertices of a component in 𝒞 are a part of S. We process each blue vertex u as follows: whenever u has edges to two or more red neighbors within the same component of 𝒞, we arbitrarily delete all but one of these edges. We repeat this until every blue vertex has at most one neighbor in each component. The resulting graph Gspar is a subgraph of Gcov. This way, each blue vertex u has at most one neighbor in Gspar within any component of 𝒞, and that neighbor will be responsible for counting the domination of u. For each red vertex rR, we define its weight as the number of its blue neighbors in the sparsified graph Gspar. Note that, while a solution may span across multiple components, and hence some domination overlap across different components may still remain, the construction of the conflict graph guarantees that such inter-component overlap is a negligible fraction of t. Recall that the different weight functions mentioned in Lemma 3 correspond to different weight functions obtained in this manner, corresponding to different colorings from the random separation step.

3. Using sparsified weights to find an approximate solution.

We now view Gconn as a weighted graph, where the weight of each vertex is defined from the sparsification step. Then, we find a connected subgraph of Gconn on at most k vertices with maximum total weight. This can be done using an application of the classical color-coding approach [1]. If there exists a feasible solution S of size k, it corresponds to a connected subgraph with total weight at least t. Conversely, if we identify a connected subgraph S with total weight at least t, then even after discounting the small amount of over-counting across different components, we are guaranteed that |NGcov(S)|(1ε)t. This completes the description of Theorem 1.

Overview of the proof of Theorem 2.

As before, suppose that the given instance is a Yes-instance and let S be an unknown solution. In this setting, the goal of (1+ε,1)-approximation is flipped: we no longer insist that the solution has size exactly k (we allow solutions of size up to (1+ε)k); but instead require that it dominates at least t blue vertices.

1. A first attempt, and why it fails.

We begin by running the algorithm from the previous subsection with a suitable choice of δ, obtaining an (1,1δ)-approximate solution S of size k that covers at least (1δ)t blue vertices. If |NGcov(S)|t, we are done. Otherwise, the solution is only slightly short; but remember that we are allowed to add a few more vertices to reach the required coverage.

Let H be the set of the g(k,d,δ) highest-degree red vertices. A combinatorial lemma (originally from [24]) guarantees that, when Gcov is Kd,d-free, at least one of the following holds.

  1. (i)

    the unknown solution intersects H, i.e., HS, or

  2. (ii)

    there exists a vertex hH such that S{h} dominates at least t blue vertices

This suggests a natural strategy: if case (i) holds, then we add h to the solution, and recurse. Otherwise, in case (ii), we simply add h to S, yielding a solution of size k+1, which is much smaller than the allowed (1+ε)k. But there is a catch: we also require that the solution be connected in Gconn, and there is no guarantee that Gconn[S{h}] is connected. If h happened to be adjacent to, or close to, a vertex of S, this would be easy to fix, but this does not hold in general.

2. A structural lemma to the rescue.

To overcome this, we first provide a structural lemma that may be of an independent interest: since Gconn[S] is connected, there exists a subset CS of size at most 1/ε such that every vertex of S is at distance at most εk from C. We “guess” C by iterating over all subsets of R of size at most 1/ε, and in each iteration, we require that the guessed set C must be a part of our solution that we will build. During recursion, C may grow as we discover additional vertices that must belong to S; but the original C serves the purpose of “cheaply connecting” a vertex to the solution.

3. Amended recursive strategy.

With this additional seed C, we run the previous algorithm again to find a set S of size k containing C that dominates at least (1δ)t blue vertices (this requires generalizing the previous algorithm so that we can provide to it an additional terminal set–here C–that must be contained in the solution). At this point, we focus only on high-degree vertices that are near C in Gconn, since S must be contained in this neighborhood. We then apply the lemma from Step 1:

  • In case (i), we guess hH that is guaranteed to belong to S. Then we branch on all possibilities for h, extend C by including it, and recurse.

  • In case (ii), for some hH, the set S{h} dominates at least t blue vertices. Because h is close to C, we can connect it to S by adding a path P in Gconn of length at most εk. The resulting set SP is connected, has size at most (1+ε)k, and dominates at least t blue vertices. Iterating over all hH, we return a solution whenever this case applies.

This concludes the overview of the proof of Theorem 2. We note that the running time of this (1+ε,1)-approximation algorithm is of the form h(k,d)||𝒪(1/ε), in contrast to the (1,1ε)-approximation of Theorem 1, which runs in time f(k,d,ε)||𝒪(1). A natural question is whether the dependence on ε in Theorem 2 can be improved, i.e., whether one can achieve a running time of the form h(k,d,ε)||𝒪(1). In analogy with PTAS versus EPTAS, this corresponds to improving from a PAS to an EPAS 222EPAS stands for Efficient Parameterized Approximation Scheme, i.e., a (1±ε)-approximation in time f(κ,ε)|I|𝒪(1) for some parameter κ. PAS stands for Parameterized Approximation Scheme, where the running time can be f(κ,ε)|I|g(ε)..

However, obtaining such an improvement would be unlikely due to the following simple reason. Indeed, suppose for any ε>0 we could compute a connected solution of size at most (1+ε)k that covers at least t blue vertices. Setting ε12k would then yield a solution of size at most k+12, which must in fact be of size at most k. This would give an exact FPT algorithm for PartialConRBDS parameterized by k and d; however the problem is W[1]-hard when parameterized by k even when d=2 (cf. Partial Vertex Cover). Thus, Theorem 2 achieves, in a precise sense, the best possible form of approximation scheme, apart from potential improvements to the 𝖥𝖯𝖳 factor of the running time.

3 Preliminaries

Let [n] be the set of integers {1,,n}. For a graph G, we denote the set of vertices of G by V(G) and the set of edges by E(G). For a directed graph D, we denote the set of vertices of D by V(D) and the set of edges by A(D). We denote an edge of an undirected graph by uv and an arc of a directed graph by (u,v). For a subset X of vertices XV(G), we use the notation GX to mean the graph G[V(G)X]. For a vertex subset SV(G), G[S] denotes the subgraph of G induced on S, i.e., V(G[S])=S,E(G[S])={uv:u,vS,uvE(G)}. For a graph G and a vertex vV(G), we define open neighborhood of a vertex as NG(v){uuvE(G)} and closed neighborhood of a vertex as NG[v]NG(v){v}. Further for a set XV(G), open neighborhood of X is defined as NG(X)(xXNG(x))X and closed neighborhood of X is defined as NG[X]NG(X)X. We use standard terminology from the book of Diestel [12] for those graph-related terms that are not explicitly defined here. We refer to [9] for an introduction to the area of parameterized complexity and related terminology.

4 FPT for PARTIALCONRBDS parameterized by 𝒕

In this section we design an algorithm for PartialConRBDS, parameterized by t. The result is obtained by giving an FPT reduction to Relaxed Directed Steiner Out-Tree problem. A directed out-tree (or arborescence) is a digraph whose underlying undirected graph is a tree and that has a unique root r such that every arc is directed away from r. Equivalently, the root r has in-degree 0 and every other vertex has in-degree exactly 1. The problem Relaxed Directed Steiner Out-Tree is defined as follows:

Relaxed Directed Steiner Out-Tree (RDSOT)

 Input: A directed graph D=(V,A), a set of terminals TV, an integer p.

Question: Is there a directed out-tree DD with |V(D)|p and TV(D)?

If in addition, we are given an additional distinguished vertex rV and ask to check whether D contain a directed out-tree on at most p vertices that is rooted at r and that contains all the vertices of T, then that problem is referred as Directed Steiner Out-Tree. Misra et al. [36] gave an algorithm for Directed Steiner Out-Tree that runs in time 2|T|n𝒪(1). Note that we can also solve Relaxed Directed Steiner Out-Tree in time 2|T|n𝒪(1) by “guessing” (i.e., enumerating n choices for) the root vertex r, and solving the resulting instance of Directed Steiner Out-Tree. Now we use this result to design a randomized algorithm for PartialConRBDS.

Randomized reduction from PARTIALCONRBDS to RDSOT.

Let =(Gconn,Gcov,k,t) be the given instance of PartialConRBDS. Let f:B[t] denote an arbitrary function (referred to as a coloring), where each vertex of B is assigned an integer from [t]. For each i, let Bi denote the vertices with color i according to f, i.e., Bi={vvB,f(v)=i} (we omit the dependence of f in the notation for the sake of brevity).

Now, we construct a directed graph Df that will serve as the input for the eventual instance of Relaxed Directed Steiner Out-Tree. We define

V(D):=V(Gcov)T, where T={τi:i[t]}.

The arc set A(Df) is constructed by adding the following three types of arcs.

  • Type 1: For every edge uvE(Gconn), add both arcs (u,v) and (v,u) to A(Df).

  • Type 2: For every edge uvE(Gcov) with uR,vB, add the arc (u,v) to A(Df).

  • Type 3: For each i[t], add the arcs {(v,τi):vBi} to A(Df).

Let p=k+2t, and let f=(Df,T,p) be the resulting instance of Relaxed Directed Steiner Out-Tree.

Our randomized algorithm works on the given instance =(Gconn,Gcov,k,t) as follows. First, it obtains a random f:B[t], i.e., for each uB, it independently and uniformly assigns a color from [t]. Then, it constructs the instance f=(Df,T,p) of RDSOT as described above. Then, it runs the 𝒪(2|T|)=𝒪(2t) time algorithm of [36], which returns Yes iff f is a Yes-instance of RDSOT. It repeats this procedure (of selecting a random coloring f and solving the resulting f) independently up to et times. If in any of the iterations, the algorithm of [36] outputs Yes, then our algorithm reports that the original instance of PartialConRBDS is a Yes-instance. Otherwise, if in all iterations the algorithm returns No, then the algorithm outputs that is a No-instance of PartialConRBDS.

Theorem 4 ().

PartialConRBDS admits a deterministic (resp. randomized) algorithm that runs in time 𝒪((2e)t+o(t)) (resp. 𝒪((2e)t) and outputs the correct answer with probability at least 11/e).

5 (𝟏,𝟏𝜺)-approximation for PARTIALCONRBDS

In this section we design an FPT algorithm (parameterized by k+d+1/ε) that achieves a (1,1ε)-bicriteria approximation for PartialConRBDS when the coverage graph Gcov is Kd,d-free. If t<k2dε, we invoke the exact algorithm of Theorem 4, which (when a solution exists) returns a set of size at most k dominating at least t vertices. When tk2dε, assuming there exists a solution of size at most k dominating at least t vertices, we compute a set SR with |S|k, Gconn[S] connected, and S dominating at least (1ε)t vertices of B in Gcov. Combining these two cases yields Theorem 17. Since the small-t case is handled by Theorem 4, the remainder of this section focuses on the regime tk2dε. We next present the algorithmic components and then assemble them into the full procedure, referring back to the corresponding subsections as needed.

Suppose =(Gconn,Gcov,k,t) is a Yes-instance of PartialConRBDS; i.e., there exists SR with |S|k such that

  1. (P1)

    Gconn[S] is connected, and

  2. (P2)

    |NGcov(S)|t.

We will refer to these properties as (𝖯1) and (𝖯2) throughout this section.

5.1 Step 1: Construction of the Conflict Graph

In this subsection, we define the notion of a conflict graph as follows.

Definition 5 (conflict graph).

Let Gconn and Gcov denote the connectivity and coverage graphs, respectively, as given in the input. We define conflict graph, denoted by Gconf as follows: the vertex set V(Gconf)V(Gconn)=R, and E(Gconf){uv:u,vRand|NGcov(u)NGcov(v)|εtk2}.

Note that the conflict graph can be constructed in time quadratic in the size of Gcov. In the following lemma, we bound the maximum degree of the conflict graph.

Lemma 6 (Restatement of Lemma 4.1 from [24]).

Suppose the coverage graph Gcov is Kd,d-free, and tk2dε. Then, the maximum degree ΔΔ(Gconf) of the conflict graph, Gconf as in Definition 5 is at most (d1)(ek2ε)d, where e denotes the base of natural logarithm.

Proof.

The proof is essentially identical to that of Lemma 4.1 of [24], but we describe it here for the sake of completeness, using the terminology used in this paper. Recall that, Gconn,Gcov,Gconf denote the connectivity, coverage and conflict graphs respectively. We show that the degree of every vertex vV(Gconf), that is 𝖽𝖾𝗀Gconf(v) is bounded. To this end, fix an arbitrary vertex vV(Gconf). WLOG, we only focus on the vertices with 𝖽𝖾𝗀Gconf(v)1; vertices with degree 0 trivially have bounded degree in Gconf. Let us consider the sets PNGconf(v)R and QNGcov(v)B. Now consider the induced subgraph Gcov[PQ] of the original coverage graph Gcov. Let this induced subgraph be Gcovv.

Let 𝒦 denote the set of all K1,d’s in the graph Gcovv, where one vertex (the center vertex of the star) belongs to NGconf(v)R and its d neighbors belong to NGcov(v)B. Since every uP has at least εtk2d neighbors in Q, i.e., |NGcovv(u)|εtk2d, it follows that each uP participates in at least (εtk2d) distinct copies of K1,d. Therefore, |𝒦||P|(εtk2d).

Further, we claim that |𝒦|(d1)(|Q|d). Assume towards contradiction that |𝒦|>(d1)(|Q|d). For each set ZQ of size exactly d, let κ(Z) denote the number of K1,d’s in 𝒦 that all the vertices of Z together participate in. Then, it follows that (d1)(|Q|d)<|𝒦|=ZQ:|Z|=dκ(Z)(|Q|d)κ(Zmax), where ZmaxargmaxZQ:|Z|=dκ(Z). This implies that κ(Zmax)>d1, i.e., κ(Zmax)d. However, this implies the existence of a Kd,d in Gcovv, which is a subgraph of Gcov, a contradiction. Therefore, we obtain that,

|P|(εtk2d)|𝒦| (d1)(|Q|d)<(d1)(td) (Since |Q|<t, else singleton solution)
|P| (d1)(td)(εtk2d)
(d1)(etd)d(εtk2d)d (Since (nk)k(nk)(enk)k)
(d1)(ek2ε)d

This concludes the proof.

5.2 Step 2: Isolating Connected Components of 𝑺 in 𝑮conf

In this section we apply the classical random separation technique [5] to probabilistically isolate the solution vertices from their Gconf-neighbors in any Yes-instance.

Let f:V(Gconf){purple,green} be a random coloring in which each vertex of Gconf is colored independently

purple with probability p11+Δandgreenwith probability 1p=Δ1+Δ,

where ΔΔ(Gconf). For a coloring f, write Vpurplef1(purple) and Vgreenf1(green).

Our objective for the coloring is the following “good separation” when the instance is a Yes-instance with witness S:

  1. (i)

    SVpurple, and

  2. (ii)

    NGconf(S)Vgreen.

Since |S|k and |NGconf(S)|Δk, we have

Pr[good separation]p|S|(1p)|NGconf(S)|(11+Δ(Δ1+Δ)Δ)kΔk2𝒪(k).

Derandomization.

The random-coloring step can be derandomized using standard tools such as (n,p,p2)-splitters (see, e.g., Exercises 5.15 and 5.21 in [10]) without affecting the final running time333Note that Δ=Ω(k2d/ε) implies logΔ=Ω(logk).. Rather than presenting a randomized algorithm, we directly give a deterministic one via an (n,p,q) lopsided-universal family of functions.

Let U be the ground set of vertices (so U=V(Gconf) and n|U|). An (n,p,q) lopsided-universal family is a collection ={f1,,f} of functions fi:U{purple,green} such that for every A(Up) and B(UAq) there exists f with Af1(purple) and Bf1(green).

Proposition 7 ([1, 10]).

There is an algorithm that, given n, p, and q, constructs an (n,p,q) lopsided-universal family of size ((p+q)2p)(p+q)𝒪(1)logn in time ((p+q)2p)(p+q)𝒪(1)nlogn. In particular, for p=k and q=k(d1)(ek2ε)d, the size of is at most (1ε)𝒪(dklogk)logn.

For our purposes we set pkandqk(d1)(ek2ε)d. Equivalently, let Δ(d1)(ek2ε)d; then q=kΔ. By Proposition 7, this yields an (n,p,q) lopsided-universal family of size (1ε)𝒪(dklogk)logn. If in addition Δ(Gconf)Δ, then kΔ(Gconf)q, so the same family suffices for enforcing Sf1(purple) and NGconf(S)f1(green).

Filtering of Connected Components.

For every coloring f we proceed as follows. Delete all green vertices, i.e., set Vgreenf1(green) and form GconffGconfVgreen. The remaining graph Gconff is a disjoint union of connected components, each an induced subgraph of Gconf on the purple vertices Vpurplef1(purple). Finally, discard every purple component C with |C|>k; let 𝒞k(f) denote the family of surviving components (those with |C|k).

Motivation for the filtering step.

Assume the instance is a Yes-instance and let SR, |S|k, satisfy (𝖯1) and (𝖯2). While Gconn[S] is connected, the conflict graph Gconf need not be a subgraph of Gconn: there may exist u,vR that are nonadjacent in Gconn but adjacent in Gconf due to a large overlap of their neighborhoods in Gcov (so uvE(Gconf) but uvE(Gconn)). Conversely, Gconf[S] need not be connected even though Gconn[S] is: some pairs u,vS that are adjacent in Gconn can become nonadjacent in Gconf by the very definition of the conflict graph (so uvE(Gconn) but uvE(Gconf)).

Let C1,,C be the connected components of Gconf[S], where 1|S|k. Clearly S=i=1V(Ci). (When convenient, we do not distinguish between a component C and its vertex set V(C).)

By the choice of the lopsided-universal family , there exists f with SVpurple and NGconf(S)Vgreen. For this good f, deleting Vgreen removes no vertex of S, and the purple subgraph Gconff=GconfVgreen still contains Gconf[S] unchanged; in particular, each Ci remains a connected purple component. Since |V(Ci)||S|k, all these components survive the size filter. Thus, the filtering step preserves every piece of S we care about while discarding large purple components that cannot be part of any size-k solution.

Constructing Gconf𝐩𝐮𝐫𝐩𝐥𝐞: Recall that 𝒞k(f) denotes the family of surviving (purple) components, i.e., the connected components of GconfVgreen whose sizes are at most k. Fix a coloring f and enumerate

𝒞𝒞k(f)={C1,C2,,Cs},so that1|Ci|kfor all i[s].

We refer to the filtered purple subgraph as

GconfpurpleGconf[Vpurple]CVpurpleC is a purple component|C|>kC,

so each Ci𝒞 is a connected component of Gconfpurple with 1|Ci|k. (When clear from context, we drop the explicit dependence on f.) The next observation follows from the construction of Gconfpurple.

Observation 8.

V(Gconfpurple)V(Gconf)=V(Gconn)=R.

5.3 Step 3: Construction of Sparsified Graph

The goal here is to compute a bipartite graph which is called as sparsified graph (will be defined shortly) which will eventually help us find an approximate solution. To this end, we first consider the induced bipartite subgraph of Gcov defined on the vertices V(Gconfpurple) and B, namely Gcov[V(Gconfpurple)B]. Based on Gcov[V(Gconfpurple)B] and Gconfpurple, we construct another bipartite graph, called sparsified graph, denoted by Gspar as follows:

Construction of Gspar

 

  • V(Gspar)V(Gconfpurple)B (two parts of the bipartite graph).

  • Let 𝒞={C1,Cs} be the set of components in Gconfpurple. Now we have the following Sparsification process which helps to construct the edges in the sparsified graph.

    Sparsification Process: For every bB and every Ci𝒞, let Eb denote the set of edges of Gcov with one endpoint at b and the other endpoint at a neighbor of b in the component Ci in graph Gcov. More formally, Eb{xbE(Gcov)xV(Ci)NGcov(b)}.

    Now we construct E(Gspar) as follows: for every bB, if Eb, then add an arbitrary edge from Eb to E(Gspar).

We now summarize few key properties of Gspar.

Properties of Gspar

 

Let 𝒞={C1,Cs} be the set of connected components in Gconfpurple and Gspar be the graph defined above.

Property 9.

For every connected component Ci𝒞 of Gconfpurple and every subset of vertices XV(Ci), it holds that,

(i) |NGspar(X)|=xX|NGspar(x)|=xX𝖽𝖾𝗀Gspar(x) (1)
(ii) NGspar(X)NGcov(X) (ifXV(Ci)) (2)
(iii) NGspar(V(Ci))=NGcov(V(Ci)) (ifX=V(Ci)) (3)

 

Property 10.

Consider any pair of vertices xV(Ci) and yV(Cj) where i,j[s]. It holds that,

(i)|NGspar(x)NGspar(y)|<εtk2 when ij and (4)
(ii)NGspar(x)NGspar(y)= when i=j (5)

Proof for Property 9.

  1. (i)

    Equation 1 follows directly from the construction of Gspar, obtained as a result of Sparsification Process. More specifically, it follows that for every vertex bB and for every component Ci𝒞, the vertex b is adjacent to at most one vertex of Ci in Gspar. Thus, the vertex b contributes either 0 or 1 to each of the three terms in Equation 1, and in each case, its contribution is same. This establishes the desired property.

  2. (ii)

    It is clear that NGspar(X)NGcov(X), because every vertex vNGspar(X) must also belong to NGcov(X), given that Gspar is a subgraph of Gcov.

  3. (iii)

    Moreover, when X=V(Ci) and bB is a vertex that has a neighbor in Ci in the graph Gspar, then by the definition of the Sparsification Process, we have that b is in fact adjacent to exactly one vertex of Ci in Gspar. Thus b contributes 1 to both sides of Equation 3.

Proof for Property 10.

  1. (i)

    Equation 4 follows directly from the property of Gconf. Note that, NGspar(x)NGspar(y)NGcov(x)NGcov(y) since Gspar is a subgraph of Gcov. Further since xV(Ci),yV(Cj) for ij, we have that x and y are non-adjacent in Gconf, this implies that |NGcov(x)NGcov(y)|<εtk2 (by definition) which further implies that |NGspar(x)NGspar(y)|<εtk2.

  2. (ii)

    Suppose, for the sake of contradiction, there exists a pair of vertices x,yV(Ci) for some Ci satisfying NGspar(x)NGspar(y). Let zNGspar(x)NGspar(y). This contradicts the property of Gspar, since Gspar is obtained as a result of Sparsification Process, which ensures that both edges xz and yz cannot simultaneously appear in E(Gspar).

In the next section, we assign weights to the vertices of Gconfpurple, where each weight reflects the number of vertices dominated by that vertex in Gspar.

5.4 Step 4: Introducing vertex weights to 𝑽(𝑮conf𝐩𝐮𝐫𝐩𝐥𝐞)

In this section, we assign weights to the vertices of the graph Gconfpurple. Let w:V(Gconfpurple)0 be a weight function defined on V(Gconfpurple) as follows:

w(v)=|NGspar(v)|=𝖽𝖾𝗀Gspar(v)for every vV(Gconfpurple) (6)

By Observation 8, we have V(Gconfpurple)V(Gconf)=V(Gconn)=R. Therefore, the assigned weights are restricted to a subset of the vertices in R. Before proceeding further, it is important to note the following assumptions, which will be maintained throughout the subsequent discussion.

We now state a crucial lemma that, via the weight function defined above, provides a lower bound on the Gcov-coverage of any set YV(Gconfpurple) with |Y|k; equivalently, it lower-bounds |NGcov(Y)| for all such Y.

Lemma 11.

For any subset YV(Gconfpurple) of size at most k, it holds that

|NGcov(Y)||NGspar(Y)|yYw(y)εt (7)

Proof.

By Inclusion-exclusion Principle, we have

|NGspar(Y)| = yY|NGspar(y)|x,yY|NGspar(x)NGcov(y)| (8)
+ x,y,zY|NGspar(x)NGspar(y)NGspar(z)|
() yY|NGspar(y)|x,yY|NGspar(x)NGspar(y)| (9)
= yYw(y)x,yY|NGspar(x)NGspar(y)| (10)

Here, inequality () holds because, in Line 9, we truncate the Inclusion–Exclusion formula by discarding all terms from the 3rd term onward, thereby obtaining a lower bound on the original expression. Line 10 follows from Equation 6. Now, it remains to provide an upper bound on x,yY|NGspar(x)NGspar(y)|.

Recall that 𝒞={C1,Cs} is the set of connected components in Gconfpurple. For each i[s], consider the restriction of the connected component Ci to Y, namely CiYV(Ci)Y. Note that Gconfpurple[CiY] need not be connected. Without loss of generality, we assume that for every i[s], CiY, i.e., it contains at least one vertex from Y; otherwise, we discard the empty components from the collection {CiY,,CsY}. Thus we can write,
x,yY|NGspar(x)NGspar(y)|=i[s],x,yCiY|NGspar(x)NGspar(y)|+i,j[s],ijxCiY,yCjY|NGspar(x)NGspar(y)| (11)

Notice that due to Equation 5 of Property 10, we have that for every Ci𝒞 and for every x,yV(Ci), it holds that NGspar(x)NGspar(y)=. Hence the first term of Equation 11 becomes 0.

Moreover, by Equation 4 in Property 10, for all distinct i,j[s] and all vertices xCiYV(Ci) and yCjYV(Cj), we have

|NGspar(x)NGspar(y)|<εtk2,

since x and y are nonadjacent in the conflict graph Gconf. Since |Y|k, there are at most (k2) pairs in Y. Hence the second component of Equation 11 is upper bounded by (k2)εtk2k2εtk2=εt. Substituting these values in Equation 11, we obtain

x,yY|NGspar(x)NGspar(y)|εt

Substituting all these values in Equation 7, we get that |NGspar(Y)|yYw(y)εt. Furthermore, since Gspar is a subgraph of Gcov, we have

|NGcov(Y)||NGspar(Y)|.

(Equivalently, this follows by repeatedly applying Equation 2 from Property 9.) This proves the lemma.

We now require a converse to Lemma 11: an upper bound on the Gcov-coverage of any set XV(Gconfpurple) with |X|k, expressed in terms of the weight function defined above; equivalently, we seek to upper bound |NGcov(X)| for all such X. However, we cannot establish this bound for arbitrary X. Instead, we prove it for a particular class of sets, which is sufficient for our purposes. The next definition characterizes this class.

Definition 12 (Component-respecting set).

Let 𝒞={C1,,Cs} be the collection of all connected components of Gconfpurple. A set ZV(Gconfpurple) is component-respecting (w.r.t.Gconfpurple) if for every i[s], either CiZ or ZCi=. Equivalently, Z is a union of some components from 𝒞, in other words, there exists I[s] with Z=iICi.

Lemma 13.

For any component-respecting set XV(Gconfpurple), it holds that

xXw(x)|NGcov(X)| (12)

Proof.

We know that,

xXw(x)=xX|NGspar(x)|(due to the definition in Equation 6) (13)

Thus, it suffices to show that

xX|NGspar(x)||NGcov(X)| (14)

Every vertex bNGcov(X)B contributes 1 to the right-hand side of Equation 14, since every neighbor of X in Gcov is counted once in |NGcov(X)|. To establish Lemma 13, it suffices to show that every vertex bNGcov(X)B contributes at least 1 to the left-hand side of Equation 14. Recall that 𝒞={C1,,Cs} denotes the set of connected components in Gconfpurple. Consider an arbitrary vertex bNGcov(X) and an arbitrary component Ci𝒞 of Gconfpurple such that bNGcov(V(Ci)). Note that such a component exists since bNGcov(X). Since X is component-respecting with respect to Gconfpurple, it holds that V(Ci)X=V(Ci), i.e., V(Ci)X. Thus, by Equation 3, we have NGspar(V(Ci))=NGcov(V(Ci)). Hence, by the construction of Gspar, we have that b is adjacent to some xV(Ci) in Gspar, i.e., bNGspar(x). Hence b contributes 1 to |NGspar(x)| in the sum on the left-hand side of Equation 14 which concludes the Lemma.

5.5 Step 5: Finding a Maximum Weighted Subtree in 𝑮conn[𝑽(𝑮conf𝐩𝐮𝐫𝐩𝐥𝐞)]

In this section we work in the vertex-weighted graph Gconn[V(Gconfpurple)], endowed with the weight function w defined above (in Equation 6). Our goal is to find a subtree on at most k vertices maximizing total weight (the weight of a tree is the sum of the weights of its vertices). We formalize this as the following subproblem.

Maximum-Weight k-Tree:

Given a graph G, a weight function γ:V(G)0, and an integer k, find a subtree TGG with |V(TG)|k maximizing γ(TG)vV(TG)γ(v).

We solve this via Weighted Tree Isomorphism.

Weighted Tree Isomorphism:

Given a host graph G, a tree T, and a weight function γ:V(G)0, find (if it exists) a subgraph TGG isomorphic to T and of maximum total weight γ(TG).

Proposition 14.

Weighted Tree Isomorphism can be solved in time 2𝒪(h)n𝒪(1), where h=|V(T)| and n=|V(G)|.

Proposition 14 follows from standard techniques. One route is the classic color-coding framework of Alon, Yuster, and Zwick [1, Thm. 6.3], whose dynamic program extends to the weighted objective by maximizing (rather than merely detecting) over states; see the discussion following [1, Thm. 6.3] (cf. also [38]). Alternatively, representative-family techniques yield the same running time with a weight-aware state space [16].

To solve Maximum-Weight k-Tree, we enumerate all non-isomorphic trees on at most k vertices and, for each such tree T, run Weighted Tree Isomorphism (Proposition 14). Otter [37] showed that the number of non-isomorphic (unrooted) trees on h vertices is th=2.956h. Moreover, all non-isomorphic rooted trees on h vertices can be generated in time 𝒪(thh) by the algorithm of Beyer and Hedetniemi [4].

Proposition 15 ([37, 4]).

The number of non-isomorphic trees on h vertices is th=2.956h. Furthermore, all non-isomorphic rooted trees on h vertices can be enumerated in time 𝒪(thh).

Combining Proposition 15 with Proposition 14 yields:

Lemma 16.

Maximum-Weight k-Tree can be solved in time 2𝒪(k)n𝒪(1).

Proof.

There are tk=2.956k, kk, non-isomorphic trees on k vertices. For each such tree T, Weighted Tree Isomorphism runs in 2𝒪(k)n𝒪(1) time (Proposition 14). Hence the total running time is tk2𝒪(k)n𝒪(1)=2𝒪(k)n𝒪(1).

5.6 Putting all the Pieces Together

By assembling all the components and present the complete algorithm as pseudocode in Algorithm 1, we now obtain the main theorem of this section, and prove it by combining the tools and ideas developed above.

Algorithm 1 AlgPartialConnRBDS.
Theorem 17.

Let =(Gcov,Gconn,k,t) be an instance of PartialConRBDS, where Gcov and Gconn are the coverage and connectivity graphs, respectively. If Gcov is Kd,d-free, then for every ε(0,1) there is an algorithm running in time 𝒪(2𝒪(k2d/ε)) that either

  1. (i)

    outputs a set SR with |S|k, Gconn[S] connected, and |NGcov(S)|(1ε)t, or

  2. (ii)

    correctly concludes that no size-k set dominates at least t vertices in Gcov.

Proof.

We first show that, if the instance is a Yes-instance, our algorithm returns a set SR with |S|k, Gconn[S] connected, and |NGcov(S)|(1ε)t. Let SR be a size-k witness satisfying (𝖯1) (i.e., Gconn[S] is connected) and (𝖯2) (i.e., |NGcov(S)|t).

Small t. If t<k2dε, algorithm invokes the exact FPT algorithm of Theorem 4 to decide and, if possible, return a size-k connected solution covering at least t vertices.

Large t. Assume tk2dε. The algorithm implements the purple/green separation by enumerating an (n,p,q) lopsided-universal family (Proposition 7) with p=k and q=k(d1)(ek2ε)d. By the defining property of such families, there exists a good f with SVpurple and NGconf(S)Vgreen. Fix this f.

We delete all green vertices and retain only purple components of size at most k (the filtering step); this preserves each connected piece Ci of Gconf[S]. We then define the vertex-weight function w on V(Gconfpurple) (equivalently, on the host Gconn[V(Gconfpurple)]), as in Step 4 (Section 5.4). Finally, let Tf be a maximum-weight subtree on k vertices in Gconn[V(Gconfpurple)], computed via Lemma 16.

We can potentially take SV(Tf). Since V(Tf)V(Gconfpurple) and |V(Tf)|=k, by Lemma 11 we have

|NGcov(V(Tf))|𝔣V(Tf)w(𝔣)εt. (15)

Recall that S has |S|k and Gconn[S] connected; let T be any spanning tree of Gconn[S]. Since SVpurple and each connected piece Ci has size at most k, the filtering step preserves Gconn[S], hence TGconn[V(Gconfpurple)]. Because Tf is a maximum-weight k-vertex tree in Gconn[V(Gconfpurple)],

𝔣V(Tf)w(𝔣)vV(T)w(v)𝔰Sw(𝔰). (16)

Moreover, by Lemma 13 applied to S and using |NGcov(S)|t,

𝔰Sw(𝔰)|NGcov(S)|t. (17)

Substituting (16) and (17) into (15) yields

|NGcov(V(Tf))| 𝔣V(Tf)w(𝔣)εt
𝔰Sw(𝔰)εt
tεt=(1ε)t.

Thus S=V(Tf) satisfies |S|k, Gconn[S] is connected, and |NGcov(S)|(1ε)t.

Our algorithm outputs either T~, the maximum-weight k-vertex subtree in Gconn[V(Gconfpurple)] over all colorings f, or if w(T~)<t. Since there exists a good f with w(Tf)𝔰Sw(𝔰)t, we have w(T~)w(Tf)t, and thus the algorithm never returns . Set SV(T~). Because V(T~)V(Gconfpurple) and |V(T~)|=k, by Lemma 11 we obtain

|NGcov(V(T~))| 𝔣V(T~)w(𝔣)εt
𝔣V(Tf)w(𝔣)εt
𝔰Sw(𝔰)εt
tεt=(1ε)t.

Hence S satisfies the coverage guarantee. Since our guarantee is conditioned on Yes-instances and we do not require bounds for the No-instance case, this completes the description of the algorithm’s correctness and approximation analysis.

Running time.

Small-t case. When t<k2dε we invoke the exact routine of Theorem 4, which runs in time

Tsmall=𝒪(2𝒪(t))𝒪(2𝒪(k2d/ε)).

Large-t case. For tk2dε we enumerate an (n,p,q) lopsided-universal family (Proposition 7) with p=k and q=k(d1)(ek2ε)d, whose size satisfies ||(1ε)𝒪(dklogk)logn. For each f, all preprocessing – forming Gconfpurple, filtering to components of size k, and computing the needed sparsifiers – is polynomial in n. The only exponential step per f is solving Maximum-Weight k-Tree in the host Gconn[V(Gconfpurple)], which takes 2𝒪(k)n𝒪(1) time (Lemma 16). Hence

Tlarge=𝒪(||2𝒪(k))=𝒪((1ε)𝒪(dklogk)2𝒪(k)).

Combined bound. Taking the worse of the two regimes and hiding polynomial factors, the overall running time is

𝒪(2𝒪(k2d/ε)),

which matches the bound stated in Theorem 17. This conludes the proof.

6 Conclusion

We introduced Partial Connected Red-Blue Dominating Set (PartialConRBDS) as a unifying model for many connectivity-constrained coverage problems that have been studied in prior polynomial-time approximation literature. The expressivity of our model comes from having two separate graphs (i) Gconn for imposing structural properties on the solution (in our case, connectivity), and (ii) Gcov for modeling hypergraph incidences (which we use for requiring certain amount of coverage). We hope that this will serve as a model for understanding a variety of problems from this two-layer perspective.

As for our algorithmic contributions, we began with an exact FPT algorithm parameterized by t. In the realm of FPT approximation, we focused on biclique-free instances, and designed an EPAS that finds a size-k connected set that dominates at least (1ε)t blue vertices; and a complementary PAS that finds a connected set of size at most (1+ε)k that covers t blue vertices. Our key tool is a small family of surrogate weight functions built via neighborhood sparsification procedure, which lets us reduce the search to maximum-weight k-trees in Gconn. Among the lower bound results, the main takeaway is that the problem inherits strong 𝖶-hardness and approximation lower bounds from Max Coverage, when Gcov is arbitrary. Together, these results mark a clear line between what is possible with FPT approximation and what is not, and point to a clean classification by the structure of the coverage and constraint graphs.

Future directions.

  • Faster algorithms. Improve the running time (and parameter dependence) of our EPAS/PAS.

  • Broader tractable classes. Identify larger families of coverage instances where the problem remains FPT (beyond biclique-free), e.g. bounded VC-dimension, or geometric incidences.

  • Other constraints. Study variants where the constraint layer enforces independence/packing, degree bounds, or fault tolerance (e.g., r-connectivity) instead of (or in addition to) connectivity.

  • Lossy kernels. It would be ideal to have lossy kernels as in the vanilla setting [34]; however, we can show Connected Partial Vertex Cover admits no lossy kernels.

References

  • [1] Noga Alon, Raphael Yuster, and Uri Zwick. Color-coding. J. ACM, 42(4):844–856, 1995. doi:10.1145/210332.210337.
  • [2] Mitali Bafna, Karthik C. S., and Dor Minzer. Near optimal constant inapproximability under ETH for fundamental problems in parameterized complexity. In Michal Koucký and Nikhil Bansal, editors, Proceedings of the 57th Annual ACM Symposium on Theory of Computing, STOC 2025, Prague, Czechia, June 23-27, 2025, pages 2118–2129. ACM, 2025. doi:10.1145/3717823.3718257.
  • [3] András A. Benczúr and David R. Karger. Approximating s-t minimum cuts in Õ(n2) time. In Gary L. Miller, editor, Proceedings of the Twenty-Eighth Annual ACM Symposium on the Theory of Computing, Philadelphia, Pennsylvania, USA, May 22-24, 1996, pages 47–55. ACM, 1996. doi:10.1145/237814.237827.
  • [4] Terry Beyer and Sandra Mitchell Hedetniemi. Constant time generation of rooted trees. SIAM J. Comput., 9(4):706–712, 1980. doi:10.1137/0209055.
  • [5] Leizhen Cai, Siu Man Chan, and Siu On Chan. Random separation: A new method for solving fixed-cardinality optimization problems. In Hans L. Bodlaender and Michael A. Langston, editors, Parameterized and Exact Computation, Second International Workshop, IWPEC 2006, Zürich, Switzerland, September 13-15, 2006, Proceedings, volume 4169 of Lecture Notes in Computer Science, pages 239–250. Springer, 2006. doi:10.1007/11847250_22.
  • [6] Parinya Chalermsook, Marek Cygan, Guy Kortsarz, Bundit Laekhanukit, Pasin Manurangsi, Danupon Nanongkai, and Luca Trevisan. From gap-exponential time hypothesis to fixed parameter tractable inapproximability: Clique, dominating set, and more. SIAM J. Comput., 49(4):772–810, 2020. doi:10.1137/18M1166869.
  • [7] Yijia Chen and Bingkai Lin. The constant inapproximability of the parameterized dominating set problem. SIAM J. Comput., 48(2):513–533, 2019. doi:10.1137/17M1127211.
  • [8] Vincent Cohen-Addad, Anupam Gupta, Amit Kumar, Euiwoong Lee, and Jason Li. Tight FPT Approximations for k-Median and k-Means. In Christel Baier, Ioannis Chatzigiannakis, Paola Flocchini, and Stefano Leonardi, editors, 46th International Colloquium on Automata, Languages, and Programming, ICALP 2019, July 9-12, 2019, Patras, Greece, volume 132 of LIPIcs, pages 42:1–42:14. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019. doi:10.4230/LIPIcs.ICALP.2019.42.
  • [9] Marek Cygan, Fedor V. Fomin, Łukasz Kowalik, Daniel Lokshtanov, Dániel Marx, Marcin Pilipczuk, Michał Pilipczuk, and Saket Saurabh. Parameterized algorithms, volume 5. Springer, 2015. doi:10.1007/978-3-319-21275-3.
  • [10] Marek Cygan, Fedor V. Fomin, Lukasz Kowalik, Daniel Lokshtanov, Dániel Marx, Marcin Pilipczuk, Michal Pilipczuk, and Saket Saurabh. Parameterized Algorithms. Springer, 2015. doi:10.1007/978-3-319-21275-3.
  • [11] Gianlorenzo D’Angelo and Esmaeil Delfaraz. Approximation algorithms for connected maximum coverage. In Sanmay Das, Ann Nowé, and Yevgeniy Vorobeychik, editors, Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025, Detroit, MI, USA, May 19-23, 2025, pages 538–546. International Foundation for Autonomous Agents and Multiagent Systems / ACM, 2025. doi:10.5555/3709347.3743569.
  • [12] Reinhard Diestel. Graph theory, volume 173. Springer Nature, 2025.
  • [13] Irit Dinur and David Steurer. Analytical approach to parallel repetition. In David B. Shmoys, editor, Symposium on Theory of Computing, STOC 2014, New York, NY, USA, May 31 - June 03, 2014, pages 624–633. ACM, 2014. doi:10.1145/2591796.2591884.
  • [14] Rodney G. Downey and Michael R. Fellows. Fixed-parameter tractability and completeness I: basic results. SIAM J. Comput., 24(4):873–921, 1995. doi:10.1137/S0097539792228228.
  • [15] Uriel Feige and Mohammad Mahdian. Finding small balanced separators. In Jon M. Kleinberg, editor, Proceedings of the 38th Annual ACM Symposium on Theory of Computing, Seattle, WA, USA, May 21-23, 2006, pages 375–384. ACM, 2006. doi:10.1145/1132516.1132573.
  • [16] Fedor V. Fomin, Daniel Lokshtanov, Fahad Panolan, and Saket Saurabh. Efficient computation of representative families with applications in parameterized and exact algorithms. J. ACM, 63(4):29:1–29:60, 2016. doi:10.1145/2886094.
  • [17] Rajiv Gandhi, Samir Khuller, and Aravind Srinivasan. Approximation algorithms for partial covering problems. J. Algorithms, 53(1):55–84, 2004. doi:10.1016/J.JALGOR.2004.04.002.
  • [18] Anupam Gupta, Euiwoong Lee, Jason Li, Pasin Manurangsi, and Michal Wlodarczyk. Losing treewidth by separating subsets. In Timothy M. Chan, editor, Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2019, San Diego, California, USA, January 6-9, 2019, pages 1731–1749. SIAM, 2019. doi:10.1137/1.9781611975482.104.
  • [19] Venkatesan Guruswami, Bingkai Lin, Xuandi Ren, Yican Sun, and Kewen Wu. Parameterized inapproximability hypothesis under exponential time hypothesis. In Bojan Mohar, Igor Shinkar, and Ryan O’Donnell, editors, Proceedings of the 56th Annual ACM Symposium on Theory of Computing, STOC 2024, Vancouver, BC, Canada, June 24-28, 2024, pages 24–35. ACM, 2024. doi:10.1145/3618260.3649771.
  • [20] Venkatesan Guruswami, Bingkai Lin, Xuandi Ren, Yican Sun, and Kewen Wu. Almost Optimal Time Lower Bound for Approximating Parameterized Clique, CSP, and More, under ETH. In Michal Koucký and Nikhil Bansal, editors, Proceedings of the 57th Annual ACM Symposium on Theory of Computing, STOC 2025, Prague, Czechia, June 23-27, 2025, pages 2136–2144. ACM, 2025. doi:10.1145/3717823.3718130.
  • [21] Dorit S Hochbaum and Anu Pathria. Analysis of the greedy approach in problems of maximum k-coverage. Naval Research Logistics (NRL), 45(6):615–627, 1998.
  • [22] Dorit S. Hochbaum and Xu Rao. Approximation algorithms for connected maximum coverage problem for the discovery of mutated driver pathways in cancer. Inf. Process. Lett., 158:105940, 2020. doi:10.1016/J.IPL.2020.105940.
  • [23] Tanmay Inamdar, Pallavi Jain, Daniel Lokshtanov, Abhishek Sahu, Saket Saurabh, and Anannya Upasana. Satisfiability to coverage in presence of fairness, matroid, and global constraints. In Karl Bringmann, Martin Grohe, Gabriele Puppis, and Ola Svensson, editors, 51st International Colloquium on Automata, Languages, and Programming, ICALP 2024, July 8-12, 2024, Tallinn, Estonia, volume 297 of LIPIcs, pages 88:1–88:18. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2024. doi:10.4230/LIPIcs.ICALP.2024.88.
  • [24] Pallavi Jain, Lawqueen Kanesh, Fahad Panolan, Souvik Saha, Abhishek Sahu, Saket Saurabh, and Anannya Upasana. Parameterized Approximation Schemes for Biclique-Free Max k-Weight SAT and Max Coverage. ACM Trans. Algorithms, August 2025. Just Accepted. doi:10.1145/3763238.
  • [25] Karthik C. S., Bundit Laekhanukit, and Pasin Manurangsi. On the parameterized complexity of approximating dominating set. J. ACM, 66(5):33:1–33:38, 2019. doi:10.1145/3325116.
  • [26] Samir Khuller, Manish Purohit, and Kanthi K. Sarpatwar. Analyzing the optimal neighborhood: Algorithms for partial and budgeted connected dominating set problems. SIAM J. Discret. Math., 34(1):251–270, 2020. doi:10.1137/18M1212094.
  • [27] Ioannis Lamprou, Ioannis Sigalas, and Vassilis Zissimopoulos. Improved budgeted connected domination and budgeted edge-vertex domination. Theor. Comput. Sci., 858:1–12, 2021. doi:10.1016/J.TCS.2021.01.030.
  • [28] Bingkai Lin. The parameterized complexity of the k-biclique problem. J. ACM, 65(5):34:1–34:23, 2018. doi:10.1145/3212622.
  • [29] Bingkai Lin. A simple gap-producing reduction for the parameterized set cover problem. In Christel Baier, Ioannis Chatzigiannakis, Paola Flocchini, and Stefano Leonardi, editors, 46th International Colloquium on Automata, Languages, and Programming, ICALP 2019, July 9-12, 2019, Patras, Greece, volume 132 of LIPIcs, pages 81:1–81:15. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019. doi:10.4230/LIPIcs.ICALP.2019.81.
  • [30] Daniel Lokshtanov, Abhishek Sahu, Saket Saurabh, Vaishali Surianarayanan, and Jie Xue. Parameterized approximation for capacitated d-hitting set with hard capacities. In Yossi Azar and Debmalya Panigrahi, editors, Proceedings of the 2025 Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2025, New Orleans, LA, USA, January 12-15, 2025, pages 1565–1592. SIAM, 2025. doi:10.1137/1.9781611978322.48.
  • [31] Daniel Lokshtanov, Saket Saurabh, and Vaishali Surianarayanan. A parameterized approximation scheme for min k-cut. SIAM J. Comput., 53(6):S20–205, 2024. doi:10.1137/20M1383197.
  • [32] Pasin Manurangsi. A Note on Max k-Vertex Cover: Faster FPT-AS, Smaller Approximate Kernel and Improved Approximation. In Jeremy T. Fineman and Michael Mitzenmacher, editors, 2nd Symposium on Simplicity in Algorithms, SOSA 2019, January 8-9, 2019, San Diego, CA, USA, volume 69 of OASIcs, pages 15:1–15:21. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019. doi:10.4230/OASIcs.SOSA.2019.15.
  • [33] Pasin Manurangsi. Tight running time lower bounds for strong inapproximability of maximum k-coverage, unique set cover and related problems (via t-wise agreement testing theorem). In Shuchi Chawla, editor, Proceedings of the 2020 ACM-SIAM Symposium on Discrete Algorithms, SODA 2020, Salt Lake City, UT, USA, January 5-8, 2020, pages 62–81. SIAM, 2020. doi:10.1137/1.9781611975994.5.
  • [34] Pasin Manurangsi. Improved FPT approximation scheme and approximate kernel for biclique-free max k-weight SAT: greedy strikes back. Theor. Comput. Sci., 1028:115033, 2025. doi:10.1016/J.TCS.2024.115033.
  • [35] Dániel Marx. Parameterized complexity and approximation algorithms. Comput. J., 51(1):60–78, 2008. doi:10.1093/comjnl/bxm048.
  • [36] Neeldhara Misra, Geevarghese Philip, Venkatesh Raman, Saket Saurabh, and Somnath Sikdar. FPT algorithms for connected feedback vertex set. J. Comb. Optim., 24(2):131–146, 2012. doi:10.1007/S10878-011-9394-2.
  • [37] Richard Otter. The number of trees. Annals of Mathematics, 49(3):583–599, 1948.
  • [38] Jürgen Plehn and Bernd Voigt. Finding minimally weighted subgraphs. In Rolf H. Möhring, editor, Graph-Theoretic Concepts in Computer Science, 16rd International Workshop, WG ’90, Berlin, Germany, June 20-22, 1990, Proceedings, volume 484 of Lecture Notes in Computer Science, pages 18–29. Springer, 1990. doi:10.1007/3-540-53832-1_28.
  • [39] François Sellier. Parameterized matroid-constrained maximum coverage. In Inge Li Gørtz, Martin Farach-Colton, Simon J. Puglisi, and Grzegorz Herman, editors, 31st Annual European Symposium on Algorithms, ESA 2023, September 4-6, 2023, Amsterdam, The Netherlands, volume 274 of LIPIcs, pages 94:1–94:16. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2023. doi:10.4230/LIPIcs.ESA.2023.94.
  • [40] Piotr Skowron and Piotr Faliszewski. Chamberlin-courant rule with approval ballots: Approximating the maxcover problem with bounded frequencies in FPT time. J. Artif. Intell. Res., 60:687–716, 2017. doi:10.1613/JAIR.5628.
  • [41] Daniel A. Spielman and Shang-Hua Teng. Spectral sparsification of graphs. SIAM J. Comput., 40(4):981–1025, 2011. doi:10.1137/08074489X.
  • [42] Michal Wlodarczyk. Parameterized inapproximability for steiner orientation by gap amplification. In Artur Czumaj, Anuj Dawar, and Emanuela Merelli, editors, 47th International Colloquium on Automata, Languages, and Programming, ICALP 2020, July 8-11, 2020, Saarbrücken, Germany (Virtual Conference), volume 168 of LIPIcs, pages 104:1–104:19. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2020. doi:10.4230/LIPIcs.ICALP.2020.104.