Improved Approximation for Pathwidth One Vertex Deletion and Parameterized Complexity of Its Variants
Abstract
The pathwidth of a graph is a measure of how path-like the graph is. The Pathwidth One Vertex Deletion (POVD) problem asks whether, given an undirected graph and an integer , one can delete at most vertices from so that the remaining graph has pathwidth at most one. This is a natural variation of the classical Feedback vertex Set (FVS) problem, where the deletion of at most vertices results in a graph of treewidth at most one. In this work, we investigate POVD in the realm of approximation algorithms. We first design a -approximation algorithm for POVD running in polynomial time. Then, using this constant factor approximation algorithm, we obtain a randomized parameterized approximation algorithm for POVD running in time , that improves the fastest existing running times for approximation ratios in the range . Here the constant depends on the approximation factor alone and has value , which lies in the range , when .
Taking inspiration from two extensively studied problems, namely Connected FVS and Independent FVS, we investigate two variations of the POVD problem from the perspective of parameterized algorithms. These variations are the connected variant, called Connected pathwidth One Vertex Deletion (CPOVD) and the independent variant, called Independent Pathwidth One Vertex Deletion (IPOVD). While in CPOVD the subgraph induced by the vertices to be deleted needs to be connected, in IPOVD it needs to be independent. Specifically, we show the following results.
-
CPOVD can be solved in time and admits no polynomial kernel unless NP co-NP/poly.
-
IPOVD can be solved in time, and admits a kernel of size .
Keywords and phrases:
Pathwidth, Parameterized complexity, Approximation, KernelizationFunding:
Satyabrata Jana: Supported by the Engineering and Physical Sciences Research Council (EPSRC) via the project MULTIPROCESS (grant no. EP/V044621/1).Copyright and License:
2012 ACM Subject Classification:
Theory of computation Design and analysis of algorithmsEditors:
C. Aiswarya, Ruta Mehta, and Subhajit RoySeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
Treewidth is a fundamental graph parameter that measures how “tree-like” the graph is. The notion of treewidth was introduced by Robertson and Seymour in their seminal Graph Minors series [27]. Formally, it is defined via tree decompositions, where the goal is to map a graph into a tree-like structure of subsets of vertices of the graph, called bags, in a way that minimizes the size of the largest bag minus one. Treewidth plays a central role in the field of parameterized complexity and has been extensively studied due to its wide applicability in algorithm design, particularly for fixed-parameter tractable (FPT) algorithms. Many computationally hard problems become efficiently solvable when restricted to graphs of bounded treewidth.
A closely related parameter is pathwidth, which is a notion closely related to treewidth, and was also introduced by Robertson and Seymour in the Graph Minors series [26]. Here the underlying decomposition structure is restricted to paths instead of trees. While treewidth measures the tree-likeness of a graph, pathwidth measures how “path-like” the graph is.
The Feedback Vertex Set (FVS) problem is a classic graph modification problem where given a graph and an integer , the objective is to delete at most vertices to obtain an acyclic graph (i.e., a forest). This problem can be viewed through the lens of treewidth, as forests are exactly the graphs with treewidth at most one. Due to this we can refer FVS problem as Treewidth One Vertex Deletion. FVS has been intensively studied in the FPT framework [12, 13, 16], leading to numerous algorithmic breakthroughs. Several variants of FVS have also been investigated. Two notable examples are: Connected Feedback Vertex Set (CFVS) [1, 6, 22] and Independent Feedback Vertex Set (IFVS)[2, 17, 21, 28], where in CFVS the subgraph of induced by solution vertices needs to be connected, and in IFVS it needs to be independent. Currently, the most efficient algorithm known for IFVS is provided by Li and Pilipczuk, which runs in time [17]. Regarding CFVS, Cygan et al.[7] gave the fastest known randomized algorithm for CFVS, achieving a runtime of .
Many classical graph problems such as Vertex Cover, Feedback Vertex Set (FVS), Odd Cycle Transversal (OCT), and Dominating Set (DOM) have been extensively studied in the fields of parameterized complexity and graph algorithms. These problems are central to both theoretical research and practical applications, and their algorithmic properties – especially in the fixed-parameter tractability (FPT) setting. In recent years, increasing attention has been devoted to connected and independent variants of these problems, which often impose additional structural constraints and lead to new algorithmic challenges. Problems such as Connected Vertex Cover [14, 23], Connected DOM [11], Connected OCT [7] have emerged as natural extensions, where solutions are required to induce connected subgraphs. Similarly, Independent Dominating Set [18], Independent OCT [20], Independent - Separator [20], Independent Multicut [19], Independent Directed FVS [19], Independent Cutset [25] introduce independence constraints.
1.1 Pathwidth One Vertex Deletion
Analogous to FVS or Treewidth One Vertex Deletion, Philip et al.[24] introduced the Pathwidth-One Vertex Deletion (POVD) problem. In problem POVD, given a graph and an integer , the goal is to delete at most vertices such that the remaining graph has pathwidth at most one. We define the problem formally as follows.
| Pathwidth One Vertex Deletion (POVD) | |
|---|---|
| Input: | A graph , a positive integer . |
| Question: | Does there exist such that and pathwidth of is at most one? |
It is known that a graph has pathwidth at most one if and only if it is a disjoint union of caterpillars [29]. A caterpillar is a special kind of tree where all vertices are within distance one of a central path called the spine [24, 29]. Hence, graphs of pathwidth at most one form a subclass of forests, exhibiting even simpler structure.
It follows from a general NP-hardness result of Lewis and Yannakakis [15] that this problem is NP-complete. Therefore, the POVD problem has attracted some attention in the parameterized complexity community and several FPT algorithms have been developed for this problem. Philip et al. [24] gave an algorithm for POVD with running time111We use notation to hide factors polynomial in the input size. and a kernel with vertices. Cygan et al. [8] improved these results by giving an algorithm with running time and a kernel with edges. The current fastest FPT algorithm for POVD has a running time of [29]. Very recently, Esmer and Kulik [10] gave a parameterized approximation algorithm for POVD with a running time of for some , achieving approximation factors lying in the range .
1.2 Our Contributions
Following the line of research on FVS variants such as CFVS and IFVS, in this paper, we initiate the study of two natural extensions of POVD, one capturing independence, and the other emphasizing connectivity. Our research looks into these variants, revealing their intricacies with the following theme.
Connectivity vs Independence
Formally, we study the following connected and independent variants of POVD.
| Connected Pathwidth One Vertex Deletion (CPOVD) | |
|---|---|
| Input: | A graph , a positive integer . |
| Question: | Is there a vertex subset of size at most such that is connected and pathwidth of is at most one? |
| Independent Pathwidth One Vertex Deletion (IPOVD) | |
|---|---|
| Input: | A graph , a positive integer . |
| Question: | Is there a vertex subset of size at most such that is independent and pathwidth of is at most one? |
We obtain the following results for CPOVD and IPOVD.
-
IPOVD admits a kernel of size and can be solvable in time .
-
CPOVD does not admit a polynomial kernel but can be solvable in time .
In Section 4, we develop a kernel of size for IPOVD. Our approach builds on the framework of established kernelization algorithms for POVD [8, 24] and IFVS [21]. However, due to the inherited generality of our problem, we need to introduce substantial modifications to the reduction rules and perform a more detailed analysis. The size of our kernel matches, up to a constant factor, the best-known kernel size for IFVS. In contrast to IPOVD, in Section 5, we prove that CPOVD does not admit a polynomial kernel parameterized by the solution size unless NP co-NP/poly. This negative result follows via a straightforward reduction from the Connected Vertex Cover problem, which is also known not to have a polynomial kernel under the same assumption.
On the algorithmic side, in Section 6, we give an FPT algorithm for CPOVD running in time . The core idea is to first eliminate certain obstructive structures from the graph through branching, simplifying the instance. Taking advantage of this simplified graph, we then invoke an FPT algorithm for Group Steiner Tree in a black-box fashion with carefully constructed input, allowing us to test whether any minimal POVD solution can be extended to a connected set that meets the size constraint. Beyond the kernel for IPOVD, in Section 7 we also present an FPT algorithm for IPOVD running in time . This algorithm is conceptually similar to the earlier FPT approach for CPOVD: we first branch to remove some of the obstructive structures, and then, again with the help of the simplified structure of the residual graph, check whether any minimal POVD solution of size at most is independent.
Interestingly, despite the active study of approximation algorithms for other vertex deletion problems, no polynomial-time approximation algorithm better than the trivial -approximation was previously known for POVD. We provide a -approximation algorithm for POVD running in polynomial time. This algorithm is based on two key observations that any pathwidth-one vertex deletion set also forms a feedback vertex set and, POVD can be solved in polynomial time when the input graph is acyclic.
Furthermore, we investigate POVD in the realm of parameterized approximation. Recently, Esmer and Kulik [10], presented a parameterized approximation algorithm for POVD that achieve an approximation guarantee with factors in the range in randomized FPT running time, that improves over the best known parameterized algorithm for POVD. In Section 3.3, with the help of the -approximation algorithm, we obtain another parameterized approximation algorithm that improves upon the existing randomized FPT running times of algorithms given by [10] for approximation ratios in the range . Due to space constraints, some proofs have been moved to Appendix (Appendix A).
2 Preliminaries
All the graphs we consider are simple, undirected and unweighted graphs. For a graph , we denote the set of vertices of the graph by and the set of edges of the graph by . We denote by , where the graph is clear from the context. An edge with endpoint and is denoted by . For a set , the subgraph of induced by is denoted by and it is defined as the subgraph of with vertex set and edge set . We denote the subgraph as . For any graph and , a vertex is said to be a neighbour of if . The open neighbourhood of is denoted by and it is defined as . We denote degree of a vertex by where . The closed neighbourhood of a vertex is . By , we denote the maximum degree of a vertex in . We denote a polynomial function of as . A set is a pathwidth-one vertex deletion set (povd) if has pathwidth at most one. If is an independent set, then is an independent povd (ipovd). If is connected, then is a connected povd (cpovd).
Parameterized Complexity.
We refer to [5] for an introduction to the area. A parameterized problem where is a string over and a parameter is fixed-parameter tractable (or FPT) if it can be solved in time for some computable function . A kernelization algorithm, or in short, kernelization, for a parameterized problem is an algorithm that, given , outputs in time polynomial in , a pair such that (a) if and only if and (b) , where is some computable function depending only on . The output of kernelization is referred to as the kernel and the function is referred to as the size of the kernel. If , then we say that admits a polynomial kernel.
Branching Algorithm.
We need to define the notion of a branching tree for the analysis of our algorithms. An algorithm or a procedure on an instance of a problem is called a branching algorithm if can potentially recursively call itself on instances for such that output of depends on the output of the recursive calls. A run of the algorithm on an instance is said to have executed a branching step, if it recursively calls itself on at least two instances. A branching tree associated with an algorithm and an input instance is a tree where the root node of the tree is associated with the instance . If on an instance , branching step is executed and recursive call is made on the instances , then the node associated with in the branching tree would have children, each associated with one of the instances on which the recursive call was made. If there is no branching step executed on any of the instances associated with a node in the branching tree, then that node would be a leaf node in the branching tree.
Now, we state some useful known results on graphs with pathwidth at most one.
Fact 1 ([29, Lemma 1]).
Let be a graph. The following assertions are equivalent.
-
The pathwidth of G is at most .
-
G is a caterpillar forest.
-
G is acyclic and does not contain a subgraph isomorphic to (see Figure 1).
Fact 2 ([24, Lemma 3]).
Let , where is a cycle of length . If is a graph that does not contain any element of as a subgraph, then each connected component of is either a tree or a cycle with zero or more pendant vertices (“hairs”) attached to it.
3 Approximation for POVD
In this section, we describe the approximation algorithms for Pathwidth One Vertex Deletion. We first obtain a -approximation algorithm for POVD that runs in polynomial time. Then using this polynomial time approximation and results of [10], we obtain parameterized -approximation for every , the running time for which scales between the running times of the best known FPT algorithm for POVD () and the polynomial time -approximation algorithm designed in Section 3.2 (). Before introducing these approximation techniques, we begin by proving a helpful result: POVD can be solved in polynomial time on acyclic graphs.
3.1 Polynomial Time Algorithm Solving POVD on Acyclic Graphs
We first solve POVD on trees using a simple greedy approach. By Fact 1, the obstructions are cycles and the graph . Since the input is a tree, the only possible obstruction is , which has seven vertices, see Figure 1.
Given a tree , we first root it at an arbitrary vertex using BFS. We then process the tree in a bottom-up manner to detect occurrences of . Whenever a copy of is encountered for the first time in this traversal, we select a vertex in the highest layer (closest to the root) of that copy. We include in the solution set and remove together with the entire subtree rooted at from . The process is then repeated: we again scan the tree bottom-up, locate the next copy of , pick a highest-layered vertex , and remove its subtree.
The algorithm terminates once the tree contains no copy of . At that point, the set of selected vertices is returned as the solution.
We now argue the correctness of this approach.
Claim 1.
The above algorithm returns a minimum pathwidth-one vertex deletion set when the input graph is a tree.
Proof.
Let be a tree and let be a copy of in such that our algorithm identifies during its first execution and selects one of the topmost vertices from into the solution. To prove correctness, it suffices to show that for any other , say in intersecting , the vertex lies in . If this holds, then choosing is always as good as choosing any other vertex from .
Assume for contradiction that , while . Then there must exist a vertex with . Since is chosen as one of the topmost vertices of , the vertex must occur at the same level or lower than in the rooted tree. This would imply that there exist two distinct root-to- paths in : one passing through and another through that avoids . Consequently, this would create a cycle in , contradicting the assumption that is a tree. Hence, must belong to every intersecting , establishing the claim.
Since BFS runs in polynomial time and each scan for also takes polynomial time, the overall running time of the greedy algorithm is polynomial. Thus, POVD can be solved in polynomial time on trees. Moreover, since every connected component of an acyclic graph is itself a tree, we can apply the same approach independently on each component of an acyclic graph. This yields the following theorem.
Theorem 2.
POVD can be solved in polynomial time on acyclic graphs.
3.2 Polynomial Time 3-approximation for POVD
In this section, we prove the following result.
Theorem 3.
There exists a polynomial time -approximation algorithm for POVD.
The approach for our approximation algorithm is based on two key observations that any pathwidth-one vertex deletion set also forms a feedback vertex set and, as shown in the previous subsection, POVD can be solved in polynomial time when the input graph is acyclic. Our approximation algorithm proceeds as follows.
-
Next, we compute the minimum pathwidth-one vertex deletion set for using the polynomial-time algorithm described in Section 3.1. Let this set be . Finally, we return the set .
Since is a pathwidth-one vertex deletion for , must be a pathwidth-one vertex deletion for . Also, the above-described algorithm runs in polynomial time, as can be found in polynomial time using one of the -approximation algorithms for Feedback Vertex Set, and for , we again need polynomial time only because of the result in Section 3.1. We next argue for the size bound of in the following claim.
Claim 4.
For any given graph , the size of , returned by the algorithm described above, can be at most three times the size of an optimal pathwidth-one vertex deletion set of .
Proof.
Let be an optimal pathwidth-one vertex deletion for the graph . We want to prove that . We know that is a caterpillar forest. Also, let be an optimal fvs for . Since is also acyclic, we must have . Again, since is a -approximate fvs solution, we get On the other side, is an optimal pathwidth-one vertex deletion for , which implies . Combining these, we get,
3.3 Parameterized Approximation for POVD
In this subsection, we improve the running time of the parameterized approximation algorithm for POVD introduced by Esmer and Kulik [10], for approximation factors between and . Recall that in [10], the authors provided running time for all approximation factors in the range from to . However, due to our -approximation described in the previous subsection, it is no longer necessary to consider approximation factors greater than in the parameterized setting for POVD. By combining our -approximation with the “Sampling with a black box” technique given in [10], we obtain faster -approximation algorithms for all such that .
Esmer and Kulik [10] showed that a -approximation for a vertex deletion minimization problem can be obtained in time time, where the precondition is an -approximation algorithm for the problem that runs in time with , and that has a sampling step where success probability of each step is at least . Here denotes Kullback-Leibler divergence222Kullback-Leibler Divergence: given two numbers , the Kullback-Leibler divergence of and is defined to be . The formula of is as follows.
| (1) |
Here, and are defined to be the unique numbers which satisfy
| (2) |
Note that for POVD, it was previously known that an -approximation with (which is same as an exact algorithm) can be obtained in time [29], and there is also a sampling step for POVD that succeeds with probability [10]. Building on this precondition, a -approximation has been developed in [10], which runs in time for every , where
| (3) |
| 1.0 | 3.888 | 7.0000 | 3.888 | 2.0 | 2.0417 | 2.0000 | 2.0000 |
| 1.1 | 3.6412 | 5.0846 | 3.6412 | 2.1 | 1.9391 | 1.8660 | 1.8660 |
| 1.2 | 3.4100 | 4.2041 | 3.4100 | 2.2 | 1.8498 | 1.7411 | 1.7411 |
| 1.3 | 3.1936 | 3.6324 | 3.1936 | 2.3 | 1.7713 | 1.6245 | 1.6245 |
| 1.4 | 2.9908 | 3.2220 | 2.9908 | 2.4 | 1.7018 | 1.5157 | 1.5157 |
| 1.5 | 2.8010 | 2.9102 | 2.8010 | 2.5 | 1.6399 | 1.4142 | 1.4142 |
| 1.6 | 2.6232 | 2.6642 | 2.6232 | 2.6 | 1.5844 | 1.3195 | 1.3195 |
| 1.7 | 2.4567 | 2.4647 | 2.4567 | 2.7 | 1.5346 | 1.2311 | 1.2311 |
| 1.7615 | 2.3596 | 2.3596 | 2.3596 | 2.8 | 1.4896 | 1.1486 | 1.1486 |
| 1.8 | 2.3007 | 2.2973 | 2.2973 | 2.9 | 1.4487 | 1.0717 | 1.0717 |
| 1.9 | 2.1604 | 2.1435 | 2.1435 | 3.0 | 1.4115 | 1.0000 | 1.0000 |
Now, because of our new -approximation with that runs in time with (since it is a polynomial time -approximation), from Equation 1, we can obtain another parameterized -approximation for every , running in time , where
Now, we compute . Note that, to evaluate for , we need the value of . Let , then from Equation 2, we have
This implies,
Thus, we now have two different parameterized -approximation algorithms for POVD, running in time and , respectively. By comparing and , we observe that when , while when .
Combining these observations, we obtain the following theorem.
Theorem 5.
For each , POVD has a -approximation algorithm running in time , where
4 Kernelization for Independent POVD
In this section, we present a polynomial kernel of size for IPOVD. Given an instance , our kernelization procedure first reduces it to an equivalent instance of a more general problem called Ext-IPOVD. In Ext-IPOVD, the input is a triple , and the goal is to determine whether there exists an independent pathwidth-one vertex deletion set in that is disjoint from the set . For any instance of IPOVD, we can create an equivalent instance of Ext-IPOVD by simply setting . Note that when , we can solve an input instance of IPOVD in polynomial time by enumerating all possible subsets of size at most . Thus, for a given instance , we assume that .
We then design a kernel of size for Ext-IPOVD. Once we obtain that we then convert it back into an equivalent instance of IPOVD of size . The following lemma formalizes this transformation.
Lemma 6.
Given an instance of size for Ext-IPOVD, we can construct an equivalent instance for IPOVD of size .
Proof.
Let be an instance of Ext-IPOVD with size . From , we create an equivalent instance of IPOVD as follows. We define the vertex set by extending with additional vertices . The edge set is formed the following way. We extend the edge set by adding the the following edges. For each , we add the edges , and then add an edge for every .
Observe that any ipovd solution for of size at most must include the vertex . Otherwise, would need to include at least one vertex from each of the disjoint pairs , contradicting the size bound of . Since is an independent set that includes , it cannot contain any vertex from . Therefore, forms an independent set in of size at most , disjoint from . Furthermore, as is an induced subgraph of , is also a valid povd solution for . Conversely, let be an ipovd solution of size at most for the instance . Since avoids all vertices in , the set forms a valid ipovd solution for of size at most . Finally, since the size of is bounded by and we only introduce new vertices and additional edges, the total size of remains bounded by .
So now we turn our attention to kernelization for Ext-IPOVD. We develop several reduction rules for this purpose and apply them exhaustively to any given instance of Ext-IPOVD. When multiple reduction rules are applicable simultaneously, we always choose the one with the smallest index. Once no further reduction rules apply, we show that the size of the resulting instance is bounded by .
Note that certain reduction rules can produce trivial Yes instances (the instance ) or trivial No instances (the instance ) for Ext-IPOVD. Some of our rules are slight adaptations of known reductions, while others are specifically tailored to this problem. Below, we present these rules and provide proofs of correctness for all non-trivial rules. We begin by describing three simple reduction rules.
Reduction Rule 1.
If and has pathwidth at least two, return a trivial No instance.
Reduction Rule 2.
If and has pathwidth at most one, return a trivial Yes instance.
Reduction Rule 3.
If has a connected component such that is caterpillar, remove from the graph. Return the instance .
The proof of the correctness of the above three reduction rules is straightforward. Below we present few other rules (Reduction Rules 4-8), correctness for each reduction rule, are provided in the Appendix (Section A.1).
Reduction Rule 4.
Given an instance , if a vertex has more than pendant neighbors, retain any of them, ensuring that at least one is from if has any pendant neighbor in . Let be the set of deleted pendant neighbors, the resulting graph, and update . Return the instance .
Now, we will state two known reduction rules given in [24] with slight modification.
Reduction Rule 5.
Let be a vertex in such that there exists a matching of size in the graph , where each edge in has at least one endpoint in the neighborhood . If , then return trivial No instance. Otherwise, remove from the graph, add all vertices in to the set , and decrease by one. Return the instance .
Reduction Rule 6.
Let be a path in such that, for every vertex with , all its neighbors other than and are pendant vertices. If , contract the edge in , and assign the resulting vertex in if both and are in . Denote the resulting graph by . The new instance is where
Towards proving the correctness of Reduction 6, we use the following observation.
Observation 7 ([24, Observation 2]).
-
1.
Let be a graph that contains at least one cycle. Any graph obtained from by contracting an edge of also contains at least one cycle (possibly with parallel edges).
-
2.
Let be a graph that contains a as a subgraph. If is a graph obtained from by contracting an edge where either or (or both) is not part of any in , then also contains a as a subgraph.
Let be a vertex in a graph , and let . An -flower passing through is a set of distinct cycles in such that each cycle contains and no two cycles share any vertex other than . The vertex is said to be at the center of the flower.
Reduction Rule 7.
Let be a vertex of the graph such that is the center of a -flower. If , then return trivial No instance. Otherwise, remove from the graph, add all vertices in to the set , and decrease by one. Return the instance
Lemma 8.
Let . Consider an Ext-IPOVD instance . Assume that there is a vertex of degree at least and Reduction Rules 1-7 cannot be applied. Then one can, in polynomial time, find disjoint sets of vertices and a function satisfying the following properties:
-
1.
every vertex in is connected by a single edge to ,
-
2.
is an independent set in ,
-
3.
, and each vertex from has a neighbor in ,
-
4.
the sets are disjoint for any two different (as before we denote the elements of by in and call them the private neighbors of ).
The above mentioned lemma is similar to Lemma 7 in [8]. The only difference lies in the bound on . Since our graph is simple, the maximum number of non-pendant neighbors of can be at most , whereas in their case it is . The proof of our Lemma 8 follows directly from the proof of Lemma 7 in [8].
Reduction Rule 8.
Let be a vertex of degree at least , and assume that none of the earlier reduction rules are applicable. Let and be the nonempty sets obtained by applying Lemma 8. Construct the graph by deleting all but two edges between and the vertices , for each . Then, put all vertices in to . Return the instance
If the kernelization algorithm for Ext-IPOVD returns a trivial Yes or trivial No instance, it is straightforward to observe that the size of such instances is constant. Therefore, we now focus on the case where the kernelization produces a non-trivial instance . Assuming the original input instance is a Yes-instance, we will argue that the size of is bounded by . By correctness of the reduction rules, we have that is a Yes-instance if and only if is a Yes-instance. Assume that is a Yes-instance and let be a solution of size at most . Since no reduction rule increases the value of , it follows that . We now partition the set into the following subsets.
Lemma 9.
If the input instance of Ext-IPOVD is Yes instance, then , and .
Proof.
We assume that none of the reduction rules are applicable to the instance . We now proceed to bound the size of each individually.
Vertices in have exactly one neighbor, which must lie in . Since and, by Reduction 4, each vertex in can have at most such pendant neighbors, the size of is bounded by .
Next, observe that if or , then must lie on the spine of a caterpillar in the subgraph . Suppose there are vertices from in a caterpillar. Then, in the subgraph induced by these vertices and their neighbors, there exists a matching of size at least . Now consider a vertex that is adjacent to more than vertices of . In the graph induced by these neighbors and their adjacent vertices, we can obtain a matching of size at least , where each matching edge has at least one endpoint adjacent to . This would trigger Reduction 5, contradicting our assumption. Therefore, each vertex in can be adjacent to at most vertices of . By the same reasoning, each vertex in can also be adjacent to at most vertices of . Hence, when Reduction 5 is not applicable, the total number of vertices in each of and is bounded by .
Notice that, for every , there is a private vertex such that is an edge. Thus, if there are such vertices adjacent to vertices in , then . Moreover, in the subgraph of induced by these and their neighbors in , there exists a matching of size . Consequently, if there is a vertex adjacent to such vertices with neighbors in , then Reduction 5 would be applicable. Therefore, for each vertex in , there can be at most such vertices , implying that .
Observe that the vertices of and lie on the spine of a caterpillar in , such that neither they nor their pendant neighbors are adjacent to . Since Reduction 6 is no longer applicable, in any spine of a caterpillar in , there cannot be more than nine consecutive vertices from . Additionally, because Reduction 6 is no longer applicable, there must be at least one vertex from , , or after any sequence of at most nine consecutive vertices from in such a spine. Therefore,
Note that the neighbors of any vertex in belong to some with . Let be adjacent to some . As before, there can be at most such vertices (otherwise Reduction 5 would be applicable). For any vertex in with , there can be at most adjacent vertices in by Reduction 4. Therefore, in total, the size of is bounded by
Furthermore, if none of the reduction rules are applicable, then for any , we have: where . Therefore, the total number of vertices adjacent to is bounded by Hence, (note that according to our initial assumption, ).
Hence, finally we get
Similarly, the number of edges with one endpoint in and the other in is at most Since is a caterpillar forest, the total number of edges in is at most Finally, the number of edges in is bounded by Therefore, the total number of edges in is at most
Now we are ready to give our final reduction rule, the correctness of which follows from the above lemma.
Reduction Rule 9.
Let be the reduced instance after applying Reduction 1-8 exhaustively. If , or , then return trivial No instance.
It is easy to verify that every reduction rule can be executed within polynomial time. Therefore, we have a kernelization algorithm for Ext-IPOVD, that either correctly returns a trivial Yes instance or a trivial No instance, or produces an equivalent instance of size . On the reduced Ext-IPOVD instance, we apply Lemma 6 to construct an equivalent IPOVD instance whose size is also bounded by .
Now, we can conclude the kernelization algorithm with the following theorem.
Theorem 10.
IPOVD admits a kernel of size .
5 Kernel Lower Bound for CPOVD
In this section, we show that CPOVD does not admit a polynomial kernel when parameterized by the solution size , unless NP co-NP/poly. We establish this by providing a parameter-preserving reduction from the Connected Vertex Cover problem to CPOVD.
Theorem 11.
CPOVD does not admit polynomial kernel unless NP co-NP/poly.
Proof.
We give a straightforward parameter-preserving reduction from Connected Vertex Cover to CPOVD. In Connected Vertex Cover, we are given an undirected graph along with an integer and our goal is to check whether there is a subset of size at most such that is connected and contains at least one end-point of each edge. It is kwown that Connected Vertex Cover does not admit a polynomial kernel unless NP co-NP/poly [9].
For an instance of Connected Vertex Cover, we construct an instance of CPOVD as follows: and and we set . We claim that has a connected vertex cover of size at most if and only if has a connected pathwidth-one vertex deletion set of size at most .
In the forward direction, let is a solution to Connected Vertex Cover. Now consider the graph . As is vertex cover, in , the degree of every vertex is at most one, because at least one of or belongs to . Now as is a vertex cover of , the graph is a disjoint union of stars. Clearly, is connected since is connected. Hence is a connected pathwidth-one vertex deletion set of size at most in .
In the backward direction, let has a connected pathwidth-one vertex deletion set of size at most . We show that the vertex set is a connected vertex cover of of size at most . By the construction of , there is a cycle for every edge . Therefore, by Fact 1, every POVD solution must include at least one vertex from each . In other words, contains at least one vertex from for every edge . Suppose contains for some edge . Since is connected and is adjacent only to and in , must also include at least one of or . On the other hand, if does not contain for some edge , then it must contain at least one of or to cover the cycle. Thus, in all cases, contains at least one endpoint of every edge in , implying that is a vertex cover of of size at most . Finally, removing the vertices from does not disconnect the induced subgraph in or in , since each is either a leaf or part of a cycle in . Therefore, is a connected vertex cover of of size at most . This completes the proof.
6 FPT Algorithm for CPOVD
In this section, we obtain the following result for CPOVD.
Theorem 12.
CPOVD can be solved in time.
Overview.
By Fact 2, if a graph does not contain , , or as a subgraph, then each connected component of is either a tree or a cycle with zero or more hairs. To compute a set of vertices whose removal makes the graph free of , , and as subgraphs, we can simply identify such a structure in the graph by brute-force search (in time) and branch on it. This results in at most -way branching, leading to at most different possible paths from the root to the leaves in the branching tree. Also, notice that any solution to POVD as well as CPOVD must contain all the vertices of one such path as a subset. In this algorithm, we first branch on the forbidden structures (, , and ), and then we call the algorithm of Group Steiner Tree at most times (once for each branching path).
Before going into the details of the algorithm, let us see Group Steiner Tree problem.
| Group Steiner Tree | |
|---|---|
| Input: | An undirected graph , vertex-disjoint subsets , and . |
| Question: | Does contain a tree on at most vertices that intersects each , ? |
The following result is known for the Group Steiner Tree problem.
Theorem 13.
[22, Lemma 3] Group Steiner Tree can be solved in time.
Now we describe our algorithm for CPOVD in detail. For a given instance of CPOVD, we first check whether contains a , , or as a subgraph (by exhaustive search) and branch on such a structure if it exists. In each branching step, the parameter is decreased by one. And update the graph to and parameter to . We stop branching on an instance when either or contains none of , , or as a subgraph. Since contains 7 vertices, each branching step generate at most 7 sub-instances, leading to a 7-way branching. As decreases by one at each step, the height of the branching tree is at most , and the total number of root-to-leaf paths is bounded by .
Let denote the collection of vertex sets corresponding to each branching path, where each set consists of at most vertices removed along that path. That is, if , then there exists a root-to-leaf path in the branching tree such that is the set of deleted vertices, and . Clearly, , and for each , every connected component of is either a tree or a cycle with zero or more hairs. Now for each , we check whether there is a connected povd solution of size at most containing the vertex set . Let . Also Let be the vertex sets corresponding to the cycle components in . If then we immediately conclude that no such solution exists that contains the vertex set . Else, i.e., when , we invoke the Group Steiner Tree problem on the input graph with the vertex-disjoint subsets with the budget parameter . If Group Steiner Tree calls returns Yes, we return Yes for the input instance of CPOVD; else if it returns No, then we immediately conclude that no such solution that contains the vertex set . Now at the end, if for each , we have that no such solution exists that contains the vertex set we return No for the input instance of CPOVD. We provide the correctness of the above-described algorithm for an input instance of CPOVD in the Section A.2.
Proof of Theorem 12.
We show that our algorithm for CPOVD runs in time. Recall that the branching step is at most a 7-way branching, producing nodes in the branching tree. At each node of the branching tree, we spend only polynomial time, since detecting a , , or subgraph, or reporting their absence, can be done in polynomial time. Therefore, the total time spent in branching is . Since there are leaves in the branching tree, the number of distinct root-to-leaf paths is also bounded by , and thus . Consequently, we make at most calls to Group Steiner Tree. For any , we call Group Steiner Tree only if . By Theorem 13, each such Group Steiner Tree instance can be solved in , that is, in time. Therefore, solving all Group Steiner Tree instances requires at most time. Combining the branching and the Group Steiner Tree solving phases, the total running time is bounded by . Hence, the proof.
7 FPT Algorithm for IPOVD
In this section, we present an FPT algorithm for IPOVD with a running time of . This algorithm also begins by branching on occurrences of , , and subgraphs. Once this branching process finishes, we have at most root-to-leaf paths in the branching tree, and every minimal independent povd solution must fully include one of these paths. For each such path , we check whether it can be extended to a solution to IPOVD, where a povd is said to be a solution to IPOVD if and is an independent set in . This verification for each fixed can be done in polynomial time.
Algorithm.
We now describe the algorithm in more detail. Given an instance of IPOVD, we iteratively remove occurrences of , , and by branching. Whenever such a structure exists in , we branch over its vertices, reducing by one at each branching step. Branching stops when we reach to an instance where either or no longer contains any of these subgraphs. Once the branching tree is constructed, we process each root-to-leaf path as follows: we check whether forms an independent set, and whether every cycle in has at least one vertex with no neighbors in . If this is true, we select exactly one such vertex from each cycle and collect them into a set . If , we return as a valid IPOVD solution. If no such path extends to such a solution, we return that is a No instance of IPOVD.
Correctness.
As established earlier, there are at most different root-to-leaf paths in the branching tree. Let be the collection of these paths. Every minimal IPOVD solution must completely contain one of the paths in . Therefore, for each , we test whether it can be extended to a solution of size at most . Observe that this requires to be an independent set. Since contains no , , or , by Fact 2 each connected component of is either a tree or a cycle (possibly with pendant vertices). Let be the number of cycles in . Any solution must pick at least one vertex from each such cycle. Therefore, constructing by selecting one vertex per cycle with no neighbors in suffices. Since the cycles are disjoint, we do not need to worry about independence among vertices in . This establishes the correctness of the algorithm.
Running Time.
Constructing the branching tree takes time . For each of the paths , checking whether it extends to a valid solution requires only polynomial time. Thus, the total time is . Additionally, since a polynomial kernel of size exists for IPOVD, we first run kernelization on the input instance . If kernelization returns a trivial Yes or No instance, we return the corresponding answer Yes or No, respectively, to IPOVD. Otherwise, it produces an equivalent instance with vertices, to which we apply the algorithm above. This results in a total running time of . Hence, we obtain the following theorem.
Theorem 14.
IPOVD admits an FPT algorithm with running time .
8 Conclusion
We studied two variants of the POVD problem: the connected version (CPOVD) and the independent version (IPOVD). We showed that CPOVD is solvable in time but admits no polynomial kernel unless NP co-NP/poly. In contrast, IPOVD can be solved in time with a polynomial kernel of size . Improving these running times, reducing the kernel size for IPOVD, improving the polynomial-time approximation factor, or establishing lower bounds are interesting avenues for future research.
References
- [1] Ankit Abhinav, Satyabrata Jana, Nidhi Purohit, Abhishek Sahu, and Saket Saurabh. Parameterized complexity of feedback vertex set with connectivity constraints. In Rastislav Královic and Vera Kurková, editors, SOFSEM 2025: Theory and Practice of Computer Science - 50th International Conference on Current Trends in Theory and Practice of Computer Science, SOFSEM 2025, Bratislava, Slovak Republic, January 20-23, 2025, Proceedings, Part I, volume 15538 of Lecture Notes in Computer Science, pages 23–36. Springer, 2025. doi:10.1007/978-3-031-82670-2_3.
- [2] Akanksha Agrawal, Sushmita Gupta, Saket Saurabh, and Roohani Sharma. Improved algorithms and combinatorial bounds for independent feedback vertex set. In Jiong Guo and Danny Hermelin, editors, 11th International Symposium on Parameterized and Exact Computation, IPEC 2016, August 24-26, 2016, Aarhus, Denmark, volume 63 of LIPIcs, pages 2:1–2:14. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2016. doi:10.4230/LIPIcs.IPEC.2016.2.
- [3] Vineet Bafna, Piotr Berman, and Toshihiro Fujito. A 2-approximation algorithm for the undirected feedback vertex set problem. SIAM J. Discret. Math., 12(3):289–297, 1999. doi:10.1137/S0895480196305124.
- [4] Fabián A. Chudak, Michel X. Goemans, Dorit S. Hochbaum, and David P. Williamson. A primal-dual interpretation of two 2-approximation algorithms for the feedback vertex set problem in undirected graphs. Oper. Res. Lett., 22(4-5):111–118, 1998. doi:10.1016/S0167-6377(98)00021-2.
- [5] Marek Cygan, Fedor V. Fomin, Łukasz Kowalik, Daniel Lokshtanov, Dániel Marx, Marcin Pilipczuk, Michał Pilipczuk, and Saket Saurabh. Parameterized algorithms, volume 5. Springer, 2015. doi:10.1007/978-3-319-21275-3.
- [6] Marek Cygan, Jesper Nederlof, Marcin Pilipczuk, Michal Pilipczuk, Johan M. M. van Rooij, and Jakub Onufry Wojtaszczyk. Solving connectivity problems parameterized by treewidth in single exponential time. In Rafail Ostrovsky, editor, IEEE 52nd Annual Symposium on Foundations of Computer Science, FOCS 2011, Palm Springs, CA, USA, October 22-25, 2011, pages 150–159. IEEE Computer Society, 2011. doi:10.1109/FOCS.2011.23.
- [7] Marek Cygan, Jesper Nederlof, Marcin Pilipczuk, Michal Pilipczuk, Johan M. M. van Rooij, and Jakub Onufry Wojtaszczyk. Solving connectivity problems parameterized by treewidth in single exponential time. ACM Trans. Algorithms, 18(2):17:1–17:31, 2022. doi:10.1145/3506707.
- [8] Marek Cygan, Marcin Pilipczuk, Michal Pilipczuk, and Jakub Onufry Wojtaszczyk. An improved FPT algorithm and a quadratic kernel for pathwidth one vertex deletion. Algorithmica, 64(1):170–188, 2012. doi:10.1007/s00453-011-9578-2.
- [9] Michael Dom, Daniel Lokshtanov, and Saket Saurabh. Incompressibility through colors and ids. In Susanne Albers, Alberto Marchetti-Spaccamela, Yossi Matias, Sotiris E. Nikoletseas, and Wolfgang Thomas, editors, Automata, Languages and Programming, 36th International Colloquium, ICALP 2009, Rhodes, Greece, July 5-12, 2009, Proceedings, Part I, volume 5555 of Lecture Notes in Computer Science, pages 378–389. Springer, 2009. doi:10.1007/978-3-642-02927-1_32.
- [10] Baris Can Esmer and Ariel Kulik. Sampling with a black box: Faster parameterized approximation algorithms for vertex deletion problems. In Keren Censor-Hillel, Fabrizio Grandoni, Joël Ouaknine, and Gabriele Puppis, editors, 52nd International Colloquium on Automata, Languages, and Programming, ICALP 2025, July 8-11, 2025, Aarhus, Denmark, volume 334 of LIPIcs, pages 39:1–39:20. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2025. doi:10.4230/LIPICS.ICALP.2025.39.
- [11] Fedor V. Fomin, Fabrizio Grandoni, and Dieter Kratsch. Solving connected dominating set faster than 2. Algorithmica, 52(2):153–166, 2008. doi:10.1007/S00453-007-9145-Z.
- [12] Yoichi Iwata and Yusuke Kobayashi. Improved analysis of highest-degree branching for feedback vertex set. Algorithmica, 83(8):2503–2520, 2021. doi:10.1007/s00453-021-00815-w.
- [13] Satyabrata Jana, Daniel Lokshtanov, Soumen Mandal, Ashutosh Rai, and Saket Saurabh. Parameterized approximation scheme for feedback vertex set. In Jérôme Leroux, Sylvain Lombardy, and David Peleg, editors, 48th International Symposium on Mathematical Foundations of Computer Science, MFCS 2023, August 28 to September 1, 2023, Bordeaux, France, volume 272 of LIPIcs, pages 56:1–56:15. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2023. doi:10.4230/LIPICS.MFCS.2023.56.
- [14] R. Krithika, Diptapriyo Majumdar, and Venkatesh Raman. Revisiting connected vertex cover: FPT algorithms and lossy kernels. Theory Comput. Syst., 62(8):1690–1714, 2018. doi:10.1007/S00224-017-9837-Y.
- [15] John M. Lewis and Mihalis Yannakakis. The node-deletion problem for hereditary properties is np-complete. J. Comput. Syst. Sci., 20(2):219–230, 1980. doi:10.1016/0022-0000(80)90060-4.
- [16] Jason Li and Jesper Nederlof. Detecting feedback vertex sets of size k in O (2.7k) time. ACM Trans. Algorithms, 18(4):34:1–34:26, 2022. doi:10.1145/3504027.
- [17] Shaohua Li and Marcin Pilipczuk. An improved FPT algorithm for independent feedback vertex set. Theory Comput. Syst., 64(8):1317–1330, 2020. doi:10.1007/s00224-020-09973-w.
- [18] Ching-Hao Liu, Sheung-Hung Poon, and Jin-Yong Lin. Independent dominating set problem revisited. Theor. Comput. Sci., 562:1–22, 2015. doi:10.1016/J.TCS.2014.09.001.
- [19] Daniel Lokshtanov, Fahad Panolan, Saket Saurabh, Roohani Sharma, and Meirav Zehavi. Covering small independent sets and separators with applications to parameterized algorithms. ACM Trans. Algorithms, 16(3):32:1–32:31, 2020. doi:10.1145/3379698.
- [20] Dániel Marx, Barry O’Sullivan, and Igor Razgon. Finding small separators in linear time via treewidth reduction. ACM Trans. Algorithms, 9(4):30:1–30:35, 2013. doi:10.1145/2500119.
- [21] Neeldhara Misra, Geevarghese Philip, Venkatesh Raman, and Saket Saurabh. On parameterized independent feedback vertex set. Theor. Comput. Sci., 461:65–75, 2012. doi:10.1016/j.tcs.2012.02.012.
- [22] Neeldhara Misra, Geevarghese Philip, Venkatesh Raman, Saket Saurabh, and Somnath Sikdar. FPT algorithms for connected feedback vertex set. J. Comb. Optim., 24(2):131–146, 2012. doi:10.1007/s10878-011-9394-2.
- [23] Daniel Mölle, Stefan Richter, and Peter Rossmanith. Enumerate and expand: Improved algorithms for connected vertex cover and tree cover. Theory Comput. Syst., 43(2):234–253, 2008. doi:10.1007/S00224-007-9089-3.
- [24] Geevarghese Philip, Venkatesh Raman, and Yngve Villanger. A quartic kernel for pathwidth-one vertex deletion. In Dimitrios M. Thilikos, editor, Graph Theoretic Concepts in Computer Science - 36th International Workshop, WG 2010, Zarós, Crete, Greece, June 28-30, 2010 Revised Papers, volume 6410 of Lecture Notes in Computer Science, pages 196–207, 2010. doi:10.1007/978-3-642-16926-7_19.
- [25] Johannes Rauch, Dieter Rautenbach, and Uéverton S. Souza. Exact and parameterized algorithms for the independent cutset problem. J. Comput. Syst. Sci., 148:103598, 2025. doi:10.1016/J.JCSS.2024.103598.
- [26] Neil Robertson and Paul D. Seymour. Graph minors. i. excluding a forest. J. Comb. Theory B, 35(1):39–61, 1983. doi:10.1016/0095-8956(83)90079-5.
- [27] Neil Robertson and Paul D. Seymour. Graph minors. II. algorithmic aspects of tree-width. J. Algorithms, 7(3):309–322, 1986. doi:10.1016/0196-6774(86)90023-4.
- [28] Yinglei Song. An improved parameterized algorithm for the independent feedback vertex set problem. Theor. Comput. Sci., 535:25–30, 2014. doi:10.1016/j.tcs.2014.03.031.
- [29] Dekel Tsur. Faster algorithm for pathwidth one vertex deletion. Theor. Comput. Sci., 921:63–74, 2022. doi:10.1016/j.tcs.2022.04.001.
Appendix A Appendix
A.1 Correctness of Reduction Rules 4-8
Lemma 15.
Reduction 4 is correct.
Proof.
Let denote the set of all pendant neighbors of in , and let be the set of pendant neighbors of that are deleted from to obtain . Since is a subgraph of and , if is a Yes instance, then is also a Yes instance. For the reverse direction, assume that is a Yes instance. Then there exists a solution for such that and . We claim that is also a solution for and satisfies .
Suppose, for the sake of contradiction, that is not a solution for . Then there exists a structure (say, ) in such that . Since is a solution for , it follows that must contain some vertex . Because and , there exists a vertex such that . Let . Then is also a in satisfying , which contradicts the assumption that is a solution for .
Finally, since , , and , we have . Therefore, is a solution for of size at most and disjoint from , implying that is a Yes instance. This completes the proof.
Lemma 16.
Reduction 5 is correct.
Proof.
Notice that, in this case, if there is a solution of size at most , then that solution must include . Otherwise, to eliminate all the in , we would need to delete strictly more than vertices. Thus, if , there cannot be a size solution disjoint from . On the other hand, if , then deleting , decreasing by one, and adding to is safe, since necessarily belongs to any valid solution.
Lemma 17.
Reduction 6 is correct.
Proof.
First, we assume that is a Yes instance. We want to show that is also a Yes instance. Let be an ipovd of size at most disjoint from . There are two cases to consider: either or exactly one of and belongs to . If , then the same set is an ipovd for of size at most disjoint from , since the contraction does not create any new cycles or new . Next, suppose exactly one of or belongs to . Then the contracted vertex in . Notice that and cannot be part of any , and if they belong to any cycle, then , , , and all participate in that cycle. Therefore, if either or is in , then the set is an ipovd for of size at most disjoint from . Otherwise, the set is an ipovd for of size at most disjoint from .
In converse direction, assume that is a Yes instance, and let be a solution for of size at most . If , then by Observation 7, the set is a solution for of size at most . Otherwise, if , then at least one of or was not in in . Let that vertex be , where is either or . Then again by Observation 7, the set is a solution for of size at most .
Lemma 18.
Reduction 7 is correct.
Proof.
The correctness of Reduction 7 is quite similar to the correctness of Reduction 5, since any POVD solution must cover all cycles, and thus any size POVD solution must include in this case. Therefore, if , there cannot be a size POVD solution. Otherwise, if , then deleting , decreasing by one, and adding to is safe, as necessarily belongs to any valid solution.
Lemma 19.
Reduction 8 is correct.
Proof.
First, we will show that if is a minimal ipovd of size for disjoint from , then is also a ipovd of size for and remains disjoint from . To establish this, it suffices to show that .
For contradiction, suppose that for some . Since is an independent set, both and are not in . Also observe that in , there are vertex-disjoint paths between and , each passing through a different vertex of . This implies that at least vertices of must be included in , which contradicts the assumption that . Therefore, we have .
For the converse direction, let be an ipovd of size for the graph , disjoint from . We need to show that is an ipovd for as well. Note that by the way Reduction 8 has been applied, for each , there exist indices and such that the cycle is present in . To hit this cycle, at least one of the vertices or must belong to . Since , the set is independent and remains disjoint from .
Now, suppose for contradiction that is not a ipovd in . Then there must be some cycle or in . Since there was no such cycle or in , any such structure in must include an edge of the form for some . However, for each , there are exactly two such edges in , and their presence would imply a cycle or in , contradicting our assumption. Hence, is indeed a ipovd of size for disjoint from .
A.2 Missing proof details for Theorem 12
Correctness.
We will argue the correctness of the above-described algorithm for an input instance of CPOVD in the following two claims.
Claim 20.
The algorithm returns Yes if and only if is a Yes instance.
Proof.
If the algorithm returns Yes, then at least one call to Group Steiner Tree must have returned Yes, corresponding to some . If is a solution to the Group Steiner Tree instance for this call, then by the way the instance was constructed, we have , and also contains at least one vertex from each cycle in . Thus, is acyclic and -free (since is , , and -free). Moreover, by the properties of the Group Steiner Tree solution corresponding to , is connected and . Hence, is a valid solution for CPOVD.
If the algorithm returns No because for each either or the algorithm return No when it call to corresponding Group Steiner Tree. In the first case, if , then there cannot be a size- pathwidth-one vertex deletion set, since any such set must include all vertices of some together with at least one vertex from each of the cycles in , exceeding the budget . If second case, the algorithm must made a call to Group Steiner Tree and returned No, Towards contradiction, suppose there exists a solution to CPOVD for . Then there must be some such that . Moreover, must contain at least one vertex from each cycle in . Since is a solution to CPOVD, we have that is connected and . This implies that would be a solution to the Group Steiner Tree instance corresponding to , contradicting the assumption that every call to Group Steiner Tree returned No.
