All-Subsets Important Separators with Applications to Sample Sets, Balanced Separators and Vertex Sparsifiers in Directed Graphs
Abstract
Given a directed graph with vertices and edges, a parameter and two disjoint subsets , we show that the number of all-subsets important separators, which is the number of - important vertex separators of size at most over all and , is at most , where , and that they can be enumerated in time . This is a generalization of the folklore result stating that the number of - important separators for two fixed sets and is at most (first implicitly shown by Chen, Liu and Lu Algorithmica ’09). From this result, we obtain the following applications:
-
1.
We give a construction for detection sets and sample sets in directed graphs, generalizing the results of Kleinberg (Internet Mathematics’ 03) and Feige and Mahdian (STOC’ 06) to directed graphs.
-
2.
Via our new sample sets, we give the first FPT algorithm for finding balanced separators in directed graphs parameterized by , the size of the separator. Our algorithm runs in time .
-
3.
Additionally, we show a approximation algorithm for finding balanced separators in directed graphs in polynomial time. This improves the best known approximation guarantee of and matches the known guarantee in undirected graphs by Feige, Hajiaghayi and Lee (SICOMP’ 08).
-
4.
Finally, using our algorithm for listing all-subsets important separators, we give a deterministic construction of vertex cut sparsifiers in directed graphs when we are interested in preserving min-cuts of size upto between bipartitions of the terminal set. Our algorithm constructs a sparsifier of size and runs in time , where is the number of terminals, and the sparsifier additionally preserves the set of important separators of size at most between bipartitions of the terminals.
Keywords and phrases:
directed graphs, important separators, sample sets, balanced separatorsCategory:
Track A: Algorithms, Complexity and GamesFunding:
Euiwoong Lee: Supported in part by NSF grant CCF-2236669 and Google.Copyright and License:
![[Uncaptioned image]](x1.png)
2012 ACM Subject Classification:
Theory of computation Fixed parameter tractability ; Theory of computation Graph algorithms analysisEditors:
Keren Censor-Hillel, Fabrizio Grandoni, Joël Ouaknine, and Gabriele PuppisSeries and Publisher:

1 Introduction
The study of parameterized algorithms, or fixed-parameter tractability (FPT) (see [9, 12]) along with the classical area of approximation algorithms, has emerged as one of the most promising ways to cope with NP-completeness. Given a decision problem with input size , together with a parameter , a parameterized (or FPT) algorithm is an algorithm running in time that decides . Over the last few years, parameterized algorithms have been studied for most well-studied NP-complete problems. A particular focus has been on graph cut optimization problems, including Multiway Cut, Multicut, Minimum Bisection, Feedback Vertex Set, Balanced Separator [28, 15, 5, 30, 11, 10, 22]. One of the most important tools for graph cut problems has been the technique of important separators (see [29] for a brief survey), which has formed a building block in parameterized algorithms for many of these and other problems [28, 5, 6, 7, 24, 25].
In this work, our contribution is two-fold. First, we show a new structural result on important separators. Next, we show that using this result, combined with other techniques, leads to interesting consequences. We show that one can compute sample sets in directed graphs. This in turn allows us to obtain both an FPT algorithm and an improved approximation algorithm for finding small balanced separators in directed graphs. We also show a deterministic construction of vertex cut sparsifiers in directed graphs. All our algorithms are simple and concise modulo standard results in approximation and parameterized algorithms.
1.1 All-Subsets Important Separators
Important separators have proved to be a very important tool in the design of many FPT algorithms, including fundamental problems such as Multiway Cut, Multicut, Directed Feedback Arc Set. We refer to Chapter 8 of [9] for various applications of important separators to design parameterized algorithms. Given two disjoint subsets of vertices and another set of vertices (which may intersect ) in a directed graph, we say that is an - separator if there is no directed path from any vertex of to any vertex of in 111 denotes the graph obtained from by deleting the vertex set along with its incident edges. For such a separator , define to be the set of vertices reachable from in . Then an - separator is called an important - separator if it is (a) inclusion-wise minimal - so that for any , is not an - separator and (b) for any other - separator with , we do not have .222Throughout the paper, we use the notation to mean that is a proper subset of . Informally, important separators are those for which there is no other separator which is further away from without increasing the cut-size. Marx [28] first introduced the notion of important separators and showed a bound of on the number of important - separators of size at most . Later this bound was improved to (implicit in the FPT algorithm for Vertex Multiway Cut by Chen, Liu and Lu [6])333While these results focused mostly on the undirected version, the same proof works in directed graphs.. Over the last few years, this result has been used extensively to obtain FPT algorithms for various central parameterized problems [30, 7, 11, 25]. In this work, we show a new structural result on important separators in directed graphs: Given a directed graph and two disjoint subsets , we show that the number of - important separators across all and is at most , where . Note that the trivial bound is , which follows directly from the fact that there are at most important - separators for any fixed . Previously, no such result is known even for undirected graphs.
Throughout this paper, given a graph, we will denote by the number of vertices and by the number of edges/arcs.
Theorem 1 (All-Subsets Important Separators).
Let be a digraph, be a positive integer and let and be disjoint sets of source and sink vertices. Then there are at most - important separators of size across all and . Further, they can be enumerated in time .
It is not difficult to show that this result is essentially tight: we show this formally in the next lemma, whose proof is deferred to the appendix.
Lemma 2.
For any positive integer , the following two statements hold.
-
1.
There exist infinitely many positive integers , such that for each , there is a directed graph and disjoint subsets of vertices with and so that there are at least - important separators of size at most across all choices , .
-
2.
There exist infinitely many positive integers , such that for each , there is a directed graph and disjoint subsets of vertices with and so that there are at least - important separators of size at most across all choices , .
In the next few subsections, we show that this simple structural result has various interesting consequences, allowing us to obtain many new results which were previously only known for undirected graphs.
1.2 Detection Sets and Sample Sets in Directed Graphs
Kleinberg [18] introduced the concept of a detection set in a graph. The principal motivation for this concept was that of a network failure: given a network, can we compute a small set of representative nodes so that for any small set of failure nodes that cuts communication between two large subsets of nodes, there are two representatives that cannot communicate? Formally, Kleinberg defined an detection set for an undirected graph as a set of terminals which satisfies the following property. First, define a network failure as a set of vertices with so that can be partitioned into where and there are no edges between and . Then for every such (vertex) failure set , the set must intersect with at least two components of . Kleinberg [18] showed that there is a detection set of size . For the edge failure version, he showed a bound of which was subsequently improved to by Kleinberg et al. [19], where is the size of the global minimum cut in the graph. Feige and Mahdian [15] showed an improved bound on (vertex) detection sets: they show a bound of . Further, they showed a bound of for the edge failure version, removing the dependence on .
Feige and Mahdian [15] also studied a strengthening of this notion of detection sets called sample sets for undirected graphs: on a high level, these are a small set of terminals which represent all the small cuts of the graph in proportion to their size, up to some additive error. Concretely, given an undirected graph and a parameter , they show that there exists a set of terminals with , such that for any vertex set of size and every connected component of , we have . Further, the set can be obtained by simple random sampling, and [15] shows that any random subset of size is a sample set with constant probability.
The most important feature of these results is that the size of the detection set (or sample set) does not depend on . It is then natural to ask if such results are possible for directed graphs. Given a digraph , let us define a network failure as a set of vertices with , so that can be partitioned into two parts and , each of size at least , such that there is no arc from to . The analogous question for detection sets is then: is there a set of terminals with , such that for any network failure there exists a pair so that there is no path from to in ? Similarly, the analogous question for sample set becomes: is there a set of terminals with for some function , so that for any set of vertices with , and any strongly connected component (SCC) of , we have ? We answer both these questions in the affirmative, showing that one can in fact asymptotically match the same bound as in the undirected case, with and .
Theorem 3.
For any directed graph , given parameters and , there is an absolute constant such that there is an detection set of size . Further, a random set of vertices must be an detection set with probability at least .
Theorem 4.
For any directed graph , given parameters and , there is an absolute constant such that there is an sample set of size . Further, a random set of vertices must be an sample with probability at least .
While we believe that this is of independent interest, we also show a similar connection to parameterized and approximation algorithms for finding balanced cuts in directed graphs along the lines of the undirected case as in [15], as discussed in the following subsections. In fact, by using a slightly more nuanced analysis, our results generalize that of [15] even for undirected graphs.
Finally, we note that one can easily prove that there exists an absolute constant such that a random subset of size is a sample set with constant probability in a directed graph. This follows from a simple application of Chernoff bounds and a union bound noting that the number of vertex sets of size at most is at most ( in the case of edge sets). However, for most applications this bound is too weak. For instance, our parameterized algorithm for Directed Balanced Separator has an exponential dependency in the size of the sample set, and hence to show results parameterized by , it is essential that the size of the sample set does not depend on .
1.3 Directed Balanced Cuts
One of the most well-studied graph partitioning problems, both from the parameterized and approximation algorithms point of view, is the problem of Minimum Bisection. Given an undirected graph and a parameter , the goal of the Minimum Bisection problem is to obtain a partition of the graph into two equal sized parts, such that the number of cut edges is at most . Minimum Bisection was shown to be fixed-parameter tractable by Cygan et al. [11] and the current best parameterized algorithm is due to [10] who show an algorithm with running time . Räcke [33] gave an approximation for (the optimization version of) Minimum Bisection. If only an approximate bisection where both sides have vertices is desired, then this problem is essentially the Balanced Separator problem which is known to have an FPT algorithm running in time due to Feige and Mahdian [15] and an approximation algorithm in polynomial time using the seminal result of Arora, Rao and Vazirani [3]. Feige et al. [14] showed that in fact this guarantee can be made , and also showed that one can compute vertex separators with the same approximation ratio. In directed graphs, the Minimum Bisection problem has been typically studied as Directed Bisection: Given a directed graph with even number of vertices, is it possible to partition the vertex set into two equal parts and so that the number of arcs from to is at most ? This question was first raised by Feige and Yahalom [16]. When , they referred to this problem as Oneway Cuts, and showed that even this problem is NP-hard. However, given a Oneway Cuts instance which admits a solution, if one relaxes the requirement so that the algorithm can output a partition of such that there are no arcs directed from to and where , the problem now becomes tractable, and they show a polynomial time algorithm for Oneway Cuts. Madathil et al. [27] showed that the Directed Bisection problem is FPT with respect to , even when one requires exactly, on a subclass of directed graphs called semi-complete digraphs, which are the class of directed graphs where for every pair of vertices and , there is an arc from to or an arc from to . In terms of approximation algorithms, Agarwal et al. [1] showed an approximation for Directed Balanced Separator , the directed analogue of Balanced Separator, while Even et al. [13] showed an approximation, which is the best known approximation guarantee depending only on .
However, there is no prior work on the fixed-parameter tractability of Directed Balanced Separator or Directed Bisection for general on general directed graphs, even when we relax the requirement of finding a bisection to that of finding an approximate bisection, that is, find a partition with so that the number of arcs from to is at most .
Our result makes the first progress on this problem. Before we state our results, we define the Directed Balanced Separator problem formally. Since all our results work for both the vertex and edge versions, we state them together as one problem. We adapt our definition from the definition of Balanced Separator in [15], whose results we generalize. For completeness, we recall the definition of Balanced Separator in [15].
Balanced Separator Parameter: Input: Undirected graph Question: Is there a set of vertices (edges) with , so that in , every connected component has size at most ?
Directed Balanced Separator Parameter: Input: Directed graph Question: Is there a set of vertices (arcs) with , so that in , every strongly connected component has size at most ?
For the sake of clarity and comparison, we state the main result of [15]. Given an undirected graph , we say that a set of vertices/edges is a -balanced separator if every connected component of has size at most .
Theorem 5 ([15]).
Given an instance of Balanced Separator with there is a randomized algorithm, that for any , runs in time and with constant probability outputs either (a) a set of vertices (edges) of size at most such that every connected component of has size at most or (b) concludes correctly that there is no -balanced separator of size at most .
The following theorem, which directly generalizes the result of Theorem 5 is our main result. Given a directed graph , we say that a set of vertices/arcs is a -balanced separator if every strongly connected component (SCC) of has size at most .
Theorem 6.
Given an instance of Directed Balanced Separator there is a randomized algorithm, that for any , runs in time and with constant probability outputs either (a) a set of vertices (arcs) of size at most such that every strongly connected component of has size at most or (b) concludes correctly that there is no -balanced separator of size at most .
We observe that our algorithm has a running time of for any , matching the run-time of Theorem 5 for while also extending to any parameter . We also observe that Theorem 6 implies Theorem 5, since given any undirected graph , one can create a directed graph on the same vertex set, so that for every edge , we have the two arcs and we can apply Theorem 6 to obtain Theorem 5.
Our algorithm for Directed Balanced Separator can be used to solve (approximate) Directed Bisection as well. To see this, given a graph , observe that in any bisection so that the number of arcs going from to is at most , the set of these at most arcs form a -balanced separator, so that in , every strongly connected component is of size at most . Using Theorem 6 with , we can find a set of arcs with that forms a balanced separator. Finally, note that the strongly connected components of the graph form a Directed Acyclic Graph (DAG), and each strongly connected component of has size at most . It follows that there is a topological ordering of these strongly connected components of , such that there is no arc from to for with . Therefore we can pick some prefix of strongly connected components in the topological ordering of to obtain a set , so that both , and in , there are no arcs from to . It follows that the set of arcs forms an (approximate) directed bisection. Note that the approximation is only in the balance, not in the number of arcs cut.
Next, we show a approximation for Directed Balanced Separator. This improves both the approximation of [1] and the approximation given by Even et al. [13] for approximating Directed Balanced Separator in polynomial time.
Theorem 7.
There is an approximation to Directed Balanced Separator in polynomial time. Formally, given an instance of Directed Balanced Separator with , suppose there is a set of vertices (arcs) with , so that every strongly connected component of has at most vertices. Then there is a polynomial time randomized algorithm that with constant probability finds a set of vertices (arcs) with so that in , every strongly connected component has size at most for some depending on .
1.4 Vertex Cut Sparsifiers
Vertex sparsification is a fundamental problem in various settings. Broadly, given a graph and a terminal set , a vertex sparsifier is a smaller graph (with size typically depending only on ) that preserves some (cut based) property of the terminals . Moitra [32] first introduced a version of vertex sparsification in undirected graphs. Chuzhoy [8] generalized this notion by allowing Steiner nodes in the sparsifier. Both these notions preserve edge cuts between bi-partitions of the terminal set.
For our setting, we focus on directed graphs and vertex cuts. Given a directed graph and a set of terminals along with an integer , a vertex cut sparsifier for is another graph with , so that for every partition of (so that and ), if the size of an - vertex min-cut is at most in , then the size of an - vertex min-cut is the same in both in and . In other words, the goal is to find a vertex sparsifier that preserves vertex min-cuts of size at most between sets of terminals. Here we allow the deletion of terminals as well.
Kratsch and Wahlström [20, 21] first studied this notion of vertex sparsification for vertex cuts (without the parameter ) and showed that given , there is a randomized polynomial time algorithm that obtains a vertex sparsifier for with . This bound was improved to by [17] for the special case of directed acyclic graphs. Their algorithm runs in linear time for fixed , but is still randomized as it needs to compute a representation of gammoids.
One can then ask the question as to what is known about deterministic algorithms. Recently, Misra et al. [31] showed that a representation of a gammoid with rank over a ground set of elements can be constructed in time deterministic time. The technique of [20] needs to compute a representation of a gammoid whose ground set has size , the number of vertices and rank , the number of terminals. We therefore obtain the following result.
Theorem 8 ([20], [31]).
Given a -vertex graph and a terminal set , there is a deterministic algorithm that runs in time and computes a vertex cut sparsifier for of size at most .
However, this algorithm (which is an XP algorithm in the notation of parameterized complexity) can be easily improved to a (still deterministic) FPT algorithm. The reason for this is simple: there are at most partitions of the terminal set . For each partition , we can compute an directed min-vertex cut . Note that this min-cut is of size at most , since one can simply delete all the terminals to obtain a cut. This gives a set of vertices. It can now be shown that we can apply the closure operation to the set , where we simply delete all vertices of , and for each vertex pair such that there are vertices with with we add an arc . This results in a sparsifier of size . One can now set in Theorem 8 to obtain a sparsifer of size in time . The total running time is .
A similar line of research considers edge and vertex cuts in undirected graphs. For edge cuts in undirected graphs, Chalermsook et al. [4] showed an upper bound of which was later improved to by Liu [23]. Both these algorithms are randomized. Saranurak and Yingchareonthawornchai [34] considered the vertex version in undirected graphs, and gave a deterministic algorithm to compute a sparsifier of size in time .
However, no improvement on the FPT algorithm with running time based on Theorem 8 is known for deterministic algorithms for vertex cut sparsifiers in directed graphs. We show the following result, which shows that if one is interested in preserving cuts of size at most , then a better deterministic algorithm is possible.
Theorem 9.
Given a directed graph , a set of terminals and integer , there is a deterministic algorithm that runs in time and computes a vertex sparsifier for of size at most , where . Additionally, satisfies , with the property that for every partition of and every subset with , is an - important separator in if and only if and is an - important separator in .
While the dependence on in the exponent for the size of the sparsifier seems undesirable, Theorem 9 shows that the vertex sparsifier constructed by our algorithm is more powerful: it in-fact “preserves” all - important separators for any partition of . Additionally, if one wants only a vertex sparsifier, we note that simply running the algorithm in Theorem 8 after applying our result in Theorem 9 gives a vertex sparsifier of size , but now the running time is . Thus, together, we obtain a vertex cut sparsifier of size in time .
Corollary 10.
Given a graph and a terminal set , there is a deterministic algorithm that runs in time and obtains a vertex cut sparisifer for with vertices.
1.5 Techniques and Overview
In this section we give a high-level overview of our techniques. For the sake of simplicity and clarity we sometimes omit small details in this section.
All-Subset Important separators
Our result on counting all-subset important separators is obtained using the connection between important separators and closest sets. For simplicity, in this overview, suppose that we are given a single source and a set of target vertices . Our goal in this simplified setting shall be to bound the number of important separators of size exactly across all , and we will show a weaker bound of . Before we describe the main idea, we remark that our eventual goal will be to show that every important separator of size for some is an important separator for some with . It is easy to see that this suffices, since there are only choices for , and there are at most such important separators of size for a fixed choice of .
Fix , and consider an important separator of size . Consider the (vertex) min-cut between and 444Throughout this paper, we allow deletion of source and sink vertices in vertex cuts. If this min-cut has size strictly less than , cannot be important: would allow more vertices to be reachable from while having a size strictly less than . This means that by itself must be a min-cut. Using simple cut-flow duality, it must be the case that there are vertex disjoint paths from to (some) vertices in .
However, this flow property can be strengthened: it is easy to see that we can in fact assume that every such important separator is the min-cut “closest” to . (We say is closest to if is the unique min-cut.) But it can be shown that closest sets have a stronger flow property: we can now conclude that for every vertex , there are paths from to , which are vertex disjoint except that two of these paths both start at (Lemma 12).
Why does this help? Fix a , and fix the paths from to . Suppose the endpoints of these paths are . Consider any set of vertices that is a separator of size that is “closer” to than . (Here “closer” means that the set of vertices reachable from in is a superset of that in ). Then must delete ! For if not, since there are paths from to which are vertex disjoint except paths which begin at , and , cannot be a separator.
Applying the above argument for every and letting allows us to conclude that there is no separator that is closer to than . Thus, in fact, among all the sets of size at most separating from , is a set which is closest to . This means must be an important separator. But , and hence we are done. Our actual result uses a slightly more subtle argument using augmenting paths to obtain such a with .
Here we remark that Lemma 4.10 of Lokshtanov et al. [26] showed a similar result: it claims that there is a set with such that any important separator is an important separator. However, there is a gap in the proof which leads to an unavoidable factor of loss. Indeed, as we show in Lemma 30, there is a graph , source vertex and for which is necessary, and hence our result is tight. Our high-level idea is flow-based, similar to [26]. But as described above, the direct application of this flow-based idea gives a bound of , which we refine using an augmenting path based argument to obtain the tight bound of .
Detection sets and Sample sets for Directed Graphs
To obtain detection sets and sample sets for directed graphs, on a high level, we follow the approach of Feige and Mahdian [15]. We focus on sample sets in this brief overview. First, we consider the family of sets that arise as a strongly connected component after deleting an arbitrary vertices in the given digraph. Our key contribution is to show that this family has VC-dimension . Using the standard results on -samples, similar to that in [15], it then follows that a random set of vertices is a sample set with constant probability. Thus the question becomes: how can we bound the VC-dimension of the family of sets formed by strongly connected components after deleting some vertices? We note that the techniques in [15] to bound the VC-dimension do not extend to directed graphs, and hence a different approach is needed. We accomplish this by showing a connection to our result on all-subset important separators.
Recall the definition of VC-dimension: given a set family on a universe , the VC-dimension is the size of the largest set shattered by . Here is shattered by if for every subset there exists an with . Thus to prove a bound of for VC-dimension for our case, it is enough to show that there exists some constant so that any set of terminals with cannot be shattered by the set family which consists of the strongly connected components in across all sets with . Fix any of size for large enough which we choose later. For the sake of contradiction, assume that the set can be shattered. Fix a “pattern” , . Since can be shattered by , there exists a set of at most vertices, so that in , there is a strongly connected component that satisfies . Fix some terminal . Then it must be the case that is the exact set of terminals that can both reach and can be reached from in . Equivalently, it is the exact set of terminals that can be reached from in both and , where is obtained by reversing every arc of . This motivates us to define for each , the collection of subsets of terminals that can be reached from after deleting a set of size at most . Formally,
there exists , such that is reachable from | |||
If we can bound for any graph by a function , then applying this result twice, once on and once on , will bound the number of such patterns with by . Then a simple union bound over all will give that the number of patterns is at most . If this is less than whenever , then we have a contradiction to the fact that can be shattered. Thus the key question is to obtain , a good bound on .
We do this as follows. Suppose some . Then there exists a set of at most vertices after whose deletion the set of reachable terminals from is exactly . We then show that there is a important separator of size at most whose deletion gives the same reachability set as that after deleting . But now using our bound on important separators, we can show that . It is then easy to see that whenever for some large enough (absolute) constant .
FPT algorithm for Directed Balanced Separators
To obtain the algorithm for Directed Balanced Separator, we follow the approach of [15] in the undirected case. For simplicity, in this overview, we work with vertex separators, assume that , and we will only return a -balanced separator (as opposed to the balance guaranteed in our result). We are given that there is a set of vertices with , so that every strongly connected component of has at most vertices. We first compute an sample set of size using our result on sample sets, Theorem 4. Since is a sample set, each SCC of has at most terminals. We will try to find a set of vertices such that and every SCC of has at most terminals. The property of sample sets will then again imply that is a -balanced separator.
Consider the toplogical ordering of the SCC’s of , so that there is no arc from to for with . Consider the smallest index so that the union contains at least terminals. Since no component contains more than terminals, it follows that contains at most terminals. Also, by definition, contains at most terminals.
The algorithm proceeds as follows. First, guess and . This can be done in time since . Next, we compute a directed min-vertex cut between and . Since is a vertex cut between these sets of size at most , this min-cut must be of size at most as well. But then the set of vertices is a -balanced separator with respect to the set . Using the property of sample sets, it follows that is a -balanced separator in .
In order to make the loss in the balance factor only an additive we need a slightly more sophisticated argument. We accomplish this using a reduction to the Skew Separator problem, where we are given pairs , and the goal is to separate from all , , for each , by deleting minimum number of vertices. This problem, which is a special case of Directed Multicut, was first defined by [5] in their FPT algorithm for Directed Feedback Arc Set.
Approximation algorithm for Directed Balanced Separators
Our approximation algorithm proceeds similar to the FPT algorithm. The only key technical difference is that to compute a balanced separator with respect to the terminal set, we cannot guess where the terminals of the sample set are (this requires FPT time); instead we use the algorithm of [1] to obtain a balanced separator with respect to the sample set. While the algorithm of [1] has an approximation ratio of , when we need a balanced separator with respect to a terminal set , this algorithm can be made to work with an approximation ratio of . On a high level, the reason is quite simple: the structure theorem of Arora, Rao and Vazirani [3] is a statement about vectors. As long as we apply the structure theorem to only vectors, we get an approximation ratio of .
Vertex Sparsifiers
Our result on vertex sparsifiers is similar in spirit to the result by Kratsch and Wahlström [20] and proceeds by identifying irrelevant vertices. However instead of using techniques based on matroids, we use our result on all-subsets important separators. Given a terminal set , we are interested in preserving cuts across partitions of the terminal set of size up to . We will define an irrelevant vertex as a vertex that is (a) not in and (b) not in any - important separator of size for any partition of . By our result on all-subsets important separators, we can bound the number of relevant (non-irrelevant) vertices by , and identify this set in time . Thus at least vertices are irrelevant. Let be the set of irrelevant vertices. We now apply the standard operation of “closing” the irrelevant vertices , which is to delete each , and for every pair where is an in-neighbour of and is an out-neighbour of for some , to add the arc . This operation is the vertex equivalent of edge contraction. The key observation here is that the closure operation can be applied to the entire set at once as opposed to applying it to one vertex of and restarting the whole algorithm. This helps us achieve a linear-time algorithm. Thus we obtain another graph with , where .
In this short version, in the interest of space, we present our result on all-subsets important separators and our result on sample sets. Our other results can be found in the linked full version.
2 All-Subsets Important Separators
The goal of this section is to prove our result on all-subsets important separators. See 1
As described in the introduction, our proof proceeds by capturing the relationship between important separators and closest sets. We then use well-known connectivity properties of closest sets to obtain our result.
Definition 11 (Closest set).
For sets of vertices , is closest to if is the unique vertex mincut between and .
We use the following well-known fact about closest cuts.
Lemma 12 (Lemma 18 of [17]).
is closest to if and only if for each vertex , there are paths from to that are vertex-disjoint except at , which appears in exactly two of these paths.
Definition 13 (Minimal separators).
Given such that , is called an - (inclusion-wise) minimal separator if in , there is no path from any to any , and further, for every , there is a path between some and in .
Definition 14.
Given , the reachability set of the source vertices after removing , that is, the set of all vertices reachable from some vertex of via a directed path after removing , is denoted by .
Definition 15.
Given with , a minimal - separator is called an - important separator if for every - separator with , does not hold.
We will drop the subscript or the superscript in when the graph or the set of source vertices is clear from the context.
Informally, the idea of important separators is similar to that of a closest set: they are separators which cannot be pushed “closer” to the sink, without increasing the cut-size. The next lemma captures this relationship formally.
Lemma 16.
Given , is an important - separator if and only if (a) is a minimal - separator and (b) is closest to .
Proof.
Notice that by definition, any important - separator must be a minimal - separator. First, suppose to the contrary that is an important - separator but is not closest to . Let be a vertex min-cut between and with . Since is not closest to , such a indeed does exist. Clearly, . We claim that , which will contradict that is an important separator. First, let us show that . Suppose for some vertex , but . Since is a min-cut, there are vertex disjoint paths between each vertex and some vertex such that no internal vertex of these paths is from . Consider a path from some vertex to in . There must be such a path, since . Let be a vertex of on this path: there has to be such a vertex since . Then is reachable from in . Also, there is a path from to which does not contain any vertex of . It follows that there is a path from to in , which is a contradiction since is an - separator. Since is a minimal - separator and , it follows that there exists a , and hence .
Next, suppose that is both closest to and is also a minimal - separator. Assume for the sake of contradiction that is not important. Since is not important, there is another separator with which is a minimal separator, and further . Since , there exists a . By Lemma 12, for every vertex there exists a collection of paths from to which are vertex disjoint except at , which appears twice. Since does not delete , it follows that there is at least one such path from some vertex to some vertex which survives in . We next show that there must be a path from some vertex of to in and this will imply a path from to in , a contradiction.
Consider any path from some vertex to , all of whose internal vertices are not in . Such a path must exist, for otherwise will not be a minimal - separator since will also be an - separator. Every internal vertex on this path is reachable from in . But , which means every internal vertex on this path is reachable from in as well. This in turn means that there is a path from to in , a contradiction.
Lemma 17 (Theorem 8.51 of [9]).
Given a digraph and two disjoint sets , there are at most - important separators of size .
Equipped with these results, we are now ready to prove our main theorem in this section which bounds the number of important separators between across all subsets and .
See 1
Proof.
Let be an important - separator for some and with . We will show that is also an - important separator for some and with and .
By Lemma 16, it must be the case that is closest to . Also, there are vertex disjoint paths from each vertex of to some vertex of . Also, we know that for every there exists disjoint paths from to which are vertex disjoint except at which appears twice.
Consider the vertex-flow network obtained by adding a super-source and a super-sink . We add outgoing arcs from to each vertex of , and incoming arcs to from each vertex of . The capacity of each vertex is . Let be the (maximum) flow corresponding to any set of vertex disjoint paths between and , and let the endpoints of these paths in be the set . Now we do the following operation for each vertex . Consider the modified flow network obtained from where the underlying graph is same, but the vertex capacity of is raised to . is not a maximum flow in , since there exists paths from to which are vertex disjoint except at which appears in two of these paths. Thus, there is an augmenting path for in . We augment the flow along this augmenting path to obtain a new flow . The new flow has one additional endpoint in , so that the endpoints of the paths of this new flow are either in or at the newly added vertex . Let us denote by the set of the new endpoints.
Let . Notice that , since each for some vertex for each . Also since is minimal, there must exist paths for each from some vertex of to which do not contain any other vertex of . Let . We now claim that is also an important separator.
First, observe that must be a minimal - separator: there exists a path from some vertex of to each vertex containing no other vertex of , and similarly there exists a path from each vertex of to some vertex in containing no other vertex of . Now suppose that is not an - important separator. Then there exists another - minimal separator with with . Since , there exists a vertex . Consider the set and the set of paths from to which are vertex disjoint except at , which appears twice. Since does not delete , at least one of these paths from some vertex to survives in . By construction, there is a path from some vertex to , whose every internal vertex is not in . Since every internal vertex of is in , it must also be in . It follows that there is a path from to in . Appending this to the path from to we obtain a path from to in , contradicting the fact that is an - separator.
Finally, it is clear that the number of - important separators where and with and is at most times the number of ways of choosing and , and hence is at most . Furthermore, once we fix and , all the important - separators of size can be enumerated in time , see Theorem 8.51 of [9]. The set of all subsets of of size at most and the set of all subsets of of size at most can be enumerated in time and respectively. This completes the proof.
3 Detection Sets and Sample Sets in Directed Graphs
The goal of this section is to prove our result on detection sets and sample sets in directed graphs. The following theorems are our main results in this section. See 3 See 4
We start by defining a notion of nets and recalling the definitions for detection sets and sample sets from the overview. These definitions naturally extend those in [15] for undirected graphs.
Definition 18 (Net).
Given a directed graph , an net is a set of terminals satisfying the following property: for every set of vertices with , the following two conditions are met:
-
1.
For every SCC of with we have
-
2.
Let be the union of all except one SCC in such that , then we have
Definition 19 (Detection Set).
Given a directed graph , an detection set is a set of terminals satisfying the following property: for every set of vertices with such that can be partitioned into with and there are no arcs from to , there exists such that there is no - path in .
Definition 20 (Sample Set).
Given a directed graph , an sample set is a set of terminals satisfying the following property: for every set of vertices with , and every SCC of , we have
We note that these definitions are almost identical to the one in [15], with the only difference being that we deal with directed graphs and strongly connected components in lieu of undirected graphs and connected components. Before proving the main results of this section, we show that -nets are also -detection sets. The proof is similar to that in undirected graphs.
Lemma 21.
Every -net is also an detection set.
Proof.
Let be an -net for . Consider a set of vertices of size at most , so that admits a partition with such that there are no arcs from to . Consider an SCC of , and let be the union of all SCC’s of except . Then clearly , hence there exists a terminal . Let be the SCC of containing . Now consider which is the union of all SCC’s of except . Again , therefore there is a terminal . But clearly and lie in different SCC’s of , which means there is either no - path or no - path in , hence satisfying our detection set property.
Thus we henceforth focus on obtaining -nets and samples.
First, we observe that showing a VC-dimension bound directly implies samples. This is similar to the approach in [15]. We start with a few definitions.
Definition 22 (Shattering a set of elements).
Suppose we are given a set system consisting of a family of sets , where each set in consists of elements from a universe . We say that a subset is shattered by , if for every subset , there exists a set so that .
Definition 23 (VC-dimension).
Given a set system , its VC-dimension is the size of the largest subset that can be shattered by .
Theorem 24 (-net theorem, see [15]).
Let be a set system with VC-dimension , and universe size . Then for every , there exists an absolute constant such that a random subset of size is an -net with probability at least . Concretely, satisfies for every satisfying with probability at least .
Theorem 25 (-sample theorem, see [15]).
Let be a set system with VC-dimension , and universe size . Then for every , there exists an absolute constant such that a random subset of size is an -sample with probability at least . Concretely, satisfies for every with probability at least .
Let be the family of sets consisting of all possible sets which are either (a) a strongly connected component after deleting some vertex set of size at most or (b) the union of all but one strongly connected components after deleting some vertex set of size at most . Then it suffices to prove a VC-dimension bound of for this family of sets with the universe as the vertex set - this would then directly imply Theorems 3 and 4 using Theorems 24 and 25. Thus we will henceforth focus on upper-bounding the VC-dimension of the set system .
Towards this, we consider a slightly different problem of reachability from a single source after deleting vertices.
Definition 26 (Single source reachability profile).
Given a graph , a source vertex and a set of sink vertices with , we define the single source reachability profile of as In other words, is the collection of subsets of reachable from after deleting some set of at most vertices from .
The next theorem bounds the size of .
Theorem 27.
Given a graph , source vertex and sink vertices , .
We prove Theorem 27 using our result on all-subset important separators, Theorem 1. We do this as follows. Given and , suppose there exists a set of vertices so that is the subset of reachable from in . Then in the following lemma, Lemma 28, we show that there is in fact an - important separator of size , so that the subset of reachable from in remains . The bound on all-subset important separators will then imply the bound on the reachability profile.
Lemma 28.
Suppose there exists a set of vertices so that is the subset of reachable from in . Then there is an - important separator of size , so that the subset of reachable from in remains .
Proof.
Without loss of generality, we assume that is a minimal separator (otherwise we consider the subset of that is minimal, this still has the same reachability set ). Let be the vertex mincut between and that is closest to . We have since itself is a vertex cut between and .
We show that the reachability profile after removing is also . In other words, a vertex is reachable from after removing iff it is reachable after removing .
Consider some sink vertex , so that is not reachable after removing . Then, every – path contains a vertex in . Since is a cut between and , this path must also contain a vertex in . This proves one direction.
Suppose that some vertex is not reachable after removing . Then, every – path contains a vertex in . Suppose for contradiction that some – path does not contain a vertex in . Consider the prefix of the path leading to a vertex in . Append to it a path from to that only contains one vertex in (at the start) and no vertex of . Such a path must exist since is a min-cut between and . The result is a path from to that does not contain a vertex in . This contradicts the set being the reachability profile after removing .
Finally, we show that is an important - separator. By Lemma 16 it is enough to show that is closest to and that is a minimal - separator. It follows from the construction of that must be closest to . Thus it is enough to show that is a minimal - separator. Since is a min-cut between and , for every vertex , there exists a path from some vertex to some vertex through that does not contain any other vertex of . Since is a minimal - separator, there is a path from to each vertex that does not include any other vertex from . Further does not contain any vertex from : if it contains some vertex , it must be the case that . Then it follows that there is a path from to in . Since is a min-cut between and , there is a path from each vertex of to some vertex of that does not have any internal vertex from . In particular, there is a path from to some vertex of which does not include any vertex of . This path, when appended to the path from to in , results in a path from to some vertex of which does not contain any vertex of , which in turn contradicts the fact that is a cut between and . Finally, appending with gives a path from to that contains , while containing no other vertex of . Since such a path exists for any , it follows that is minimal, and hence is an important - separator.
Proof of Theorem 27.
Suppose . Lemma 28 implies that there is an important - separator so that the reachability set of in is exactly . Using Theorem 1 the number of such important separators is at most , and hence the number of such is at most .
Next, we show why Theorem 27 implies a VC-dimension bound.
Theorem 29.
Let be the family of sets over consisting of all possible sets which are either (a) a strongly connected component after deleting some vertex set of size at most or (b) the union of all but one strongly connected components after deleting some vertex set of size at most . Then the VC-dimension of is .
Proof.
From the definition of VC-dimension, it is enough to show that there no set of size that can be shattered, for some constant . Suppose a set of terminals can be shattered. We will show that .
Consider some , with , and suppose that after deleting some subset of vertices of size at most , some strongly connected component of satisfies . Fix any , and let . Let denote the graph with arcs reversed. Then by definition of an SCC, is the exact subset of reachable from in both and . But this means for some and .
Together, we obtain that if is for some SCC after deleting some vertices, there must exist a vertex such that for some and . But then the number of choices for is at most .
Alternatively, suppose that where is the union of all but one strongly connected component, say , after deleting some set of vertices. Suppose , so that . Consider some . Then it follows that where and . But then the number of choices for is at most , and hence the number of choices for is at most .
Thus the total number of choices for across both cases is at most (accounting for in the first case, and in the second) which is not less than only when , finishing the proof.
References
- [1] Amit Agarwal, Moses Charikar, Konstantin Makarychev, and Yury Makarychev. approximation algorithms for min uncut, min 2cnf deletion, and directed cut problems. In Proceedings of the thirty-seventh annual ACM symposium on Theory of computing, pages 573–581, 2005.
- [2] Aditya Anand, Euiwoong Lee, Jason Li, and Thatchaphol Saranurak. All-subsets important separators with applications to sample sets, balanced separators and vertex sparsifiers in directed graphs. CoRR, 2025. arXiv:2504.20027.
- [3] Sanjeev Arora, Satish Rao, and Umesh Vazirani. Expander flows, geometric embeddings and graph partitioning. Journal of the ACM (JACM), 56(2):1–37, 2009. doi:10.1145/1502793.1502794.
- [4] Parinya Chalermsook, Syamantak Das, Yunbum Kook, Bundit Laekhanukit, Yang P Liu, Richard Peng, Mark Sellke, and Daniel Vaz. Vertex sparsification for edge connectivity. In Proceedings of the 2021 ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 1206–1225. SIAM, 2021. doi:10.1137/1.9781611976465.74.
- [5] J Chen, Y Liu, S Lu, B O’Sullivan, and Igor Razgon. A fixed-parameter algorithm for the directed feedback vertex set problem. Journal of the ACM, 55(5):1–19, 2008. doi:10.1145/1411509.1411511.
- [6] Jianer Chen, Yang Liu, and Songjian Lu. An improved parameterized algorithm for the minimum node multiway cut problem. Algorithmica, 55(1):1–13, 2009. doi:10.1007/S00453-007-9130-6.
- [7] Rajesh Chitnis, MohammadTaghi Hajiaghayi, and Dániel Marx. Fixed-parameter tractability of directed multiway cut parameterized by the size of the cutset. SIAM Journal on Computing, 42(4):1674–1696, 2013. doi:10.1137/12086217X.
- [8] Julia Chuzhoy. On vertex sparsifiers with steiner nodes. In Proceedings of the forty-fourth annual ACM symposium on Theory of computing, pages 673–688, 2012. doi:10.1145/2213977.2214039.
- [9] Marek Cygan, Fedor V Fomin, Łukasz Kowalik, Daniel Lokshtanov, Dániel Marx, Marcin Pilipczuk, Michał Pilipczuk, and Saket Saurabh. Parameterized algorithms, volume 4(8). Springer, 2015. doi:10.1007/978-3-319-21275-3.
- [10] Marek Cygan, Paweł Komosa, Daniel Lokshtanov, Marcin Pilipczuk, Michał Pilipczuk, Saket Saurabh, and Magnus Wahlström. Randomized contractions meet lean decompositions. ACM Transactions on Algorithms (TALG), 17(1):1–30, 2020. doi:10.1145/3426738.
- [11] Marek Cygan, Daniel Lokshtanov, Marcin Pilipczuk, Michał Pilipczuk, and Saket Saurabh. Minimum bisection is fixed-parameter tractable. SIAM Journal on Computing, 48(2):417–450, 2019. doi:10.1137/140988553.
- [12] Rodney G Downey, Michael R Fellows, et al. Fundamentals of parameterized complexity, volume 4. Springer, 2013.
- [13] Guy Even, Joseph Naor, Satish Rao, and Baruch Schieber. Fast approximate graph partitioning algorithms. SIAM Journal on Computing, 28(6):2187–2214, 1999. doi:10.1137/S0097539796308217.
- [14] Uriel Feige, Mohammadtaghi Hajiaghayi, and James R Lee. Improved approximation algorithms for minimum weight vertex separators. SIAM Journal on Computing, 38(2):629–657, 2008. doi:10.1137/05064299X.
- [15] Uriel Feige and Mohammad Mahdian. Finding small balanced separators. In Proceedings of the thirty-eighth annual ACM symposium on Theory of computing, pages 375–384, 2006. doi:10.1145/1132516.1132573.
- [16] Uriel Feige and Orly Yahalom. On the complexity of finding balanced oneway cuts. Information processing letters, 87(1):1–5, 2003. doi:10.1016/S0020-0190(03)00251-5.
- [17] Zhiyang He, Jason Li, and Magnus Wahlström. Near-linear-time, optimal vertex cut sparsifiers in directed acyclic graphs. In 29th Annual European Symposium on Algorithms (ESA 2021). Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2021.
- [18] Jon Kleinberg. Detecting a Network Failure. Internet Mathematics, 1(1):37–55, 2003. doi:10.1080/15427951.2004.10129077.
- [19] Jon Kleinberg, Mark Sandler, and Aleksandrs Slivkins. Network failure detection and graph connectivity. SIAM Journal on Computing, 38(4):1330–1346, 2008. doi:10.1137/070697793.
- [20] Stefan Kratsch and Magnus Wahlström. Representative sets and irrelevant vertices: New tools for kernelization. In 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science, pages 450–459. IEEE, 2012. doi:10.1109/FOCS.2012.46.
- [21] Stefan Kratsch and Magnus Wahlström. Representative sets and irrelevant vertices: New tools for kernelization. Journal of the ACM (JACM), 67(3):1–50, 2020. doi:10.1145/3390887.
- [22] Jason Li and Jesper Nederlof. Detecting feedback vertex sets of size k in time. ACM Transactions on Algorithms (TALG), 18(4):1–26, 2022.
- [23] Yang P Liu. Vertex sparsification for edge connectivity in polynomial time. In 14th Innovations in Theoretical Computer Science Conference (ITCS 2023), volume 251, page 83. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2023.
- [24] Daniel Lokshtanov and Dániel Marx. Clustering with local restrictions. Information and Computation, 222:278–292, 2013. doi:10.1016/J.IC.2012.10.016.
- [25] Daniel Lokshtanov, Pranabendu Misra, MS Ramanujan, Saket Saurabh, and Meirav Zehavi. Fpt-approximation for fpt problems. In Proceedings of the 2021 ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 199–218. SIAM, 2021.
- [26] Daniel Lokshtanov, Maadapuzhi-Sridharan Ramanujan, Saket Saurabh, Roohani Sharma, and Meirav Zehavi. Wannabe bounded treewidth graphs admit a polynomial kernel for directed feedback vertex set. ACM Trans. Comput. Theory, 17(1), February 2025. doi:10.1145/3711669.
- [27] Jayakrishnan Madathil, Roohani Sharma, and Meirav Zehavi. A sub-exponential fpt algorithm and a polynomial kernel for minimum directed bisection on semicomplete digraphs. Algorithmica, 83:1861–1884, 2021. doi:10.1007/S00453-021-00806-X.
- [28] Dániel Marx. Parameterized graph separation problems. Theoretical Computer Science, 351(3):394–406, 2006. doi:10.1016/J.TCS.2005.10.007.
- [29] Dániel Marx. Important separators and parameterized algorithms. In International Workshop on Graph-Theoretic Concepts in Computer Science, pages 5–10. Springer, 2011. doi:10.1007/978-3-642-25870-1_2.
- [30] Daniel Marx and Igor Razgon. Fixed-parameter tractability of multicut parameterized by the size of the cutset. SIAM Journal on Computing, 43(2):355, 2014.
- [31] Pranabendu Misra, Fahad Panolan, MS Ramanujan, and Saket Saurabh. Linear representation of transversal matroids and gammoids parameterized by rank. Theoretical Computer Science, 818:51–59, 2020. doi:10.1016/J.TCS.2018.02.029.
- [32] Ankur Moitra. Approximation algorithms for multicommodity-type problems with guarantees independent of the graph size. In 2009 50th Annual IEEE Symposium on Foundations of Computer Science, pages 3–12. IEEE, 2009. doi:10.1109/FOCS.2009.28.
- [33] Harald Räcke. Optimal hierarchical decompositions for congestion minimization in networks. In Proceedings of the fortieth annual ACM symposium on Theory of computing, pages 255–264, 2008. doi:10.1145/1374376.1374415.
- [34] Thatchaphol Saranurak and Sorrachai Yingchareonthawornchai. Deterministic small vertex connectivity in almost linear time. In 2022 IEEE 63rd Annual Symposium on Foundations of Computer Science (FOCS), pages 789–800. IEEE, 2022. doi:10.1109/FOCS54457.2022.00080.
Appendix A Appendix
See 2
Proof.
Essentially, we will construct undirected graphs: these graphs can be made directed simply by adding for every undirected edge , the directed edges and .
For the first part, consider the star on vertices for some . Let consist of the single center vertex, and let consist of all the leaves. Observe that for any of size at most , the separator which consists of exactly the vertices is an important - separator of size at most . Thus there are at least such separators.
For the second part, consider again a star, but this time we blow up the center vertex to a clique of size . Formally, consider the graph on vertices, where is a clique of size , and forms the remaining vertices. Each vertex of is adjacent to each vertex of , and not adjacent to any vertex of . Thus the graph is a star with a core of size and leaves which form the set . Now for every subset with , itself forms an - important separator of size at most . Thus there are at least such separators.
Lemma 30 (Tight example for important separator preservation).
There is a graph , source vertex , integer , sink vertices with and an - important separator with , such that is not an - important separator for any .
Proof.
Consider the graph in Figure 2. is an important - separator of size . However, clearly is not an important - important separator for any , since we must have for some and this means that or is an - separator closer to than , contradicting the importance of .