Tight Bounds for Connected Odd Cycle Transversal Parameterized by Clique-Width
Abstract
Recently, Bojikian and Kratsch [ICALP 2024] presented a novel approach to tackle connectivity problems parameterized by clique-width (), based on counting (modulo 2) the number of representations of partial solutions, while allowing for possibly multiple representations to exist for the same partial solution. Using this technique, they got a SETH-tight bound of for the Steiner Tree problem, which was left open by Hegerfeld and Kratsch [ESA 2023]. We use the same technique to solve the Connected Odd Cycle Transversal problem in time . Moreover, we prove that our result is tight by providing a SETH-based lower bound excluding algorithms with running time . This answers another question of Hegerfeld and Kratsch [ESA 2023].
Keywords and phrases:
Parameterized complexity, connected odd cycle transversal, clique-widthCopyright and License:
2012 ACM Subject Classification:
Theory of computation Parameterized complexity and exact algorithmsAcknowledgements:
We are grateful to Vera Chekan for her detailed comments on presentation, and to Falko Hegerfeld for several discussions on the lower bound presented here.Editors:
Akanksha Agrawal and Erik Jan van LeeuwenSeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
Parameterized complexity studies the fine-grained complexity of problems by analyzing the dependence of the running time on different measures of the input (called parameters) in addition to its size. Typical parameters are the solution size or structural measures relating, e.g., to sparsity or the existence of certain decompositions of the input. We refer to [12] for an introduction to the field. A central goal is to determine whether a problem with chosen parameter can be solved in time where is an arbitrary computable function and is the input size. Such problems are called fixed-parameter tractable and form the class FPT. While in general the dependence on may be arbitrary, there is much interest in making this as low as possible and possibly establishing conditional optimality of obtained time bounds.
The most studied parameter in this direction is the treewidth (of a graph), a measure that sparks interest in different areas of theoretical computer science, not only parameterized complexity. Roughly, the treewidth of a graph is the smallest value such that can be fully decomposed along non-crossing separators of size . FPT algorithms with single exponential dependence on the treewidth of the graph have long been known for many problems. However, one would still try to improve the exponential dependence on the parameter, by improving the base of the exponent in the running time. In 2010, Lokshtanov et al. [33, 34] provided the first tight lower bounds for problems parameterized by treewidth, proving that known bases for some benchmark problems cannot be improved assuming the strong exponential time hypothesis (SETH), a widely used hypothesis that, informally speaking, conjectures that the naive solution of the SAT problem by testing all possible assignments is essentially optimal [27, 28]. This has inspired a series of SETH based lower bounds for structural parameters [11, 16, 36, 32, 22, 24].
Connectivity problems like Steiner Tree or Feedback Vertex Set, however, resisted single exponential running time algorithms for a long time. A major obstacle had been keeping track of different connectivity patterns in the boundary of the graph, representing different partial solutions in this graph. This seemed to necessitate a factor of in the running time. In 2011, Cygan et al. [14, 15] showed that this issue can be overcome for many (but not all) connectivity problems, by means of their Cut&Count technique. The core idea is to count the number of connected solutions of a problem modulo 2 in single exponential time by counting the ways that (possibly disconnected) solution candidates can be cut. They used the isolation lemma [35] to solve the underlying decision problem with high probability. Finally, they also proved that many of the resulting bases were tight under SETH.
Relying on the Cut&Count technique, tight bounds for connectivity problems were also derived relative to parameters such as pathwidth [13], cutwidth [6], modular treewidth [25], and clique-width [24, 8]. In this paper, we focus on Clique-width, a graph parameter defined by the smallest value , such that a given graph can be built recursively from the disjoint union of -labeled graphs, by allowing to add bicliques between the vertices of two label classes, and to relabel all vertices in a class. Clique-width is a more general parameter than treewidth, since cliques have clique-width at most , and unbounded treewidth, while graphs of treewidth have bounded clique-width (at most ) [10, 9].
The first single exponential time algorithms for connectivity problems parameterized by clique-width, to the best of our knowledge, were given by Bergougnoux and Kanté [2]. However, the first SETH-tight bounds were given by Hegerfeld and Kratsch [24], where they provided algorithms, based on the Cut&Count technique, and matching SETH-based lower bounds for the Connected Vertex Cover and the Connected Dominating Set problems. They left open however, the tight complexity of other connectivity problems such as Steiner Tree and the Connected Odd Cycle Transversal. A major obstacle towards finding the optimal base of the exponent, as they mention, is a gap between the rank of a matrix defined by the interactions of partial solutions of the underlying problem (called the compatibility matrix), and its largest triangular submatrix. Except for a handful examples (Feedback Vertex Set [] [6]), the rank of such matrices has usually been used directly to derive the optimal running time, while current lower bound techniques usually match the size of a largest triangular submatrix.
Very recently Bojikian and Kratsch [8] introduced a new technique called “isolating a representative” closing the gap for the Steiner Tree problem. Their technique is based on a smaller set of possible representations of connectivity patterns. They showed that one can count the number of these representations (modulo 2) in time proportional to the size of a largest triangular submatrix of the corresponding compatibility matrix. They introduced the notion of action-sequences to distinguish different representations of the same solution, and using the isolation lemma to also isolate a unique representation, they were able to solve the underlying decision problem in the same running time, matching known lower bounds.
Our Technique.
Following the approach of Bojikian and Kratsch, we provide an optimal (modulo SETH) algorithm for the Connected Odd Cycle Transversal problem parameterized by clique-width with running time , proving the following theorem (we refer to Section 2 for definitions):
Theorem 1.
There exists a one-sided error Monte-Carlo algorithm (with false negatives only), that given a graph together with a -expression of , and a positive integer , solves the Connected Odd Cycle Transversal problem in time , and outputs the correct answer with probability at least .
Our algorithm counts the number of representations of solutions modulo 2. We achieve this by double representation of partial solutions. The first of which is existential, in the sense that we replace each family of partial solutions at a node of the clique-expression with a smaller family, such that the former completes into a solution in the full graph if and only if the representing family does. In the second step we define an even smaller family, that correctly counts (modulo 2) the number of extensions of the first family into solutions in the rest of the graph. Finally, we assign weights to different actions defined to build the first representing family, and we use the isolation lemma [35] to isolate a single representation with high probability. We make use of fast convolution methods over power lattices [4, 24, 6].
We prove the tightness of our algorithm under SETH by a parameter-preserving reduction from the -SAT problem. To the best of our knowledge, this is the largest known single exponential base for a natural problem parameterized by a structural parameter. This imposes a challenge when designing the lower bound. As is usual, the lower bound even works for the linear version of the parameter, i.e., for linear clique-width [1, 20, 26], and we obtain the following theorem:
Theorem 2.
Given a graph of linear clique-width at most , assuming SETH, there exists no algorithm that solves the Connected Odd Cycle Transversal problem in time even when a linear -expression of is provided with .
Related work.
Clique-width was first studied by Courcelle and Olariu [10] showing that problems expressible in can be solved in FPT time when parameterized by clique-width. However, the resulting algorithms could have a non-elementary dependence on the parameter value , and hence, are probably not tight for most relevant problems. Espelage et al. [17] provided XP algorithms for some problems not expressible in logic. Nonetheless, some problems were proven to be para-NP-hard when parameterized by clique-width, and hence probably do not admit a polynomial time algorithm on graphs of bounded clique-width [21].
The SETH-tight lower bounds by Lokshtanov et al. [34] were followed by series of tight bounds for structural parameters [6, 36, 19, 30, 32]. Iwata and Yoshida [29] proved that Vertex Cover has the same base parameterized by treewidth and clique-width, providing one of the earliest tight bounds for clique-width. Further tight bounds were obtained for counting perfect matchings, graph coloring and other problems [11, 18, 24, 31, 32].
Finally, we make use of fast convolution methods. These methods are usually used to combine different partial solutions in different parts of the graph along a common boundary set efficiently. Björklund et al. [3] introduced fast convolution over the subset lattice solving the Steiner Tree problem parameterized by the number of terminals more efficiently. Fast subset convolution, and variations thereof have since become a standard technique to handle a join node in dynamic programming algorithms over tree decompositions [37, 15]. Björklund et al. [4] showed that one can compute the so-called join-product over a lattice more efficiently, if the underlying lattice admits a small number of so-called irreducible elements. Building on this result, Hegerfeld and Kratsch [24] provided a fast convolution technique over power lattices resulting in tight algorithms for connectivity problems parameterized by clique-width.
Organization.
In Section 2 we provide preliminaries and introduce some notation. In Section 3 we define connectivity patterns and pattern operations. In Section 4 we describe the dynamic programming algorithm and prove its running time. In Section 5 we define action-sequences, and we use them to prove the correctness of the algorithm. Finally, in Section 6 we present the lower bound. We conclude with final remarks in Section 7.
2 Preliminaries
We denote by the set and by the set . A labeled graph is a graph together with a labeling function . We usually omit and assume that it is given implicitly with . We define a clique-expression as a well-formed expression defined by the following operations on labeled graphs:
-
Introduce vertex for : constructs a graph with a single vertex labeled .
-
Union : the disjoint union of and .
-
Relabel for : change the label of all vertices labeled to .
-
Join for : add all edges between vertices labeled and vertices labeled .
We denote the graph resulting from a clique-expression by , and the constructed labeling function by . We associate with a clique-expression a syntax tree in the natural way, and associate with each node the corresponding operation. Let . We omit when clear from the context. The subtree rooted at each node induces a subexpression . We define , , , , , and . We also define the set as the set of all introduce nodes in , and as the set of all introduce and join nodes in . Given a set , we define . For a mapping , we denote by the mapping , i.e. the restriction of to .
We call a clique-expression linear, if it holds for each union node with children and that is an introduce operation. We say that is -labeled, if it holds that for all . We say that a clique-expression is a -expression if is a -labeled graph for all . We define the (linear) clique-width of a graph , denoted by ( resp.) as the smallest value such that there exists a (linear) -expression , with isomorphic to . We can assume without loss of generality, that any given -expression defining a graph uses at most union operations, and at most unary operations [2, 10].
A mapping , for , is a proper -coloring of , iff for all it holds that . A graph is -colorable if it admits a proper -coloring. A set is an odd cycle transversal of , if is -colorable. In this case, we call a proper -coloring of a witness. In the Connected Odd Cycle Transversal problem, given is a graph and a positive integer , asked is whether admits an odd cycle transversal of size at most , such that induces a connected subgraph of .
We denote the symmetric difference by . A weight function is a mapping from a set to some ring (usually ). Given , we define . Let be a mapping (not explicitly defined as a weight function). For , we define . For , we define in a natural way. Finally, we define the operation with and call it exclusive . We define the union of two mappings over disjoint domains in a natural way.
A short introduction to lattices can be found in the full version. For a lattice , we define the set of join-irreducible elements as the set of elements where for all with it holds that or . We say that a lattice is given in the join representation if its elements are given as -bit strings, together with the elements of , and an algorithm that computes the join for and .
Let be two vectors over a ring . We define the -product where for it holds that . We refer to the full version for details.
Conventions.
Along the upper bound, let be the input graph and let be the target value in the Connected Odd Cycle Transversal instance. Let be a fixed vertex that we choose later. Let and . Let a -expression of for some value , and be the corresponding syntax tree. Let be the root of . A partial solution at is a set of vertices , representing the part of the solution introduced so far, such that is bipartite.
For a function , we denote by the class , where the star hides any polynomial factor of . We define the ground set , and call its elements labels. Let . Let be some fixed value, and a weight function, both fixed later. We define the intervals and , and we use the indices and to iterate over and respectively. We skip defining these indices repeatedly to avoid redundancy.
3 Connectivity patterns
Patterns are structures that represent connectivity in labeled graphs. We use the definition of patterns and some of their properties from [8]. A pattern is a set of subsets of , such that there exists exactly one set that contains the element in it. We call this set the zero-set of . We denote by the family of all patterns. Let be set of labels that appear in at least one set of , and the set of labels that appear as singletons in (excluding the element ). We define the set of incomplete labels . We denote a pattern by , but only if this notation does not cause confusion. We also omit the zero-set when it is a singleton.
Definition 3.
Given a labeled graph and a set , we define the pattern as the sets of labels appearing in each connected component of . Formally, for each connected component , we add the set to the pattern. We add the label to the set corresponding to the component that contains , or as singleton if .
Such patterns carry enough information about the connectivity of partial solutions. We define pattern operations, that allow us to build connectivity patterns corresponding to different partial solutions recursively over the nodes of (see [8] for details).
Definition 4.
We define the operations join, relabel, union, and add over patterns as follows: The union operation of two patterns is defined as the pattern resulting from the union of and by replacing the zero-sets of both patterns by the union of these two sets. The relabel operation () changes all occurrences of the label into the label . The join operation () exhaustively merges all sets of and that intersect. Finally, we define the add operation as if , or as otherwise.
As we shall see later, these operations allow to build all patterns corresponding to all partial solutions of a fixed size over . This can be done by deciding at each leaf node, whether to include the introduced vertex in the solution or not. One can then compute these families at each node by applying the corresponding pattern operations on the families at the children nodes of this node. It is not hard to see that a union node corresponds to a union operation, a relabel node corresponds to a relabel operation, and a join node to an add operation. A correctness proof of this is given later implicitly, when proving that the so-called solution-sequences correspond to all partial solutions of a given weight.
Now we define the compatibility relation over patterns. This relation indicates whether two partial solutions in two labeled subgraphs join into a connected graph by connecting all vertices of the same label between them. Formally, we call two patterns and compatible (), if their join is a single set.
This notion of compatibility will allow us to define two kinds of representation, an existential one (representation), and a (modulo 2) count-preserving one (parity-representation). Based on these, we identify two special families of patterns, namely the family of complete patterns and the family of -patterns . We will show later that one can represent each family of patterns by a family of complete patterns, and parity-represent each family of complete patterns by a family of -patterns.
Definition 5.
A pattern is complete if it holds that , i.e. each label that appears in this pattern, appears as a singleton as well. We call a complete pattern a -pattern, if it consists only of a zero-set and singletons. We denote the family of complete patterns by and the family of -patterns by .
Lemma 6.
The pattern is the only complete pattern compatible with the pattern .
We will show that all defined pattern operations preserve (parity-)representation, in the sense that if represents , then represents as well. Therefore, we can replace the families of patterns of all partial solutions over by representations thereof, which allows us to restrict the dynamic programming algorithm to -patterns only, correctly counting (modulo 2) the number of all representations of all solutions. Therefore, the running time will depend on the number of -patterns over labels instead of the number of all patterns.
Finally, we use the isolation lemma to show that, by assigning random weights to the different representations of different partial solutions, we get a unique minimum-weight representation with high probability. This allows to solve the decision problem in the given time. The running time is implied by the number of -patterns over labels, combined with a proper -coloring of the remaining part of the graph.
Before we present the algorithm, we still need to define the notion of actions and the operation . Actions are series of “forget” and “fix” operations over patterns, that build a representation of a pattern by deciding whether a given label will still be relevant (in the future) for deciding the connectivity of a solution. Using actions, we build an equivalent representations of all partial solutions recursively over using complete patterns only. The operation is then used to create a parity-representation thereof, consisting of -patterns only.
Definition 7.
Given a pattern and a label , we define the fix and forget operations, where adds the singleton to , and removes the label from all sets of . Now we define actions for patterns with and , by either fixing or forgetting the labels in independently.
It was shown in [8] that represents . Hence, by applying the actions for on a pattern with , we get a family of complete patterns that represents . It is not hard to see that applying pattern operations on -patterns yields patterns with . In particular, is closed under union and relabeling, whereas for and , it holds that .
Definition 8.
Let . We define as follows: Let , and let be all non-singleton sets of different from . We define for as follows:
Then we define . Given a set , we define .
Note that each pattern of results from by removing all non-singleton sets different from , and adding some subset of their union to .
4 The Algorithm
Now we present our main result, namely a dynamic programming algorithm over . We start by defining the family of states that we use to index the dynamic programming tables.
CS-Patterns.
We use the family , similar to [8], to count the number of weighted representations of partial solutions in for each . We note that a -pattern is uniquely defined by its set of labels and its zero-set; Given the set of labels and the zero-set of respectively with , we define .
We define the partial ordering over , where for it holds that if and only if and . Clearly, defines a lattice, with where , and . Hence, the join operation over this lattice corresponds to the union operation over -patterns .
Colorings and bipartition.
We define the set , and call it the set of basic colors, where stands for black, and stands for white. We define the set of colors , where each element of corresponds to a different subset of in the natural way. We define the ordering over given by inclusion over the corresponding subsets, namely, given by the two chains and . We denote by the join operation, and by the meet operation over the underlying lattice. We call two colors compatible (denoted by ), if .
We call a mapping a coloring. We denote the family of all colorings of by . We define the ordering over colorings, where for , it holds that if it holds that for all . Finally, we denote by the join operation over the underlying lattice (pointwise join). For and , we define the coloring as , and for . For and , let with and for .
Dynamic programming tables.
We define the family of states as the set of combinations of a -pattern and a coloring. For , the pattern indicates the connectivity of a partial solution, and the coloring indicates the basic colors assigned to the vertices of each label class in a witness of this partial solution. This coloring allows to extend a partial solution, preserving a proper 2-coloring of the rest of the graph.
We define the ordering over with if and only if and . It holds that is the product of two lattices, and hence, it holds that , where and .
Description of the algorithm.
Now we describe the algorithm. Intuitively, actions (Definition 7) allow to build (existential) representations of partial solutions by a dynamic programming scheme over using complete patterns. However, since the number of complete patterns is at least the number of all partitions of , we seek to reduce the space of the states by representing (in a count-preserving manner) these patterns using -patterns only.
We define the vectors for and , that constitute the dynamic programming tables of our algorithm, and we show that these vectors can be computed in time . In the following sections, we will prove the correctness of the algorithm in two stages. In the first stage we show that any pattern, corresponding to a partial solution at a node , can be represented by a set of complete patterns resulting from applying different actions at each node in . We assign different weights to the actions at each node resulting in different weights for different representations of partial solutions. In the second stage, we show that the tables count for each -pattern , the number of representations of weight of solutions of size that are compatible with . By appropriately choosing the weight function, we show that this suffices to solve the Connected Odd Cycle Transversal problem with high probability.
Algorithm 1.
For , and for two values , we define the vectors recursively over as follows:
-
Introduce node : Let if , and otherwise. Let and . We set all values to zero, for all values of and , and then we add one to each of the following entries:
where the first two entries correspond to adding the vertex to a partial solution and computing a representation thereof, and the latter two entries correspond to the two different partial colorings of this vertex.
-
Relabel node : For each state we define
-
Join node : For , if , we set . Otherwise, we define
Intuitively, this corresponds to iterating over all patterns at , applying an add operation to , and then applying all four actions to the resulting pattern to compute a representation thereof. We add parity-representations of the resulting patterns to .
-
Union node : We define
Now we show how to compute these tables in time . We assume that an update of a single entry can be done in logarithmic time in the size of the tables (hence, polynomial in ). Although processing a union node in the naive way requires time polynomial in , we apply a fast convolution technique to compute these tables more efficiently.
In particular, we use a result from [23], which informally states that the -product over powers of a finite lattice can be computed efficiently. In the full version, we define a simple lattice over the set . We show that the lattice is isomorphic to the -th power of this lattice. It follows by the mentioned result that the -product over can be computed in time implying the same running time (up to a polynomial factor) to process a union node. The running time of the algorithm is a direct consequence of this result, since the tables at a join or a relabel node can be computed in a straight-forward way by iterating over all entries of the tables at a child node and updating the right indices accordingly.
Corollary 9.
All tables for all values of can be computed in time .
5 Solution representation
Now we introduce solution-sequences and action-sequences. A solution-sequence is defined by the actions taken at all introduce nodes in a subtree of to create a partial solution (together with a witness thereof). We show a bijective mapping between solution-sequences at a node and partial solutions (combined with witnesses) at that node. An action-sequence is defined by the actions taken at both introduce and join nodes in a subtree of , encoding the forget and fix operations along . Each action-sequence corresponds to a different representation of a partial solution. Intuitively, the set of all action-sequences extending a solution-sequence build a full complete-pattern representation of the corresponding partial solution.
Definition 10.
Given a labeled graph , a set and a mapping , we define the coloring corresponding to by assigning to each label the color defined as follows: let if , and otherwise, and let if , and otherwise. We define .
Intuitively, is the coloring defined by assigning to each label the join of all basic colors assigned to vertices labeled in , or if no such vertex exists. Hence, it indicates the colors assigned to each label in a witness of a partial solution.
From now on, we shorthand for , and for .
Definition 11.
A solution-sequence is a mapping . The cost of is defined as . The solution-subsequence induced by a node is the restriction of to . We define the pattern in a natural way, where for an introduce node, if we define if , and if . We define otherwise. For different nodes , is then defined by applying the corresponding pattern operation to the patterns for children nodes of . We also define the coloring . For an introduce node , we define if , if , and if . For a relabel node, we define . For a join node, we define , and for a union node we define . We say that is a valid solution-sequence, if for each join node , it holds that and are compatible.
We define an action-sequence as a mapping with the cost of defined by . The weight of is defined by . The action-subsequence at a descendant of is defined by the restriction of to . Finally, we define the pattern in a similar way to , where for an introduce node we apply a forget operation if , and a fix if . For a join node, we follow the add operation by the applying the action on the resulting pattern. The coloring is defined in the same way as where at an introduce node we define replace by . We define a valid action-sequence in the same way as for solution-sequences. We say that is the pair generated by .
The notion of valid sequences ensures the validity of witnesses, as it forbids adding edges between vertices of the same color. Now we show the correspondence between solution-sequences and partial solutions at a node .
Definition 12.
Given a set and a mapping , we define the solution-sequence where for the vertex introduced at we have if , if , and if .
This forms a bijective mapping between the family of partial solutions at a node and the family of all solution-sequences at . In the full version, we show that , , . We also prove that is a proper 2-coloring of if and only if is valid. Now we show that action-sequences represent partial solutions.
Definition 13.
A family represents another family , if it holds for all that is consistent with a pattern of iff if it is consistent with a pattern of . The family parity-represents over a family , if it holds for each that is consistent with an odd number of patterns of iff it is consistent with an odd number of patterns of .
We show in the full version (and from [8]) that pattern operations and actions preserve both representation and parity-representation over , in the sense that if (parity-) represents , then (parity-)represents . It was also shown in [8] that the four actions over a pattern with represent . Therefore, one can show by induction over that the family of patterns corresponding to valid action-sequences at a node represents the family of patterns corresponding to valid solution-sequences at , and hence, represents all partial solutions at as well. The following result follows:
Corollary 14.
Let . There exist , and a valid action-sequence of cost and weight at generating the pair , if and only if there exists a connected odd cycle transversal of size in containing .
All that is left to show is that the algorithm correctly counts (modulo 2) the number of valid action-sequences of cost and weight at each node that generate a pair , where for some complete pattern . In order to show this, we prove that parity-represents over for any . Based on this, together with the fact that all defined operations preserve parity-representation over , we prove the following lemma:
Lemma 15.
For each node , for all values , and all and , it holds that , if and only if there exists an odd number of valid action-sequences at of cost and weight generating the pair with .
The following corollary follows directly from Lemma 6:
Corollary 16.
It holds for all , and all that iff there exists an odd number of valid action-sequences of cost and weight at generating .
So far, we have shown that the algorithm correctly counts the number of action-sequences of a specific weight, generating a pair for any coloring . We make use of the isolation lemma [35], to show that by a careful choice of the value and the weight function , we get a unique minimum weight sequence with high probability if a solution exists.
Lemma 17.
Assume that contains a connected odd cycle transversal of size containing . Fix , and choose independently and uniformly at random in . Let be a minimum weight valid action-sequence of cost generating a pair for any coloring . Then is unique with probability at least .
Proof sketch of Theorem 1.
Fix and as in Lemma 17. For each fix , and compute tables for all , and all . The algorithm accepts if there exist a choice of , a coloring and a weight such that , and rejects otherwise. The algorithms runs in time by Corollary 9.
The correctness follows by the following observation: There exists a solution of cost if and only if, for any vertex in this solution, there exists a valid action-sequence at of cost generating the pair for some . If such a sequence exists, then by Lemma 17 a unique minimum weight sequence exists with probability at least . Hence, it follows from Corollary 16 that if a solution exists, then there exist and such that with probability at least . On the other hand, if no such sequence exists, then it follows by Corollary 16 that for all values , any coloring and any choice of .
6 Lower bound
We prove Theorem 2 through a parameterized reduction from the -SAT problem for each constant , proving that an algorithm running in time for some contradicts SETH. Let be the given instance with clauses over variables . We refer to the full version for details. We start by a formal statement of SETH.
Conjecture 18 ([27, 28]).
For any positive value there exists an integer such that the -SAT problem cannot be solved in time , where is the number of variables.
Construction of the graph.
We build by combining copies of constant components, called gadgets. We distinguish three kinds of gadgets: path gadgets, decoding gadgets, and clause gadgets. We distinguish a vertex in called the root, that is forced to every solution, and another vertex . The connectivity of a solution can be proven by showing that all vertices are connected to by a path. The vertex will not be contained in any optimal solution, and its color serves as reference for a witness of an optimal solution.
Let be a constant that we fix later, and . We partition the variables into groups each of size (except for the last one). Let and . The graph consists of path-like structures, called path sequences. Each path sequence is a long sequence of path gadgets. Consecutive path gadgets are connected by small bicliques, giving a path sequence its narrow path-like structure. Let be the path sequences of . We group them into groups of size each, calling each of these groups a path bundle. For a path bundle and , we call the set of the th path gadgets on each path sequences in a (simple) bundle . Finally, we call the set consisting of the -th path gadget of each path sequence a column . We attach a decoding gadget to each bundle and a clause gadget to each column. See Figure 1 for illustration.
In each path gadget , We distinguish 3 entry vertices and three exit vertices. We add a biclique between the exit vertices of each path gadget, and the entry vertices of the following path gadget. We attach a special gadget to each entry or exit vertex of a path gadget called a simple path gadget. We define the set of simple states . A solution defines a simple state in each simple path gadget. The states defined by the simple path gadgets at the entry vertices of a path gadget combine to one of 12 specific states (see full version). The combination of states over a bundle corresponds to a Boolean assignment over the variable group . The clause gadget and all decoding gadgets on the th column enforce a partial assignment of some variable group to satisfying a corresponding clause. Finally, the bicliques restrict transitions of states along consecutive path gadgets, ensuring that in any solution there exists a long segment where all path gadgets share the same state along each path. This ensures that the corresponding Boolean assignment satisfies all clauses.
Budget.
We choose the value in the output instance by defining a family of pairwise vertex-disjoint components in (mostly the gadgets of ). For each component , we define a value . We define as the sum of for all . In the full version, we show that any solution must contain vertices in each component . By the tightness of , it follows that a solution of size contains exactly vertices in each component , and no other vertices.
Correctness.
We show the correctness of our reduction in two steps. First we show that has a satisfying assignment if and only if admits a solution of size . Second, we show that the graph has linear clique-width at most for some constant . Using these, we show that given an algorithm running in time for some , we can choose the constant large enough, such that the -SAT problem for any value of can be solved in time for some positive value , contradicting SETH.
7 Conclusion
In this work, we have provided a single-sided error Monte Carlo algorithm for the Connected Odd Cycle Transversal problem with running time , presenting a new application of the “isolating a representative” technique by [6]. We proved that the running time is tight assuming SETH. This closes the second open problem posed by Hegerfeld and Kratsch [24]. However, one more interesting benchmark problem is still open, namely the Feedback Vertex Set problem. As mentioned in their paper, solving this problem using the Cut&Count technique is challenging, since a direct application would require counting edges induced by partial solutions. Another approach would be to follow the approach of Bojikian and Kratsch [8]. The difficulty of such application lays in finding a low -rank representation of partial solutions.
Since both related techniques – the Cut&Count technique [15] and the rank-based approach [5] – were used to solve connectivity problems more efficiently under different structural parameters, it is natural to ask whether the approach of Bojikian and Kratsch [8] can likewise be adapted to different settings. In particular, one would ask whether this results in tight bounds for different connectivity problems under different parameters, where there also exists a gap between the rank of a corresponding compatibility matrix, and the size of a largest triangular submatrix thereof.
References
- [1] Isolde Adler and Mamadou Moustapha Kanté. Linear rank-width and linear clique-width of trees. Theor. Comput. Sci., 589:87–98, 2015. doi:10.1016/j.tcs.2015.04.021.
- [2] Benjamin Bergougnoux and Mamadou Moustapha Kanté. Fast exact algorithms for some connectivity problems parameterized by clique-width. Theor. Comput. Sci., 782:30–53, 2019. doi:10.1016/j.tcs.2019.02.030.
- [3] Andreas Björklund, Thore Husfeldt, Petteri Kaski, and Mikko Koivisto. Fourier meets möbius: fast subset convolution. In David S. Johnson and Uriel Feige, editors, Proceedings of the 39th Annual ACM Symposium on Theory of Computing, San Diego, California, USA, June 11-13, 2007, pages 67–74. ACM, 2007. doi:10.1145/1250790.1250801.
- [4] Andreas Björklund, Thore Husfeldt, Petteri Kaski, Mikko Koivisto, Jesper Nederlof, and Pekka Parviainen. Fast zeta transforms for lattices with few irreducibles. ACM Trans. Algorithms, 12(1):4:1–4:19, 2016. doi:10.1145/2629429.
- [5] Hans L. Bodlaender, Marek Cygan, Stefan Kratsch, and Jesper Nederlof. Deterministic single exponential time algorithms for connectivity problems parameterized by treewidth. Inf. Comput., 243:86–111, 2015. doi:10.1016/j.ic.2014.12.008.
- [6] Narek Bojikian, Vera Chekan, Falko Hegerfeld, and Stefan Kratsch. Tight bounds for connectivity problems parameterized by cutwidth. In Petra Berenbrink, Patricia Bouyer, Anuj Dawar, and Mamadou Moustapha Kanté, editors, 40th International Symposium on Theoretical Aspects of Computer Science, STACS 2023, March 7-9, 2023, Hamburg, Germany, volume 254 of LIPIcs, pages 14:1–14:16. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2023. doi:10.4230/LIPIcs.STACS.2023.14.
- [7] Narek Bojikian and Stefan Kratsch. Tight algorithm for connected odd cycle transversal parameterized by clique-width. CoRR, abs/2402.08046, 2024. doi:10.48550/arXiv.2402.08046.
- [8] Narek Bojikian and Stefan Kratsch. A tight monte-carlo algorithm for steiner tree parameterized by clique-width. In Karl Bringmann, Martin Grohe, Gabriele Puppis, and Ola Svensson, editors, 51st International Colloquium on Automata, Languages, and Programming, ICALP 2024, July 8-12, 2024, Tallinn, Estonia, volume 297 of LIPIcs, pages 29:1–29:18. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2024. doi:10.4230/LIPICS.ICALP.2024.29.
- [9] Derek G. Corneil and Udi Rotics. On the relationship between clique-width and treewidth. SIAM J. Comput., 34(4):825–847, 2005. doi:10.1137/S0097539701385351.
- [10] Bruno Courcelle and Stephan Olariu. Upper bounds to the clique width of graphs. Discret. Appl. Math., 101(1-3):77–114, 2000. doi:10.1016/S0166-218X(99)00184-5.
- [11] Radu Curticapean and Dániel Marx. Tight conditional lower bounds for counting perfect matchings on graphs of bounded treewidth, cliquewidth, and genus. In Robert Krauthgamer, editor, Proceedings of the Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2016, Arlington, VA, USA, January 10-12, 2016, pages 1650–1669. SIAM, 2016. doi:10.1137/1.9781611974331.ch113.
- [12] Marek Cygan, Fedor V. Fomin, Lukasz Kowalik, Daniel Lokshtanov, Dániel Marx, Marcin Pilipczuk, Michal Pilipczuk, and Saket Saurabh. Parameterized Algorithms. Springer, 2015. doi:10.1007/978-3-319-21275-3.
- [13] Marek Cygan, Stefan Kratsch, and Jesper Nederlof. Fast hamiltonicity checking via bases of perfect matchings. J. ACM, 65(3):12:1–12:46, 2018. doi:10.1145/3148227.
- [14] Marek Cygan, Jesper Nederlof, Marcin Pilipczuk, Michal Pilipczuk, Johan M. M. van Rooij, and Jakub Onufry Wojtaszczyk. Solving connectivity problems parameterized by treewidth in single exponential time. CoRR, abs/1103.0534, 2011. arXiv:1103.0534.
- [15] Marek Cygan, Jesper Nederlof, Marcin Pilipczuk, Michal Pilipczuk, Johan M. M. van Rooij, and Jakub Onufry Wojtaszczyk. Solving connectivity problems parameterized by treewidth in single exponential time. ACM Trans. Algorithms, 18(2):17:1–17:31, 2022. doi:10.1145/3506707.
- [16] Baris Can Esmer, Jacob Focke, Dániel Marx, and Pawel Rzazewski. List homomorphisms by deleting edges and vertices: tight complexity bounds for bounded-treewidth graphs. CoRR, abs/2210.10677, 2022. doi:10.48550/arXiv.2210.10677.
- [17] Wolfgang Espelage, Frank Gurski, and Egon Wanke. How to solve np-hard graph problems on clique-width bounded graphs in polynomial time. In Andreas Brandstädt and Van Bang Le, editors, Graph-Theoretic Concepts in Computer Science, 27th International Workshop, WG 2001, Boltenhagen, Germany, June 14-16, 2001, Proceedings, volume 2204 of Lecture Notes in Computer Science, pages 117–128. Springer, 2001. doi:10.1007/3-540-45477-2_12.
- [18] Robert Ganian, Thekla Hamm, Viktoriia Korchemna, Karolina Okrasa, and Kirill Simonov. The fine-grained complexity of graph homomorphism parameterized by clique-width. In Mikolaj Bojanczyk, Emanuela Merelli, and David P. Woodruff, editors, 49th International Colloquium on Automata, Languages, and Programming, ICALP 2022, July 4-8, 2022, Paris, France, volume 229 of LIPIcs, pages 66:1–66:20. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPIcs.ICALP.2022.66.
- [19] Carla Groenland, Isja Mannens, Jesper Nederlof, and Krisztina Szilágyi. Tight bounds for counting colorings and connected edge sets parameterized by cutwidth. In Petra Berenbrink and Benjamin Monmege, editors, 39th International Symposium on Theoretical Aspects of Computer Science, STACS 2022, March 15-18, 2022, Marseille, France (Virtual Conference), volume 219 of LIPIcs, pages 36:1–36:20. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPIcs.STACS.2022.36.
- [20] Frank Gurski and Egon Wanke. On the relationship between nlc-width and linear nlc-width. Theor. Comput. Sci., 347(1-2):76–89, 2005. doi:10.1016/j.tcs.2005.05.018.
- [21] Frank Gurski and Egon Wanke. Vertex disjoint paths on clique-width bounded graphs. Theor. Comput. Sci., 359(1-3):188–199, 2006. doi:10.1016/j.tcs.2006.02.026.
- [22] Falko Hegerfeld and Stefan Kratsch. Towards exact structural thresholds for parameterized complexity. In Holger Dell and Jesper Nederlof, editors, 17th International Symposium on Parameterized and Exact Computation, IPEC 2022, September 7-9, 2022, Potsdam, Germany, volume 249 of LIPIcs, pages 17:1–17:20. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPIcs.IPEC.2022.17.
- [23] Falko Hegerfeld and Stefan Kratsch. Tight algorithms for connectivity problems parameterized by clique-width. CoRR, abs/2302.03627, 2023. doi:10.48550/arXiv.2302.03627.
- [24] Falko Hegerfeld and Stefan Kratsch. Tight algorithms for connectivity problems parameterized by clique-width. In Inge Li Gørtz, Martin Farach-Colton, Simon J. Puglisi, and Grzegorz Herman, editors, 31st Annual European Symposium on Algorithms, ESA 2023, September 4-6, 2023, Amsterdam, The Netherlands, volume 274 of LIPIcs, pages 59:1–59:19. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2023. doi:10.4230/LIPIcs.ESA.2023.59.
- [25] Falko Hegerfeld and Stefan Kratsch. Tight algorithms for connectivity problems parameterized by modular-treewidth. In Daniël Paulusma and Bernard Ries, editors, Graph-Theoretic Concepts in Computer Science - 49th International Workshop, WG 2023, Fribourg, Switzerland, June 28-30, 2023, Revised Selected Papers, volume 14093 of Lecture Notes in Computer Science, pages 388–402. Springer, 2023. doi:10.1007/978-3-031-43380-1_28.
- [26] Pinar Heggernes, Daniel Meister, and Charis Papadopoulos. Characterising the linear clique-width of a class of graphs by forbidden induced subgraphs. Discret. Appl. Math., 160(6):888–901, 2012. doi:10.1016/j.dam.2011.03.018.
- [27] Russell Impagliazzo and Ramamohan Paturi. On the complexity of k-sat. J. Comput. Syst. Sci., 62(2):367–375, 2001. doi:10.1006/jcss.2000.1727.
- [28] Russell Impagliazzo, Ramamohan Paturi, and Francis Zane. Which problems have strongly exponential complexity? J. Comput. Syst. Sci., 63(4):512–530, 2001. doi:10.1006/jcss.2001.1774.
- [29] Yoichi Iwata and Yuichi Yoshida. On the equivalence among problems of bounded width. In Nikhil Bansal and Irene Finocchi, editors, Algorithms - ESA 2015 - 23rd Annual European Symposium, Patras, Greece, September 14-16, 2015, Proceedings, volume 9294 of Lecture Notes in Computer Science, pages 754–765. Springer, 2015. doi:10.1007/978-3-662-48350-3_63.
- [30] Bart M. P. Jansen and Jesper Nederlof. Computing the chromatic number using graph decompositions via matrix rank. Theor. Comput. Sci., 795:520–539, 2019. doi:10.1016/j.tcs.2019.08.006.
- [31] Ioannis Katsikarelis, Michael Lampis, and Vangelis Th. Paschos. Structural parameters, tight bounds, and approximation for (k, r)-center. Discret. Appl. Math., 264:90–117, 2019. doi:10.1016/j.dam.2018.11.002.
- [32] Michael Lampis. Finer tight bounds for coloring on clique-width. SIAM J. Discret. Math., 34(3):1538–1558, 2020. doi:10.1137/19M1280326.
- [33] Daniel Lokshtanov, Dániel Marx, and Saket Saurabh. Known algorithms on graphs of bounded treewidth are probably optimal. CoRR, abs/1007.5450, 2010. arXiv:1007.5450.
- [34] Daniel Lokshtanov, Dániel Marx, and Saket Saurabh. Known algorithms on graphs of bounded treewidth are probably optimal. ACM Trans. Algorithms, 14(2):13:1–13:30, 2018. doi:10.1145/3170442.
- [35] Ketan Mulmuley, Umesh V. Vazirani, and Vijay V. Vazirani. Matching is as easy as matrix inversion. Comb., 7(1):105–113, 1987. doi:10.1007/BF02579206.
- [36] Bas A. M. van Geffen, Bart M. P. Jansen, Arnoud A. W. M. de Kroon, and Rolf Morel. Lower bounds for dynamic programming on planar graphs of bounded cutwidth. J. Graph Algorithms Appl., 24(3):461–482, 2020. doi:10.7155/jgaa.00542.
- [37] Johan M. M. van Rooij, Hans L. Bodlaender, and Peter Rossmanith. Dynamic programming on tree decompositions using generalised fast subset convolution. In Amos Fiat and Peter Sanders, editors, Algorithms - ESA 2009, 17th Annual European Symposium, Copenhagen, Denmark, September 7-9, 2009. Proceedings, volume 5757 of Lecture Notes in Computer Science, pages 566–577. Springer, 2009. doi:10.1007/978-3-642-04128-0_51.
