Abstract 1 Introduction 2 Definitions and preparatory lemmas 3 String graphs of odd girth larger than 𝟏𝟏 are 𝟖-colorable 4 Approximation algorithm for Vertex Cover References

An 11/6-Approximation Algorithm for Vertex Cover on String Graphs

Édouard Bonnet ORCID Univ Lyon, CNRS, ENS de Lyon, Université Claude Bernard Lyon 1, LIP UMR5668, France Paweł Rzążewski ORCID Warsaw University of Technology, Poland
University of Warsaw, Poland
Abstract

We present a 1.8334-approximation algorithm for Vertex Cover on string graphs given with a representation, which takes polynomial time in the size of the representation; the exact approximation factor is 116. Recently, the barrier of 2 was broken by Lokshtanov, Panolan, Saurabh, Xue, and Zehavi [SoGC ’24] with a 1.9999-approximation algorithm. Thus we increase by three orders of magnitude the distance of the approximation ratio to the trivial bound of 2. Our algorithm is very simple. The intricacies reside in its analysis, where we mainly establish that string graphs without odd cycles of length at most 11 are 8-colorable. Previously, Chudnovsky, Scott, and Seymour [JCTB ’21] showed that string graphs without odd cycles of length at most 7 are 80-colorable, and string graphs without odd cycles of length at most 5 have bounded chromatic number.

Keywords and phrases:
Approximation algorithms, string graphs, Vertex Cover, Coloring, odd girth
Funding:
Édouard Bonnet: supported by the French National Research Agency through the project TWIN-WIDTH with reference number ANR-21-CE48-0014.
Paweł Rzążewski: funded in part by National Science Centre SONATA BIS grant number
2024/54/E/ST6/00094.
Copyright and License:
[Uncaptioned image] © Édouard Bonnet and Paweł Rzążewski; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Approximation algorithms analysis
Related Version:
Full Version: https://arxiv.org/abs/2409.18820 [4]
Funding:
For the purpose of Open Access, the author has applied a CC-BY public copyright licence to any Author Accepted Manuscript (AAM) version arising from this submission.
Editors:
Oswin Aichholzer and Haitao Wang

1 Introduction

The Vertex Cover problem111As an optimization problem, given a graph G, find a smallest possible vertex subset S such that GS (i.e., the subgraph of G induced by all the vertices not in S) is edgeless. is one of Karp’s 21 NP-complete problems [12]. While it admits several easy polynomial-time 2-approximation algorithms, it is an open question if an approximation factor of 2ε can be achieved for some ε>0. If the unique games conjecture (UGC) holds, then the answer to the latter question is negative [15]. Under the sole 𝖯𝖭𝖯 assumption, it is currently only known that Vertex Cover cannot be approximated within ratio less than 2 [13, 9, 14].

On several graphs classes, better approximation algorithms of Vertex Cover exist. For instance, this problem can be exactly solved in polynomial time on bipartite graphs, as it reduces to a maximum flow problem [16]. It also admits a polynomial-time approximation scheme (PTAS) on planar graphs by Baker’s technique [2]. On the contrary, a (2ε)-approximation algorithm for Vertex Cover with ε(0,1/2] on triangle-free graphs would imply the same holding for general graphs. Indeed, one can observe that any vertex cover intersects each triangle on at least two vertices. One can thus repeatedly include all three vertices of a triangle in the approximate solution. Once the input graph has no triangle left, one calls the (2ε)-approximation algorithm. This strategy can be seen to achieve approximation factor max(3/2,2ε) on general graphs. In particular, the former algorithm on triangle-free graphs would refute the UGC. Thus we say that (2ε)-approximating Vertex Cover on triangle-free graphs is UGC-hard. It would be interesting to establish a dichotomy splitting the hereditary (i.e., closed under vertex removals) graph classes on which we know a (2ε)-approximation algorithm from those on which this task is UGC-hard.

The class of string graphs consists of every intersection graph of connected sets in a planar graph. Alternatively string graphs are the intersection graphs of curves in the planes, called strings. The set of strings is called a geometric representation or string representation. The geometric representation may be convenient for drawing and in the proofs, but it is not so easy to handle as input. For that, we will also adopt a more combinatorial approach, based on the first definition of string graphs. A representation of a string graph G is a planar graph P and as many connected vertex subsets of P as G has vertices; two vertices are joined by an edge if and only if their corresponding subsets intersect. A caveat is that finding a (geometric) representation is NP-hard [19]. Furthermore, some string graphs G require representations whose size (i.e., the number of vertices of P or the number of crossings in a geometric representation) is exponential in |V(G)| [20].

Recently Lokshtanov, Panolan, Saurabh, Xue, and Zehavi [23] presented a (2ε)-approximation algorithm for Vertex Cover on string graphs, for some small positive ε104. In this paper, we give a simpler algorithm with improved approximation factor 11/6=1.8333

Theorem 1.

Vertex Cover admits an 116-approximation algorithm in string graphs given with a representation, whose running time is polynomial in the size of a representation.

Our algorithm works as follows. We first get rid of all the odd cycles of length at most 11. This is done as in the abovementioned reduction to triangle-free graphs. While there is an odd cycle C with |V(C)|11, any vertex cover contains at least 6 vertices of C. We thus include all vertices of C in our approximate solution, and remove them from the graph. This ensures an 11/6 approximation factor, if such a factor can be achieved in the resulting graph. We can thus assume that our input graph has odd girth222The (odd) girth of a graph G is the length of a shortest (odd) cycle of G. more than 11. As the standard linear-programming (LP) formulation of Vertex Cover is half-integral [26], we can further assume that the vertex cover contains at least half of the vertices (see Theorem 14). Note that these opening steps are also present in [23].

We now deviate from the previous algorithm, and simply bound the chromatic number of string graphs of odd girth larger than 11.

Theorem 2.

Every string graph of odd girth larger than 11 is 8-colorable, and given a representation, an 8-coloring can be found in time polynomial in the representation.

A largest color class in a proper 8-coloring has size at least n/8 in an n-vertex graph. Thus, its complement is a vertex cover of size at most 7n/8. As the optimum solution has size at least n/2, this yields a 7/4=1.75 factor, and we conclude. Therefore, the main technical content of the paper is the proof of Theorem 2.

Outline of the proof of Theorem 2.

Our strategy goes as follows. We make a breadth-first search (BFS) from some arbitrary vertex u0 in each connected component of the graph G. We reserve colors 1,2,3,4 for the even-indexed BFS layer, and 5,6,7,8 for the odd-indexed BFS layer. We thus need to 4-color each connected component X of each subgraph induced by a single layer. Let R be a string representation of the entire graph, and R[X] its restriction to X. As any face in the arrangement R[X] can be made the infinite face, we can assume that the string of u0 lies on the infinite face of R[X]. Now R[X] has a nice property: The minimal topological disk D that contains it is such that each string of R[X] intersects at least one string (in RR[X]) that itself crosses D, the boundary of D. Indeed, such a string is given by a neighbor in the previous layer.

We now consider the string representation RH made by X and at least one neighbor of the previous layer for each vertex of X, and intersect it by D. Each string of X remains in one piece, whereas the strings of the previous layer are possibly split into several strings in D. Let H be the intersection graph of RH. We further layer X with the distance in H to a fixed vertex wV(H)V(X). Let Xk be the vertices of H at distance exactly k from w in H.

We are left with proving that H[Xk] is bipartite. For the sake of contradiction, we assume that there is an odd cycle C in H[Xk]. Our goal is to show that this odd cycle is contained in a ball of small radius around some vertex, yielding a contradiction in the form of a short odd cycle. For that, we establish that there is another odd cycle C (built from C) such that the string of w is contained in a face F made by few strings of NH[C], with most of the rest of C not intersecting F. A string defining F is then at a small distance of every string of C since a (shortest) path from w to a vertex of the rest of C has to cross the boundary of F. After which, the path has only a constant number of steps to reach its target since vertices of NH[C] are at distance k1, k, or k+1 from w. The short odd cycle in H implies the existence of a short odd cycle in G, a contradiction.

Chromatic number of intersection graphs without short (odd) cycles.

The chromatic number of intersection graphs of a given girth has a long history. Erdős and Gyárfás asked if girth-4 (i.e., triangle-free) segment intersection graphs have bounded chromatic number [10]. Kostochka and Nešetřil [17] raised the same question for 1-string graphs.333The 1-strings are strings every pair of which intersects at most once; note that it is not a property of particular objects but rather their arrangement. Both questions were answered in the negative in [27]. It was further shown by Walczak [28] that there are triangle-free segment intersection graphs without independent sets of size linear in the number of vertices. Nevertheless, Kostochka and Nešetřil [18] proved that 1-string graphs of girth at least 8 are 3-colorable, and asked [18, Problem 3] whether 8 could be replaced by 5. This has been confirmed for outer 1-strings by Das, Mukherjee, and Sahoo [7].

More relevant to our Vertex Cover application, the chromatic number of (1-)string graphs of a given odd girth has also been considered. (Note indeed that, in the design of a (2ε)-approximation algorithm for Vertex Cover, whereas one can remove odd cycles up to any fixed size, the same trick does not apply to 4-vertex cycles.) McGuinness established that 1-string graphs of odd girth at least 7 have bounded chromatic number [24, 25]. Chudnovsky, Scott, and Seymour [6, Corollary from 12.4 and 12.5] further showed that string graphs of odd girth at least 9 are 80-colorable, and string graphs of odd girth at least 7 have bounded chromatic number. With proper adjustments, the former result can be turned into an algorithm that inputs a representation of any string graph G of odd girth at least 9, and outputs an 80-coloring of G in time polynomial in the representation. However, this would yield a significantly worse approximation factor for Vertex Cover.

The reason intersection graphs of 1-strings with girth at least 8 are 3-chromatic [18] is that they are 2-degenerate, i.e., (all their induced subgraphs) have a vertex of degree at most 2, which gives a recursive 3-coloring strategy. While string graphs of large odd girth are not d-degenerate for any d (they contain arbitrarily large bipartite complete graphs), we conjecture that they are nevertheless 3-colorable.

Conjecture 3.

There is an integer k such that the class of string graphs of odd girth at least k is 3-chromatic.

A stronger form of Conjecture 3 is that it holds for k=7.

Limits of the method.

The following observation summarizes the trade-off between required odd girth and bound on the chromatic number, in how they impact the approximation ratio.

Observation 4.

Let 𝒞 be a class such that there is a polynomial-time algorithm to c-color the graphs of 𝒞 of odd girth at least an odd positive integer g. Then Vertex Cover admits a polynomial-time 2max(g2g1,c1c)-approximation algorithm in 𝒞.

We apply Observation 4 on represented string graphs with g=13 and c=8. Showing a counterpart of Theorem 2 with g=11 and c10 would improve the ratio to 9/5=1.8, and with g=9 and c8, to 7/4=1.75. Past this point, our 8-coloring starts being the bottleneck. The last possible step of this method would be to get parameters g=7 and c6, for a ratio of 5/3=1.666 Indeed we recall that string graphs of odd girth at least 5 have unbounded chromatic number [27], and do not necessarily have linear-size independent sets [28]. On the complexity side, we do not expect that Vertex Cover is APX-hard (i.e., NP-hard to approximate within some constant ratio r>1) on represented string graphs as a quasipolynomial-time approximation scheme (QPTAS) exists [1, 11]. However, if a PTAS also exists, it will have to be found with a different approach than ours.

Let us emphasize that Theorem 2 is the only place where our algorithm actually requires a representation of the input to be given. Thus, an algorithm coloring string graphs with no short odd cycles, that works directly on the input graph G (not its representation), would yield an approximation algorithm for Vertex Cover in string graphs whose complexity is polynomial in the number of vertices of G. However, if we are only interested in the decision variant of the problem, the existential statement in Theorem 2 is sufficient to get the following result.

Theorem 5.

Given a graph G and an integer k, in time polynomial in |V(G)| we can distinguish the following cases:

  1. 1.

    G is a string graph and Vertex Cover(G)k, and

  2. 2.

    G is a string graph and Vertex Cover(G)>11k/6.

If none of the cases applies, the algorithm terminates but its output is unspecified.

Let us remark that the authors of [23] claim that their algorithm can actually return a 1.9999-approximate solution in time polynomial in the number of vertices of the input graph but this is, unfortunately, not true. Indeed, one of the crucial steps in their argument is the application of a result of Lee [22, Theorem 4.2], whose proof has recently been realized to be flawed444The inequality at the sixth line of the proof of Lemma 4.14 need not hold. [21, 3]. Seemingly the result [22, Theorem 4.2] can be reproven in a different way but only for region intersection graphs (a generalization of string graphs) [8], and the new approach requires that the representation is given (and still the final approximation ratio is much closer to 2 than the one given by Theorem 1). Similarly, it is claimed that the approach in [23] applies for all proper induced-minor-closed classes, but the problem lies in the same place: we are not aware of any way to fix the proof of Lee [22, Theorem 4.2] in such a general setting.

We leave as open questions if Vertex Cover admits, for some constant ε>0, a (2ε)-approximation algorithm on string graphs given without representation (which is very likely to be the case), and on classes excluding a fixed graph H as an induced minor.555i.e, H cannot be obtained from graphs of the class by removing vertices and contracting edges

2 Definitions and preparatory lemmas

If ij are two non-negative integers, we denote by [i,j] the set {i,i+1,,j1,j}, and [i] is a short-hand for [1,i].

Graphs and odd cycles.

We denote by V(G) and E(G) the set of vertices and edges of a graph G, respectively. A graph H is an induced subgraph (resp. subgraph) of a graph G if H can be obtained from G by vertex deletions (resp. by vertex and edge deletions). For SV(G), the subgraph of G induced by S, denoted G[S], is obtained by removing from G all the vertices that are not in S. Then GS is a short-hand for G[V(G)S]. A set XV(G) is connected (in G) if G[X] has a single connected component.

If C is a cycle, once a direction along the cycle is fixed, we may denote by C[xy] the subpath of C from xV(C) to yV(C), turning in this direction.

Lemma 6.

Let G be a graph that has an induced odd cycle C, and let vV(G)V(C) have at least two neighbors in V(C). Then there is a subpath of C that forms with v an induced odd cycle of G.

Proof.

Let v1,,vh list the neighbors of v on C, starting at an arbitrary vertex and turning in a fixed (arbitrary) direction. The cycles C1,,Ch, each with vertex set {v}C[vivi(modh)+1] for i[h], respectively, have a combined number of edges of |V(C)|+2h; an odd number since |V(C)| is odd. Hence at least one of C1,,Ch has odd length.

Let G be a graph and vV(G). We denote by NG(v) and NG[v], the open, respectively closed, neighborhood of v in G. For a non-negative integer r, we denote by NGr(v) (resp. NG=r(v)) the set of vertices of G at distance at most r (resp. exactly r) from v. We may call NGr(v) the ball of radius r around v (in G). We will rely on the observation that if a possibly long odd cycle is contained in a ball of small radius, then there is also a short odd cycle (in this ball).

Lemma 7.

Let G be a graph, vV(G), and r be a positive integer. Suppose G[NGr(v)] contains an odd cycle C. Then G (even G[NGr(v)]) has an odd cycle of length at most 2r+1. Furthermore, if NG=r(v)V(C) is an independent set of G, then G (even G[NGr1(v)]) has an odd cycle of length at most 2r1.

Proof.

Perform a breadth-first search (BFS) starting at v in G[NGr(v)]. By definition, this breadth-first search has exactly r+1 layers L0,L1,,Lr, with Li:=NG=i(v) for each i[0,r]. At least one Li is not an independent set, otherwise G[NGr(v)] would be bipartite, and would not contain any odd cycle. Say x,yLi are adjacent, and let zLi be their lowest common ancestor in the BFS tree (possibly z=v and i=0). Then, there is a cycle of length 2(ii)+12r+1 in G[NGr(v)], consisting of the edge xy and the paths from z to x and from z to y in the BFS tree. Furthermore, if NG=r(v)V(C) is an independent set of G, then the odd cycle C has at least one of its edges with both endpoints in some LiLr, which makes an odd cycle of length at most 2r1.

String representations, and some useful notions and properties.

string is a subset of 2 homeomorphic to a closed segment. We may call a path-connected subset of a string ssubstring of s. If R is a collection of strings in the plane, we denote their intersection graph by GR, with one vertex per string, and an edge between every pair of intersecting strings. For any vertex vV(GR), we denote by sR(v) the string representing v in R. Conversely, we may denote by vR(s) the vertex represented by the string s. We typically drop the subscript when R is clear from the context. If X is an induced subgraph of GR, we may denote by R[X] the subset of strings of R corresponding to vertices of X.

face of a geometric arrangement made by a collection 𝒮 of strings, or more specifically of curves, in 2 is a connected component of 2s𝒮s. A closed face is the topological closure of a face. If D is a topological disk, and more generally a face, we denote by D its boundary. For instance, the curve D defines two faces, one of which is finite (DD), and admits D as a closed face. We may denote by F¯ the closure of an (open) face F.

If P is an induced path represented by strings s1,,sh (i.e., si and sj intersect whenever |ij|=1), as1 and bsh, we denote by s[a,P,b] a minimal path-connected subset of i[h]si containing a and b. In particular s[a,P,b] is a string with endpoints a and b, coinciding with each si on a substring of positive length. Although s[a,P,b] is not necessarily uniquely defined, every further claim will hold regardless of the actual choice for s[a,P,b]. It can thus be thought as a short-hand for: fix any minimal path-connected subset of i[h]si containing a and b, and denote it s[a,P,b].

Figure 1: Left: 5-vertex path as the intersection graph of strings, where the first string of the path contains point a, and the last string of the path contains point b. Right: illustration of s[a,P,b].

It will be convenient to extend the latter notation to induced cycles, that is, to allow s1 and sh to be adjacent. If P may be a cycle, then s[a,P,b] is defined more specifically in the case when s1 and sh indeed intersect. If there is a point cs1sh and two substrings s1s1,shsh with endpoints a,c and b,c, respectively, such that (s1sh)2ih1si=, then s[a,P,b]:=s1sh. If not, then there is a (non-self-intersecting) string that starts at a, follows non-empty substrings of s1,,sh in this order, and ends at b. We fix, as before, s[a,P,b] to be any such string.

We will need the following lemma.

Lemma 8.

Let D be a topological disk in the plane. Let VW be a set of strings all contained in D, whose intersection graph is a cycle C, such that

  • |W|2,

  • every string of W has one or both endpoints in D,

  • no other point of a string of VW intersects D, and

  • no two strings of W (hence of VW) intersect at a point of D.

Let sp be a string contained in D with at least one endpoint p in D, such that sp does not intersect any string of VW.

Then there are two strings ssW, and h strings s1,,shV with v(s),v(s1),, v(sh),v(s) a subpath of C (or C itself), and sp is contained in a closed face F¯ of the arrangement {D,s,s1,,sh,s} such that F¯ does not intersect any string of VW outside of s,s1,,sh,s, the two strings of VW intersecting s (including s1), and the two string of VW intersecting s (including sh).

Proof.

Let t,t be two distinct strings of W, with qtD and qtD. Let P1,P2 be the two paths from v(t) to v(t) in C. We define the strings t1:=s[q,P1,q] and t2:=s[q,P2,q]. We denote by q,q and q,q the two minimal path-connected subsets of D containing q and q. Let F1,F1 be the two finite (open) faces with boundary q,qt1 and q,qt1, respectively. Similarly let F2,F2 be the two finite faces with boundary q,qt2 and q,qt2, respectively.

Figure 2: Illustration of the proof of Lemma 8. The strings of W are in blue. Say, we initially pick ttW as depicted. The v(t)v(t) subpath of C enclosing sp contains a string in W, namely t′′. As qpq′′ turns as qpq along D, t will be set to t′′, and the new P (green strings) is entirely in V, so the process stops. The strings s:=t,s:=t′′, the green strings, and D form a closed face containing sp that, among the strings of the rest of C, only the two purple strings (may) intersect.

As C is an induced cycle, at most one of F1¯,F1¯ intersects the string of some vertex in V(C)NC[P1]. Without loss of generality, suppose that F1¯ does not intersect any string of V(C)NC[P1]. As sp does not intersect t1, it is contained in the closed face F1¯ or in the closed face F1¯. If sp is contained in F1¯, we set P:=P1{v(t),v(t)} and F:=F1, and proceed to the next paragraph. Otherwise, we observe that sp is contained in F2¯, and F2¯ does not intersect any string of V(C)NC[P2]. In which case we set P:=P2{v(t),v(t)} and F:=F2.

At this point, we have all the requirements of the lemma except that some strings of P may be in W. Let P be v1,,v with v1 a neighbor of v(t), and v a neighbor of v(t). While there is some string t′′W with v(t′′)=viV(P), we let q′′t′′D and distinguish two cases. If q,p,q′′ turn along D in the same direction as q,p,q, we set t:=t′′ and P:=v1,,vi1. Otherwise we set t:=t′′ and P:=vi+1,,v. (Note that |V(P)| decreases by at least one in each case.) When we exit the while loop, s:=t,s:=tW, and the subpath P of C (now only made of strings of V) satisfy the lemma statement. A visual recap is provided by Figure 2.

3 String graphs of odd girth larger than 𝟏𝟏 are 𝟖-colorable

This section is devoted to the proof of Theorem 2. Let G be a string graph of odd girth strictly greater than 11, and R be a string representation of G. We may color each connected component of G independently, thus assume that G is connected. Let u0V(G) be an arbitrary vertex. For each i0, let Li:=NG=i(u0)V(G) be the set of vertices at distance exactly i from u0. Note that {u0}=L0,L1,L2, partition V(G), and that there may be an edge between xLi and yLj only if |ij|1.

Our goal is to 4-color G[Li] for each non-negative integer i. We are then done, since we can use two disjoint 4-color palettes for G[iis oddLi] and G[iis evenLi]. We fix i, and assume that i2 since L0 is a singleton, and L1 is an independent set, due to the condition on the odd girth. Note that it is sufficient to 4-color each connected component of G[Li]. Let X be the vertex set of any connected component of G[Li].

Definition of the topological disk 𝑫.

Let R[X] be the string representation of G[X] obtained by keeping the strings of R corresponding to vertices of X. Let D0 be the topological disk whose boundary is the boundary of the infinite face in the arrangement R[X]. If s(u0) is contained in D0, we draw R on a sphere, place any arbitrary free point (i.e., not occupied by a string of R) of the face containing s(u0) in R[X] at its North pole, and consider the stereographic projection of this string representation. The latter is such that s(u0) is on the infinite face of the string arrangement R[X]. Thus we can in fact assume that s(u0) is not contained in D0.

Note that every string of R[X] is contained in D0, and apart from those of Li1Li+1X no string of R intersects D0. Let DD0 be a very slightly augmented topological disk such that D0DD, the property that no string outside Li1Li+1X intersects D is preserved, no intersection of two strings of R lies on D, and, for any point qD, no string of R intersects D at q without crossing it at q. We observe that every string of Li1 with a neighbor in V(X) intersects D, and that every vertex of X has a neighbor in Li1.

Definition of the auxiliary graph 𝑯.

Let RH be the set of strings formed by DR[(Li1NG(V(X)))V(X)], and H its intersection graph. Every vertex of X corresponds to a single vertex in H, whereas each vertex of Li1NG(V(X)) may split in several strings in H as the corresponding string in R may enter and exit D several times. However, the strings of RH that correspond to the same vertex in Li1NG(V(X)) are pairwise non-intersecting and thus they form an independent set in H. Fix an arbitrary vertex wLi1NG(V(X)). Let Xj be the subset of vertices of X at distance exactly j from w in H. Again, X1,X2, partition X. By the previous remarks, it is sufficient to show that that for any positive integer k, the graph G[Xk]=H[Xk] is bipartite. We fix k, and show this fact in the next section.

3.1 𝑮[𝑿𝒌] is bipartite

For the sake of contradiction assume that there is an induced odd cycle C in G[Xk]. We denote by v1,,vh the vertices of C. For every [h], we fix some wNH(v)X. Such a vertex exists since every vertex of Li has at least one neighbor in Li1 (possibly split into several strings in RH, at least one of which intersects s(v)). Observe that w1,,wh are not necessarily pairwise distinct. We set W:={w1,,wh}, with 1|W|h. We say that string s(vi) (and, by extension, vertex vi) is simply attached in a cycle C containing vi if wi has only one neighbor in V(C), namely vi. We denote by r(wi) one (arbitrary, if both endpoints are in D) endpoint of wi that is in D.

Our plan is to show that there is an odd cycle C contained in the ball of radius 6 around some vertex z in H such that NH=6(z)V(C) is an independent set. This implies, by Lemma 7, the existence of an odd cycle of length at most 11 in H, which in turn implies, as we will see, that the same happens in G. To do this, we exhibit a path P^ in H[V(C)W] on at most four vertices, whose strings define with D a (finite) closed face F containing s(w) (recall that w is the arbitrary BFS root in H), and an odd cycle C in H[V(C)W] “mostly” contained in DF. We then show that z can be chosen among the vertices of P^.

Let us first establish the following lemma.

Lemma 9.

There is an induced odd cycle C in H such that V(C)V(C)W and one of the following items holds

  • C=C and every string of V(C) is simply attached in C, or

  • V(C)W={wi}, for some wiW, and V(C){wi} induces a subpath of C whose internal vertices are all simply attached in C, or

  • |V(C)W|2, and there is a subpath x,vi,,vj,y of C (or C itself) such that xy, {x,y}W, {vi,,vj}V(C) may be empty, s(vi+1),,s(vj1) are all simply attached in C, and s(w) is contained in a closed face of {D,s(x),s(vi),s(vi+1),,s(vj1), s(vj),s(y)} that does not intersect any string of C but s(x),s(vi),s(vi+1),,s(vj1), s(vj),s(y) and the other neighbor in C of x and of y.

Proof.

If all the strings of C are simply attached, we are done as the first item holds. Otherwise there is some wiW with at least two neighbors in V(C). By Lemma 6, there is an induced odd cycle C1 such that V(C1) comprises wi and a subpath of C.

Let p1, and suppose we have defined Cp. If Cp satisfies the second or the third item of the lemma, we are done; so we assume otherwise. We will show that we can find an induced cycle Cp+1 in H[V(C)W] with fewer vertices in V(C) than Cp has.

First, suppose that V(Cp)W is a singleton {x}. As Cp does not satisfy the second outcome, there is a vertex viV(Cp){x}V(C) that is not simply attached nor adjacent to x in Cp. We then obtain Cp+1 by applying Lemma 6 to the pair Cp,wi.

Suppose instead that Cp has at least two vertices in W. By Lemma 8, s(w) is contained in a closed face F of the arrangement {D,s(x),s(vi),s(vi+1),, s(vj1),s(vj),s(y)} with xy and {x,y}W such that F does not intersect any string of C outside s(x),s(x),s(vi),s(vi+1),, s(vj1), s(vj),s(y),s(y) where x,y are the unique vertices in NC(x){vi},NC(y){vj}, respectively. As Cp does not satisfy the third item, |ji|2 and there is some vi with i+1ij1 such that s(vi) is not simply attached in Cp. We then obtain Cp+1 by applying Lemma 6 to the pair Cp,wi; see Figure 3.

In both cases, the number of vertices of V(C) present in the current odd cycle drops by at least one when going from Cp to Cp+1. Thus, after at most h=|V(C)| iterations, the procedure will return an odd cycle Cq in H[V(C)W] that satisfies the second or the third statement of the lemma.

Figure 3: The string wiW with several neighbors in V(Cp), and the new odd induced cycle Cp+1 obtained by Lemma 6. Lemma 8 will then locate s(w) as enclosed by x,wi and some (here, three) strings of V. For legibility, a string may be labeled by its corresponding vertex.

We can then obtain the announced milestone.

Lemma 10.

There is an induced odd cycle C in H[V(C)W], two distinct vertices x,y in W, possibly part of C, and a subpath P of C on 0, 1, or 2 vertices, all in V(C), such that

  • if V(P), then x is adjacent to one endpoint of P, and y, to the other (possibly equal) endpoint of P, and if V(P)=, then x and y are adjacent,

  • one finite closed face F1 of the arrangement {D,s[r(x),xPy,r(y)]} contains s(w), and

  • the other finite closed face F2 contains every string of V(C)NC[V(P){x,y}].

Proof.

By Lemma 9, there is an induced odd cycle C in H[V(C)W], two vertices xyW, possibly part of C, and a possibly-empty subpath P of C such that

  • V(P)V(C) and each internal vertex of P is simply attached in C,

  • xPy is a path or a cycle (we allow x and y to be adjacent, even when V(P)),

  • {D,s[r(x),xPy,r(y)]} has two finite closed faces F1s(w), and

  • F2 containing every string of V(C)NC[V(P){x,y}].

Indeed, we directly get this outcome if the third item of Lemma 9 holds. If instead the first item of Lemma 9 holds, we set x:=wi and y:=wj for any vivjV(C)=V(C) such that wiwjE(H). This is possible to ensure since H is triangle-free and |V(C)|3. The path P is then chosen as the subpath of C from vi to vj such that Ds(x)uV(P)s(u)s(y) separates s(w) from V(C)NC[P]. If finally the second item of Lemma 9 holds, we set x:=wi for any viV(C) such that wiV(C), and y is defined as the only vertex of WV(C). We then set P as previously.

Now, while the path P has at least three vertices and s[r(x),xPy,r(y)] passes through the strings of P, let vi be an internal vertex of P. By construction, s(vi) is simply attached in C. Let P be the subpath of P going from the endpoint of P neighboring x to vi, and P′′ be the subpath of P going from vi to its other endpoint (neighboring y). Then, either P and the pair xwiW or P′′ and the pair wiy satisfy the four items of the previous paragraph. In the former case, we set P:=P and y:=wi, whereas in the latter, we set P:=P′′ and x:=wi; see Figure 4.

Figure 4: Example of strings xyW and the subpath P (three green strings) of C. The internal node of P is simply attached to C. This is the case when x should be updated (to the vertex of the cyan string). The new path P (darker green) has two vertices, thus the process stops.

We are done when |V(P)|2 or when s[r(x),xPy,r(y)] is contained in s(x)s(y). In both cases, we set x:=x and y:=y, and in the latter case, we set P to be empty.

Following the notations of Lemma 10, we define z as the neighbor of x in P if V(P), and as x otherwise.

Lemma 11.

V(C)NH6(z).

Proof.

We keep the notations of Lemma 10. First we observe that any vertex in NC[V(P){x,y}] is at distance at most 3 of z in H. It remains to see that every vertex z of C such that s(z) is contained in F2 is at distance at most 6 from z in H.

Note that every vertex of W is at distance k1, k, or k+1 from w in H, since every vertex of C is at distance exactly k from w in H, and every vertex of W has a neighbor in V(C). As s(w) is in F1, a (shortest) path from w to z has to contain a vertex y such that s(y) intersects s[r(x),xPy,r(y)]. Since vertices of V(P){x,y} are at distance at least k1 from w, vertex y is at distance at least k2 from w. Now, as vertices of C are at distance at most k+1 from w, vertex y is at distance at most 3 from z. In turn, z is at distance at most 3 from y, and we conclude.

We further show that no edge of C can lie within NH=6(z).

Lemma 12.

NH=6(z)V(C) is an independent set of H or H admits an odd cycle of length at most 9.

Proof.

The proof of Lemma 11 shows that for some vertex zV(C) to be at distance 6 from z in H, there should be a 4-edge path from y to z. Thus two vertices z,z′′ adjacent in C and at distance 6 from z in H entail an odd cycle of length at most 9 in H, built from y, the corresponding two (non-necessarily edge-disjoint) 4-edge paths, z, and z′′.

Lemmas 11, 12, and 7 imply the existence of an odd cycle Co of length at most 11 in H. To reach the final contradiction, we build an odd cycle of at most the same length in G.

Lemma 13.

Let Co be an odd cycle in H. Then G has an odd cycle of length at most |V(Co)|.

Proof.

We initialize a represented odd cycle C^ to Co realized by the corresponding strings of RH. We make an induction on the number of strings of C^ not part of the representation R of G. When this number is 0, we conclude since G contains the cycle C^ as a subgraph. Let y1,,yqV(C^) be all the vertices of C^ whose strings are substrings of s(y) for some yLi1NG(V(X)).

We further assume that starting at y1V(C^), and turning in some fixed (arbitrary) direction along C^, one encounters y1,y2,,yq in this order. Let, for every a[q], a be the number of edges of C^[yaya(modq)+1]. As, |V(C^)|=a[q]a, at least one a is odd. Recall that {y1,,yq} is an independent set in H, hence every a is strictly greater than 1. Then the string s(y) and those of the internal vertices in the path between ya and ya(modq)+1 form an odd cycle of length at least 3 and at most |V(C^)|. This defines the new represented odd cycle C^, and concludes the proof.

3.2 8-coloring algorithm

To claim Theorem 2, we finally need to check that, given a representation P,𝒮 of a string graph G of odd girth larger than 11, one can compute an 8-coloring of G in time polynomial in |V(P)|. Recall that P is a planar graph and 𝒮 is a set of non-empty connected sets in P in one-to-one correspondence with V(G) such that two distinct connected sets of 𝒮 intersect if and only if the corresponding vertices of G are adjacent. Note that, as G is in particular triangle-free, |V(G)|2|V(P)|.

We remind the reader that we compute one BFS in G, and fewer than |V(G)| BFSes in auxiliary graphs H of size at most (|V(P)|+1)|V(G)|. After this we simply 2-color bipartite graphs whose combined number of vertices is at most |V(G)|. Thus, we shall just detail how to compute each auxiliary graph H.

Let X be the component of G[Li] giving rise to H. We start by adding every vertex of X to V(H). We compute the set YV(P) of all the vertices contained in an element of 𝒮 corresponding to a vertex of X. Let uV(P) be a vertex in the connected set of u0. Let YY be the vertices vY for which there is a path in P from u to v whose internal vertices are all in V(P)Y. Note that Y can be computed in polynomial time in |V(P)| by checking if u and y are in the same connected component of P(Y{v}). The set Y is the combinatorial counterpart of D. In particular, we do not need to change the representation if u is in a finite face “made by Y.” This was merely helpful in the correctness proof.

For each vertex vLi1NG(V(X)) (alternatively we can only keep at least one neighbor per vertex of X), we add to V(H) one vertex for each connected component of SY in P, where S is the connected set of v. Note that SY for every vertex set S of such a connected component. This step adds to H fewer than |V(G)||V(P)| vertices. Graph H is finally defined as the intersection graph of all the connected sets of vertices in V(H), which can be computed in time polynomial in |V(P)|.

4 Approximation algorithm for Vertex Cover

For a graph G, let 𝗏𝖼(G) denote the size of a minimum vertex cover in G. We will use the following result of Chlebík and Chlebíková [5], which is a slight strengthening of the well-known Nemhauser–Trotter theorem [26].

Theorem 14 (Chlebík and Chlebíková [5], Nemhauser and Trotter [26]).

Given a graph G, one can compute in polynomial time a partition of V(G) into V0,V1/2,V1, such that

  1. 1.

    there are no edges between V0 and V1/2 or within V0,

  2. 2.

    𝗏𝖼(G[V1/2])12|V1/2|, and

  3. 3.

    every minimum vertex cover S of G satisfies V1SV1V1/2.

Finally, we are ready to prove Theorem 1, which we restate for convenience.

See 1

Proof.

Let G be the input graph, given along with a representation. The algorithm has three phases.

Phase one.

We initialize X=. If G contains an odd cycle of length at most 11, we include all its vertices into X and remove them from the graph. We repeat this step exhaustively; clearly this can be done in polynomial time. Let G be the graph obtained after the last iteration of the process.

Phase two.

We call the algorithm from Theorem 14 on the graph G in order to obtain three sets V0,V1/2, and V1. We denote Y=V1 and G′′=G[V1/2]. Note that G′′ is a string graph with odd girth larger than 11 and the representation of G′′ can be easily obtained from the representation of G by removing objects corresponding to vertices in V(G)V(G′′).

Phase three.

We apply the algorithm from Theorem 2 to find a proper coloring of G′′ with at most 8 colors. Let c be the color that appears most frequently, and let Z be set of vertices of G′′ not colored c. Clearly |Z|78|V(G′′)|, so |V(G′′)|87|Z|. The algorithm returns Q=XYZ.

Analysis.

First, let us argue that Q is indeed a vertex cover. Since XQ, it is enough to show that QV(G)=YZ is a vertex cover of G. Note that by the first property in Theorem 14 and since V1=Y, all edges of G not contained in G′′ are covered by Q. So we are left with showing that QV(G′′)=Z is a vertex cover of G′′. This is clearly true, as the complement of Z in G′′ is an independent set (of color c).

Now let us analyze the approximation factor. Let S be an optimum solution, i.e., a vertex cover of G of size 𝗏𝖼(G). Note that for each odd cycle C removed in the first phase, SC must contain at least |C|2 vertices in order to cover all the edges of C. As each removed cycle has at most 11 vertices, we conclude that |SX|611|X|.

Note that SX is a vertex cover of G. By the third property in Theorem 14 we observe that Y=V1S, and so S(XY) is a vertex cover of G′′. By the second property in Theorem 14 we obtain that |S(XY)|12|V(G′′)|.

Summing up, we obtain

|S|= |SX|+|SY|+|S(XY)|611|X|+|Y|+12|V(G′′)|
611|X|+|Y|+47|Z|611|XYZ|=611|Q|,

which means that |Q|116|S|=116𝗏𝖼(G). This completes the proof.

The proof above is easily adapted to show Theorem 5. The first two phases remain unchanged. Let 𝒞 be the family of odd cycles found in phase one and let Y be the set found in phase two. Once we reach phase three, we do not call the coloring algorithm on G′′, as its running time might not be polynomial in |V(G)|. Instead, we check if C𝒞|V(C)|/2+|Y|+|V(G′′)|/2>k and, if so, we reject the instance. Note that the sum in the expression above is a lower bound on 𝗏𝖼(G), so this step is correct. Otherwise, we accept the instance. The approximation guarantee is estimated as in the proof of Theorem 1.

References

  • [1] Anna Adamaszek, Sariel Har-Peled, and Andreas Wiese. Approximation schemes for independent set and sparse subsets of polygons. J. ACM, 66(4):29:1–29:40, 2019. doi:10.1145/3326122.
  • [2] Brenda S. Baker. Approximation algorithms for NP-complete problems on planar graphs. J. ACM, 41(1):153–180, 1994. doi:10.1145/174644.174650.
  • [3] Édouard Bonnet, Jędrzej Hodor, Tuukka Korhonen, and Tomáš Masařík. Treewidth is polynomial in maximum degree on weakly sparse graphs excluding a planar induced minor. CoRR, abs/2312.07962, 2023. doi:10.48550/arXiv.2312.07962.
  • [4] Édouard Bonnet and Paweł Rzążewski. An 11/6-approximation algorithm for vertex cover on string graphs. CoRR, abs/2409.18820, 2024. doi:10.48550/arXiv.2409.18820.
  • [5] Miroslav Chlebík and Janka Chlebíková. Improvement of Nemhauser-Trotter theorem and its applications in parametrized complexity. In Torben Hagerup and Jyrki Katajainen, editors, Algorithm Theory - SWAT 2004, 9th Scandinavian Workshop on Algorithm Theory, Humlebaek, Denmark, July 8-10, 2004, Proceedings, volume 3111 of Lecture Notes in Computer Science, pages 174–186. Springer, 2004. doi:10.1007/978-3-540-27810-8_16.
  • [6] Maria Chudnovsky, Alex Scott, and Paul D. Seymour. Induced subgraphs of graphs with large chromatic number. V. Chandeliers and strings. J. Comb. Theory B, 150:195–243, 2021. doi:10.1016/J.JCTB.2021.05.001.
  • [7] Sandip Das, Joydeep Mukherjee, and Uma kant Sahoo. Outer 1-string graphs of girth at least five are 3-colorable. In Extended Abstracts EuroComb 2021: European Conference on Combinatorics, Graph Theory and Applications, pages 593–598. Springer, 2021.
  • [8] James Davies. private communication.
  • [9] Irit Dinur, Subhash Khot, Guy Kindler, Dor Minzer, and Muli Safra. Towards a proof of the 2-to-1 games conjecture? In Ilias Diakonikolas, David Kempe, and Monika Henzinger, editors, Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2018, Los Angeles, CA, USA, June 25-29, 2018, pages 376–389. ACM, 2018. doi:10.1145/3188745.3188804.
  • [10] András Gyárfás. Problems from the world surrounding perfect graphs. Applicationes Mathematicae, 19(3-4):413–441, 1987.
  • [11] Sariel Har-Peled. Approximately: Independence implies vertex cover. CoRR, abs/2308.00840, 2023. doi:10.48550/arXiv.2308.00840.
  • [12] Richard M. Karp. Reducibility among combinatorial problems. In Raymond E. Miller and James W. Thatcher, editors, Proceedings of a symposium on the Complexity of Computer Computations, held March 20-22, 1972, at the IBM Thomas J. Watson Research Center, Yorktown Heights, New York, USA, The IBM Research Symposia Series, pages 85–103. Plenum Press, New York, 1972. doi:10.1007/978-1-4684-2001-2_9.
  • [13] Subhash Khot, Dor Minzer, and Muli Safra. On independent sets, 2-to-2 games, and grassmann graphs. In Hamed Hatami, Pierre McKenzie, and Valerie King, editors, Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2017, Montreal, QC, Canada, June 19-23, 2017, pages 576–589. ACM, 2017. doi:10.1145/3055399.3055432.
  • [14] Subhash Khot, Dor Minzer, and Muli Safra. Pseudorandom sets in grassmann graph have near-perfect expansion. In Mikkel Thorup, editor, 59th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2018, Paris, France, October 7-9, 2018, pages 592–601. IEEE Computer Society, 2018. doi:10.1109/FOCS.2018.00062.
  • [15] Subhash Khot and Oded Regev. Vertex cover might be hard to approximate to within 2-epsilon. J. Comput. Syst. Sci., 74(3):335–349, 2008. doi:10.1016/J.JCSS.2007.06.019.
  • [16] Dénes Kőnig. Gráfok és mátrixok. Matematikai és Fizikai Lapok, 38:116–119, 1931.
  • [17] Alexandr Kostochka and Jaroslav Nešetřil. Chromatic number of geometric intersection graphs. In 1995 Prague Midsummer Combinatorial Workshop, volume 95.309, pages 43–45. KAM Series, 1995.
  • [18] Alexandr V. Kostochka and Jaroslav Nešetřil. Coloring relatives of intervals on the plane, I: chromatic number versus girth. Eur. J. Comb., 19(1):103–110, 1998. doi:10.1006/EUJC.1997.0151.
  • [19] Jan Kratochvíl. String graphs. II. recognizing string graphs is np-hard. J. Comb. Theory B, 52(1):67–78, 1991. doi:10.1016/0095-8956(91)90091-W.
  • [20] Jan Kratochvíl and Jiří Matoušek. String graphs requiring exponential representations. J. Comb. Theory B, 53(1):1–4, 1991. doi:10.1016/0095-8956(91)90050-T.
  • [21] James R. Lee. private communication.
  • [22] James R. Lee. Separators in region intersection graphs. In Christos H. Papadimitriou, editor, 8th Innovations in Theoretical Computer Science Conference, ITCS 2017, January 9-11, 2017, Berkeley, CA, USA, volume 67 of LIPIcs, pages 1:1–1:8. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2017. doi:10.4230/LIPICS.ITCS.2017.1.
  • [23] Daniel Lokshtanov, Fahad Panolan, Saket Saurabh, Jie Xue, and Meirav Zehavi. A 1.9999-approximation algorithm for vertex cover on string graphs. In Wolfgang Mulzer and Jeff M. Phillips, editors, 40th International Symposium on Computational Geometry, SoCG 2024, June 11-14, 2024, Athens, Greece, volume 293 of LIPIcs, pages 72:1–72:11. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2024. doi:10.4230/LIPICS.SOCG.2024.72.
  • [24] Sean McGuinness. Colouring arcwise connected sets in the plane I. Graphs Comb., 16(4):429–439, 2000. doi:10.1007/PL00007228.
  • [25] Sean McGuinness. Colouring arcwise connected sets in the plane II. Graphs Comb., 17(1):135–148, 2001. doi:10.1007/PL00007235.
  • [26] George L. Nemhauser and Leslie E. Trotter Jr. Properties of vertex packing and independence system polyhedra. Math. Program., 6(1):48–61, 1974. doi:10.1007/BF01580222.
  • [27] Arkadiusz Pawlik, Jakub Kozik, Tomasz Krawczyk, Michał Lasoń, Piotr Micek, William T. Trotter, and Bartosz Walczak. Triangle-free intersection graphs of line segments with large chromatic number. J. Comb. Theory B, 105:6–10, 2014. doi:10.1016/J.JCTB.2013.11.001.
  • [28] Bartosz Walczak. Triangle-free geometric intersection graphs with no large independent sets. Discret. Comput. Geom., 53(1):221–225, 2015. doi:10.1007/S00454-014-9645-Y.