Abstract 1 Introduction 2 Hardness of Approximation 3 Bicriteria Approximation for GraphDD 4 SubmodCover and SupmodDD 5 Conclusion References

On Deleting Vertices to Reduce Density in Graphs and Supermodular Functions

Karthekeyan Chandrasekaran ORCID Grainger College of Engineering, University of Illinois, Urbana-Champaign, IL, USA Chandra Chekuri Grainger College of Engineering, University of Illinois, Urbana-Champaign, IL, USA Shubhang Kulkarni ORCID Grainger College of Engineering, University of Illinois, Urbana-Champaign, IL, USA
Abstract

We consider deletion problems in graphs and supermodular functions where the goal is to reduce density. In Graph Density Deletion (GraphDD), we are given a graph G=(V,E) with non-negative vertex costs and a non-negative parameter ρ0 and the goal is to remove a minimum cost subset S of vertices such that the densest subgraph in GS has density at most ρ. This problem has an underlying matroidal structure and generalizes several classical problems such as vertex cover, feedback vertex set, and pseudoforest deletion set for appropriately chosen ρ1 and all of these classical problems admit a 2-approximation. In sharp contrast, we prove that for every fixed integer ρ>1, GraphDD is hard to approximate to within a logarithmic factor via a reduction from SetCover, thus showing a phase transition phenomenon. Next, we investigate a generalization of GraphDD to monotone supermodular functions, termed Supermodular Density Deletion (SupmodDD). In SupmodDD, we are given a monotone supermodular function f:2V0 via an evaluation oracle with element costs and a non-negative integer ρ0 and the goal is remove a minimum cost subset SV such that the densest subset according to f in VS has density at most ρ. We show that SupmodDD is approximation equivalent to the well-known Submodular Cover problem; this implies a tight logarithmic approximation and hardness for SupmodDD; it also implies a logarithmic approximation for GraphDD, thus matching our inapproximability bound. Motivated by these hardness results, we design bicriteria approximation algorithms for both GraphDD and SupmodDD.

Keywords and phrases:
Combinatorial Optimization, Approximation Algorithms, Randomized Algorithms, Hardness of Approximation, Densest Subgraph, Supermodular Functions, Submodular Set Cover
Category:
Track A: Algorithms, Complexity and Games
Copyright and License:
[Uncaptioned image] © Karthekeyan Chandrasekaran, Chandra Chekuri, and Shubhang Kulkarni; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Graph algorithms analysis
; Theory of computation Approximation algorithms analysis
Related Version:
Full version of this work is available at: https://arxiv.org/abs/2503.08828 [6]
Acknowledgements:
Shubhang Kulkarni thanks Kishen Gowda for engaging in preliminary discussions about approximations for feedback vertex set and pseudoforest deletion set. We thank anonymous reviewers for pointing out an error, and for suggesting improvements to a previous version of this work.
Funding:
This work was supported in part by NSF grant CCF-2402667.
Editors:
Keren Censor-Hillel, Fabrizio Grandoni, Joël Ouaknine, and Gabriele Puppis

1 Introduction

The densest subgraph problem in graphs (DSG) is a core primitive in graph and network mining applications. In DSG, we are given a graph G=(V,E) and the goal is to find λG:=maxSV|E(S)|/|S|, where E(S) is the set of edges with both end vertices in S. DSG is not only interesting for its applications but is a fundamental problem in algorithms and combinatorial optimization with several connections to graph theory, matroids, and submodularity. Many recent works have explored various aspects of DSG and related problems from both theoretical and practical perspectives [23, 4, 28, 30, 9, 20, 12, 13, 8, 26]. A useful feature of DSG is its polynomial-time solvability. This was first seen via a reduction to network flow [19, 27] but another way to see it is by considering a more general problem, namely the densest supermodular subset problem (DSS): Given a supermodular function f:2V0 via evaluation oracle, the goal is to find λf:=maxSVf(S)/|S|. One can easily see that DSG is a special case of DSS by noting that for any graph G, the function f:2V defined by f(S)=|E(S)| for every SV is a supermodular function. It is well-known and easy to see that DSS and DSG can be solved in polynomial-time by a simple reduction to submodular function minimization. Several other problems that are studied in graph and network mining can be seen as special cases of DSS. Recent work has demonstrated the utility of the supermodularity lens in understanding greedy heuristics and approximation algorithms for DSG and these problems [9, 30, 20, 21].

Density Deletion Problems.

In this work we consider several interrelated vertex deletion problems that aim to reduce the density. We start with the graph density deletion problem.

Definition 1 (ρ-GraphDD).

For a fixed constant ρ, the ρ-graph density deletion problem, denoted ρ-GraphDD, is defined as follows:

Input: Graph G=(V,E) and vertex costs c:V0
Goal: argmin{uScS:SV and λGSρ}.

This deletion problem naturally generalizes to supermodular functions as defined below. We recall that a set function f:2V is (i) submodular if f(A)+f(B)f(AB)+f(AB) for every A,BV, (ii) supermodular if f is submodular, (iii) non-decreasing if f(A)f(B) for every ABV, and (iv) normalized if f()=0. We observe that non-negative normalized supermodular functions are non-decreasing. For a function f:2V and SV, we define fVS as the function f restricted to the ground set VS. The evaluation oracle for the function takes a subset SV as input and returns the function value of the set S.

Definition 2 (ρ-SupmodDD).

For a fixed constant ρ, the ρ-supermodular density deletion problem, denoted ρ-SupmodDD, is defined as follows:

Input: Integer-valued normalized supermodular function f:2V0 via an
evaluation oracle and element costs c:V0
Goal: argmin{uScu:SV and λfVSρ}.

When the density threshold ρ is part of input, we use GraphDD and SupmodDD to refer to these problems. It is easy to see that GraphDD (and hence SupmodDD) is NP-Hard from a general result on vertex deletion problems [25]. Our goal is to understand the approximability of these problems.

Motivations and Connections.

While the deletion problems are natural in their formulation, to the best of our knowledge, GraphDD has only recently been explicitly defined and explored. Bazgan, Nichterlein and Vazquez Alferez [2] defined and studied this problem from an FPT perspective. As pointed out in their work, given the importance of DSG and DSS in various applications to detect communities and sub-groups of interest, it is useful to consider the robustness (or sensitivity) of the densest subgraph to the removal of vertices. In this context, we mention the classical work of Cunningham on the attack problem [11] which can be seen as the problem of deleting edges to reduce density; this edge deletion problem can be solved in polynomial time for integer parameters ρ via matroidal and network flow techniques. In addition to their naturalness and the recent work, we are motivated to consider GraphDD and SupmodDD owing to their connections to several classical vertex deletion problems as well as a matroidal structure underlying GraphDD that we articulate next.

We observe that 0-GraphDD is equivalent to the vertex cover problem: requiring density of 0 after deleting S is equivalent to S being a vertex cover of G. One can also see, in a similar fashion, that 1-GraphDD is equivalent to the pseudoforest deletion set problem, denoted PFDS– where the goal is to delete vertices so that every connected component in the remaining graph has at most one cycle, and (11/|V|)-GraphDD is equivalent to the feedback vertex set problem, denoted FVS– where the goal is to delete vertices so that the remaining graph is acyclic (see [5] for formal definitions of both problems). Vertex cover, PFDS, and FVS admit 2-approximations, and moreover this bound cannot be improved under the Unique Games Conjecture (UGC) [22]. We note that while 2-approximations for vertex cover are relatively easy, 2-approximations for FVS and PFDS are non-obvious [1, 3, 10]. Until very recently there was no polynomial-time solvable linear program (LP) that yielded a 2-approximation for FVS and PFDS. In fact, the new and recent LP formulations [5] for FVS and PFDS are obtained via connections to Charikar’s LP-relaxation for DSG [7]. Fujito [18] unified the 2-approximations for vertex cover, FVS, and PFDS via primal-dual algorithms by considering a more general class of matroidal vertex deletion problem on graphs that is relevant to our work. This abstract problem, denoted MatroidFVS 111We use the feedback vertex set terminology in our naming of the MatroidFVS problem since the goal is to pick a min-cost subset of vertices to cover all circuits of the matroid defined on the edges of a graph. This generalizes FVS which is MatroidFVS where the matroid of interest is the graphic matroid on the input graph., is defined below.

Definition 3 (MatroidFVS).

The Matroid Feedback Vertex Set problem, denoted MatroidFVS, is defined as follows:

Input: Graph G=(V,E), vertex costs c:V0, and
Matroid =(E,) with being the collection of independent sets
    (via an independence testing oracle)
Goal: argmin{uScu:SV and E[VS]}.

Fujito [18] obtained a 2-approximation for MatroidFVS for the class of “uniformly sparse” matroids [24]. It is not difficult to show that vertex cover, FVS, and PFDS can be cast as special cases of MatroidFVS where the associated matroids are “uniformly sparse”. Consequently, Fujito’s result unifies the 2-approximations for these three fundamental problems.

We now observe some non-trivial connections between ρ-GraphDD, MatroidFVS and ρ-SupmodDD. We can show that ρ-GraphDD is a special case of MatroidFVS for every integer ρ: indeed, ρ-GraphDD corresponds to MatroidFVS where the matroid ρ is the ρ-fold union of the 1-cycle matroid defined on the edge set of the input graph. Although it is not obvious, we can show that MatroidFVS is a special case of 1-SupmodDD. We refer the reader to Appendix A of the full version [6] of this work for details regarding these connections, and the problems in the right column in Figure 1(b) for a pictorial representation of the reductions discussed so far. Given these connections and the existence of a 2-approximation for vertex cover, FVS, and PFDS, we are led to the following questions.

Question 1.

What is the approximability of ρ-GraphDD, MatroidFVS, and ρ-SupmodDD? Do these admit constant factor approximations?

1.1 Results

In this section, we give an overview of our technical results that resolve Question 1 up to a constant factor gap.

1.1.1 Connections between SubmodCover and SupmodDD

We obtain a logarithmic approximation for ρ-GraphDD, MatroidFVS, and ρ-SupmodDD via a reduction to the submodular cover problem and using the Greedy algorithm for it due to Wolsey [31]. First, we recall the submodular cover problem.

Definition 4 (SubmodCover).

The submodular cover problem, denoted SubmodCover, is defined as follows:

Input: Integer-valued normalized non-decreasing submodular function h:2V0
via evaluation oracle and element costs c:V0
Goal: argmin{eFce:FV and h(F)h(V)}.

For a function f:2V, we define the marginal f(v|S):=f(S+v)f(S) for every vV and SV. We show the following result.

Theorem 5.

Let f:2V0 be an integer-valued normalized supermodular function and ρ be a rational number. Then, there exists a normalized non-decreasing submodular function h:2V0 such that for FV, we have that λf|VFρ if and only if h(F)h(V). Moreover,

  1. 1.

    if ρ is an integer, then h is integer-valued,

  2. 2.

    h(v)max{0,f(v|Vv)ρ} for all vV, and

  3. 3.

    evaluation queries for the function h can be answered in polynomial time by making polynomial number of evaluation queries to the function f.

We discuss the consequences of Theorem 5 for ρ-SupmodDD. We recall that SubmodCover admits a (1+ln(maxvh(v)))-approximation for input function h via the Greedy algorithm of Wolsey [31]. Consider ρ-SupmodDD for integer-valued ρ. By Theorem 5, we have a reduction to SubmodCover and consequently, we have a (1+ln(maxvVf(v|Vv)))-approximation. In particular, we note that ρ-GraphDD for integer-valued ρ and MatroidFVS admit O(logn)-approximation, where n is the number of vertices in the input graph.

Corollary 6.

ρ-SupmodDD for integer-valued ρ admits an (1+ln(maxvVf(v|Vv)))-approximation, where f:V0 is the input integer-valued, normalized supermodular function. Consequently, ρ-GraphDD for integer valued ρ and MatroidFVS admit O(logn)-approximations, where n is the number of vertices in the input graph.

 Remark 7.

The reduction from SupmodDD to SubmodCover is in some sense implicit in prior literature (see [29, 18] for certain special cases of supermodular functions). We note that the reduction from FVS to SubmodCover which follows from this connection does not seem to be well-known in the literature, and the authors of this paper were not aware of it until recently.

From a structural point of view we also prove that SubmodCover reduces to 1-SupmodDD, thus essentially showing the equivalence of SubmodCover and SupmodDD. We believe that it is useful to have this equivalence explicitly known given that vertex deletion problems arise naturally but seem different from covering problems on first glance.

Theorem 8.

Let h:2V0 be an integer-valued normalized non-decreasing submodular function. Then, there exists a normalized supermodular function f:2V0 such that for FV, we have that h(F)h(V) if and only if λf|VF1. Moreover,

  1. 1.

    f(v|Vv)=h(v)+1 for all vV, and

  2. 2.

    evaluation queries for the function f can be answered in polynomial time by making a constant number of evaluation queries to the function h.

1.1.2 Hardness of Approximation

A starting point for our attempt to answer Question 1 was our belief that ρ-GraphDD for integer ρ admits a (ρ+1)-approximation via the primal-dual approach suggested by Fujito for MatroidFVS [18]. This belief stems from Fujito’s work which showed a 2-approximation for vertex cover, FVS, PFDS, and MatroidFVS for “uniformly sparse” matroids and our reduction showing that ρ-GraphDD for integral ρ is a special case of MatroidFVS (see Appendix A of the full version of this work [6]). We note that the matroid that arises in the reduction is not a “uniformly sparse” matroid but has lot of similarities with it, so our initial belief was that a more careful analysis would lead to a constant factor approximation. However, to our surprise, after several unsuccessful attempts to prove a constant factor upper bound, we were able to show that for every integer ρ2, ρ-GraphDD is Ω(logn)-hard to approximate via a reduction from Set Cover.

Theorem 9.

For every integer ρ2, there is no o(logn) approximation for ρ-GraphDD assuming PNP, where n is the number of vertices in the input instance.

Thus, ρ-GraphDD exhibits a phase transition: it admits a 2-approximation for ρ1 (via Fujito’s results [18]) and becomes Ω(logn)-hard for every integer ρ2. To conclude our hardness results, we note that since GraphDD is a special case of MatroidFVS, which itself is a special case of SupmodDD, both MatroidFVS and SupmodDD are Ω(logn)-inapproximable. However, both these problems are also O(logn)-approximable via Corollary 6. Thus, we resolve the approximability of all these problems to within a small constant factor. We refer the reader to Figure 1 for an illustration of problems considered in this work and approximation-factor preserving reductions between them.

Figure 1: Reductions between problems of interest to this work. Arrow from Problem A to Problem B implies that Problem A has an approximation-preserving reduction to Problem B. Figure (a) consists of the connections between problems known prior to our work. Figure (b) showcases our results.

1.1.3 Bicriteria Approximations

The hardness result for 2-GraphDD (and ρ-GraphDD) motivates us to consider bicriteria approximation algorithms. Can we obtain constant factor approximation by relaxing the requirement of meeting the density target ρ exactly? We show that this is indeed possible. We consider an orientation based LP that was used recently to obtain a polynomial-time solvable LP to approximate FVS and PFDS [5]. We observed that this LP has an Ω(n) integrality gap when considering 2-GraphDD. Nevertheless, the LP is useful in obtaining the following bicriteria approximation.

Theorem 10.

There exists a polynomial time algorithm which takes as input a graph G=(V,E), vertex deletion costs c:V0, a target density ρ, and an error parameter ϵ(0,1/2), and returns a set SV such that:

  1. 1.

    λGS(112ϵ)ρ,

  2. 2.

    uScu(1ϵ)𝙾𝙿𝚃,

where OPT denotes the cost of an optimum solution to ρ-GraphDD on the instance (G,c).

Next, we consider ρ-SupmodDD. Unlike the case of graphs, it is not clear how to write an integer programming formulation for SupmodDD whose LP-relaxation is polynomial-time solvable. Instead, we take inspiration from the very recent work of [32] on vertex deletion to reduce treewidth. We design a combinatorial randomized algorithm that yields a bicriteria approximation for SupmodDD, where the bicriteria approximation bounds are based on a parameter cf that depends on the input supermodular function f. For a normalized supermodular function f:2V0, we define

cf:=max{uSf(u|Su)f(S):SV}.

This parameter cf was defined in a recent work on DSS to unify the analysis of the greedy peeling algorithm for DSG [9]. We note that 1cf|V| and moreover, cf=1 if and only if the function f is modular. If f is the induced edge function of a graph (i.e., f(S) is the number of edges with all its end-vertices in S for every subset S of vertices), then cf2. This follows from the observation that the sum of degrees is at most twice the number of edges in a graph. Similarly, if f is the induced edge function of a hypergraph with rank r (i.e., all hyperedges have size at most r), then cfr. We show the following bicriteria approximation for SupmodDD.

Theorem 11.

There exists a randomized polynomial time algorithm which takes as input a normalized monotone supermodular function f:2V (given by oracle access), element deletion costs c:V0, a target density ρ, and an error parameter ϵ(0,1), and returns a set SV such that:

  1. 1.

    λf|VScf(1+ϵ)ρ, and

  2. 2.

    𝔼[uScu]cf(1+1ϵ)𝙾𝙿𝚃,

where OPT denotes the cost of an optimum solution to ρ-SupmodDD on the instance (f,c).

As a consequence of Theorem 11, we obtain a bicriteria approximation for density deletion problems in graphs and r-rank hypergraphs. We note that the bicriteria guarantee that we get from this theorem for graphs is weaker than the guarantee stated in Theorem 10. We discuss another special case of SupmodDD where the supermodular function of interest has bounded cf to illustrate the significance of Theorem 11. Given a graph G=(V,E) and a parameter p1, the p-mean density of G is defined as max{uSdS(u)p/|S|:SV}, where dS(u) is the number of edges in E[S] incident to the vertex u. The p-mean density of graphs was introduced and studied by Veldt, Benson, and Kleinberg [30]. Subsequent work by Chekuri, Quanrud, and Torres [9] showed that p-mean density is a special case of the densest supermodular set problem (i.e., DSS) where the supermodular function f:2V0 of interest is given by f(S):=uSdS(u)p for every SV and moreover cf(p+1)p. We note that the natural vertex deletion problem is the p-mean density deletion problem: given a graph G=(V,E) with vertex deletion costs, find a min-cost subset of vertices to delete so that the p-mean density of the remaining graph is at most a given threshold. Since cf(p+1)p for the function f of interest here, Theorem 11 implies a bicriteria approximation for p-mean density deletion for integer-valued p. An interesting open question is to obtain better bicriteria approximation for ρ-SupmodDD – in particular, can we remove the dependence on cf?

Organization.

In Section 2, we show an approximation-preserving reduction from SetCover to ρ-GraphDD and prove Theorem 9. In Section 3, we give a bicriteria approximation for ρ-SupmodDD and prove Theorem 11. Finally, in Section 4 we show the connections between SupmodDD and Submodular Cover by proving Theorems 5 and 8. We defer the proof of Theorem 10 to the full version of this work [6].

2 Hardness of Approximation

In this section, we show Theorem 9, i.e., we show an approximation preserving reduction from SetCover to ρ-GraphDD. We recall the set cover problem and its inapproximability.

Definition 12 (SetCover).

The set cover problem, denoted SetCover, is defined as follows:

Input: Finite Universe 𝒰, Family 𝒮2𝒰 with costs c:𝒮0
Goal: argmin{ScS:𝒮 with SS=𝒰}.
Theorem 13 ([15, 14]).

For every ϵ>0, there does not exist a (1ϵ)lnn-approximation for SetCover assuming P NP, where n is the size of the input instance.

We will also need the following characterization of density via orientations. We recall that an orientation of a graph G=(V,E) assigns each edge {u,v} to either u or v. For notational convenience, we use G=(V,E) to denote an orientation of the graph G and dGin(u) to denote the indegree of a vertex uV (i.e., the number of edges assigned to u) in the oriented graph G. The following connection between density and orientations will be used in our hardness reduction. This characterization is implied by a result of Hakimi [16] and also by the dual of Charikar’s LP to solve DSG [7].

Proposition 14.

Let G=(V,E) be a graph and let ρ0 be an integer value. Then, we have that λGρ if and only if there exists an orientation G=(V,E) of the graph G such that dGin(u)ρ for every uV.

We now restate and prove Theorem 9.

See 9

Proof.

We show the theorem when the target density ρ=2 via a reduction from SetCover. At the end of the proof, we remark on how to modify the reduction to obtain hardness for all integers ρ2 as claimed in the theorem. Let (𝒮,𝒰,c:𝒮) be a SetCover instance. We will assume (without loss of generality) that all elements in 𝒰 have the same frequency f4 which is a power of 2 – we note that this assumption is not a technical requirement and is only for ease of exposition. For this instance, we construct an instance (G=(V,E),cG:V) of 2-GraphDD as follows:

  1. 1.

    Add vertices representing sets: For each set S𝒮, add a set-vertex vS to V.

  2. 2.

    Add binary trees representing elements: For each element e𝒰, add a complete binary tree with the f leaves as the set-vertices corresponding to the sets containing e. We denote this tree as 𝒯e and its root as re.

  3. 3.

    Add self-loops: For each element e𝒰, add a self-loop to the root vertex re of the tree 𝒯e. For each set S𝒮, add ρ=2 self-loops to the vertex vS.

  4. 4.

    Define cost function: We define the cost function cG:V as follows: cG(vS)=c(S) for all S𝒮 and cG(u)= for all uV{vS:S𝒮}.

Figure 2: The figure in (a) depicts the subgraph of the construction corresponding to element e𝒰. Here, f=8. The figure in (b) depicts the intermediate orientation H for the subgraph of H corresponding to an element e𝒰. The greyed-out set-vertex at the bottom represents that this vertex is in XF. The figure in (c) depicts the final orientation for the subgraph from the figure in (b) after reorientation. The highlighted edges are those that have been reoriented.

We refer the reader to Figure 2(a) for pictorial depictions of the instance constructed via the reduction above. The next claim shows the correctness of our reduction and also implies that our reduction is approximation-preserving. The approximation hardness guarantee of the theorem when the target density ρ=2 then follows by Theorem 13 and the observation that the number of vertices |V| is a constant factor of the size of the input set cover instance.

Claim 15.

The instance (G,cG) has a feasible solution to 2-GraphDD of finite cost T if and only if (𝒮,𝒰,c) has a SetCover of cost T.

Proof.

We first show the forward direction of the claim. Let XV be a feasible solution to 2-GraphDD of finite cost T. By construction, we have that X{vS:S𝒮}. Let F:={S:vSX} denote the corresponding sets. By way of contradiction, suppose F is not a set cover. Consequently, there exists an element e𝒰 not covered by F. For convenience, we use 𝒮(e):={S𝒮:eS} to denote the sets that contain the element e. We note that since the element e is not covered by F, we have that {vS:S𝒮(e)}X=. Let Ve denote the set of vertices obtained by including all vertices of the binary tree 𝒯e. Then, the following gives us a contradiction:

2λGX|E[Ve]||Ve|=(2f+1)+(2f2)2f1>2.

Here, the first inequality is because X is a feasible solution to 2-GraphDD. The second inequality is by definition of graph density. The equality is because there are (2f+1) self loop edges, (2f2) non-self loop edges, and 2f1 vertices in the tree 𝒯e by construction.

We now show the reverse direction. Let F𝒮 be a set cover. Let XF={vS:SF}. Then, we show that the graph H:=GXF has density at most 2. By Proposition 14(2), it suffices to exhibit an orientation of the graph H in which the indegree of every vertex is at most 2. We first consider the following intermediate orientation of G. For each element e𝒰, we do the following: for vertex u𝒯e{re}, we denote p(u) as the (unique) parent of u in the (rooted) tree, and we orient the edge (u,p(u)) towards the parent p(u). All self-loops are assumed to be trivially oriented. Let H:=(VH,EH) denote this intermediate orientation restricted to the graph H, and :={re:e𝒰} denote the set of all root vertices. Refer to Figure 2(b) for a pictorial depiction of orientation H. We now make three important observations regarding the indegrees in the orientation H.

Observation 16.

We have the following:

  1. 1.

    for all uVH, dHin(u)2,

  2. 2.

    for all r, dHin(r)=3, and

  3. 3.

    for each element e𝒰, there exists a set Se𝒮(e) such that dHin(p(vSe))1.

Proof.

We show each statement separately below.

  1. 1.

    We note that the statement easily follows by construction for the set vertices vS. Let e𝒰 be an arbitrary element and let u𝒯e({vS:S𝒮(e)}{re}) be a non-root internal vertex of the binary tree. Since u has exactly two child nodes and one parent node in 𝒯e, we have that dHin(u)2. We note that the inequality may be strict if any children of u belong to the set XF.

  2. 2.

    Let e𝒰. We note that re has exactly 2 children and 1 self loop, and consequently dHin(re)3 as claimed. Here, we note that bo child of re belongs to the set XF because of our simplifying assumption that f4.

  3. 3.

    Let e𝒰. Since F is a set cover, there exists a set SS(e) such that SF. Consequently, vSXF, and so dHin(p(vS))1 by construction.

We now use the orientation H and Observation 16 to construct the orientation which certifies that the graph H has density at most 2. We note that by Observation 16(1) and (2), it suffices to modify the orientation H to reduce the indegree of all root vertices in to 2 while keeping all other indegrees at most 2. Consider an arbitrary element e𝒰. Let ue be an arbitrary vertex of 𝒯e such that dHin(ue)1. We note that such a vertex exists by Observation 16(3). We consider the unique path Pe from ue to re in 𝒯e. We note that by the construction of the orientation H, every edge along this path is oriented in the direction of the path. Consider the orientation obtained by reorienting these edges in the reverse direction of the path. Refer to Figure 2(c) for a pictorial depiction of the modified orientation. We note that only the indegrees of re and ue change due to this reorientation. In particular, for all e𝒰, we have that dHin(ue)2 and dHin(re)=2. This concludes the proof. The preceding reduction can be modified to show approximation hardness for all integral ρ2. Suppose that ρ=2+α, where α0. Then, we construct the same graph as above with α additional self-loops on each vertex. The proof generalizes.

 Remark 17.

The addition of self-loops is not technically necessary for the construction; they simply make it more straightforward. Specifically, a vertex u with γ1 self-loops can be replaced by a subgraph with density exactly γ. In this subgraph, one vertex, say hu, is identified with u and its cost is defined as c(hu):=c(u). All other vertices have infinite cost. All edges incident to u are then redirected to connect to hu instead.

3 Bicriteria Approximation for GraphDD

In this section, we describe a randomized combinatorial bicriteria approximation algorithm for ρ-SupmodDD and prove Theorem 11. The algorithm is inspired by the ideas of the recent work of Włodarczyk [32]. Our algorithm is based on the following idea. Suppose that we had non-negative potentials π:V0 for the elements of the ground set such that the potential value uXπ(u) of an optimal solution XV is large, say at least αuVπ(u). Then, a natural algorithm – at least when the vertex deletion costs are uniform – would be to compute the potentials, sample an element in proportion to the potentials and delete it, define a residual instance, and repeat. This would ensure an α-approximation for the problem in expectation via a martingale argument.

Unfortunately, our hardness result suggests that we are unlikely to obtain good potentials. In fact, the hard instances seem to be the functions that have density very close to the target density. However, we leverage supermodularity to show that if the density of the input function is at least β times the target density, then we can indeed find such good vertex potentials. In order to ensure that the density of the input function is at least β times the target density, we perform a preprocessing step to prune certain elements from the ground set without changing the cost of an optimal solution. Overall, this gives us an (α,β)-bicriteria guarantee, where the values α and β are as given in Theorem 11. We also note that the cost function to delete vertices may be arbitrary – we overcome this by using the natural bang-per-buck sampling strategy, i.e. we sample a vertex u in proportion to π(u)/c(u).

The rest of the section is organized as follows. In Section 3.1 we give our preprocessing step based on the dense decomposition of supermodular functions. In Section 3.2 we describe our element potentials based on marginal gains of supermodular functions. In Section 3.3 we present our algorithm and complete the proof of Theorem 11.

3.1 Preprocessing via Dense Decomposition

We discuss the dense decomposition of a normalized non-negative supermodular set function [20, 17] and prove a lemma that will enable us to use it as a preprocessing step.

Definition 18 ([20]).

Let f:2V+ be a non-negative normalized supermodular function. A sequence (V1,ρ1),(V2,ρ2),,(Vk,ρk) is the dense decomposition of f if

  1. 1.

    V1,,Vk is a partition of V obtained iteratively as follows: for i=1,2,,k, Vi is the inclusion-wise maximal set SVj[i1]Vj that maximizes

    f(Sj[i1]Vj)f(j[i1]Vj)|S|
  2. 2.

    the values ρ1,,ρk are obtained as

    ρi:=f(j[i]Vj)f(j[i1]Vj)|Vi|i[k].

We note that the dense decomposition can also be viewed algorithmically as the output of a recursive process which computes the unique inclusion-wise maximal set S that maximizes the ratio f(S)/|S| and recurses on the contracted function f/S:2VS defined as f/S(X):=f(SX) for all XVS. It can be shown that this decomposition is unique for a supermodular f. The following lemma allows us to use the dense decomposition as an algorithmic preprocessing step in the next section. In particular, the lemma says that the dense decomposition can be used to find a set RV such that it suffices to focus on solving the ρ-SupmodDD problem on the function restriction f|R; and additionally, the ground set elements of the restricted function have large marginal gains.

Lemma 19.

Let f:2V0 be a normalized non-negative supermodular function, c:V+ be a cost function, and ρ+ be a positive real value. Moreover, let (V1,ϕ1),(V2,ϕ2),,(Vk,ϕk) be the dense decomposition of (f,V) and for ρ>ρ, let R:=i[k]:ϕi>ρVi. Then, we have that

  1. 1.

    every feasible solution to ρ-SupmodDD for the function f|R is also a feasible solution to ρ-SupmodDD for the function f,

  2. 2.

    𝙾𝙿𝚃(f)𝙾𝙿𝚃(f|R), where 𝙾𝙿𝚃(f)and 𝙾𝙿𝚃(f|R) denote the costs of optimal solutions to ρ-SupmodDD for the functions f and f|R respectively,

  3. 3.

    f(v|Rv)ρ for all vR, and

  4. 4.

    the set R can be computed in polynomial time given access to a function evaluation oracle for f.

Proof.

We show all four properties separately below.

  1. 1.

    Let XR be a feasible solution to ρ-SupmodDD for the function f|R and by way of contradiction suppose that X is not a feasible solution to ρ-SupmodDD for the function f. Thus, there exists a set SVX such that f(S)/|S|>ρ. We note that this set S cannot be contained in RX since otherwise X would not be a feasible solution to ρ-SupmodDD for f|R. Then, the following gives us the required contradiction:

    ρ<f(S)|S| f(SR)f(R)+f(SR)|SR|+|SR|
    max{f(SR)f(R)|SR|,f(SR)|SR|}
    max{max{f(RS)f(R)|S|:SVR},λf|RX)}
    ρ.

    Here, the second inequality is by supermodularity of the function f. The third inequality is by the observation that (a+b)/(c+d)max{a/c,b/d} for non-negative numbers a,b,c,d. For the final inequality, observe that λf|RXρ because X is a feasible ρ-SupmodDD for f|R. Furthermore, we have that max{f(RS)f(R)|S|:SVR}ρ by definition of the dense decomposition and R.

  2. 2.

    Let XV be an optimal ρ-SupmodDD for f w.r.t. cost function c. Then, we note that XR is a feasible ρ-SupmodDD for f|R. This can be easily observed as follows: by way of contradiction, suppose that XR is not a feasible ρ-SupmodDD for f|R. Then, there exists a set SRX such that f|R(S)/|S|>ρ. Consequently, we have that λf|VXf(S)/|S|=f|R(S)/|S|>ρ, a contradiction to X being a feasible ρ-SupmodDD for f. Then, 𝙾𝙿𝚃(f)𝙾𝙿𝚃(f|R) follows by non-negativity of c.

  3. 3.

    By way of contradiction, suppose that there exists a vertex vR such that f(v|Rv)ρ. We recall that R=i[k]:ϕi>ρVi. Let j[k] be such that vVj. For convenience, we will let Uj1:=i[j1]Vi and Uj=i[j]Vi. We note that Uj1 and Uj are contained in R and ρj:=f(Uj)f(Uj1)|Vj| by definition of the dense decomposition. Moreover, ρj>ρ by the definition of the set R, and so by supermodularity we have that f(v|Ujv)f(v|Rv)ρ<ρj. Then, the following sequence of inequalities gives us the required contradiction.

    ρj =f(Uj)f(Uj1)|Vj|
    =f(Uj)f(Ujv)+f(Ujv)f(Uj1)|Vj|
    =f(v|Ujv)+f(Ujv)f(Uj1)1+(|Vj|1)
    <ρj+f(Ujv)f(Uj1)1+(|Vj|1)
    max{ρj,f(Ujv)f(Uj1)|Vj|1}
    max{ρj,max{f(Uj1S)f(Uj1)|S|:SVUj1}}
    ρj,

    where the second inequality is by the observation that (a+b)/(c+d)max{a/c,b/d} for non-negative numbers a,b,c,d. Here, we note that |Vj|2 since otherwise, Vj={v} and so by supermodularity we have that ρj=f(Uj1+v)f(Uj1)=f(v|Uj1)f(v|Rv)<ρj, a contradiction.

  4. 4.

    It is well-known that the dense decomposition (and consequently the set R) can be computed in polynomial time (given access to the function evaluation oracle for f) via supermodular maximization. This is implicit in Fujishige’s work on principle partitions [17], and is also explicitly considered in more recent works on dense decompositions for supermodular functions [20]. We omit the details of a formal proof here for brevity.

3.2 Element Potentials via Marginal Gains

We now show that the marginal gains of elements relative to the entire ground set are good potentials for a sampling-based algorithm for ρ-SupmodDD when all the marginal gains of the input function are large enough, in particular, at least cf(1+ϵ)ρ.

Lemma 20.

Let ρ and ϵ(0,1). Let f:2V be a normalized monotone supermodular function such that f(u|Vu)cf(1+ϵ)ρ for all uV, and let XV be a ρ-SupmodDD for f. Then, we have that

uXf(u|Vu)1cf(1+1/ϵ)uVf(u|Vu).
Proof.

By supermodularity of f, we have that

uXf(Vu) (|X|1)f(V)+f(VX) and hence,
uXf(u|Vu) f(V)f(VX). (1)

By way of contradiction, suppose that the lemma is false. Then, we have the following.

uVf(u|Vu) cff(V)
=cf(f(V)f(VX)+f(VX))
cf(f(V)f(VX))+cfρ|VX|
cfuXf(u|Vu)+cfρ|VX|
<11+1/ϵuVf(u|Vu)+cfρ|VX|.

Here, the first inequality is by definition of the parameter cf. The second inequality is because X is a feasible ρ-SupmodDD for the function f. The third inequality is by (1). The final inequality is by our contradiction assumption. Then, on rearranging the terms, we obtain the following contradiction.

cfρ|VX|>(111+1/ϵ)uVf(u|Vu)=1(1+ϵ)uVf(u|Vu)cfρ|V|,

where the final inequality is because f(u|Vu)cf(1+ϵ)ρ for all uV.

3.3 Random Deletion Algorithm

We now describe our bicriteria algorithm for ρ-SupmodDD and analyze its approximation factor. Algorithm 1, Lemma 23 and Lemma 24 together complete the proof of Theorem 11.

Algorithm.

Our algorithm takes as input (1) a normalized non-negative supermodular function f:2V+, (2) element deletion costs c:V+, (3) target density ρ+, and (4) error parameter ϵ>0. The algorithm returns a set SV which starts off as the empty-set and is then constructed element-by-element. This is done iteratively as follows. Let β:=cf(1+ϵ). If the function f has density at most βρ, then the algorithm breaks and returns the current set S. Otherwise, the algorithm first computes the dense decomposition (V1,ϕ1),(V2,ϕ2),,(Vk,ϕk) of the function f, defines the set R:=i[k]:ϕi>βρVi, and redefines the function f to be the restricted function f|R:2R0 – we use DenseDecompositionPreprocess(f,ρ) to denote a subroutine that computes the set R and returns the tuple (f|R,R). Next, the algorithms samples a random element u from the (modified) set V in proportion to the ratio f(u|Vu)/c(u). The algorithm then adds the vertex u to the set S, restricts f to the ground set Vu, and repeats the previous steps. We give a formal description of the algorithm in Algorithm 1.

Algorithm 1 Bicriteria approximation algorithm for ρ-SupmodDD.

Algorithm((f:2V,c),ρ,ϵ):

  1. 1.

    S:=

  2. 2.

    while λf>cf(1+ϵ)ρ:

    1. (a)

      Redefine (f,V):= DenseDecompositionPreprocess(f,cf(1+ϵ)ρ)

    2. (b)

      u:= vertex sampled from V according to the following distribution:

      Pr(u=v):=f(v|Vv)c(v)WvV, where W:=vVf(v|Vv)c(v) is a normalizing factor
    3. (c)

      S:=S+u and f:=f|Vu

  3. 3.

    return S.

Martingales.

For the analysis of our randomized algorithm, we will require the following concepts from probability theory.

Definition 21.
  1. 1.

    A sequence of random variables P1,P2, is called a supermartingale w.r.t. the sequence X1,X2, of random variables if for each i+ it holds that (i) Pi is a function of X1,Xi, (ii) 𝔼[|Pi|]< and (iii) 𝔼[Pi+1|X1,Xi]Pi.

  2. 2.

    A random variable T is called a stopping time with respect to the sequence of random variables P1,P2, if for each i+, the event (Ti) depends only on P1,,Pi.

The following result shows that the expected value of a random variable in the supermartingale process only decreases with time. This will be crucial in analyzing the performance of Algorithm 1.

Theorem 22 (Doob’s Optional-Stopping Theorem).

Let P0,P1, be a supermartingale w.r.t. the sequence X1,X2, of random variables and be a stopping time with respect to the process P. Suppose that Pr(n)=1 for some integer n+. Then, we have that 𝔼[P]𝔼[P0].

Algorithm Analysis.

Henceforth, we consider the execution of Algorithm 1 on a fixed input instance (f,c,ρ,ϵ). Let + be the number of iterations of the while-loop – we note that is a random variable with value at most n since at every iteration of the while-loop, the size of the ground set decreases by at least 1. Throughout the analysis, we will index the (random) variables at the ith iteration of the algorithm with the subscript i for all i[]. In particular, we let Si denote the set S at the start of the ith iteration (so S1:=, and Si+1 is defined by Step 2(c)), fi:2Vi0 denote the preprocessed function f after step 2(a), and ui denote the sampled vertex u after step 2(b) of the ith iteration of the algorithm. For simplicity, we define Sj:=S, and fj to be the empty-function for all j. The next lemma shows that the density of the function after deleting the set S is at most cf(1+ϵ)ρ, i.e. S is a feasible solution to (cf(1+ϵ)ρ)-SupmodDD for the function f. The proof easily follows by considering any fixed execution of the algorithm and leveraging Lemma 19(1) while inducting on . We omit details of the proof here for brevity.

Lemma 23 (Approximate Feasibility).

λf|VScf(1+ϵ)ρ.

The next lemma shows that the expected cost of the solution returned by the algorithm is at most cf(1+1/ϵ)ρ times the cost of the optimal ρ-SupmodDD of the function f. For any restriction g of the function f, we use 𝙾𝙿𝚃(g) to denote the value of an optimal ρ-SupmodDD for g with respect to the cost function c.

Lemma 24 (Approximate Cost).

𝔼[c(S)]cf(1+1/ϵ)𝙾𝙿𝚃(f).

Proof.

For ease of exposition, we will use α:=cf(1+1/ϵ). We consider the sequence of random variables P1,P2,, where Pi:=c(Si)+α𝙾𝙿𝚃(fi) for all i+. Our strategy will be to first show that this sequence of random variables is a supermartingale, and then apply Doob’s Optional-Stopping Theorem with stopping time n to bound the expected cost of the set returned by the algorithm (note that n since with each iteration of the while-loop, the size of the ground set decreases by at least 1). Before showing that the sequence is a supermartingale, we first show that the expected cost of a vertex chosen in step 2(b) of an iteration of the algorithm is at most an α-fraction of the expected decrease in the optimum value during the iteration.

Claim 25.

𝔼[c(ui)|u1,u2,,ui1]α𝔼[𝙾𝙿𝚃(fi)𝙾𝙿𝚃(fi+1)|u1,u2,,ui1] for all i[].

Proof.

Let Xi be an optimal ρ-SupmodDD for fi. We have the following:

𝔼[c(ui)|u1,u2,,ui1] =vViPr(ui=v)c(v)
=1WvVifi(v|Vi)
αWvXifi(v|Vi)
=αvXiPr(ui=v)c(v),

where the first inequality is by Lemma 20 and the fact that f(v|Vi)cf(1+ϵ)ρ by our preprocessing (Step 2(a)) and Lemma 19(3). We now show that because Xi is an optimal solution for fi, the final expression in the above can be upper bounded by α𝔼[𝙾𝙿𝚃(fi)𝙾𝙿𝚃(fi+1)], thereby completing the proof of the claim. This can be seen as follows:

𝔼[𝙾𝙿𝚃(fi)𝙾𝙿𝚃(fi+1)] =vVi𝔼[𝙾𝙿𝚃(fi)𝙾𝙿𝚃(fi+1)|ui=v]Pr(ui=v)
vXi𝔼[𝙾𝙿𝚃(fi)𝙾𝙿𝚃(fi+1)|ui=v]Pr(ui=v)
vXi𝔼[𝙾𝙿𝚃(fi)𝙾𝙿𝚃(fi|Viv)|ui=v]Pr(ui=v)
vXi𝔼[c(Xi)c(Xiv)|ui=v]Pr(ui=v)
=vXic(v)Pr(ui=v).

Here, the second inequality is by Step 2(b) of Algorithm 1 which says that the function fi+1 is defined to be DenseDecompositionPreprocess(fi|Viv,cf(1+ϵ)ρ) and Lemma 19(2). The third inequality is because Xi is an optimal solution for fi and Xiv is a feasible solution for fi|Viv.

We now show that the sequence P1,P2, is a supermartingale w.r.t. the sequence of random variables u1,u2, chosen by the algorithm.

Claim 26.

The sequence of random variables P1,P2, is a supermartingale w.r.t. the sequence of random variables u1,u2,.

Proof.

Let i+ be arbitrary. We note that Pi has finite expectation and also is fully determined by the subsequence u1,,ui. Thus, our goal is to show that 𝔼[Pi+1|u1,,ui]Pi. This is equivalent to showing 𝔼[Pi+1Pi|u1,,ui]0. We note that this inequality indeed holds because

𝔼[Pi+1Pi|u1,,ui]=𝔼[c(ui+1)α(𝙾𝙿𝚃(fi)𝙾𝙿𝚃(fi+1))|u1,,ui]0,

where the inequality is by Claim 25.

By Claim 26, the sequence P1,P2, is a supermartingale w.r.t. the sequence u1,u2, of random variables. Consider the stopping time . By Theorem 22, we have that 𝔼[P]𝔼[P1]. The following then completes the proof of the lemma:

𝔼[c(S)]=𝔼[c(S)+α𝙾𝙿𝚃(f)]=𝔼[P]𝔼[P1]=c(S1)+α𝙾𝙿𝚃(f1)α𝙾𝙿𝚃(f).

Here, the final inequality follows by observing that f1=DenseDecompositionPreprocess(f,cf(1+ϵ)ρ) and applying Lemma 19(2).

4 SubmodCover and SupmodDD

In this section, we prove Theorems 5 and 8.

See 5

Proof.

For simplicity, we define an intermediate function g:2V0, and use it to define the function h:2V0 of interest. The functions g and h are as follows: for every XV,

g(X) :=max{f(Z)ρ|Z|:ZX}, and
h(X) :=g(V)g(VX).

The following claim shows that the function h is normalized, non-decreasing, and submodular.

Claim 27.

The function h is normalized, non-decreasing, and submodular.

Proof.

We note that h is normalized by definition, and is non-decreasing because the g is non-decreasing. To show that h is submodular, it suffices to show that the g is supermodular. Let A,BV be arbitrary. Let ZA:=argmax{f(Z)ρ|Z|:ZA} and ZB:=argmax{f(Z)ρ|Z|:ZB}. Then, we have the following:

g(A)+g(B) =f(ZA)ρ|ZA|+f(ZB)ρ|ZB|
f(ZAZB)+f(ZAZB)ρ|ZAZB|ρ|ZAZB|
=(f(ZAZB)ρ|ZAZB|)+(f(ZAZB)ρ|ZAZB|)
g(AB)+g(AB).

Here, the first inequality follows from supermodularity of f and the second inequality is by definition of g.

Let FV. We now show that λF|VFρ if and only if h(F)h(V) as claimed. We have the following sequence of equivalences.

λfVFρ maxSVF{f(S)|S|}ρ
g(VF)0 (2)
g(V)g(VF)g(V)+g()0
h(F)h(V)0.

Here, the second equivalence can be seen by the following. For the forward direction, we suppose that max{f|VF(S)|S|:SVF}ρ. By way of contradiction, suppose that g(VF)>0. By definition of the function g, there exists a set ZVF such that g(VF)=f(Z)ρ|Z|. Thus, f(Z)ρ|Z|>0. Equivalently, we have that f(Z)/|Z|>ρ, a contradiction to our hypothesis. Here we note that Z as otherwise we would have that f(Z)ρ|Z|=0 since our function f is normalized, contradicting our choice of Z. For the reverse direction, suppose that g(VF)0. By way of contradiction, suppose that there exists a non-empty set SVF such that f(S)/|S|>ρ. Then, we equivalently have that f(S)ρ|S|>0, a contradiction.

We now show that the function h satisfies properties (1)-(3) of the theorem. We note that if ρ is an integer, then h is integer-valued by definition. Moreover, we can answer evaluation queries for h using polynomial many evaluation queries to f (via supermodular maximization). Thus h satisfies properties (1) and (3). We now show property (2).

Let vV. We have the following:

h(v) =g(V)g(Vv)
=max{f(Z)ρ|Z|:ZV}max{f(Z)ρ|Z|:ZVv}
=max{0,max{f(Z)ρ|Z|:vZV}max{f(Z)ρ|Z|:ZVv}}
max{0,max{(f(Z)ρ|Z|)(f(Z)ρ|Z|):vZV and ZVv}}
max{0,max{(f(Z)ρ|Z|)(f(Zv)ρ|Zv|):vZV}}
=max{0,max{(f(Z)f(Zv):vZV}ρ}
max{0,f(V)f(Vv)ρ},

where the final inequality is because f(Vv)+f(Z)f(V)+f(Zv) for all ZV such that vZ by supermodularity of the function f.

See 8

Proof.

We consider the function f:2V0 defined as follows: for every XV,

f(X):=h(V)h(VX)+|X|.

We note that the function f is normalized, non-decreasing, integer-valued and supermodular. Moreover, we can answer evaluation queries for the function f using two queries to the evaluation oracle for h. We note that for FV, h(F)h(V) if and only if λf|VF1 can be observed by following the steps of (2) in reverse order, and so we omit the formal details here for brevity. Property (1) of the theorem can be observed as follows: for vV, we have the following:

f(v|Vv)=f(V)f(Vv) =(h(V)h()+|V|)(h(V)h(v)+|Vv|)
=h(v)+1,

where the final equality is because h is normalized.

5 Conclusion

We considered several interrelated density deletion problems motivated by the question of understanding the robustness of densest subgraph. We showed tight logarithmic approximations for these problems. We showed inapproximability of graph density deletion by reduction from set cover, and approximation algorithms by exhibiting the equivalence of supermodular density deletion and submodular cover. Motivated by the hardness results, we designed bicriteria approximations. Our bicriteria approximation for graph density deletion is LP-based, and that for supermodular density deletion is a randomized combinatorial one and relies on the notion of dense decomposition of supermodular functions. We mention two open questions. First, our bicriteria approximation for supermodular density deletion depends on the parameter cf related to the input supermodular function (see Theorem 11). Is it possible to design a bicriteria approximation without dependence on the parameter cf? Second, our hardness reduction shows that ρ-GraphDD is Ω(logn)-hard for every fixed constant integer ρ2. We were able to adapt our reduction to conclude that it is Ω(logn)-hard for every fixed constant ρ3 (not necessarily integers). Is ρ-GraphDD Ω(logn)-hard for every fixed constant ρ>1?

References

  • [1] Vineet Bafna, Piotr Berman, and Toshihiro Fujito. Constant ratio approximations of the weighted feedback vertex set problem for undirected graphs. In Algorithms and Computations, pages 142–151, 1995. doi:10.1007/BFB0015417.
  • [2] Cristina Bazgan, André Nichterlein, and Sofia Vazquez Alferez. Destroying Densest Subgraphs Is Hard. In 19th Scandinavian Symposium and Workshops on Algorithm Theory (SWAT 2024), pages 6:1–6:17, 2024. doi:10.4230/LIPICS.SWAT.2024.6.
  • [3] Ann Becker and Dan Geiger. Optimization of Pearl’s method of conditioning and greedy-like approximation algorithms for the vertex feedback set problem. Artificial Intelligence, 83:167–188, 1996. doi:10.1016/0004-3702(95)00004-6.
  • [4] Digvijay Boob, Yu Gao, Richard Peng, Saurabh Sawlani, Charalampos Tsourakakis, Di Wang, and Junxing Wang. Flowless: Extracting densest subgraphs without flow computations. In Proceedings of The Web Conference 2020, pages 573–583, 2020. doi:10.1145/3366423.3380140.
  • [5] Karthekeyan Chandrasekaran, Chandra Chekuri, Samuel Fiorini, Shubhang Kulkarni, and Stefan Weltge. Polyhedral aspects of feedback vertex set and pseudoforest deletion set. Mathematical Programming, pages 1–48, 2025.
  • [6] Karthekeyan Chandrasekaran, Chandra Chekuri, and Shubhang Kulkarni. On deleting vertices to reduce density in graphs and supermodular functions, 2025. doi:10.48550/arXiv.2503.08828.
  • [7] Moses Charikar. Greedy Approximation Algorithms for Finding Dense Components in a Graph. In Approximation Algorithms for Combinatorial Optimization, pages 84–95, 2000. doi:10.1007/3-540-44436-X_10.
  • [8] Chandra Chekuri, Aleksander Bjørn Christiansen, Jacob Holm, Ivor van der Hoog, Kent Quanrud, Eva Rotenberg, and Chris Schwiegelshohn. Adaptive out-orientations with applications. In Proceedings of the 2024 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 3062–3088, 2024.
  • [9] Chandra Chekuri, Kent Quanrud, and Manuel R Torres. Densest subgraph: Supermodularity, iterative peeling, and flow. In Proceedings of the 2022 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 1531–1555, 2022. doi:10.1137/1.9781611977073.64.
  • [10] Fabián A. Chudak, Michel X. Goemans, Dorit S. Hochbaum, and David P. Williamson. A primal–dual interpretation of two 2-approximation algorithms for the feedback vertex set problem in undirected graphs. Operations Research Letters, 22(4):111–118, 1998. doi:10.1016/S0167-6377(98)00021-2.
  • [11] William H Cunningham. Optimal attack and reinforcement of a network. Journal of the ACM (JACM), 32(3):549–561, 1985. doi:10.1145/3828.3829.
  • [12] Laxman Dhulipala, Quanquan C Liu, Sofya Raskhodnikova, Jessica Shi, Julian Shun, and Shangdi Yu. Differential privacy from locally adjustable graph algorithms: k-core decomposition, low out-degree ordering, and densest subgraphs. In 2022 IEEE 63rd Annual Symposium on Foundations of Computer Science (FOCS), pages 754–765, 2022. doi:10.1109/FOCS54457.2022.00077.
  • [13] Michael Dinitz, Satyen Kale, Silvio Lattanzi, and Sergei Vassilvitskii. Almost tight bounds for differentially private densest subgraph. In Proceedings of the 2025 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 2908–2950, 2025. doi:10.1137/1.9781611978322.94.
  • [14] Irit Dinur and David Steurer. Analytical approach to parallel repetition. In Proceedings of the Forty-Sixth Annual ACM Symposium on Theory of Computing, STOC, pages 624–633, 2014. doi:10.1145/2591796.2591884.
  • [15] Uriel Feige. A threshold of ln n for approximating set cover. Journal of the ACM (JACM), 45(4):634–652, 1998. doi:10.1145/285055.285059.
  • [16] A. Frank. Connections in combinatorial optimization. Oxford University Press, Oxford, 2011.
  • [17] Satoru Fujishige. Lexicographically optimal base of a polymatroid with respect to a weight vector. Mathematics of Operations Research, 5(2):186–196, 1980. doi:10.1287/MOOR.5.2.186.
  • [18] Toshihiro Fujito. Approximating node-deletion problems for matroidal properties. Journal of Algorithms, 31(1):211–227, 1999. doi:10.1006/JAGM.1998.0995.
  • [19] A. V. Goldberg. Finding a maximum density subgraph. Technical report, University of California at Berkeley, USA, 1984.
  • [20] Elfarouk Harb, Kent Quanrud, and Chandra Chekuri. Faster and scalable algorithms for densest subgraph and decomposition. In Advances in Neural Information Processing Systems, 2022.
  • [21] Elfarouk Harb, Kent Quanrud, and Chandra Chekuri. Convergence to lexicographically optimal base in a (contra)polymatroid and applications to densest subgraph and tree packing. In 31st Annual European Symposium on Algorithms (ESA), pages 56:1–56:17, 2023. doi:10.4230/LIPICS.ESA.2023.56.
  • [22] Subhash Khot and Oded Regev. Vertex cover might be hard to approximate to within 2- ε. Journal of Computer and System Sciences, 74(3):335–349, 2008.
  • [23] Tommaso Lanciano, Atsushi Miyauchi, Adriano Fazzone, and Francesco Bonchi. A survey on the densest subgraph problem and its variants. ACM Comput. Surv., 56(8), April 2024. doi:10.1145/3653298.
  • [24] Audrey Lee and Ileana Streinu. Pebble game algorithms and sparse graphs. Discrete Mathematics, 308(8):1425–1437, April 2008. doi:10.1016/J.DISC.2007.07.104.
  • [25] John M. Lewis and Mihalis Yannakakis. The node-deletion problem for hereditary properties is np-complete. Journal of Computer and System Sciences, 20(2):219–230, April 1980. doi:10.1016/0022-0000(80)90060-4.
  • [26] Ta Duy Nguyen and Alina Ene. Multiplicative weights update, area convexity and random coordinate descent for densest subgraph problems. In Proceedings of the 41st International Conference on Machine Learning (ICML), 2024.
  • [27] Jean-Claude Picard and Maurice Queyranne. A network flow solution to some nonlinear 0-1 programming problems, with applications to graph theory. Networks, 12(2):141–159, 1982. doi:10.1002/NET.3230120206.
  • [28] Saurabh Sawlani and Junxing Wang. Near-optimal fully dynamic densest subgraph. In Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing, pages 181–193, 2020. doi:10.1145/3357713.3384327.
  • [29] Shuichi Ueno, Yoji Kajitani, and Shin’ya Gotoh. On the nonseparating independent set problem and feedback set problem for graphs with no vertex degree exceeding three. Discrete Mathematics, 72(1):355–360, December 1988. doi:10.1016/0012-365X(88)90226-9.
  • [30] Nate Veldt, Austin R Benson, and Jon Kleinberg. The generalized mean densest subgraph problem. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, pages 1604–1614, 2021. doi:10.1145/3447548.3467398.
  • [31] Laurence A Wolsey. An analysis of the greedy algorithm for the submodular set covering problem. Combinatorica, 2(4):385–393, 1982. doi:10.1007/BF02579435.
  • [32] Michał Włodarczyk. Losing treewidth in the presence of weights. In Proceedings of the 2025 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 3743–3761, 2025.