Traffic-Oblivious Multi-Commodity Flow Network Design
Abstract
We consider the Minimum Multi-Commodity Flow Subgraph (MMCFS) problem: given a directed graph with edge capacities cap and a retention ratio , find an edge-wise minimum subgraph such that for all traffic matrices routable in using a multi-commodity flow, is routable in . This natural yet novel problem is motivated by recent research that investigates how the power consumption in backbone computer networks can be reduced by turning off connections during times of low demand without compromising the quality of service. Since the actual traffic demands are generally not known beforehand, our approach must be traffic-oblivious, i.e., work for all possible sets of simultaneously routable traffic demands in the original network.
In this paper we present the problem, relate it to other known problems in literature, and show several structural results, including a reformulation, maximum possible deviations from the optimum, and NP-hardness (as well as a certain inapproximability) already on very restricted instances. The most significant contribution is a -approximation based on a surprisingly simple LP-rounding scheme. We also give instances where this worst-case approximation ratio is met and thus prove that our analysis is tight.
Keywords and phrases:
Multi-commodity flow, Digraphs, LP-rounding, Approximation algorithmCopyright and License:
2012 ACM Subject Classification:
Mathematics of computing Network flows ; Mathematics of computing Approximation algorithms ; Networks Network design and planning algorithmsFunding:
Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) grant 461207633 (CH 897/7-2).Editors:
Ho-Lin Chen, Wing-Kai Hon, and Meng-Tsung TsaiSeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
We present the (suprisingly seemingly novel) network design problem Minimum Multi-Commodity Flow Subgraph (MMCFS): given a directed flow network with edge capacities and a retention ratio , find a subnetwork of minimum size such that still allows for a multi-commodity flow (MCF)routing of any traffic demands routable in when they are scaled down by factor .
The problem arises naturally in recent research concerning power saving in backbone (Tier 1) networks of Internet service providers. There, the overall amount of traffic has distinct peaks in the evenings (when people are, e.g., streaming videos) and lows late at night and in the early mornings [40]. Clearly, the networks are built to handle the peak times. This opens up the possibility to reduce the power consumption of the network by turning off some resources – e.g., connections, line cards, or servers – during low traffic periods [44, 14].
So consider a computer network that allows for the simultaneous routing of the traffic at peak times. This traffic is comprised of a set of commodities, where each commodity is identified by a pair of nodes in the network and has a demand specifying that flow units have to be sent from to . The entirety of the demands for each pair of nodes is encoded in a traffic matrix . Commonly, one makes the simplifying assumption that during low traffic periods the traffic demands are upper bounded by a down-scaling of using a factor ; the most practically relevant scenarios concern [34]. The task now is to minimize the size of the network by deactivating connections such that the reduced network still accommodates a routing of the scaled-down demands . However, in practice the traffic matrix may change from day to day (in fact, even within sampling windows of 15 minutes) and while we can assume the capacity of the network to be large enough for all occurring traffic at any given time, is usually not known beforehand. Thus, our solution should be traffic-oblivious, i.e., independent of any specific traffic matrix.
Technically, routing in realistic scenarios is not done via fully general multi-commodity flows: while MCFwould be optimal in terms of minimizing congestion [40], it would be too complicated and temporally unstable for the routing hardware. Instead, simpler techniques like 2-segment routing are used (which, in contrast to trivial shortest path routing, routes flow along a sequence of two chained shortest subpaths) [19]. Interestingly, studies show that in realistic networks, these routing solutions are virtually identical to those achieved by MCF [40, 12]. At the same time, for a given fixed network and traffic matrix, an MCFcan be computed in polynomial time while establishing an optimal routing table for 2-segment routing (which is then deployable on router hardware) is NP-hard [25]. Thus, we describe the feasibility of solutions in our problem setting in terms of routability via MCF, and assume that realistic (simpler) routing protocols will still be able to attain effective routability.
Let us give a formal description of our problem: Given a directed graph (or digraph) with positive edge capacities and a traffic matrix , a flow from a vertex to a vertex is a function satisfying the flow conservation constraints
| A multi-commodity flow (MCF) is a set of flows satisfying | |||||
For an edge , we may use the shorthand notation . Further, we call a traffic matrix routable in an edge set if there exists an MCF for with for all . Based on this notion, we define the Minimum Multi-Commodity Flow Subgraph (MMCFS) problem as follows: given a digraph with edge capacities cap and a retention ratio , find the edge set with minimum cardinality such that for all traffic matrices that are routable in , is routable in .
Throughout this paper, when mentioning a problem’s name or the corresponding abbreviation in sans-serif typeface, e.g. MMCFS, we refer to the optimization question. To indicate that a subgraph is an optimal solution for the problem, we will denote it by the abbreviation in normal typeface, e.g. an MMCFS with .
Our contribution.
We present the MMCFS problem, which has both a natural formulation and practical applicability. After discussing related problems from literature in Section 2, we give some structural results in Section 3: We show how the MMCFS problem, even though it is traffic-oblivious in nature, can be reformulated to consider a specific single “hardest” traffic matrix. We also establish how an MCFcan be routed in an optimal MMCFS solution, and how the ratio between the values of a feasible MMCFS solution and an optimal one relates to their average edge capacities. In Section 4, we prove that MMCFS is NP-hard already with unit edge capacities. Additionally, we show that it is NP-hard (and a closely related problem cannot be approximated within a sublogarithmic factor) already on directed acyclic graphs (DAGs). Our most important contribution is given in Section 5, where we present a -approximation algorithm for MMCFS: after modelling MMCFS as an ILP, we can deduce a surprisingly simple LP-rounding scheme, whose complexity is solely shifted to the correctness proof. Moreover, we show that our analysis of this algorithm is tight.
2 Related Work
There is a rich body of work on multi-commodity flows– see e.g. [3, Ch. 17] for a primer on this topic and [38] for a recent literature review. The ability to route an MCFin an MMCFS solution not only determines the latter’s feasibility, the problem of routing an MCFalso has many close ties to several other network design problems. These, however, involve constraints unrelated to MMCFS, are usually not traffic-oblivious, and mostly focus on undirected graphs. Concerning approaches on directed graphs, Foulds [20] minimizes the cost of an MCFin a bidirected network where the use of some unidirectional arcs is prohibited to reduce congestion. Gendron et al. [22, 23] discuss a directed MCFproblem that considers costs for both the installation of an edge and the amount of flow routed over it. Further, in buy-at-bulk network design [8, 39, 13], capacity on edges must be bought as cheaply as possible such that a given traffic matrix becomes routable – with the caveat that larger amounts of capacity can be bought at a lower price per capacity unit.
In robust network design [10, 4, 9, 24], possible traffic matrices are given as an uncertainty set in the form of a polytope, and the objective is usually to minimize the cost of reserving capacity on the edges. While the dynamic routing variant considers a different MCFfor every traffic matrix in the polytope, static routing specifies a fixed unit flow for each commodity that is only scaled with the respective demand value. For directed graphs, Al-Najjar et al. [5] show that an exact algorithm for static routing would yield an -approximation for dynamic routing. Our result can be seen as a better approximation ratio for the special uncertainty set under dynamic routing, but w.r.t. minimizing the number of edges in a subgraph rather than the cost of reserved capacity.
There are also several related graph construction and subgraph minimization problems: Khuller et al. [28] give an approximation algorithm for the construction of an undirected tree with constant degree that accommodates given traffic demands between its leaves such that the maximum load on any edge is minimized. Otten et al. [34] evaluate an integer linear program (ILP)and a heuristic for a green traffic engineering problem on digraphs – however, there a specific traffic matrix is also given (and rather than finding an edge subset of minimum cardinality, they minimize the number of “line cards”, i.e., sets of 8 incident edges at each vertex). Another well-known topic in the realm of subgraph minimization problems is that of spanners [1], i.e., subgraphs that preserve the length of a shortest path within a given ratio (stretch factor) between each pair of vertices. There exists a correspondence between upper bounds on the stretch of shortest paths and the congestion of MCFs, however, this only applies to the existence of probabilistic mappings in undirected graphs [6, 36]. Nonetheless, this correspondence was used to find flow sparsifiers in undirected graphs [18]. While a MMCFS solution is similar to a flow sparsifier that preserves the congestion up to a factor , they differ in that a flow sparsifier is an entirely new (undirected) graph, not necessarily subgraph, containing a subset of the vertices of but both old and new edges [32, 30, 7].
Closely related to MMCFS are classical Directed Survivable Network Design (DSND) problems, where, given a (possibly capacitated) input digraph and a requirement function , one aims to find an edge-wise minimum subgraph of in which one can send a flow of value from to for every . On undirected graphs with unit edge capacities, there exists a 2-approximation by Jain [26], which has been adapted to directed instances, but only for a very restricted set of requirement functions [31]. The DSND problem most similar to MMCFS is the Minimum Capacity-Preserving Subgraph (MCPS) problem [15], where the requirement value for a vertex pair equals a fraction of the value of a maximum flow from to . However, in all of the aforementioned DSND approaches, each routed commodity is considered in isolation from the others, whereas in the MMCFS setting, all commodities are routed simultaneously. For example, given a digraph with edge capacities cap and a traffic matrix routable in , the scaled-down matrix is not necessarily routable in any optimal MCPS-solution of since some edges may be congested by the simultaneous routing of multiple commodities. This is especially easy to see for but even holds true when :
Observation 1.
Given an arbitrarily small and an arbitrarily large , there exists a digraph with edge capacities cap and a traffic matrix such that is routable in , but is not routable in any optimal MCPS-solution of .
Proof.
Let be a (high) edge capacity value and a number of vertices. Construct (visualized in Figure 1) as follows: Create two vertex sets and with vertices each, as well as two distinct vertices , and let . Moreover, let , , and . Edge capacities and the traffic matrix are chosen as follows:
Every non-zero demand can be routed in using the respective edge .
The only optimal MCPS-solution of is : For every vertex pair , the only --path in is the one consisting of the edge – so the edges must be in the solution. also establishes a maximum flow of sufficient value for the remaining vertex pairs. In particular, for every vertex pair , there exists a maximum --flow of value in , which is at least times the value of a maximum --flow in :
However, the scaled matrix is not routable in : many demands of value would have to be routed over the single edge , exceeding its capacity .
Lastly, we want to highlight similarities of MMCFS to the well-established NP-hard Minimum Equivalent Digraph (MED) problem [2, 37, 21, 27], where, given a digraph , one asks for the edge-wise minimum subgraph of that preserves the reachability relation of . In Section 4, we show that MED is a special case of MMCFS. Further, we observe:
Observation 2.
In a simple DAG , the unique minimum equivalent digraph (MED)of must be contained in every feasible MMCFS solution of , regardless of edge capacities and .
Proof.
In any simple DAG , the MEDis unique and consists of exactly those edges for which there is no --path in [2]. A feasible MMCFS solution must also contain these edges in order to allow for a non-zero amount of flow from to . However, in general digraphs, a feasible MMCFS solution may not always contain the MED, see Figure 2. Note that MED is not only polynomial-time solvable on DAGs, but there are also several polynomial approximation algorithms for general graphs [29, 45] with the currently best approximation ratio being 1.5 [11, 42].
3 Structural Results
We present some structural insights concerning MMCFS that give a deeper understanding of the problem. Most importantly, we give a reformulation of MMCFS that is used throughout the rest of the paper to obtain structural and algorithmic results: Recall that a feasible solution for a given MMCFS instance is defined as an edge set such that for all traffic matrices that are routable in , is routable in . Interestingly, instead of explicitly considering all routable traffic matrices , it suffices to consider the single specific traffic matrix , which forces each edge to be utilized to its full capacity but has no demands between non-adjacent vertices:
We show that an edge set is a feasible MMCFS solution iff is routable in :
Theorem 3.
Given a digraph with edge capacities cap, a retention ratio , and an edge set , the following statements are equivalent:
-
For all traffic matrices that are routable in , the scaled matrix is routable in .
-
The scaled matrix is routable in .
Proof.
is routable in by definition. If every traffic matrix routable in is also routable in when scaled down by , then so is . For the other direction, consider any arbitrary traffic matrix routable in . Let be the MCFthat routes in with the vector specifying the total flow over each edge. Using this MCF, we can construct a new traffic matrix :
Observe that when using component-wise comparison. Thus, since is routable in , so is . But if is routable in using the flows , then is also routable in using the flows constructed as follows: for each commodity and each edge , calculate the fraction of flow routed over that is used by , and route this fraction over the path chosen by . In short,
We note that a related but slightly different concept to Theorem 3 has been implicitly used previously in [35] (on undirected graphs). From this point onwards, we may refer to edges as commodities since specifies a non-zero demand precisely for the edges in . Moreover, given a flow for a commodity , we call the direct flow for . We observe that in an optimal MMCFS solution , for every commodity , the demand can always be fully satisfied by a direct flow :
Observation 4.
Let be a digraph with edge capacities cap and a traffic matrix routable in an edge set with for all edges . Then, there exists an MCF in the graph satisfying the demands such that for all edges .
Proof.
Among all MCFsthat witness the routability of in , let be one with a maximum sum of direct flow values .
We give a proof by contradiction: Assume that there exists an edge such that (if no such edge exists, and we are done). There must exist at least one alternative --path that routes at least some of the remaining demand . Further, the edge has residual capacity as otherwise we could increase (and decrease flow along accordingly), which would contradict the selection of . Thus, there exists an edge , , with . But then, we can exchange a non-zero amount of flow of commodity routed over with an equally small amount of flow of commodity routed over . This increases without decreasing any other direct flow value – again a contradiction to the selection of .
Further, for any edge set , we can compare its total edge capacity to the total flow needed to satisfy the demands . This not only gives us a necessary condition for an edge set to be a feasible MMCFS solution, but, upon closer analysis, also allows us to relate its quality as a solution to its mean capacity . The following results apply both in the case of simple and non-simple graphs, but we can give better guarantees in the former case.
Theorem 5.
Let be an optimal solution and a feasible solution for an MMCFS instance . Then, with if has no parallel edges and otherwise.
Proof.
Let , and . The commodities of all must be routed through , requiring a total flow of at least and leaving a total remaining capacity in of at most . Every commodity has to be routed within this remaining capacity since is feasible. This requires a total flow of at least ; the is due to the fact that without parallel edges each such commodity must be routed over at least two other edges in since . We thus have
| (1) |
By adding to this inequality, we obtain
Alternatively, we can rewrite inequality (1) as and obtain
Corollary 6.
Any arbitrary feasible solution for MMCFS (including the trivial one, itself), is a -approximation.
This ratio is met, e.g., on a bundle of parallel edges with capacity-1 edges and one capacity- edge (for any given and an arbitrary s.t. ).
Corollary 7.
For uniform capacities, any arbitrary feasible solution for MMCFS (including the trivial one, itself) is a -approximation.
4 Complexity
Given that even trivial MMCFS solutions satisfy an approximation guarantee according to Section 3, one might expect MMCFS to be polynomial-time solvable. However, in this section, we show that MMCFS is NP-hard already on DAGsand give a first inapproximability result. We begin by proving that MMCFS is NP-hard already with unit edge capacities using a reduction from MED that directly follows from Theorem 3:
Corollary 8.
MED is the special case of MMCFS with unit edge capacities cap and .
Proof.
An optimal solution for an MMCFS instance with unit edge capacities is an edge set of minimum cardinality such that the demands for each edge are routable in . This is equivalent to ensuring that there exists an --path in for every edge since the (unit) capacity of an edge can never be surpassed by the many flows of size at most each.
Moreover, we can show that MMCFS is NP-hard already on DAGsusing a reduction from the NP-hard decision variant of Set Cover [27], where one asks: given a universe , a family of sets with , and a parameter , is there a subfamily of cardinality such that ? The reduction is similar to that given in [15] to prove the NP-hardness of the Minimum Capacity-Preserving Subgraph problem.
Theorem 9.
For any fixed , MMCFS is NP-hard already on DAGswhere the longest path has length 3.
Proof.
Given a Set Cover instance and a fixed retention ratio , we construct an instance for the decision variant of MMCFS: is a yes-instance if and only if there exists a feasible MMCFS solution for with cardinality . We construct as follows (see Figure 3 for a visualization):
As is a DAG, its MEDis unique [2] and must be part of any feasible MMCFS solution, see Section 2. This MEDis formed by the edges ; a flow of must be routed over each of them in order to satisfy the demands . The remaining capacity for each edge is .
So consider a single set and the corresponding two-path , whose remaining capacity can thus accommodate either arbitrarily many item commodities (each one with the sufficiently small demand ) or a single set commodity (with the demand ). In the former case, we can remove at least two corresponding item edges , with , which is more than the single corresponding set edge we can remove in the latter case. Thus, for each item commodity , an optimal MMCFS solution must contain one of the corresponding set edges ; the item commodity can then be routed over the path from over to .
Given a Set Cover solution with , we can construct an MMCFS solution with cardinality . Since every item is covered by the sets in , the constructed MMCFS solution includes at least one corresponding set edge for each item, ensuring its feasibility. Conversely, since a feasible MMCFS solution with has at least one corresponding set edge for each item , the Set Cover solution also contains at least one covering set for each item. Moreover, .
Considering the optimization variants of Set Cover and MMCFS (and thus ignoring the additional input values and ), the reduction above also implies the inapproximability of the number of edges in an optimal MMCFS solution beyond the edges required for an MED: Consider an instance for the optimization variant of MMCFS that is produced by the reduction above, and an arbitrary feasible solution for this instance. Let where denotes the number of edges in an MEDof . Then, can be transformed into a feasible solution for the original Set Cover instance with objective value in linear time. Further, let be the minimum over all feasible solutions for , and recall that the size of the MMCFS instance is linear in the size of the Set Cover instance: if it was possible to approximate within a factor in , one could also approximate Set Cover within , which is NP-hard [17, 33]. This implies that any approximation algorithm for MMCFS (such as the one we present in Section 5) must leverage the existence of a comparatively high number of MED-edges in order to achieve its approximation ratio.
Observation 10.
Given an MMCFS instance and an MEDof , approximating with a ratio in is NP-hard. This already holds on DAGswhere the longest path has length 3.
5 LP-based Approximation
We present an extremely simple -approximation for MMCFS based on LP rounding. This is a clear improvement over the default approximation guarantee of Section 3, which depends the instance’s edge capacities and is thus not even polynomially bounded in or the instance size. Consider the following ILP formulation for MMCFS:
| (1.a) | |||||
| (1.b) | |||||
| (1.c) | |||||
| (1.d) | |||||
| (1.e) | |||||
A binary indicator variable determines whether edge is part of the solution subgraph or not. We minimize the sum of these variables in the objective function (1.a) to obtain a subgraph of minimum size. The non-negative variables determine the amount of flow routed over edge for commodity . Recall that specifies a demand of precisely for each edge : the flow preservation constraints (1.b) guarantee that the -variables represent proper flows that satisfy these demands. Lastly, the capacity constraints (1.c) ensure that the total sum of flow over any edge does not surpass the capacity of . This flow sum must be zero if is not part of the solution.
We obtain the relaxation of ILP (1) by replacing the integrality constraints (1.e) on by the inequalities for all . The only other type of constraint that bounds the -variables is (1.c). Since the -variables can assume fractional values in the LP relaxation, and the sum over all is minimized, this lower bound of on for all will always be met with equality. Hence, the LP relaxation is equivalent to a standard MCF-LP that minimizes the sum of edge utilizations in the objective function . Accordingly, we will refer to as the cost that routing a single unit of flow over an edge will add to this objective.
Remark 11.
The solution for all with objective value is always feasible for the relaxation of ILP (1). When the input graph is simple and all edge capacities are uniform, it is in fact optimal: the flow for commodity will always be routed completely over since all edges have the same cost and routing over an alternative path would cost more than routing it over a single edge. This implies that any arbitrary feasible solution for MMCFS on simple graphs with uniform edge capacities is an -approximation (which we already proved using a separate argument via Section 3). However, below we also use the LP relaxation to approximate MMCFS on non-simple graphs with non-uniform edge capacities.
For our approximation algorithm, we make use of the well-known fact that all standard LP solving algorithms will always return a basic optimal solution (if any solution exists) [43, p. 279]. A basic optimal solution – or equivalently, extreme point solution – is a vertex of the polyhedron defined as the convex hull of the set of feasible solutions. It does not lie on a higher-dimensional face of the polyhedron and thus cannot be expressed as a convex combination of two or more other feasible solutions [41, p. 100]. We prove that a basic optimal solution for the relaxation of ILP (1) will only have comparatively few variables with a positive value below , allowing us to round up the solution to obtain an approximation.
Lemma 12.
Let be a basic optimal solution for the relaxation of ILP (1), and the number of edges such that . There exist at most many edges with .
Proof.
Let be the many edges that are saturated by the fractional multi-commodity flow, and with the edges that are only used to a fraction less than ; in short, edges with high and low corresponding -values. We show that an optimal fractional solution with would allow us to construct a vector such that both and are still feasible solutions for the LP relaxation – thus, would not be basic.
For every edge , let denote an arbitrary alternative --path not using the edge and for all edges . Such a path must exist since at most flow is routed over , so an alternative --path is necessary to satisfy the demand . Further, the total cost of routing a unit of flow over must be lower than or equal to the cost of routing it over itself, i.e., , by the optimality of . Based on the alternative paths, we construct a matrix indexed by pairs , with . Since , the rows of must be linearly dependent, i.e., there exists a vector of coefficients with , such that (and consequently, ).
We can obtain two new feasible solutions by modifying the MCFcorresponding to the optimal basic solution as follows: for each edge , and using a small positive value , we send less flow over itself and more flow over the alternative path – or vice versa. Put formally, we argue that for a small enough , the following vector yields two feasible solutions and for the LP relaxation:
Clearly, the two solutions satisfy all demands and flow preservation constraints (1.b). Consider the capacity constraints (1.c): By construction of , the flow difference on saturated edges is 0. For all non-saturated edges it holds: if we modified the flow routed over , this flow was already non-zero before our modification, and can be chosen sufficiently small such that the flow will neither turn negative nor surpass .
It remains to show that given that . So, among the edges with , choose one with maximum cost and denote it by . We show that .
If were contained in any of the alternative paths with and , we would arrive at a contradiction even without using : Recall that . So we must have , i.e., and are parallel edges with . Since and thus , we could simply obtain two new solutions by shifting some small of flow from one to another, or vice versa, contradicting that is basic.
Hence, is only contained in alternative paths with s.t. , and
The proof implicitly gives an intuitive explanation for the distribution of LP values:
Corollary 13.
Let be a basic optimal solution for the relaxation of ILP (1). For each edge with it holds: every alternative --path (not containing ) with for all of its edges must contain an edge with .
Proof.
Assume that, for some edge with , there is a that does not contain an edge with . Then, we can follow the proof of Section 5, choosing as the alternative --path for . As a result, the matrix constructed in the proof contains a row of zeroes, allowing us to route more flow over and less flow over , or vice versa. Thus, cannot be basic, a contradiction.
We now analyze the following LP-rounding algorithm: compute a basic optimal solution for the relaxation of ILP (1) in polynomial time, and return the edge set . Based on Section 5, one could use naïve techniques to prove approximation guarantees of or for this algorithm. However, we provide the stronger bound of based on the following intuition: the algorithm either “misses” the optimal solution mainly due to edges with and we can obtain a 2-approximation via Section 5, or due to edges with and we can round up the variables for a -approximation.
Theorem 14.
Let be a basic optimal solution for the relaxation of ILP (1). The edge set is a -approximation for MMCFS. That is, rounding up is a 2-approximation when , and an -approximation when .
Proof.
The solution is clearly feasible. Let be the number of edges such that . Furthermore, let be the number of these edges whose -variable is set to a value in the interval . We know that the remaining edges have a -variable either set to or a value lower than . In particular, by Section 5, there exist at least as many edges with as there are edges with . Thus, . Hence, the optimal fractional solution has objective value
Using the minimum fractional objective value as a lower bound for the minimum integral objective value , we can then give an upper bound for the approximation ratio:
To obtain an upper bound on this ratio, we examine when its denominator is at its minimum. For , the term is always non-negative and reaches its lowest value 0 when , giving us the upper bound in this case. In contrast, for , the term is negative, and the minimum of the denominator is reached when , leading to the upper bound in this second case.
It is surprisingly hard to find instances (in particular, ones without parallel edges) where the worst-case approximation ratio given by Theorem 14 is actually met. However, we show that such instances exist for the most relevant , and hence our analysis is tight:
Lemma 15.
The ratio for the algorithm of Theorem 14 is tight for all , .
Proof.
Consider a complete bidirected graph on vertices, and let denote an arbitrarily chosen directed Hamiltonian cycle in (an example for is visualized in Figure 4). For all edges , let denote the number of edges of the unique --path in and set the capacity . is a feasible integral solution for the MMCFS instance since the demands are routable in it: A single edge is able to satisfy its own demand . Adding to this, for every , there are edges with whose unique --path in contains . The edge can accommodate all of these commodities because each of them has a demand of , summing up to a total flow of . Lastly, is not only feasible but optimal since we need at least many edges to preserve the reachability relation of in .
The algorithm from Theorem 14 would potentially choose all edges of , i.e., many: As the cost of an edge is equal to the total cost of its unique --path in , all --paths are equal in cost. Thus, one optimal solution for the relaxation of ILP (1) simply routes all commodities completely over their own edges and chooses . This solution is also basic: Let denote the flow variables of this solution. If it were not basic, there would exist two other optimal solutions with flow variables respectively, such that for some , . However, if , already routes the full demand of commodity over and we cannot increase without introducing a flow cycle, which would raise the objective value and thus be suboptimal. In contrast, if , is already and cannot be decreased.
Lemma 16.
The ratio for the algorithm of Theorem 14 is tight for all .
Proof.
We construct a family of MMCFS instances (visualized in Figure 5), each one consisting of a simple graph with edge capacities cap and the given retention ratio , that meets the approximation ratio of 2 asymptotically with increasing size. Intuitively, the edge set of each instance contains two subsets that have a size quadratic in the number of vertices and two subsets of linear size; our algorithm chooses both quadratic-size subsets even though one could be replaced by a linear-size subset.
Let be comprised of five disjoint vertex sets , , , , with vertices each (respectively named with the corresponding lowercase letter and indexed by ) and a distinct vertex . Further, let and be sufficiently low and high positive values, respectively. Then, with
is a DAG, thus its MEDis unique [2] and must be part of any feasible solution for the MMCFS instance , see Section 2. The MEDof consists of the edges and in particular can satisfy all demands with . The “remaining” capacity of for each of the edges in (and for edges in ) is also sufficient to satisfy the demands of the commodities , and to almost satisfy the demands for the commodities : it only leaves a demand of for every such . Thus, there exists a feasible solution for that contains all of the edges from and additionally, to satisfy the remaining demands (of size each), the edges from . This feasible solution gives an upper bound on the objective value of the optimal integral solution: .
In contrast, the algorithm from Theorem 14 also includes all edges of the MED in its solution, but must additionally choose (at least) the edges from , resulting in an objective value . This is because every optimal solution for the relaxation of ILP (1) routes the commodities over the edges themselves: they have a lower cost than the alternative --path of length 2 over the edges in .
For increasing , the ratio of the algorithmic solution’s objective value to the optimum is
Combining Theorem 14 with Sections 5 and 5, we obtain:
Corollary 17.
The approximation ratio for the algorithm given by Theorem 14 is tight for all and all with .
Remark 18.
There are instances where the integrality gap of ILP (1) is – e.g. trees, where the unique optimal integral solution contains every edge while the unique optimal fractional solution chooses for all edges of the input graph. Interestingly, the approximation ratio of our algorithm (for ) equals the integrality gap exactly, but on completely different instances and not on trees (where the algorithm always finds the optimal solution).
6 Conclusion and Open Questions
We introduced the practically motivated Minimum Multi-Commodity Flow Subgraph (MMCFS) problem and paved the way for further research by giving several structural results, most importantly a reformulation of this traffic-oblivious problem that only needs to consider a single specific traffic matrix. Further, we showed that MMCFS is NP-hard (and a closely related problem cannot be approximated within a sublogarithmic factor) already on DAGs. Lastly, we gave an extremely simple LP-rounding scheme for MMCFS with a tight approximation guarantee of .
Considering seemingly related problems (see Section 2), one observes that an approximation ratio of 2 (which we attain for the practically most relevant cases of [34]) often arises as a seemingly “natural” limit for such ratios. Yet, it remains an open question whether there exists an approximation algorithm for MMCFS with a better quality guarantee, and whether there is a non-trivial lower bound on the approximation guarantee for any such algorithm (assuming ). Further, it might be of interest to explore several generalizations of MMCFS: this includes the non-traffic-oblivious variant where a specific traffic matrix is part of the input (which is also NP-hard via Theorem 3), and the variant where, given an additional cost function on the edges, one asks for a subgraph of minimum cost.
References
- [1] Abu Reyan Ahmed, Greg Bodwin, Faryad Darabi Sahneh, Keaton Hamm, Mohammad Javad Latifi Jebelli, Stephen G. Kobourov, and Richard Spence. Graph spanners: A tutorial review. Comput. Sci. Rev., 37:100253, 2020. doi:10.1016/J.COSREV.2020.100253.
- [2] Alfred V. Aho, Michael R. Garey, and Jeffrey D. Ullman. The transitive reduction of a directed graph. SIAM J. Comput., 1(2):131–137, 1972. doi:10.1137/0201008.
- [3] Ravindra K. Ahuja, Thomas L. Magnanti, and James B. Orlin. Network flows – Theory, algorithms and applications. Prentice Hall, 1993.
- [4] Yacine Al-Najjar, Walid Ben-Ameur, and Jérémie Leguay. On the approximability of robust network design. Theor. Comput. Sci., 860:41–50, 2021. doi:10.1016/J.TCS.2021.01.026.
- [5] Yacine Al-Najjar, Walid Ben-Ameur, and Jérémie Leguay. Approximability of robust network design: The directed case. In Proc. STACS 2022, volume 219 of LIPIcs, pages 6:1–6:16. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPICS.STACS.2022.6.
- [6] Reid Andersen and Uriel Feige. Interchanging distance and capacity in probabilistic mappings. CoRR, abs/0907.3631, 2009. arXiv:0907.3631.
- [7] Alexandr Andoni, Anupam Gupta, and Robert Krauthgamer. Towards (1 + )-approximate flow sparsifiers. In Proc. SODA 2014, pages 279–293. SIAM, 2014. doi:10.1137/1.9781611973402.20.
- [8] Spyridon Antonakopoulos. Approximating directed buy-at-bulk network design. In Proc. WAOA 2010, volume 6534 of LNCS, pages 13–24. Springer, 2010. doi:10.1007/978-3-642-18318-8_2.
- [9] Yossi Azar, Edith Cohen, Amos Fiat, Haim Kaplan, and Harald Räcke. Optimal oblivious routing in polynomial time. J. Comput. Syst. Sci., 69(3):383–394, 2004. doi:10.1016/J.JCSS.2004.04.010.
- [10] Walid Ben-Ameur and Hervé Kerivin. Routing of uncertain traffic demands. Optimization and Engineering, 6:283–313, 2005. doi:10.1145/777313.777314.
- [11] Piotr Berman, Bhaskar DasGupta, and Marek Karpinski. Approximating transitive reductions for directed networks. In Proc. WADS 2009, volume 5664 of LNCS, pages 74–85. Springer, 2009. doi:10.1007/978-3-642-03367-4_7.
- [12] Randeep Bhatia, Fang Hao, Murali S. Kodialam, and T. V. Lakshman. Optimized network traffic engineering using segment routing. In Proc. INFOCOM 2015, pages 657–665. IEEE, 2015. doi:10.1109/INFOCOM.2015.7218434.
- [13] Chandra Chekuri, Mohammad Taghi Hajiaghayi, Guy Kortsarz, and Mohammad R. Salavatipour. Approximation algorithms for nonuniform buy-at-bulk network design. SIAM J. Comput., 39(5):1772–1798, 2010. doi:10.1137/090750317.
- [14] Luca Chiaraviglio, Marco Mellia, and Fabio Neri. Reducing power consumption in backbone networks. In Proc. ICC 2009, pages 1–6. IEEE, 2009. doi:10.1109/ICC.2009.5199404.
- [15] Markus Chimani and Max Ilsen. Capacity-preserving subgraphs of directed flow networks. In Proc. IWOCA 2023, volume 13889 of LNCS, pages 160–172. Springer, 2023. doi:10.1007/978-3-031-34347-6_14.
- [16] Markus Chimani and Max Ilsen. Traffic-oblivious multi-commodity flow network design. CoRR, abs/2504.16744, 2025. doi:10.48550/arXiv.2504.16744.
- [17] Irit Dinur and David Steurer. Analytical approach to parallel repetition. In Proc. STOC 2014, pages 624–633. ACM, 2014. doi:10.1145/2591796.2591884.
- [18] Matthias Englert, Anupam Gupta, Robert Krauthgamer, Harald Räcke, Inbal Talgam-Cohen, and Kunal Talwar. Vertex sparsifiers: New results from old techniques. SIAM J. Comput., 43(4):1239–1262, 2014. doi:10.1137/130908440.
- [19] Clarence Filsfils, Stefano Previdi, Les Ginsberg, Bruno Decraene, Stephane Litkowski, and Rob Shakir. Segment routing architecture. RFC, 8402:1–32, 2018. doi:10.17487/RFC8402.
- [20] Les R. Foulds. A multi-commodity flow network design problem. Transportation Research Part B: Methodological, 15(4):273–283, 1981. doi:10.1016/0191-2615(81)90013-8.
- [21] Michael R. Garey and David S. Johnson. Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman, 1979.
- [22] Bernard Gendron, Teodor Gabriel Crainic, and Antonio Frangioni. Multicommodity Capacitated Network Design, pages 1–19. Springer US, Boston, MA, 1999. doi:10.1007/978-1-4615-5087-7_1.
- [23] Bernard Gendron and Mathieu Larose. Branch-and-price-and-cut for large-scale multicommodity capacitated fixed-charge network design. EURO J. Comput. Optim., 2(1-2):55–75, 2014. doi:10.1007/S13675-014-0020-9.
- [24] Mohammad Taghi Hajiaghayi, Robert D. Kleinberg, Harald Räcke, and Tom Leighton. Oblivious routing on node-capacitated and directed graphs. ACM Trans. Algorithms, 3(4):51, 2007. doi:10.1145/1290672.1290688.
- [25] Renaud Hartert, Pierre Schaus, Stefano Vissicchio, and Olivier Bonaventure. Solving segment routing problems with hybrid constraint programming techniques. In Proc. CP 2015, volume 9255 of LNCS, pages 592–608. Springer, 2015. doi:10.1007/978-3-319-23219-5_41.
- [26] Kamal Jain. A factor 2 approximation algorithm for the generalized steiner network problem. Comb., 21(1):39–60, 2001. doi:10.1007/s004930170004.
- [27] Richard M. Karp. Reducibility among combinatorial problems. In Proc. COCO 1972, The IBM Research Symposia Series, pages 85–103. Plenum Press, New York, 1972. doi:10.1007/978-1-4684-2001-2_9.
- [28] Samir Khuller, Balaji Raghavachari, and Neal E. Young. Designing multi-commodity flow trees. Inf. Process. Lett., 50(1):49–55, 1994. doi:10.1016/0020-0190(94)90044-2.
- [29] Samir Khuller, Balaji Raghavachari, and Neal E. Young. Approximating the minimum equivalent digraph. SIAM J. Comput., 24(4):859–872, 1995. doi:10.1137/S0097539793256685.
- [30] Frank Thomson Leighton and Ankur Moitra. Extensions and limits to vertex sparsification. In Proc. STOC 2010, pages 47–56. ACM, 2010. doi:10.1145/1806689.1806698.
- [31] Vardges Melkonian and Éva Tardos. Algorithms for a network design problem with crossing supermodular demands. Networks, 43(4):256–265, 2004. doi:10.1002/NET.20005.
- [32] Ankur Moitra. Approximation algorithms for multicommodity-type problems with guarantees independent of the graph size. In Proc. FOCS 2009, pages 3–12. IEEE Computer Society, 2009. doi:10.1109/FOCS.2009.28.
- [33] Dana Moshkovitz. The projection games conjecture and the np-hardness of ln n-approximating set-cover. Theory Comput., 11:221–235, 2015. doi:10.4086/toc.2015.v011a007.
- [34] Daniel Otten, Max Ilsen, Markus Chimani, and Nils Aschenbruck. Green traffic engineering by line card minimization. In Proc. LCN 2023, pages 1–8. IEEE, 2023. doi:10.1109/LCN58197.2023.10223344.
- [35] Harald Räcke. Minimizing congestion in general networks. In Proc. FOCS 2002, pages 43–52. IEEE Computer Society, 2002. doi:10.1109/SFCS.2002.1181881.
- [36] Harald Räcke. Optimal hierarchical decompositions for congestion minimization in networks. In Proc. STOC 2008, pages 255–264. ACM, 2008. doi:10.1145/1374376.1374415.
- [37] Sartaj Sahni. Computationally related problems. SIAM J. Comput., 3(4):262–279, 1974. doi:10.1137/0203021.
- [38] Khodakaram Salimifard and Sara Bigharaz. The multicommodity network flow problem: state of the art classification, applications, and solution methods. Oper. Res., 22(1):1–47, 2022. doi:10.1007/S12351-020-00564-8.
- [39] F. Sibel Salman, Joseph Cheriyan, R. Ravi, and S. Subramanian. Buy-at-bulk network design: Approximating the single-sink edge installation problem. In Proc. SODA 1997, pages 619–628. ACM/SIAM, 1997. URL: http://dl.acm.org/citation.cfm?id=314161.314397.
- [40] Timmy Schüller, Nils Aschenbruck, Markus Chimani, Martin Horneffer, and Stefan Schnitter. Traffic engineering using segment routing and considering requirements of a carrier IP network. IEEE/ACM Trans. Netw., 26(4):1851–1864, 2018. doi:10.1109/TNET.2018.2854610.
- [41] Vijay V. Vazirani. Approximation algorithms. Springer, 2001. URL: http://www.springer.com/computer/theoretical+computer+science/book/978-3-540-65367-7.
- [42] Adrian Vetta. Approximating the minimum strongly connected subgraph via a matching lower bound. In Proc. SODA 2001, pages 417–426. ACM/SIAM, 2001. URL: http://dl.acm.org/citation.cfm?id=365411.365493.
- [43] David P. Williamson and David B. Shmoys. The Design of Approximation Algorithms. Cambridge University Press, 2011. URL: http://www.cambridge.org/de/knowledge/isbn/item5759340/.
- [44] Mingui Zhang, Cheng Yi, Bin Liu, and Beichuan Zhang. GreenTE: Power-aware traffic engineering. In Proc. ICNP 2010, pages 21–30. IEEE Computer Society, 2010. doi:10.1109/ICNP.2010.5762751.
- [45] Liang Zhao, Hiroshi Nagamochi, and Toshihide Ibaraki. A linear time 5/3-approximation for the minimum strongly-connected spanning subgraph problem. Inf. Process. Lett., 86(2):63–70, 2003. doi:10.1016/S0020-0190(02)00476-3.
