Traffic-Oblivious Multi-Commodity Flow Network Design

Chimani, Markus; Ilsen, Max

doi:10.4230/LIPIcs.ISAAC.2025.19

Traffic-Oblivious Multi-Commodity Flow Network Design

Markus Chimani

Theoretical Computer Science, Osnabrück University, Germany Max Ilsen¹¹1Corresponding author.

Theoretical Computer Science, Osnabrück University, Germany

Abstract

We consider the Minimum Multi-Commodity Flow Subgraph (MMCFS) problem: given a directed graph $G$ with edge capacities cap and a retention ratio $\alpha\in(0,1)$ , find an edge-wise minimum subgraph $G^{\prime}\subseteq G$ such that for all traffic matrices $T$ routable in $G$ using a multi-commodity flow, $\alpha\cdot T$ is routable in $G^{\prime}$ . This natural yet novel problem is motivated by recent research that investigates how the power consumption in backbone computer networks can be reduced by turning off connections during times of low demand without compromising the quality of service. Since the actual traffic demands are generally not known beforehand, our approach must be traffic-oblivious, i.e., work for all possible sets of simultaneously routable traffic demands in the original network.

In this paper we present the problem, relate it to other known problems in literature, and show several structural results, including a reformulation, maximum possible deviations from the optimum, and NP-hardness (as well as a certain inapproximability) already on very restricted instances. The most significant contribution is a $\max(\nicefrac{{1}}{{\alpha}},2)$ -approximation based on a surprisingly simple LP-rounding scheme. We also give instances where this worst-case approximation ratio is met and thus prove that our analysis is tight.

Keywords and phrases:

Multi-commodity flow, Digraphs, LP-rounding, Approximation algorithm

Copyright and License:

2012 ACM Subject Classification:

Mathematics of computing

\rightarrow

Network flows ; Mathematics of computing

\rightarrow

Approximation algorithms ; Networks

\rightarrow

Network design and planning algorithms

Related Version:

Previous Version: https://arxiv.org/abs/2504.16744v1 [16]

Funding:

Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) grant 461207633 (CH 897/7-2).

DOI:

10.4230/LIPIcs.ISAAC.2025.19

Event:

36th International Symposium on Algorithms and Computation (ISAAC 2025)

Editors:

Ho-Lin Chen, Wing-Kai Hon, and Meng-Tsung Tsai

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

We present the (suprisingly seemingly novel) network design problem Minimum Multi-Commodity Flow Subgraph (MMCFS): given a directed flow network $G$ with edge capacities and a retention ratio $\alpha\in(0,1)$ , find a subnetwork $G^{\prime}\subseteq G$ of minimum size such that $G^{\prime}$ still allows for a multi-commodity flow (MCF)routing of any traffic demands routable in $G$ when they are scaled down by factor $\alpha$ .

The problem arises naturally in recent research concerning power saving in backbone (Tier 1) networks of Internet service providers. There, the overall amount of traffic has distinct peaks in the evenings (when people are, e.g., streaming videos) and lows late at night and in the early mornings [40]. Clearly, the networks are built to handle the peak times. This opens up the possibility to reduce the power consumption of the network by turning off some resources – e.g., connections, line cards, or servers – during low traffic periods [44, 14].

So consider a computer network that allows for the simultaneous routing of the traffic at peak times. This traffic is comprised of a set of commodities, where each commodity is identified by a pair of nodes $(s,t)$ in the network and has a demand $d$ specifying that $d$ flow units have to be sent from $s$ to $t$ . The entirety of the demands for each pair of nodes is encoded in a traffic matrix $T$ . Commonly, one makes the simplifying assumption that during low traffic periods the traffic demands are upper bounded by a down-scaling of $T$ using a factor $\alpha\in(0,1)$ ; the most practically relevant scenarios concern $\alpha\geq\frac{1}{2}$ [34]. The task now is to minimize the size of the network by deactivating connections such that the reduced network still accommodates a routing of the scaled-down demands $\alpha\cdot T$ . However, in practice the traffic matrix $T$ may change from day to day (in fact, even within sampling windows of 15 minutes) and while we can assume the capacity of the network to be large enough for all occurring traffic at any given time, $T$ is usually not known beforehand. Thus, our solution should be traffic-oblivious, i.e., independent of any specific traffic matrix.

Technically, routing in realistic scenarios is not done via fully general multi-commodity flows: while MCFwould be optimal in terms of minimizing congestion [40], it would be too complicated and temporally unstable for the routing hardware. Instead, simpler techniques like 2-segment routing are used (which, in contrast to trivial shortest path routing, routes flow along a sequence of two chained shortest subpaths) [19]. Interestingly, studies show that in realistic networks, these routing solutions are virtually identical to those achieved by MCF [40, 12]. At the same time, for a given fixed network and traffic matrix, an MCFcan be computed in polynomial time while establishing an optimal routing table for 2-segment routing (which is then deployable on router hardware) is NP-hard [25]. Thus, we describe the feasibility of solutions in our problem setting in terms of routability via MCF, and assume that realistic (simpler) routing protocols will still be able to attain effective routability.

Let us give a formal description of our problem: Given a directed graph (or digraph) $G=(V,E)$ with positive edge capacities $\textit{cap}\colon E\nobreak\ \to\nobreak\ \mathbb{Q}$ and a traffic matrix $T$ , a flow $f_{s,t}\colon E\nobreak\ \to\nobreak\ \mathbb{Q}$ from a vertex $s\in V$ to a vertex $t\in V$ is a function satisfying the flow conservation constraints

	$\displaystyle\sum_{uv\in E}f_{s,t}(uv)-\sum_{vu\in E}f_{s,t}(vu)$	$\displaystyle\;=\;\begin{cases}-T(s,t)&if $v=s$\\ T(s,t)&if $v=t$\\ 0&else\end{cases}$	$\displaystyle\forall v\in V.$
A multi-commodity flow (MCF) is a set of flows $\mathcal{F}=\{f_{s,t}\mid(s,t)\in V^{2}\}$ satisfying
	$\displaystyle\sum_{(s,t)\in V^{2}}f_{s,t}(uv)$	$\displaystyle\leq\textit{cap}(uv)$	$\displaystyle\forall uv\in E.$

For an edge $e=st\in E$ , we may use the shorthand notation $T(e)\coloneqq T(s,t)$ . Further, we call a traffic matrix $T$ routable in an edge set $A\subseteq E$ if there exists an MCF $\mathcal{F}$ for $(G,\textit{cap},T)$ with $\sum_{f\in\mathcal{F}}f(e)=0$ for all $e\notin A$ . Based on this notion, we define the Minimum Multi-Commodity Flow Subgraph (MMCFS) problem as follows: given a digraph $G=(V,E)$ with edge capacities cap and a retention ratio $\alpha\in(0,1)$ , find the edge set $A\subseteq E$ with minimum cardinality such that for all traffic matrices $T$ that are routable in $E$ , $\alpha\cdot T$ is routable in $A$ .

Throughout this paper, when mentioning a problem’s name or the corresponding abbreviation in sans-serif typeface, e.g. MMCFS, we refer to the optimization question. To indicate that a subgraph is an optimal solution for the problem, we will denote it by the abbreviation in normal typeface, e.g. an MMCFS $(V,A)$ with $A\subseteq E$ .

Our contribution.

We present the MMCFS problem, which has both a natural formulation and practical applicability. After discussing related problems from literature in Section 2, we give some structural results in Section 3: We show how the MMCFS problem, even though it is traffic-oblivious in nature, can be reformulated to consider a specific single “hardest” traffic matrix. We also establish how an MCFcan be routed in an optimal MMCFS solution, and how the ratio between the values of a feasible MMCFS solution and an optimal one relates to their average edge capacities. In Section 4, we prove that MMCFS is NP-hard already with unit edge capacities. Additionally, we show that it is NP-hard (and a closely related problem cannot be approximated within a sublogarithmic factor) already on directed acyclic graphs (DAGs). Our most important contribution is given in Section 5, where we present a $\max(\nicefrac{{1}}{{\alpha}},2)$ -approximation algorithm for MMCFS: after modelling MMCFS as an ILP, we can deduce a surprisingly simple LP-rounding scheme, whose complexity is solely shifted to the correctness proof. Moreover, we show that our analysis of this algorithm is tight.

2 Related Work

There is a rich body of work on multi-commodity flows– see e.g. [3, Ch. 17] for a primer on this topic and [38] for a recent literature review. The ability to route an MCFin an MMCFS solution not only determines the latter’s feasibility, the problem of routing an MCFalso has many close ties to several other network design problems. These, however, involve constraints unrelated to MMCFS, are usually not traffic-oblivious, and mostly focus on undirected graphs. Concerning approaches on directed graphs, Foulds [20] minimizes the cost of an MCFin a bidirected network where the use of some unidirectional arcs is prohibited to reduce congestion. Gendron et al. [22, 23] discuss a directed MCFproblem that considers costs for both the installation of an edge and the amount of flow routed over it. Further, in buy-at-bulk network design [8, 39, 13], capacity on edges must be bought as cheaply as possible such that a given traffic matrix becomes routable – with the caveat that larger amounts of capacity can be bought at a lower price per capacity unit.

In robust network design [10, 4, 9, 24], possible traffic matrices are given as an uncertainty set in the form of a polytope, and the objective is usually to minimize the cost of reserving capacity on the edges. While the dynamic routing variant considers a different MCFfor every traffic matrix in the polytope, static routing specifies a fixed unit flow for each commodity that is only scaled with the respective demand value. For directed graphs, Al-Najjar et al. [5] show that an exact algorithm for static routing would yield an $\mathcal{O}(|V|)$ -approximation for dynamic routing. Our result can be seen as a better approximation ratio for the special uncertainty set $\{\alpha\cdot T\mid\alpha\in(0,1),\nobreak\ T\text{ is routable in }G\}$ under dynamic routing, but w.r.t. minimizing the number of edges in a subgraph rather than the cost of reserved capacity.

There are also several related graph construction and subgraph minimization problems: Khuller et al. [28] give an approximation algorithm for the construction of an undirected tree with constant degree that accommodates given traffic demands between its leaves such that the maximum load on any edge is minimized. Otten et al. [34] evaluate an integer linear program (ILP)and a heuristic for a green traffic engineering problem on digraphs – however, there a specific traffic matrix is also given (and rather than finding an edge subset of minimum cardinality, they minimize the number of “line cards”, i.e., sets of 8 incident edges at each vertex). Another well-known topic in the realm of subgraph minimization problems is that of spanners [1], i.e., subgraphs that preserve the length of a shortest path within a given ratio (stretch factor) between each pair of vertices. There exists a correspondence between upper bounds on the stretch of shortest paths and the congestion of MCFs, however, this only applies to the existence of probabilistic mappings in undirected graphs [6, 36]. Nonetheless, this correspondence was used to find flow sparsifiers in undirected graphs $G$ [18]. While a MMCFS solution is similar to a flow sparsifier that preserves the congestion up to a factor $\frac{1}{\alpha}$ , they differ in that a flow sparsifier is an entirely new (undirected) graph, not necessarily subgraph, containing a subset of the vertices of $G$ but both old and new edges [32, 30, 7].

Closely related to MMCFS are classical Directed Survivable Network Design (DSND) problems, where, given a (possibly capacitated) input digraph $G=(V,E)$ and a requirement function $r\colon V^{2}\to\mathbb{N}$ , one aims to find an edge-wise minimum subgraph of $G$ in which one can send a flow of value $r(s,t)$ from $s$ to $t$ for every $(s,t)\in V^{2}$ . On undirected graphs with unit edge capacities, there exists a 2-approximation by Jain [26], which has been adapted to directed instances, but only for a very restricted set of requirement functions [31]. The DSND problem most similar to MMCFS is the Minimum Capacity-Preserving Subgraph (MCPS) problem [15], where the requirement value for a vertex pair $(s,t)$ equals a fraction $\beta$ of the value of a maximum flow from $s$ to $t$ . However, in all of the aforementioned DSND approaches, each routed commodity is considered in isolation from the others, whereas in the MMCFS setting, all commodities are routed simultaneously. For example, given a digraph $G=(V,E)$ with edge capacities cap and a traffic matrix $T$ routable in $E$ , the scaled-down matrix $\alpha\cdot T$ is not necessarily routable in any optimal MCPS-solution of $(G,\textit{cap},\beta)$ since some edges may be congested by the simultaneous routing of multiple commodities. This is especially easy to see for $\alpha\geq\beta$ but even holds true when $\alpha\ll\beta$ :

Figure 1: Digraph

G

constructed in the proof of Section 2. Edges of the unique optimal MCPS-solution

E^{\prime}

of

(G,\textit{cap},\beta)

are solid (black), and the remaining edges

E_{XY}

are dashed (orange).

Observation 1.

Given an arbitrarily small $\alpha\in(0,1)$ and an arbitrarily large $\beta\in(0,1)$ , there exists a digraph $G=(V,E)$ with edge capacities cap and a traffic matrix $T$ such that $T$ is routable in $E$ , but $\alpha\cdot T$ is not routable in any optimal MCPS-solution $E^{\prime}$ of $(G,\textit{cap},\beta)$ .

Proof.

Let $C\coloneqq\left\lceil\frac{2\beta}{1-\beta}\right\rceil$ be a (high) edge capacity value and $k>\sqrt{\frac{C}{\alpha}}$ a number of vertices. Construct $G=(V,E)$ (visualized in Figure 1) as follows: Create two vertex sets $X$ and $Y$ with $k$ vertices each, as well as two distinct vertices $z_{1},z_{2}$ , and let $V\coloneqq X\cup Y\cup\{z_{1},z_{2}\}$ . Moreover, let $E_{XY}\coloneqq X\times Y$ , $E^{\prime}\coloneqq(X\times\{z_{1}\})\cup\{z_{1}z_{2}\}\cup(\{z_{2}\}\times Y)$ , and $E\coloneqq E_{XY}\cup E^{\prime}$ . Edge capacities and the traffic matrix are chosen as follows:

\displaystyle\textit{cap}(uv)

\displaystyle=\begin{cases}1&\text{if }uv\in E_{XY},\\ C&\text{otherwise;}\end{cases}

\displaystyle T(u,v)

\displaystyle=\begin{cases}1&\text{if }uv\in E_{XY},\\ 0&\text{otherwise.}\end{cases}

Every non-zero demand $T(u,v)$ can be routed in $(G,\textit{cap})$ using the respective edge $uv\in E_{XY}$ .

The only optimal MCPS-solution of $(G,\textit{cap},\beta)$ is $E^{\prime}$ : For every vertex pair $(u,v)\in E^{\prime}$ , the only $u$ - $v$ -path in $G$ is the one consisting of the edge $u v$ – so the edges $E^{\prime}$ must be in the solution. $E^{\prime}$ also establishes a maximum flow of sufficient value for the remaining vertex pairs. In particular, for every vertex pair $(u,v)\in X\times Y$ , there exists a maximum $u$ - $v$ -flow of value $C$ in $E^{\prime}$ , which is at least $\beta$ times the value $(C+1)$ of a maximum $u$ - $v$ -flow in $E$ :

\displaystyle C=\left\lceil\frac{2\beta}{1-\beta}\right\rceil\geq\beta\cdot% \frac{2\cdot(1+\beta-\beta)}{1-\beta}=\beta\left(\frac{2\beta}{1-\beta}+2% \right)\geq\beta\left(\left\lceil\frac{2\beta}{1-\beta}\right\rceil+1\right)=% \beta\cdot(C+1)

However, the scaled matrix $\alpha\cdot T$ is not routable in $E^{\prime}$ : $|X|\cdot|Y|=k^{2}>\frac{C}{\alpha}$ many demands of value $\alpha$ would have to be routed over the single edge $z_{1}z_{2}$ , exceeding its capacity $C$ . $\hfill\blacktriangleleft$

Lastly, we want to highlight similarities of MMCFS to the well-established NP-hard Minimum Equivalent Digraph (MED) problem [2, 37, 21, 27], where, given a digraph $G$ , one asks for the edge-wise minimum subgraph of $G$ that preserves the reachability relation of $G$ . In Section 4, we show that MED is a special case of MMCFS. Further, we observe:

Observation 2.

In a simple DAG $G$ , the unique minimum equivalent digraph (MED)of $G$ must be contained in every feasible MMCFS solution of $G$ , regardless of edge capacities and $\alpha$ .

Proof.

In any simple DAG $G$ , the MEDis unique and consists of exactly those edges $s t$ for which there is no $s$ - $t$ -path in $G-st$ [2]. A feasible MMCFS solution must also contain these edges $s t$ in order to allow for a non-zero amount of flow from $s$ to $t$ . $\hfill\blacktriangleleft$ However, in general digraphs, a feasible MMCFS solution may not always contain the MED, see Figure 2. Note that MED is not only polynomial-time solvable on DAGs, but there are also several polynomial approximation algorithms for general graphs [29, 45] with the currently best approximation ratio being 1.5 [11, 42].

Figure 2: An MMCFS instance with an optimal solution (given by the solid edges) that does not contain the MED. All edge capacities are 1, and

\alpha=\frac{1}{2}

. The MED, which is unique in this example and drawn in black, is not a feasible MMCFS solution: for

T

with

T(s,t)=\textit{cap}(st)

if

e=st\in E

and 0 otherwise, the dashed edge would have to accommodate a flow of

\frac{3}{2}

to satisfy all demands, but it only has a capacity of 1.

3 Structural Results

We present some structural insights concerning MMCFS that give a deeper understanding of the problem. Most importantly, we give a reformulation of MMCFS that is used throughout the rest of the paper to obtain structural and algorithmic results: Recall that a feasible solution $A$ for a given MMCFS instance $(G=(V,E),\textit{cap},\alpha)$ is defined as an edge set $A\subseteq E$ such that for all traffic matrices $T$ that are routable in $E$ , $\alpha\cdot T$ is routable in $A$ . Interestingly, instead of explicitly considering all routable traffic matrices $T$ , it suffices to consider the single specific traffic matrix $\mathbb{T}_{E}$ , which forces each edge to be utilized to its full capacity but has no demands between non-adjacent vertices:

\mathbb{T}_{E}(s,t)\coloneqq\begin{cases*}\textit{cap}(e)&if $e=st\in E$,\\ 0&otherwise.\end{cases*}

We show that an edge set $A\subseteq E$ is a feasible MMCFS solution iff $\alpha\cdot\mathbb{T}_{E}$ is routable in $A$ :

Theorem 3.

Given a digraph $G=(V,E)$ with edge capacities cap, a retention ratio $\alpha\in(0,1)$ , and an edge set $A\subseteq E$ , the following statements are equivalent:

$\blacksquare$

For all traffic matrices $T$ that are routable in $E$ , the scaled matrix $\alpha\cdot T$ is routable in $A$ .
$\blacksquare$

The scaled matrix $\alpha\cdot\mathbb{T}_{E}$ is routable in $A$ .

Proof.

$\mathbb{T}_{E}$ is routable in $E$ by definition. If every traffic matrix routable in $E$ is also routable in $A$ when scaled down by $\alpha$ , then so is $\alpha\cdot\mathbb{T}_{E}$ . For the other direction, consider any arbitrary traffic matrix $T$ routable in $E$ . Let $\{f^{T}_{s,t}\mid(s,t)\in V^{2}\}$ be the MCFthat routes $T$ in $E$ with the vector $\mathbf{f}^{T}\coloneqq\sum_{(s,t)\in V^{2}}f^{T}_{s,t}$ specifying the total flow over each edge. Using this MCF, we can construct a new traffic matrix $T^{\prime}$ :

T^{\prime}(s,t)\coloneqq\begin{cases*}\mathbf{f}^{T}(st)&if $e=st\in E$,\\ 0&otherwise.\end{cases*}

Observe that $T^{\prime}\leq\mathbb{T}_{E}$ when using component-wise comparison. Thus, since $\alpha\cdot\mathbb{T}_{E}$ is routable in $A$ , so is $\alpha\cdot T^{\prime}$ . But if $\alpha\cdot T^{\prime}$ is routable in $A$ using the flows $\{f^{\alpha\cdot T^{\prime}}_{u,v}\mid(u,v)\in V^{2}\}$ , then $\alpha\cdot T$ is also routable in $A$ using the flows $\{f^{\alpha\cdot T}_{s,t}\mid(s,t)\in V^{2}\}$ constructed as follows: for each commodity $st\in E$ and each edge $uv\in E$ , calculate the fraction of flow routed over $u v$ that is used by $f^{T}_{s,t}$ , and route this fraction over the path chosen by $f^{\alpha\cdot T^{\prime}}_{u,v}$ . In short,

f^{\alpha\cdot T}_{s,t}(e)\coloneqq\sum_{uv\in E\colon\mathbf{f}^{T}(uv)>0}% \frac{f^{T}_{s,t}(uv)}{\mathbf{f}^{T}(uv)}\cdot f^{\alpha\cdot T^{\prime}}_{u,% v}(e).\

$\hfill\blacktriangleleft$

We note that a related but slightly different concept to Theorem 3 has been implicitly used previously in [35] (on undirected graphs). From this point onwards, we may refer to edges as commodities since $\mathbb{T}_{E}$ specifies a non-zero demand precisely for the edges in $E$ . Moreover, given a flow $f_{\tilde{e}}$ for a commodity $\tilde{e}\in E$ , we call $f_{\tilde{e}}(\tilde{e})$ the direct flow for $\tilde{e}$ . We observe that in an optimal MMCFS solution $A$ , for every commodity $\tilde{e}\in A$ , the demand $\mathbb{T}_{E}(\tilde{e})$ can always be fully satisfied by a direct flow $f_{\tilde{e}}(\tilde{e})$ :

Observation 4.

Let $G=(V,E)$ be a digraph with edge capacities cap and $T$ a traffic matrix routable in an edge set $A\subseteq E$ with $T(e)\leq\textit{cap}(e)$ for all edges $e\in E$ . Then, there exists an MCF $\mathcal{F}=\{f_{s,t}\mid(s,t)\in V^{2}\}$ in the graph $G^{\prime}\coloneqq(V,A)$ satisfying the demands $T$ such that $f_{\tilde{e}}(\tilde{e})=T(\tilde{e})$ for all edges $\tilde{e}\in A$ .

Proof.

Among all MCFsthat witness the routability of $T$ in $A$ , let $\mathcal{F}^{\prime}=\{f^{\prime}_{s,t}\mid(s,t)\in V^{2}\}$ be one with a maximum sum of direct flow values $\sum_{\tilde{e}\in E}f^{\prime}_{\tilde{e}}(\tilde{e})$ .

We give a proof by contradiction: Assume that there exists an edge $e^{\prime}=uv\in A$ such that $f^{\prime}_{e^{\prime}}(e^{\prime})<T(e^{\prime})$ (if no such edge exists, $\mathcal{F}=\mathcal{F}^{\prime}$ and we are done). There must exist at least one alternative $u$ - $v$ -path $P$ that routes at least some of the remaining demand $T(e^{\prime})-f^{\prime}_{e^{\prime}}(e^{\prime})$ . Further, the edge $e^{\prime}$ has residual capacity $\textit{cap}(e^{\prime})-\sum_{(s,t)\in V^{2}:}f^{\prime}_{s,t}(e^{\prime})=0$ as otherwise we could increase $f_{e^{\prime}}(e^{\prime})$ (and decrease flow along $P$ accordingly), which would contradict the selection of $\mathcal{F}^{\prime}$ . Thus, there exists an edge $e^{\prime\prime}\in E$ , $e^{\prime\prime}\neq e^{\prime}$ , with $f_{e^{\prime\prime}}(e^{\prime})>0$ . But then, we can exchange a non-zero amount $\varepsilon>0$ of flow of commodity $e^{\prime}$ routed over $P$ with an equally small amount of flow of commodity $e^{\prime\prime}$ routed over $e^{\prime}$ . This increases $f^{\prime}_{e^{\prime}}(e^{\prime})$ without decreasing any other direct flow value – again a contradiction to the selection of $\mathcal{F}^{\prime}$ . $\hfill\blacktriangleleft$

Further, for any edge set $A$ , we can compare its total edge capacity $\sum_{e\in A}\textit{cap}(e)$ to the total flow needed to satisfy the demands $\mathbb{T}_{E}$ . This not only gives us a necessary condition for an edge set $A$ to be a feasible MMCFS solution, but, upon closer analysis, also allows us to relate its quality as a solution to its mean capacity $\overline{\textit{cap}_{A}}\coloneqq\frac{1}{|A|}\cdot\sum_{e\in A}\textit{cap% }(e)$ . The following results apply both in the case of simple and non-simple graphs, but we can give better guarantees in the former case.

Theorem 5.

Let $O$ be an optimal solution and $A\neq O$ a feasible solution for an MMCFS instance $(G,\textit{cap},\alpha)$ . Then, $\frac{|A|}{|O|}\leq\min\left\{\left(1+\frac{1-\alpha}{\theta\alpha}\right)% \cdot\frac{\overline{\textit{cap}_{O}}}{\ \overline{\textit{cap}_{A}}\ },1+% \frac{1-\alpha}{\theta\alpha}\cdot\frac{\overline{\textit{cap}_{O}}}{\ % \overline{\textit{cap}_{A\setminus O}}\ }\right\}$ with $\theta=2$ if $G$ has no parallel edges and $\theta=1$ otherwise.

Proof.

Let $X\coloneqq A\setminus O$ , and $Y\coloneqq A\cap O$ . The commodities of all $\tilde{e}\in O$ must be routed through $O$ , requiring a total flow of at least $\alpha\sum_{e\in O}\textit{cap}(e)$ and leaving a total remaining capacity in $O$ of at most $(1-\alpha)\sum_{e\in O}\textit{cap}(e)$ . Every commodity $\tilde{e}^{\prime}\in X$ has to be routed within this remaining capacity since $O$ is feasible. This requires a total flow of at least $\theta\alpha\sum_{e\in X}\textit{cap}(e)$ ; the $\theta$ is due to the fact that without parallel edges each such commodity $\tilde{e}^{\prime}$ must be routed over at least two other edges in $O$ since $\tilde{e}^{\prime}\not\in O$ . We thus have

\theta\alpha\sum_{e\in X}\textit{cap}(e)\leq(1-\alpha)\sum_{e\in O}\textit{cap% }(e)\nobreak\ .

(1)

By adding $\theta\alpha\sum_{e\in Y}\textit{cap}(e)\leq\theta\alpha\sum_{e\in O}\textit{% cap}(e)$ to this inequality, we obtain

	$\displaystyle\theta\alpha\sum_{e\in A}\textit{cap}(e)$	$\displaystyle\leq(1-\alpha)\sum_{e\in O}\textit{cap}(e)+\theta\alpha\sum_{e\in O% }\textit{cap}(e)$
	$\displaystyle\theta\alpha\cdot\|A\|\cdot\overline{\textit{cap}_{A}}$	$\displaystyle\leq(1-\alpha+\theta\alpha)\cdot\|O\|\cdot\overline{\textit{cap}_{O}}$
	$\displaystyle\frac{\|A\|}{\|O\|}$	$\displaystyle\leq\left(1+\frac{1-\alpha}{\theta\alpha}\right)\cdot\frac{% \overline{\textit{cap}_{O}}}{\ \overline{\textit{cap}_{A}}\ }\nobreak\ .$

Alternatively, we can rewrite inequality (1) as $\theta\alpha\cdot|X|\cdot\overline{\textit{cap}_{X}}\leq(1-\alpha)\cdot|O|% \cdot\overline{\textit{cap}_{O}}$ and obtain

\displaystyle\frac{|A|}{|O|}\leq\frac{|O|+|X|}{|O|}=1+\frac{|X|}{|O|}

\displaystyle\leq 1+\frac{1-\alpha}{\theta\alpha}\cdot\frac{\overline{\textit{% cap}_{O}}}{\ \overline{\textit{cap}_{X}}\ }\nobreak\ .\

$\hfill\blacktriangleleft$

Corollary 6.

Any arbitrary feasible solution for MMCFS (including the trivial one, $E$ itself), is a $(1+\frac{1-\alpha}{\theta\alpha}\cdot\frac{\max_{e\in E}\textit{cap}(e)}{\min_% {e\in E}\textit{cap}(e)})$ -approximation.

This ratio is met, e.g., on a bundle of parallel edges with $\ell\coloneqq(\frac{1}{\alpha}-1)\cdot k$ capacity-1 edges and one capacity- $k$ edge (for any given $\alpha\in\mathbb{Q}_{(0,1)}$ and an arbitrary $k\in\mathbb{Q}$ s.t. $\ell\in\mathbb{N}$ ).

Corollary 7.

For uniform capacities, any arbitrary feasible solution for MMCFS (including the trivial one, $E$ itself) is a $(1+\frac{1-\alpha}{\theta\alpha})$ -approximation.

4 Complexity

Given that even trivial MMCFS solutions satisfy an approximation guarantee according to Section 3, one might expect MMCFS to be polynomial-time solvable. However, in this section, we show that MMCFS is NP-hard already on DAGsand give a first inapproximability result. We begin by proving that MMCFS is NP-hard already with unit edge capacities using a reduction from MED that directly follows from Theorem 3:

Corollary 8.

MED is the special case of MMCFS with unit edge capacities cap and $\alpha\leq\frac{1}{|E|}$ .

Proof.

An optimal solution $A\subseteq E$ for an MMCFS instance $(G,\textit{cap},\alpha)$ with unit edge capacities is an edge set of minimum cardinality such that the demands $\alpha\cdot\mathbb{T}_{E}(s,t)=\alpha\cdot\textit{cap}(st)\leq\frac{1}{|E|}$ for each edge $st\in E$ are routable in $A$ . This is equivalent to ensuring that there exists an $s$ - $t$ -path in $A$ for every edge $st\in E$ since the (unit) capacity of an edge can never be surpassed by the $|E|$ many flows of size at most $\frac{1}{|E|}$ each. $\hfill\blacktriangleleft$

Moreover, we can show that MMCFS is NP-hard already on DAGsusing a reduction from the NP-hard decision variant of Set Cover [27], where one asks: given a universe $U$ , a family of sets $\mathcal{S}=\{S_{i}\subseteq U\}_{1\leq i\leq k}$ with $k\in\mathcal{O}(\text{poly}(|U|))$ , and a parameter $\varphi$ , is there a subfamily $\mathcal{C}\subseteq\mathcal{S}$ of cardinality $|\mathcal{C}|\leq\varphi$ such that $\bigcup_{S\in\mathcal{C}}S=U$ ? The reduction is similar to that given in [15] to prove the NP-hardness of the Minimum Capacity-Preserving Subgraph problem.

Theorem 9.

For any fixed $\alpha\in(0,1)$ , MMCFS is NP-hard already on DAGswhere the longest path has length 3.

Proof.

Given a Set Cover instance $(U,\mathcal{S},\varphi)$ and a fixed retention ratio $\alpha\in(0,1)$ , we construct an instance $I=(G=(V,E),\textit{cap},\alpha,\psi)$ for the decision variant of MMCFS: $I$ is a yes-instance if and only if there exists a feasible MMCFS solution $E^{\prime}\subseteq E$ for $(G,\textit{cap},\alpha)$ with cardinality $|E^{\prime}|\leq\psi$ . We construct $I$ as follows (see Figure 3 for a visualization):

$\displaystyle V$	$\displaystyle\coloneqq\mathrlap{V_{U}\cup V_{\mathcal{S}}\cup V^{\mathcal{S}}_% {t}\cup\{t\}\text{ and }E\coloneqq E_{U}\cup E_{\mathcal{S}}\cup E_{1}}$
$\displaystyle V_{U}$	$\displaystyle\coloneqq\{v_{u},v^{\prime}_{u}\mid\forall u\in U\}$	$\displaystyle V_{\mathcal{S}}$	$\displaystyle\coloneqq\{v_{S}\mid\forall S\in\mathcal{S}\}$	$\displaystyle V^{t}_{\mathcal{S}}$	$\displaystyle\coloneqq\{z_{S}\mid\forall S\in\mathcal{S}\}$
$\displaystyle E_{U}$	$\displaystyle\coloneqq V_{U}\times\{t\}$	$\displaystyle E_{\mathcal{S}}$	$\displaystyle\coloneqq V_{\mathcal{S}}\times\{t\}$
$\displaystyle E_{1}$	$\displaystyle\coloneqq\mathrlap{\{v_{u}v_{S},v^{\prime}_{u}v_{S}\mid\forall S% \in\mathcal{S},u\in S\}\cup\{v_{S}z_{S},z_{S}t\mid s\in\mathcal{S}\}}$
$\displaystyle\textit{cap}(e)$	$\displaystyle\coloneqq\mathrlap{\left\{1\nobreak\ \text{if}\nobreak\ e\in E_{1% };\qquad{\textstyle\frac{1-\alpha}{\alpha}}\nobreak\ \text{if}\nobreak\ e\in E% _{\mathcal{S}};\qquad\varepsilon\nobreak\ \text{if}\nobreak\ e\in E_{U},% \nobreak\ \text{with}\nobreak\ \varepsilon\leq\min\left({\textstyle\frac{1-% \alpha}{\alpha^{2}\cdot\|U\|}},{\textstyle\frac{1-\alpha}{\alpha}}\right).\right.}$
$\displaystyle\psi$	$\displaystyle\coloneqq\hbox to0.0pt{$\|E_{1}\|+\varphi=2\cdot\sum_{S\in\mathcal{% S}}\|S\|+2\cdot\|\mathcal{S}\|+\varphi$\hss}$

As $G$ is a DAG, its MEDis unique [2] and must be part of any feasible MMCFS solution, see Section 2. This MEDis formed by the edges $e\in E_{1}$ ; a flow of $\alpha$ must be routed over each of them in order to satisfy the demands $\mathbb{T}_{E}(e)=\alpha\cdot\textit{cap}(e)$ . The remaining capacity for each edge $e\in E_{1}$ is $1-\alpha$ .

So consider a single set $S\in\mathcal{S}$ and the corresponding two-path $\{v_{S}z_{S},z_{S}t\}\in E_{1}$ , whose remaining capacity can thus accommodate either arbitrarily many item commodities $\tilde{e}_{U}\in E_{U}$ (each one with the sufficiently small demand $\mathbb{T}_{E}(\tilde{e}_{U})=\alpha\cdot\varepsilon$ ) or a single set commodity $\tilde{e}_{\mathcal{S}}\in E_{\mathcal{S}}$ (with the demand $\alpha\cdot\mathbb{T}_{E}(\tilde{e}_{\mathcal{S}})=\alpha\cdot\frac{1-\alpha}{% \alpha}=1-\alpha$ ). In the former case, we can remove at least two corresponding item edges $v_{u}t$ , $v^{\prime}_{u}t$ with $u\in S$ , which is more than the single corresponding set edge $v_{S}t$ we can remove in the latter case. Thus, for each item commodity $v_{u}t\in E_{U}$ , an optimal MMCFS solution must contain one of the corresponding set edges $\{v_{S}t\mid S\ni u\}\subseteq E_{\mathcal{S}}$ ; the item commodity can then be routed over the path from $v_{u}$ over $v_{S}$ to $t$ .

Given a Set Cover solution $\mathcal{C}$ with $|\mathcal{C}|\leq\varphi$ , we can construct an MMCFS solution $E^{\prime}=E_{1}\cup\{v_{S}t\in E_{\mathcal{S}}\mid S\in\mathcal{C}\}$ with cardinality $|E^{\prime}|=|E_{1}|+|\mathcal{C}|\leq|E_{1}|+\varphi=\psi$ . Since every item is covered by the sets in $\mathcal{C}$ , the constructed MMCFS solution includes at least one corresponding set edge for each item, ensuring its feasibility. Conversely, since a feasible MMCFS solution $E^{\prime}$ with $|E^{\prime}|\leq\psi$ has at least one corresponding set edge for each item $u\in U$ , the Set Cover solution $\mathcal{C}=\{S\mid v_{S}t\in E_{\mathcal{S}}\cap E^{\prime}\}$ also contains at least one covering set for each item. Moreover, $|\mathcal{C}|=|E^{\prime}|-|E_{1}|\leq\psi-|E_{1}|=\varphi$ . $\hfill\blacktriangleleft$

Figure 3: MMCFS instance constructed from a Set Cover instance with universe

U=\{a,b,c,d\}

and family of subsets

\mathcal{S}=\{\{a\},\ \{a,b,c\},\ \{a,c,d\}\}

. An optimal solution contains the (solid black) MEDas well as one (blue) corresponding set edge for each

u\in U

. Item edges are orange and dashed.

Considering the optimization variants of Set Cover and MMCFS (and thus ignoring the additional input values $\varphi$ and $\psi$ ), the reduction above also implies the inapproximability of the number of edges in an optimal MMCFS solution beyond the edges required for an MED: Consider an instance $I=(G=(V,E),\textit{cap},\alpha)$ for the optimization variant of MMCFS that is produced by the reduction above, and an arbitrary feasible solution $A\subseteq E$ for this instance. Let $\mu(I,A)\coloneqq|A|-\textit{med}(G)$ where $\textit{med}(G)$ denotes the number of edges in an MEDof $G$ . Then, $A$ can be transformed into a feasible solution for the original Set Cover instance with objective value $\mu(I,A)$ in linear time. Further, let $\mu(I)$ be the minimum $\mu(I,A^{\prime})$ over all feasible solutions $A^{\prime}$ for $I$ , and recall that the size $|E|$ of the MMCFS instance $I$ is linear in the size $N\in\mathcal{O}(\text{poly}(U))$ of the Set Cover instance: if it was possible to approximate $\mu(I)$ within a factor in $o(\log|E|)=o(\log|U|)$ , one could also approximate Set Cover within $o(\log|U|)$ , which is NP-hard [17, 33]. This implies that any approximation algorithm for MMCFS (such as the one we present in Section 5) must leverage the existence of a comparatively high number of MED-edges in order to achieve its approximation ratio.

Observation 10.

Given an MMCFS instance $I=(G=(V,E),\textit{cap},\alpha)$ and an MEDof $G$ , approximating $\mu(I)$ with a ratio in $o(\log|E|)$ is NP-hard. This already holds on DAGswhere the longest path has length 3.

5 LP-based Approximation

We present an extremely simple $\max(\nicefrac{{1}}{{\alpha}},2)$ -approximation for MMCFS based on LP rounding. This is a clear improvement over the default approximation guarantee of Section 3, which depends the instance’s edge capacities and is thus not even polynomially bounded in $\alpha^{-1}$ or the instance size. Consider the following ILP formulation for MMCFS:


$\displaystyle\min$	$\displaystyle\sum_{e\in E}x_{e}$		(1.a)
$\displaystyle\sum_{u\colon uv\in E}f_{\tilde{e}}(uv)-\sum_{u\colon vu\in E}f_{% \tilde{e}}(vu)$	$\displaystyle=\begin{cases}-\alpha\mathbb{T}_{E}(\tilde{e})&if $v=s$\\ \alpha\mathbb{T}_{E}(\tilde{e})&if $v=t$\\ 0&else\end{cases}$	$\displaystyle\forall v\in V,\tilde{e}\!=\!st\in E$	(1.b)
$\displaystyle\sum_{\tilde{e}\in E}f_{\tilde{e}}(e)$	$\displaystyle\leq x_{e}\cdot\textit{cap}(e)$	$\displaystyle\forall e\in E$	(1.c)
$\displaystyle f_{\tilde{e}}(e)$	$\displaystyle\geq 0$	$\displaystyle\forall e\in E,\tilde{e}\in E$	(1.d)
$\displaystyle x_{e}$	$\displaystyle\in\{0,1\}$	$\displaystyle\forall e\in E$	(1.e)

A binary indicator variable $x_{e}$ determines whether edge $e\in E$ is part of the solution subgraph or not. We minimize the sum of these variables in the objective function (1.a) to obtain a subgraph of minimum size. The non-negative variables $f_{\tilde{e}}(e)$ determine the amount of flow routed over edge $e$ for commodity $\tilde{e}\in E$ . Recall that $\mathbb{T}_{E}$ specifies a demand of $\textit{cap}(\tilde{e})$ precisely for each edge $\tilde{e}$ : the flow preservation constraints (1.b) guarantee that the $f$ -variables represent proper flows that satisfy these demands. Lastly, the capacity constraints (1.c) ensure that the total sum of flow over any edge $e\in E$ does not surpass the capacity of $e$ . This flow sum $\mathbf{f}(e)\coloneqq\sum_{\tilde{e}\in E}f_{\tilde{e}}(e)$ must be zero if $e$ is not part of the solution.

We obtain the relaxation of ILP (1) by replacing the integrality constraints (1.e) on $x_{e}$ by the inequalities $0\leq x_{e}\leq 1$ for all $e\in E$ . The only other type of constraint that bounds the $x_{e}$ -variables is (1.c). Since the $x_{e}$ -variables can assume fractional values in the LP relaxation, and the sum over all $x_{e}$ is minimized, this lower bound of $\frac{\mathbf{f}(e)}{\textit{cap}(e)}$ on $x_{e}$ for all $e\in E$ will always be met with equality. Hence, the LP relaxation is equivalent to a standard MCF-LP that minimizes the sum of edge utilizations in the objective function $\sum_{e\in E}\frac{\mathbf{f}(e)}{\textit{cap}(e)}$ . Accordingly, we will refer to $\textit{cost}(e)\coloneqq\frac{1}{\textit{cap}(e)}$ as the cost that routing a single unit of flow over an edge $e$ will add to this objective.

$\blacktriangleright$ Remark 11.

The solution $x_{e}=\alpha$ for all $e\in E$ with objective value $\alpha|E|$ is always feasible for the relaxation of ILP (1). When the input graph is simple and all edge capacities are uniform, it is in fact optimal: the flow $f_{\tilde{e}}$ for commodity $\tilde{e}$ will always be routed completely over $\tilde{e}$ since all edges have the same cost and routing $f_{\tilde{e}}$ over an alternative path would cost more than routing it over a single edge. This implies that any arbitrary feasible solution for MMCFS on simple graphs with uniform edge capacities is an $(\nicefrac{{1}}{{\alpha}})$ -approximation (which we already proved using a separate argument via Section 3). However, below we also use the LP relaxation to approximate MMCFS on non-simple graphs with non-uniform edge capacities.

For our approximation algorithm, we make use of the well-known fact that all standard LP solving algorithms will always return a basic optimal solution (if any solution exists) [43, p. 279]. A basic optimal solution – or equivalently, extreme point solution – is a vertex of the polyhedron defined as the convex hull of the set of feasible solutions. It does not lie on a higher-dimensional face of the polyhedron and thus cannot be expressed as a convex combination of two or more other feasible solutions [41, p. 100]. We prove that a basic optimal solution for the relaxation of ILP (1) will only have comparatively few variables with a positive value below $\alpha$ , allowing us to round up the solution to obtain an approximation.

Lemma 12.

Let $x^{*}$ be a basic optimal solution for the relaxation of ILP (1), and $h$ the number of edges $e$ such that $x^{*}_{e}=1$ . There exist at most $h$ many edges $e^{\prime}$ with $x^{*}_{e^{\prime}}\in(0,\alpha)$ .

Proof.

Let $H\coloneqq\{e\in E\mid x^{*}_{e}=1\}$ be the $h$ many edges that are saturated by the fractional multi-commodity flow, and $L\coloneqq\{e\in E\mid x^{*}_{e}\in(0,\alpha)\}$ with $\ell\coloneqq|L|$ the edges that are only used to a fraction less than $\alpha$ ; in short, edges with high and low corresponding $x^{*}$ -values. We show that an optimal fractional solution $x^{*}$ with $\ell>h$ would allow us to construct a vector $p\in\mathbb{Q}^{|E|}$ such that both $(x^{*}+p)$ and $(x^{*}-p)$ are still feasible solutions for the LP relaxation – thus, $x^{*}$ would not be basic.

For every edge $e=st\in L$ , let $P_{e}$ denote an arbitrary alternative $s$ - $t$ -path not using the edge $e$ and $f_{e}(e^{\prime})>0$ for all edges $e^{\prime}\in P_{e}$ . Such a path must exist since at most $x^{*}_{e}\cdot\textit{cap}(e)<\alpha\cdot\textit{cap}(e)$ flow is routed over $e$ , so an alternative $s$ - $t$ -path is necessary to satisfy the demand $\alpha\cdot\mathbb{T}_{E}(s,t)=\alpha\cdot\textit{cap}(e)$ . Further, the total cost of routing a unit of flow over $P_{e}$ must be lower than or equal to the cost of routing it over $e$ itself, i.e., $\sum_{e^{\prime}\in P_{e}}\frac{1}{\textit{cap}(e^{\prime})}\leq\frac{1}{% \textit{cap}(e)}$ , by the optimality of $x^{*}$ . Based on the alternative paths, we construct a matrix $M\in\{0,1\}^{\ell\times h}$ indexed by pairs $(e,e^{\prime\prime})\in L\times H$ , with $M(e,e^{\prime\prime})\coloneqq 1\nobreak\ \text{if}\nobreak\ e^{\prime\prime}% \in P_{e},\nobreak\ \text{and}\nobreak\ 0\nobreak\ \text{otherwise}$ . Since $\ell>h$ , the $\ell$ rows of $M$ must be linearly dependent, i.e., there exists a vector of coefficients $q\in\mathbb{Q}^{\ell}$ with $q\neq\mathbf{0}$ , such that $q^{\top}\cdot M=\mathbf{0}$ (and consequently, $-q^{\top}\cdot M=\mathbf{0}$ ).

We can obtain two new feasible solutions by modifying the MCFcorresponding to the optimal basic solution $x^{*}$ as follows: for each edge $e\in L$ , and using a small positive value $\varepsilon\in\mathbb{Q}$ , we send $\varepsilon\cdot q(e)$ less flow over $e$ itself and $\varepsilon\cdot q(e)$ more flow over the alternative path $P_{e}$ – or vice versa. Put formally, we argue that for a small enough $\varepsilon>0$ , the following vector $p\in\mathbb{Q}^{|E|}$ yields two feasible solutions $(x^{*}+p)$ and $(x^{*}-p)$ for the LP relaxation:

\displaystyle p(e^{\prime})=\begin{cases*}\varepsilon\cdot q(e^{\prime})-% \varepsilon\cdot\sum_{e\in L\colon e^{\prime}\in P_{e}}q(e)&if $e^{\prime}\in L% $,\\ -\varepsilon\cdot\sum_{e\in L\colon e^{\prime}\in P_{e}}q(e)&otherwise.\end{% cases*}

Clearly, the two solutions satisfy all demands and flow preservation constraints (1.b). Consider the capacity constraints (1.c): By construction of $q$ , the flow difference on saturated edges is 0. For all non-saturated edges $e^{\prime}$ it holds: if we modified the flow routed over $e^{\prime}$ , this flow was already non-zero before our modification, and $\varepsilon$ can be chosen sufficiently small such that the flow will neither turn negative nor surpass $\textit{cap}(e^{\prime})$ .

It remains to show that $p\neq\mathbf{0}$ given that $q\neq\mathbf{0}$ . So, among the edges $e\in L$ with $q(e)\neq 0$ , choose one with maximum cost $\frac{1}{\textit{cap}(e)}$ and denote it by $e_{\max}$ . We show that $p(e_{\max})\neq 0$ .

If $e_{\max}$ were contained in any of the alternative paths $P_{e}$ with $e\in L$ and $q(e)\neq 0$ , we would arrive at a contradiction even without using $p$ : Recall that $\sum_{e^{\prime}\in P_{e}}\frac{1}{\textit{cap}(e^{\prime})}\leq\frac{1}{% \textit{cap}(e)}$ . So we must have $|P_{e}|=1$ , i.e., $e$ and $e_{\max}$ are parallel edges with $\textit{cap}(e)=\textit{cap}(e_{\max})$ . Since $e,e_{\max}\in L$ and thus $x^{*}_{e},x^{*}_{e_{\max}}\in(0,\alpha)$ , we could simply obtain two new solutions by shifting some small $\varepsilon>0$ of flow from one to another, or vice versa, contradicting that $x^{*}$ is basic.

Hence, $e_{\max}$ is only contained in alternative paths $P_{e}$ with $e\in L$ s.t. $q(e)=0$ , and

\displaystyle p(e_{\max})=\varepsilon\cdot q(e_{\max})-\varepsilon\cdot\sum_{e% \in L\colon e_{\max}\in P_{e}}q(e)=\varepsilon\cdot q(e_{\max})-0\neq 0.

$\hfill\blacktriangleleft$

The proof implicitly gives an intuitive explanation for the distribution of LP values:

Corollary 13.

Let $x^{*}$ be a basic optimal solution for the relaxation of ILP (1). For each edge $e=st$ with $x^{*}_{e}\in(0,\alpha)$ it holds: every alternative $s$ - $t$ -path $P_{e}$ (not containing $e$ ) with $f_{e}(e^{\prime})>0$ for all of its edges $e^{\prime}\in P_{e}$ must contain an edge $e^{\prime\prime}$ with $x^{*}_{e^{\prime\prime}}=1$ .

Proof.

Assume that, for some edge $e=st$ with $x^{*}_{e}\in(0,\alpha)$ , there is a $P_{e}$ that does not contain an edge $e^{\prime\prime}$ with $x^{*}_{e^{\prime\prime}}=1$ . Then, we can follow the proof of Section 5, choosing $P_{e}$ as the alternative $s$ - $t$ -path for $e$ . As a result, the matrix $M$ constructed in the proof contains a row of zeroes, allowing us to route $\varepsilon$ more flow over $e$ and $\varepsilon$ less flow over $P_{e}$ , or vice versa. Thus, $x^{*}$ cannot be basic, a contradiction. $\hfill\blacktriangleleft$

We now analyze the following LP-rounding algorithm: compute a basic optimal solution $x^{*}$ for the relaxation of ILP (1) in polynomial time, and return the edge set $\{e\in E\mid x^{*}_{e}>0\}$ . Based on Section 5, one could use naïve techniques to prove approximation guarantees of $(2+\frac{1}{\alpha})$ or $\frac{2}{\alpha}$ for this algorithm. However, we provide the stronger bound of $\max(\nicefrac{{1}}{{\alpha}},2)$ based on the following intuition: the algorithm either “misses” the optimal solution mainly due to edges with $x^{*}_{e}\in(0,\alpha)$ and we can obtain a 2-approximation via Section 5, or due to edges with $x^{*}_{e}\in[\alpha,1)$ and we can round up the variables for a $\frac{1}{\alpha}$ -approximation.

Theorem 14.

Let $x^{*}$ be a basic optimal solution for the relaxation of ILP (1). The edge set $A\coloneqq\{e\in E\mid x^{*}_{e}>0\}$ is a $\max(\nicefrac{{1}}{{\alpha}},2)$ -approximation for MMCFS. That is, rounding up $x^{*}$ is a 2-approximation when $\alpha\geq\frac{1}{2}$ , and an $\nicefrac{{1}}{{\alpha}}$ -approximation when $\alpha<\frac{1}{2}$ .

Proof.

The solution $A$ is clearly feasible. Let $z\coloneqq|A|$ be the number of edges $e$ such that $x^{*}_{e}>0$ . Furthermore, let $\Delta=|\{e\in E\mid x^{*}_{e}\in\mathopen{[}\alpha,1)\}|$ be the number of these $z$ edges whose $x^{*}$ -variable is set to a value in the interval $\mathopen{[}\alpha,1)$ . We know that the remaining $(z-\Delta)$ edges have a $x^{*}$ -variable either set to $1$ or a value lower than $\alpha$ . In particular, by Section 5, there exist at least as many edges $e^{\prime}$ with $x^{*}_{e^{\prime}}=1$ as there are edges $e^{\prime\prime}$ with $x^{*}_{e^{\prime\prime}}\in(0,\alpha)$ . Thus, $|\{e\in E\mid x^{*}_{e}=1\}|\geq\frac{z-\Delta}{2}$ . Hence, the optimal fractional solution $x^{*}$ has objective value

	$\displaystyle z^{*}$	$\displaystyle\geq\|\{e\in E\mid x^{}_{e}=1\}\|+\alpha\cdot\|\{e\in E\mid x^{}_{% e}\in\mathopen{[}\alpha,1)\}\|$
		$\displaystyle\geq\frac{z-\Delta}{2}+\alpha\cdot\Delta=\left(\frac{1}{2}+(% \alpha-\frac{1}{2})\cdot\frac{\Delta}{z}\right)\cdot z.$

Using the minimum fractional objective value $z^{*}$ as a lower bound for the minimum integral objective value $z^{*}_{\mathit{int}}$ , we can then give an upper bound for the approximation ratio:

\displaystyle r\coloneqq\frac{z}{z^{*}_{\mathit{int}}}\leq\frac{z}{z^{*}}\leq% \frac{1}{\frac{1}{2}+(\alpha-\frac{1}{2})\cdot\frac{\Delta}{z}}

To obtain an upper bound on this ratio, we examine when its denominator is at its minimum. For $\alpha\geq\frac{1}{2}$ , the term $(\alpha-\frac{1}{2})\cdot\frac{\Delta}{z}$ is always non-negative and reaches its lowest value 0 when $\Delta=0$ , giving us the upper bound $2\geq r$ in this case. In contrast, for $\alpha<\frac{1}{2}$ , the term $(\alpha-\frac{1}{2})\cdot\frac{\Delta}{z}$ is negative, and the minimum of the denominator is reached when $\Delta=z$ , leading to the upper bound $\frac{1}{\alpha}\geq r$ in this second case. $\hfill\blacktriangleleft$

It is surprisingly hard to find instances (in particular, ones without parallel edges) where the worst-case approximation ratio given by Theorem 14 is actually met. However, we show that such instances exist for the most relevant $\alpha\in(0,1)$ , and hence our analysis is tight:

Lemma 15.

The ratio $\nicefrac{{1}}{{\alpha}}$ for the algorithm of Theorem 14 is tight for all $\alpha=\frac{1}{q}$ , $q\in\mathbb{N}_{>1}$ .

Proof.

Consider a complete bidirected graph $G=(V,E)$ on $q+1$ vertices, and let $H\subset G$ denote an arbitrarily chosen directed Hamiltonian cycle in $G$ (an example for $q=4$ is visualized in Figure 4). For all edges $st\in E$ , let $\eta(st)$ denote the number of edges of the unique $s$ - $t$ -path in $H$ and set the capacity $\textit{cap}(st)\coloneqq\frac{1}{\eta(st)}$ . $E(H)$ is a feasible integral solution for the MMCFS instance $(G,\textit{cap},\alpha)$ since the demands $\alpha\cdot\mathbb{T}_{E}$ are routable in it: A single edge $e\in H$ is able to satisfy its own demand $\alpha\cdot\textit{cap}(e)=\frac{1}{q}\cdot 1$ . Adding to this, for every $i\in\{2,\dots,q\}$ , there are $i$ edges $uv\in E\setminus E(H)$ with $\eta(uv)=i$ whose unique $u$ - $v$ -path in $H$ contains $e$ . The edge $e$ can accommodate all of these commodities $u v$ because each of them has a demand of $\alpha\cdot\textit{cap}(uv)=\frac{1}{q}\cdot\frac{1}{i}$ , summing up to a total flow of $\sum_{i=1}^{q}i\cdot\frac{1}{i}\cdot\frac{1}{q}=1\leq\frac{1}{\textit{cap}(e)}$ . Lastly, $E(H)$ is not only feasible but optimal since we need at least $|E(H)|$ many edges to preserve the reachability relation of $G$ in $H$ .

The algorithm from Theorem 14 would potentially choose all edges of $G$ , i.e., $|E|=(|V|-1)\cdot|V|=q\cdot|E(H)|=\frac{1}{\alpha}\cdot|E(H)|$ many: As the cost $\frac{1}{\textit{cap}(uv)}=\eta(uv)$ of an edge $uv\in E\setminus E(H)$ is equal to the total cost of its unique $u$ - $v$ -path in $H$ , all $u$ - $v$ -paths are equal in cost. Thus, one optimal solution for the relaxation of ILP (1) simply routes all commodities $\tilde{e}\in E$ completely over their own edges $\tilde{e}$ and chooses $x^{*}_{\tilde{e}}=\alpha$ . This solution is also basic: Let $f$ denote the flow variables of this solution. If it were not basic, there would exist two other optimal solutions with flow variables $f^{\prime},f^{\prime\prime}$ respectively, such that for some $\tilde{e},e\in E$ , $f^{\prime}_{\tilde{e}}(e)<f_{\tilde{e}}(e)<f^{\prime\prime}_{\tilde{e}}(e)$ . However, if $e=\tilde{e}$ , $f$ already routes the full demand of commodity $\tilde{e}$ over $e$ and we cannot increase $f_{\tilde{e}}(e)$ without introducing a flow cycle, which would raise the objective value and thus be suboptimal. In contrast, if $e\neq\tilde{e}$ , $f_{\tilde{e}}(e)$ is already $0$ and cannot be decreased. $\hfill\blacktriangleleft$

Figure 4: MMCFS instance for

\alpha=\frac{1}{4}

, constructed as a member of the family of instances in the proof of Section 5. The Hamiltonian cycle

H

is drawn as black. The value

\eta(st)

, denoting the distance from

s

to

t

in

H

for an edge

s t

, is 2, 3, or 4 for blue, green, and orange edges, respectively.

Figure 5: Family of MMCFS instances constructed in the proof of Section 5. Edges in

E_{v}

are drawn in green (and dashed), edges in

E_{XZ}

are orange, edges in

E_{1}

are black, and edges in

E_{B}

are blue. Recall that edges

E_{1}

and

E_{B}

(black and blue) form the unique MED, which is contained in every feasible solution. There exists a feasible MMCFS solution

E_{1}\cup E_{B}\cup E_{v}

without any (orange) edges from

E_{XZ}

, but the algorithm from Theorem 14 will include all edges of

E_{XZ}

in its solution.

Lemma 16.

The ratio $2$ for the algorithm of Theorem 14 is tight for all $\alpha>\frac{1}{2}$ .

Proof.

We construct a family of MMCFS instances (visualized in Figure 5), each one consisting of a simple graph $G=(V,E)$ with edge capacities cap and the given retention ratio $\alpha$ , that meets the approximation ratio of 2 asymptotically with increasing size. Intuitively, the edge set of each instance contains two subsets that have a size quadratic in the number of vertices and two subsets of linear size; our algorithm chooses both quadratic-size subsets even though one could be replaced by a linear-size subset.

Let $V$ be comprised of five disjoint vertex sets $X$ , $Y$ , $Z$ , $U$ , $W$ with $k\in\mathbb{N}$ vertices each (respectively named with the corresponding lowercase letter and indexed by $i\in\{1,\dots,k\}$ ) and a distinct vertex $v$ . Further, let $\varepsilon\leq\frac{1-\alpha}{k\cdot\alpha}$ and $B\geq\frac{k\cdot\varepsilon}{1-\alpha}$ be sufficiently low and high positive values, respectively. Then, $E\coloneqq E_{v}\cup E_{XZ}\cup E_{1}\cup E_{B}$ with

$\displaystyle E_{v}$	$\displaystyle\coloneqq(X\times\{v\})\cup(\{v\}\times Z)$	$\displaystyle E_{XZ}$	$\displaystyle\coloneqq X\times Z$
$\displaystyle E_{1}$	$\displaystyle\coloneqq X\times Y\cup\{x_{i}u_{i},u_{i}v,vw_{i},w_{i}z_{i}\mid i% \in\{1,\dots,k\}\}$	$\displaystyle E_{B}$	$\displaystyle\coloneqq\{y_{i}z_{i}\mid i\in\{1,\dots,k\}\}$
$\displaystyle\textit{cap}(e)$	$\displaystyle\coloneqq\mathrlap{\Big\{{\textstyle\frac{1-\alpha}{\alpha}}% \nobreak\ \text{if}\nobreak\ e\in E_{v},\qquad{\textstyle\frac{1-\alpha+% \varepsilon}{\alpha}}\nobreak\ \text{if}\nobreak\ e\in E_{XZ},\qquad 1\nobreak% \ \text{if}\nobreak\ e\in E_{1},\qquad B\nobreak\ \text{if}\nobreak\ e\in E_{B% }.}$

$G$ is a DAG, thus its MEDis unique [2] and must be part of any feasible solution for the MMCFS instance $(G,\textit{cap},\alpha)$ , see Section 2. The MEDof $G$ consists of the edges $E_{1}\cup E_{B}$ and in particular can satisfy all demands $\alpha\cdot\mathbb{T}_{E}(e)=\alpha\cdot\textit{cap}(e)$ with $e\in E_{1}\cup E_{B}$ . The “remaining” capacity of $1-\alpha$ for each of the edges in $E_{1}$ (and $(1-\alpha)\cdot B$ for edges in $E_{B}$ ) is also sufficient to satisfy the demands $\alpha\cdot\mathbb{T}_{E}(e_{v})$ of the commodities $e_{v}\in E_{v}$ , and to almost satisfy the demands $\alpha\cdot\mathbb{T}_{E}(e_{XZ})=1-\alpha+\varepsilon$ for the commodities $e_{XZ}\in E_{XZ}$ : it only leaves a demand of $\varepsilon$ for every such $e_{XZ}$ . Thus, there exists a feasible solution $E_{1}\cup E_{B}\cup E_{v}$ for $(G,\textit{cap},\alpha)$ that contains all of the $k^{2}+5k$ edges from $E_{1}\cup E_{B}$ and additionally, to satisfy the remaining demands (of size $\varepsilon$ each), the $2k$ edges from $E_{v}$ . This feasible solution gives an upper bound on the objective value $z^{*}_{\mathit{int}}$ of the optimal integral solution: $z^{*}_{\mathit{int}}\leq k^{2}+7k$ .

In contrast, the algorithm from Theorem 14 also includes all $k^{2}+5k$ edges of the MED $E_{1}\cup E_{B}$ in its solution, but must additionally choose (at least) the $k^{2}$ edges from $E_{XZ}$ , resulting in an objective value $z\geq 2k^{2}+5k$ . This is because every optimal solution for the relaxation of ILP (1) routes the commodities $st=e_{XZ}\in E_{XZ}$ over the edges $e_{XZ}$ themselves: they have a lower cost than the alternative $s$ - $t$ -path of length 2 over the edges in $E_{v}$ .

For increasing $k$ , the ratio of the algorithmic solution’s objective value to the optimum is

\displaystyle\lim_{k\to\infty}\frac{z}{z^{*}_{\mathit{int}}}\geq\lim_{k\to% \infty}\frac{2k^{2}+5k}{k^{2}+7k}=2.

$\hfill\blacktriangleleft$

Combining Theorem 14 with Sections 5 and 5, we obtain:

Corollary 17.

The approximation ratio $\max(\nicefrac{{1}}{{\alpha}},2)$ for the algorithm given by Theorem 14 is tight for all $\alpha>\frac{1}{2}$ and all $\alpha=\frac{1}{q}$ with $q\in\mathbb{N}_{>1}$ .

$\blacktriangleright$ Remark 18.

There are instances where the integrality gap of ILP (1) is $\frac{1}{\alpha}$ – e.g. trees, where the unique optimal integral solution contains every edge while the unique optimal fractional solution chooses $x^{*}_{e}=\alpha$ for all edges $e$ of the input graph. Interestingly, the approximation ratio of our algorithm (for $\alpha\leq\frac{1}{2}$ ) equals the integrality gap exactly, but on completely different instances and not on trees (where the algorithm always finds the optimal solution).

6 Conclusion and Open Questions

We introduced the practically motivated Minimum Multi-Commodity Flow Subgraph (MMCFS) problem and paved the way for further research by giving several structural results, most importantly a reformulation of this traffic-oblivious problem that only needs to consider a single specific traffic matrix. Further, we showed that MMCFS is NP-hard (and a closely related problem cannot be approximated within a sublogarithmic factor) already on DAGs. Lastly, we gave an extremely simple LP-rounding scheme for MMCFS with a tight approximation guarantee of $\max(\nicefrac{{1}}{{\alpha}},2)$ .

Considering seemingly related problems (see Section 2), one observes that an approximation ratio of 2 (which we attain for the practically most relevant cases of $\alpha\geq\frac{1}{2}$ [34]) often arises as a seemingly “natural” limit for such ratios. Yet, it remains an open question whether there exists an approximation algorithm for MMCFS with a better quality guarantee, and whether there is a non-trivial lower bound on the approximation guarantee for any such algorithm (assuming $\text{P}\neq\text{NP}$ ). Further, it might be of interest to explore several generalizations of MMCFS: this includes the non-traffic-oblivious variant where a specific traffic matrix is part of the input (which is also NP-hard via Theorem 3), and the variant where, given an additional cost function on the edges, one asks for a subgraph of minimum cost.

References

[1] Abu Reyan Ahmed, Greg Bodwin, Faryad Darabi Sahneh, Keaton Hamm, Mohammad Javad Latifi Jebelli, Stephen G. Kobourov, and Richard Spence. Graph spanners: A tutorial review. Comput. Sci. Rev., 37:100253, 2020. doi:10.1016/J.COSREV.2020.100253.
[2] Alfred V. Aho, Michael R. Garey, and Jeffrey D. Ullman. The transitive reduction of a directed graph. SIAM J. Comput., 1(2):131–137, 1972. doi:10.1137/0201008.
[3] Ravindra K. Ahuja, Thomas L. Magnanti, and James B. Orlin. Network flows – Theory, algorithms and applications. Prentice Hall, 1993.
[4] Yacine Al-Najjar, Walid Ben-Ameur, and Jérémie Leguay. On the approximability of robust network design. Theor. Comput. Sci., 860:41–50, 2021. doi:10.1016/J.TCS.2021.01.026.
[5] Yacine Al-Najjar, Walid Ben-Ameur, and Jérémie Leguay. Approximability of robust network design: The directed case. In Proc. STACS 2022, volume 219 of LIPIcs, pages 6:1–6:16. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPICS.STACS.2022.6.
[6] Reid Andersen and Uriel Feige. Interchanging distance and capacity in probabilistic mappings. CoRR, abs/0907.3631, 2009. arXiv:0907.3631.
[7] Alexandr Andoni, Anupam Gupta, and Robert Krauthgamer. Towards (1 + $\epsilon$ )-approximate flow sparsifiers. In Proc. SODA 2014, pages 279–293. SIAM, 2014. doi:10.1137/1.9781611973402.20.
[8] Spyridon Antonakopoulos. Approximating directed buy-at-bulk network design. In Proc. WAOA 2010, volume 6534 of LNCS, pages 13–24. Springer, 2010. doi:10.1007/978-3-642-18318-8_2.
[9] Yossi Azar, Edith Cohen, Amos Fiat, Haim Kaplan, and Harald Räcke. Optimal oblivious routing in polynomial time. J. Comput. Syst. Sci., 69(3):383–394, 2004. doi:10.1016/J.JCSS.2004.04.010.
[10] Walid Ben-Ameur and Hervé Kerivin. Routing of uncertain traffic demands. Optimization and Engineering, 6:283–313, 2005. doi:10.1145/777313.777314.
[11] Piotr Berman, Bhaskar DasGupta, and Marek Karpinski. Approximating transitive reductions for directed networks. In Proc. WADS 2009, volume 5664 of LNCS, pages 74–85. Springer, 2009. doi:10.1007/978-3-642-03367-4_7.
[12] Randeep Bhatia, Fang Hao, Murali S. Kodialam, and T. V. Lakshman. Optimized network traffic engineering using segment routing. In Proc. INFOCOM 2015, pages 657–665. IEEE, 2015. doi:10.1109/INFOCOM.2015.7218434.
[13] Chandra Chekuri, Mohammad Taghi Hajiaghayi, Guy Kortsarz, and Mohammad R. Salavatipour. Approximation algorithms for nonuniform buy-at-bulk network design. SIAM J. Comput., 39(5):1772–1798, 2010. doi:10.1137/090750317.
[14] Luca Chiaraviglio, Marco Mellia, and Fabio Neri. Reducing power consumption in backbone networks. In Proc. ICC 2009, pages 1–6. IEEE, 2009. doi:10.1109/ICC.2009.5199404.
[15] Markus Chimani and Max Ilsen. Capacity-preserving subgraphs of directed flow networks. In Proc. IWOCA 2023, volume 13889 of LNCS, pages 160–172. Springer, 2023. doi:10.1007/978-3-031-34347-6_14.
[16] Markus Chimani and Max Ilsen. Traffic-oblivious multi-commodity flow network design. CoRR, abs/2504.16744, 2025. doi:10.48550/arXiv.2504.16744.
[17] Irit Dinur and David Steurer. Analytical approach to parallel repetition. In Proc. STOC 2014, pages 624–633. ACM, 2014. doi:10.1145/2591796.2591884.
[18] Matthias Englert, Anupam Gupta, Robert Krauthgamer, Harald Räcke, Inbal Talgam-Cohen, and Kunal Talwar. Vertex sparsifiers: New results from old techniques. SIAM J. Comput., 43(4):1239–1262, 2014. doi:10.1137/130908440.
[19] Clarence Filsfils, Stefano Previdi, Les Ginsberg, Bruno Decraene, Stephane Litkowski, and Rob Shakir. Segment routing architecture. RFC, 8402:1–32, 2018. doi:10.17487/RFC8402.
[20] Les R. Foulds. A multi-commodity flow network design problem. Transportation Research Part B: Methodological, 15(4):273–283, 1981. doi:10.1016/0191-2615(81)90013-8.
[21] Michael R. Garey and David S. Johnson. Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman, 1979.
[22] Bernard Gendron, Teodor Gabriel Crainic, and Antonio Frangioni. Multicommodity Capacitated Network Design, pages 1–19. Springer US, Boston, MA, 1999. doi:10.1007/978-1-4615-5087-7_1.
[23] Bernard Gendron and Mathieu Larose. Branch-and-price-and-cut for large-scale multicommodity capacitated fixed-charge network design. EURO J. Comput. Optim., 2(1-2):55–75, 2014. doi:10.1007/S13675-014-0020-9.
[24] Mohammad Taghi Hajiaghayi, Robert D. Kleinberg, Harald Räcke, and Tom Leighton. Oblivious routing on node-capacitated and directed graphs. ACM Trans. Algorithms, 3(4):51, 2007. doi:10.1145/1290672.1290688.
[25] Renaud Hartert, Pierre Schaus, Stefano Vissicchio, and Olivier Bonaventure. Solving segment routing problems with hybrid constraint programming techniques. In Proc. CP 2015, volume 9255 of LNCS, pages 592–608. Springer, 2015. doi:10.1007/978-3-319-23219-5_41.
[26] Kamal Jain. A factor 2 approximation algorithm for the generalized steiner network problem. Comb., 21(1):39–60, 2001. doi:10.1007/s004930170004.
[27] Richard M. Karp. Reducibility among combinatorial problems. In Proc. COCO 1972, The IBM Research Symposia Series, pages 85–103. Plenum Press, New York, 1972. doi:10.1007/978-1-4684-2001-2_9.
[28] Samir Khuller, Balaji Raghavachari, and Neal E. Young. Designing multi-commodity flow trees. Inf. Process. Lett., 50(1):49–55, 1994. doi:10.1016/0020-0190(94)90044-2.
[29] Samir Khuller, Balaji Raghavachari, and Neal E. Young. Approximating the minimum equivalent digraph. SIAM J. Comput., 24(4):859–872, 1995. doi:10.1137/S0097539793256685.
[30] Frank Thomson Leighton and Ankur Moitra. Extensions and limits to vertex sparsification. In Proc. STOC 2010, pages 47–56. ACM, 2010. doi:10.1145/1806689.1806698.
[31] Vardges Melkonian and Éva Tardos. Algorithms for a network design problem with crossing supermodular demands. Networks, 43(4):256–265, 2004. doi:10.1002/NET.20005.
[32] Ankur Moitra. Approximation algorithms for multicommodity-type problems with guarantees independent of the graph size. In Proc. FOCS 2009, pages 3–12. IEEE Computer Society, 2009. doi:10.1109/FOCS.2009.28.
[33] Dana Moshkovitz. The projection games conjecture and the np-hardness of ln n-approximating set-cover. Theory Comput., 11:221–235, 2015. doi:10.4086/toc.2015.v011a007.
[34] Daniel Otten, Max Ilsen, Markus Chimani, and Nils Aschenbruck. Green traffic engineering by line card minimization. In Proc. LCN 2023, pages 1–8. IEEE, 2023. doi:10.1109/LCN58197.2023.10223344.
[35] Harald Räcke. Minimizing congestion in general networks. In Proc. FOCS 2002, pages 43–52. IEEE Computer Society, 2002. doi:10.1109/SFCS.2002.1181881.
[36] Harald Räcke. Optimal hierarchical decompositions for congestion minimization in networks. In Proc. STOC 2008, pages 255–264. ACM, 2008. doi:10.1145/1374376.1374415.
[37] Sartaj Sahni. Computationally related problems. SIAM J. Comput., 3(4):262–279, 1974. doi:10.1137/0203021.
[38] Khodakaram Salimifard and Sara Bigharaz. The multicommodity network flow problem: state of the art classification, applications, and solution methods. Oper. Res., 22(1):1–47, 2022. doi:10.1007/S12351-020-00564-8.
[39] F. Sibel Salman, Joseph Cheriyan, R. Ravi, and S. Subramanian. Buy-at-bulk network design: Approximating the single-sink edge installation problem. In Proc. SODA 1997, pages 619–628. ACM/SIAM, 1997. URL: http://dl.acm.org/citation.cfm?id=314161.314397.
[40] Timmy Schüller, Nils Aschenbruck, Markus Chimani, Martin Horneffer, and Stefan Schnitter. Traffic engineering using segment routing and considering requirements of a carrier IP network. IEEE/ACM Trans. Netw., 26(4):1851–1864, 2018. doi:10.1109/TNET.2018.2854610.
[41] Vijay V. Vazirani. Approximation algorithms. Springer, 2001. URL: http://www.springer.com/computer/theoretical+computer+science/book/978-3-540-65367-7.
[42] Adrian Vetta. Approximating the minimum strongly connected subgraph via a matching lower bound. In Proc. SODA 2001, pages 417–426. ACM/SIAM, 2001. URL: http://dl.acm.org/citation.cfm?id=365411.365493.
[43] David P. Williamson and David B. Shmoys. The Design of Approximation Algorithms. Cambridge University Press, 2011. URL: http://www.cambridge.org/de/knowledge/isbn/item5759340/.
[44] Mingui Zhang, Cheng Yi, Bin Liu, and Beichuan Zhang. GreenTE: Power-aware traffic engineering. In Proc. ICNP 2010, pages 21–30. IEEE Computer Society, 2010. doi:10.1109/ICNP.2010.5762751.
[45] Liang Zhao, Hiroshi Nagamochi, and Toshihide Ibaraki. A linear time 5/3-approximation for the minimum strongly-connected spanning subgraph problem. Inf. Process. Lett., 86(2):63–70, 2003. doi:10.1016/S0020-0190(02)00476-3.

[bib.bib1] [1] Abu Reyan Ahmed, Greg Bodwin, Faryad Darabi Sahneh, Keaton Hamm, Mohammad Javad Latifi Jebelli, Stephen G. Kobourov, and Richard Spence. Graph spanners: A tutorial review. Comput. Sci. Rev., 37:100253, 2020. doi:10.1016/J.COSREV.2020.100253.

[bib.bib2] [2] Alfred V. Aho, Michael R. Garey, and Jeffrey D. Ullman. The transitive reduction of a directed graph. SIAM J. Comput., 1(2):131–137, 1972. doi:10.1137/0201008.

[bib.bib3] [3] Ravindra K. Ahuja, Thomas L. Magnanti, and James B. Orlin. Network flows – Theory, algorithms and applications. Prentice Hall, 1993.

[bib.bib4] [4] Yacine Al-Najjar, Walid Ben-Ameur, and Jérémie Leguay. On the approximability of robust network design. Theor. Comput. Sci., 860:41–50, 2021. doi:10.1016/J.TCS.2021.01.026.

[bib.bib5] [5] Yacine Al-Najjar, Walid Ben-Ameur, and Jérémie Leguay. Approximability of robust network design: The directed case. In Proc. STACS 2022, volume 219 of LIPIcs, pages 6:1–6:16. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPICS.STACS.2022.6.

[bib.bib6] [6] Reid Andersen and Uriel Feige. Interchanging distance and capacity in probabilistic mappings. CoRR, abs/0907.3631, 2009. arXiv:0907.3631.

[bib.bib7] [7] Alexandr Andoni, Anupam Gupta, and Robert Krauthgamer. Towards (1 + $\epsilon$ )-approximate flow sparsifiers. In Proc. SODA 2014, pages 279–293. SIAM, 2014. doi:10.1137/1.9781611973402.20.

[bib.bib8] [8] Spyridon Antonakopoulos. Approximating directed buy-at-bulk network design. In Proc. WAOA 2010, volume 6534 of LNCS, pages 13–24. Springer, 2010. doi:10.1007/978-3-642-18318-8_2.

[bib.bib9] [9] Yossi Azar, Edith Cohen, Amos Fiat, Haim Kaplan, and Harald Räcke. Optimal oblivious routing in polynomial time. J. Comput. Syst. Sci., 69(3):383–394, 2004. doi:10.1016/J.JCSS.2004.04.010.

[bib.bib10] [10] Walid Ben-Ameur and Hervé Kerivin. Routing of uncertain traffic demands. Optimization and Engineering, 6:283–313, 2005. doi:10.1145/777313.777314.

[bib.bib11] [11] Piotr Berman, Bhaskar DasGupta, and Marek Karpinski. Approximating transitive reductions for directed networks. In Proc. WADS 2009, volume 5664 of LNCS, pages 74–85. Springer, 2009. doi:10.1007/978-3-642-03367-4_7.

[bib.bib12] [12] Randeep Bhatia, Fang Hao, Murali S. Kodialam, and T. V. Lakshman. Optimized network traffic engineering using segment routing. In Proc. INFOCOM 2015, pages 657–665. IEEE, 2015. doi:10.1109/INFOCOM.2015.7218434.

[bib.bib13] [13] Chandra Chekuri, Mohammad Taghi Hajiaghayi, Guy Kortsarz, and Mohammad R. Salavatipour. Approximation algorithms for nonuniform buy-at-bulk network design. SIAM J. Comput., 39(5):1772–1798, 2010. doi:10.1137/090750317.

[bib.bib14] [14] Luca Chiaraviglio, Marco Mellia, and Fabio Neri. Reducing power consumption in backbone networks. In Proc. ICC 2009, pages 1–6. IEEE, 2009. doi:10.1109/ICC.2009.5199404.

[bib.bib15] [15] Markus Chimani and Max Ilsen. Capacity-preserving subgraphs of directed flow networks. In Proc. IWOCA 2023, volume 13889 of LNCS, pages 160–172. Springer, 2023. doi:10.1007/978-3-031-34347-6_14.

[bib.bib16] [16] Markus Chimani and Max Ilsen. Traffic-oblivious multi-commodity flow network design. CoRR, abs/2504.16744, 2025. doi:10.48550/arXiv.2504.16744.

[bib.bib17] [17] Irit Dinur and David Steurer. Analytical approach to parallel repetition. In Proc. STOC 2014, pages 624–633. ACM, 2014. doi:10.1145/2591796.2591884.

[bib.bib18] [18] Matthias Englert, Anupam Gupta, Robert Krauthgamer, Harald Räcke, Inbal Talgam-Cohen, and Kunal Talwar. Vertex sparsifiers: New results from old techniques. SIAM J. Comput., 43(4):1239–1262, 2014. doi:10.1137/130908440.

[bib.bib19] [19] Clarence Filsfils, Stefano Previdi, Les Ginsberg, Bruno Decraene, Stephane Litkowski, and Rob Shakir. Segment routing architecture. RFC, 8402:1–32, 2018. doi:10.17487/RFC8402.

[bib.bib20] [20] Les R. Foulds. A multi-commodity flow network design problem. Transportation Research Part B: Methodological, 15(4):273–283, 1981. doi:10.1016/0191-2615(81)90013-8.

[bib.bib21] [21] Michael R. Garey and David S. Johnson. Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman, 1979.

[bib.bib22] [22] Bernard Gendron, Teodor Gabriel Crainic, and Antonio Frangioni. Multicommodity Capacitated Network Design, pages 1–19. Springer US, Boston, MA, 1999. doi:10.1007/978-1-4615-5087-7_1.

[bib.bib23] [23] Bernard Gendron and Mathieu Larose. Branch-and-price-and-cut for large-scale multicommodity capacitated fixed-charge network design. EURO J. Comput. Optim., 2(1-2):55–75, 2014. doi:10.1007/S13675-014-0020-9.

[bib.bib24] [24] Mohammad Taghi Hajiaghayi, Robert D. Kleinberg, Harald Räcke, and Tom Leighton. Oblivious routing on node-capacitated and directed graphs. ACM Trans. Algorithms, 3(4):51, 2007. doi:10.1145/1290672.1290688.

[bib.bib25] [25] Renaud Hartert, Pierre Schaus, Stefano Vissicchio, and Olivier Bonaventure. Solving segment routing problems with hybrid constraint programming techniques. In Proc. CP 2015, volume 9255 of LNCS, pages 592–608. Springer, 2015. doi:10.1007/978-3-319-23219-5_41.

[bib.bib26] [26] Kamal Jain. A factor 2 approximation algorithm for the generalized steiner network problem. Comb., 21(1):39–60, 2001. doi:10.1007/s004930170004.

[bib.bib27] [27] Richard M. Karp. Reducibility among combinatorial problems. In Proc. COCO 1972, The IBM Research Symposia Series, pages 85–103. Plenum Press, New York, 1972. doi:10.1007/978-1-4684-2001-2_9.

[bib.bib28] [28] Samir Khuller, Balaji Raghavachari, and Neal E. Young. Designing multi-commodity flow trees. Inf. Process. Lett., 50(1):49–55, 1994. doi:10.1016/0020-0190(94)90044-2.

[bib.bib29] [29] Samir Khuller, Balaji Raghavachari, and Neal E. Young. Approximating the minimum equivalent digraph. SIAM J. Comput., 24(4):859–872, 1995. doi:10.1137/S0097539793256685.

[bib.bib30] [30] Frank Thomson Leighton and Ankur Moitra. Extensions and limits to vertex sparsification. In Proc. STOC 2010, pages 47–56. ACM, 2010. doi:10.1145/1806689.1806698.

[bib.bib31] [31] Vardges Melkonian and Éva Tardos. Algorithms for a network design problem with crossing supermodular demands. Networks, 43(4):256–265, 2004. doi:10.1002/NET.20005.

[bib.bib32] [32] Ankur Moitra. Approximation algorithms for multicommodity-type problems with guarantees independent of the graph size. In Proc. FOCS 2009, pages 3–12. IEEE Computer Society, 2009. doi:10.1109/FOCS.2009.28.

[bib.bib33] [33] Dana Moshkovitz. The projection games conjecture and the np-hardness of ln n-approximating set-cover. Theory Comput., 11:221–235, 2015. doi:10.4086/toc.2015.v011a007.

[bib.bib34] [34] Daniel Otten, Max Ilsen, Markus Chimani, and Nils Aschenbruck. Green traffic engineering by line card minimization. In Proc. LCN 2023, pages 1–8. IEEE, 2023. doi:10.1109/LCN58197.2023.10223344.

[bib.bib35] [35] Harald Räcke. Minimizing congestion in general networks. In Proc. FOCS 2002, pages 43–52. IEEE Computer Society, 2002. doi:10.1109/SFCS.2002.1181881.

[bib.bib36] [36] Harald Räcke. Optimal hierarchical decompositions for congestion minimization in networks. In Proc. STOC 2008, pages 255–264. ACM, 2008. doi:10.1145/1374376.1374415.

[bib.bib37] [37] Sartaj Sahni. Computationally related problems. SIAM J. Comput., 3(4):262–279, 1974. doi:10.1137/0203021.

[bib.bib38] [38] Khodakaram Salimifard and Sara Bigharaz. The multicommodity network flow problem: state of the art classification, applications, and solution methods. Oper. Res., 22(1):1–47, 2022. doi:10.1007/S12351-020-00564-8.

[bib.bib39] [39] F. Sibel Salman, Joseph Cheriyan, R. Ravi, and S. Subramanian. Buy-at-bulk network design: Approximating the single-sink edge installation problem. In Proc. SODA 1997, pages 619–628. ACM/SIAM, 1997. URL: http://dl.acm.org/citation.cfm?id=314161.314397.

[bib.bib40] [40] Timmy Schüller, Nils Aschenbruck, Markus Chimani, Martin Horneffer, and Stefan Schnitter. Traffic engineering using segment routing and considering requirements of a carrier IP network. IEEE/ACM Trans. Netw., 26(4):1851–1864, 2018. doi:10.1109/TNET.2018.2854610.

[bib.bib41] [41] Vijay V. Vazirani. Approximation algorithms. Springer, 2001. URL: http://www.springer.com/computer/theoretical+computer+science/book/978-3-540-65367-7.

[bib.bib42] [42] Adrian Vetta. Approximating the minimum strongly connected subgraph via a matching lower bound. In Proc. SODA 2001, pages 417–426. ACM/SIAM, 2001. URL: http://dl.acm.org/citation.cfm?id=365411.365493.

[bib.bib43] [43] David P. Williamson and David B. Shmoys. The Design of Approximation Algorithms. Cambridge University Press, 2011. URL: http://www.cambridge.org/de/knowledge/isbn/item5759340/.

[bib.bib44] [44] Mingui Zhang, Cheng Yi, Bin Liu, and Beichuan Zhang. GreenTE: Power-aware traffic engineering. In Proc. ICNP 2010, pages 21–30. IEEE Computer Society, 2010. doi:10.1109/ICNP.2010.5762751.

[bib.bib45] [45] Liang Zhao, Hiroshi Nagamochi, and Toshihide Ibaraki. A linear time 5/3-approximation for the minimum strongly-connected spanning subgraph problem. Inf. Process. Lett., 86(2):63–70, 2003. doi:10.1016/S0020-0190(02)00476-3.

Traffic-Oblivious Multi-Commodity Flow Network Design

Abstract

Keywords and phrases:

Copyright and License:

2012 ACM Subject Classification:

Related Version:

Funding:

DOI:

Event:

Editors:

Series and Publisher:

1 Introduction

Our contribution.

2 Related Work

Observation 1.

Proof.

Observation 2.

Proof.

3 Structural Results

Theorem 3.

Proof.

Observation 4.

Proof.

Theorem 5.

Proof.

Corollary 6.

Corollary 7.

4 Complexity

Corollary 8.

Proof.

Theorem 9.

Proof.

Observation 10.

5 LP-based Approximation

▶ Remark 11.

Lemma 12.

Proof.

Corollary 13.

Proof.

Theorem 14.

Proof.

Lemma 15.

Proof.

Lemma 16.

Proof.

Corollary 17.

▶ Remark 18.

6 Conclusion and Open Questions

References

$\blacktriangleright$ Remark 11.

$\blacktriangleright$ Remark 18.