3.415-Approximation for Coflow Scheduling via Iterated Rounding

Rohwedder, Lars; Schnaars, Leander

doi:10.4230/LIPIcs.ICALP.2025.128

$3.415$ -Approximation for Coflow Scheduling via Iterated Rounding

Lars Rohwedder

University of Southern Denmark, Odense, Denmark Leander Schnaars

Technical University of Munich, Germany

Abstract

We provide an algorithm giving a $\frac{140}{41}(<3.415)$ -approximation for Coflow Scheduling and a $4.36$ -approximation for Coflow Scheduling with release dates. This improves upon the best known $4$ - and respectively $5$ -approximations and addresses an open question posed by Agarwal, Rajakrishnan, Narayan, Agarwal, Shmoys, and Vahdat [1], Fukunaga [9], and others. We additionally show that in an asymptotic setting, the algorithm achieves a $(2+\epsilon)$ -approximation, which is essentially optimal under $\mathbb{P}\neq\mathbb{NP}$ . The improvements are achieved using a novel edge allocation scheme using iterated LP rounding together with a framework which enables establishing strong bounds for combinations of several edge allocation algorithms.

Keywords and phrases:

Coflow Scheduling, Approximation Algorithms, Iterated Rounding

Category:

Track A: Algorithms, Complexity and Games

Funding:

Lars Rohwedder: Supported by Dutch Research Council (NWO) project “The Twilight Zone of Efficiency: Optimality of Quasi-Polynomial Time Algorithms” [grant number OCEN.W.21.268].

Leander Schnaars: Supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – GRK 2201/2 – Projektnummer 277991500.

Copyright and License:

2012 ACM Subject Classification:

Theory of computation

\rightarrow

Scheduling algorithms

Editors:

Keren Censor-Hillel, Fabrizio Grandoni, Joël Ouaknine, and Gabriele Puppis

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

Coflow Scheduling models the problem of data exchange between various nodes in a shared network. It has been proven indispensable in improving the performance of common data exchange and distributed computing frameworks such as MapReduce [7], Spark [27], and Hadoop [25]. These routines form an integral part for large scale computations commonly found in applications such as bioinformatics, deep learning, and large language models [11, 21, 12]. The problem has enjoyed attention both from the theory community as well as the application side, with many works spanning the bridge between theory and practice.

Formally, a coflow instance is given by some bipartite set of vertices $V:=U_{1}\cup U_{2}$ and a set of coflows $E_{1},\dotsc,E_{n}$ , where each coflow $E_{j}$ is a subset of bipartite edges on $V$ , possibly containing duplicates. Additionally, each coflow $E_{j}$ has some associated weight $\omega_{j}\in\mathbb{R}^{+}$ . This models for example a set of input and output ports in a shared network, where each edge inside a coflow represents some data transmission requirement. During each discrete step in time, we are allowed to schedule a set of edges on the graph for which no vertex has more than one adjacent edge, so a matching. This represents the requirement that ports only send to and receive from one other port during each discrete step in time. A coflow finishes at time-step $t$ if all of its edges have been scheduled on or before time $t$ , with at least one edge being scheduled during $t$ . We call $C^{*}_{j}$ the finishing time of coflow $E_{j}$ and wish to minimize the weighted sum of completion times $\sum_{j\in[n]}\omega_{j}C^{*}_{j}$ .

Coflow Scheduling was first introduced by Chowdhury et al. [6], though the closely related problem of scheduling on network switches has been studied earlier under different names by various authors [5, 13]. Coflow Scheduling can be seen as an extension with a combinatorial structure of a problem called Concurrent Open Shop Scheduling (COSS) with preemption. In COSS, there is a set of machines and jobs, where every job has some demand on each machine which can be fulfilled concurrently, and jobs finish when they are completed on every machine. For COSS, several $2$ -approximation algorithms are known [10, 19, 20] and the problem is known to be $\mathbb{NP}$ -hard to approximate within $2-\epsilon$ , for any $\epsilon>0$ [23]. This hardness result extends to Coflow Scheduling. On the side of approximation algorithms for Coflow Scheduling, there is still a gap to the lower bound. For the case of no release dates, multiple authors have given $4$ -approximation algorithms [1, 2, 9], which extend to $5$ -approximations in the case of release dates. Fukunaga [9] shows that in the case of release dates the integrality gap of the linear program used in the algorithm is at most $4$ , though his proof is non-constructive. Several authors have claimed $(2+\epsilon)$ -approximations, but all were later shown to be incorrect, see [15, 2] for discussion. For the setting in which the simultaneously schedulable flows have to be independent sets of a matroid instead of matchings, Im et al. [14] provided an algorithm with a $(2+\epsilon)$ -approximation guarantee. Khuller et al. [17] showed a framework which provides guarantees in an online setting using offline approximation algorithms, leading to a $12$ -approximation for online Coflow Scheduling. Extensions to general graphs [16] and so called path-based Coflow Scheduling [8] have been studied.

Whether the $4$ - and respectively $5$ -approximation can be improved has been a major open question raised by most previous works [1, 9], especially in light of the $2-\epsilon$ lower bound. We address this question and show that both bounds can be beaten, with a tight result in an asymptotic setting.

1.1 Our Contribution

We present the first polynomial time algorithm which achieves a better than $4$ -approximation for Coflow Scheduling without release dates and the first algorithm which achieves a better than $5$ -approximation with release dates. More specifically, we show the following theorems.

Theorem 1.

There is a polynomial time algorithm achieving a $\frac{140}{41}(<3.415)$ -approximation for Coflow Scheduling without release dates.

Theorem 2.

There is a polynomial time algorithm achieving a $4.36$ -approximation for Coflow Scheduling with release dates.

Using a more technical construction, the guarantee of Theorem 1 can be slightly improved, details can be found in the full version. We additionally prove that in a certain asymptotic setting, roughly when most coflows have large finishing times in any optimum solution, we can achieve a $(2+\epsilon)$ -approximation, which is optimal. This result holds even in the case with release dates.

Theorem 3.

For any $\epsilon>0$ , there exists an $\hat{\epsilon}>0$ such that there is a $(2+\epsilon)$ -approximation algorithm for all coflow instances $\mathcal{I}$ fulfilling

\sum_{j\in[|\mathcal{I}|]}\omega_{j}\quad\leq\quad\hat{\epsilon}\cdot\mathrm{% OPT}(\mathcal{I}).

Note that using a framework by Khuller et al. [17], any improvement in the approximation ratio for Coflow Scheduling without release dates directly gives an improvement for the best known approximation for the online setting of Coflow Scheduling. The following table provides an overview over the state of the art of approximation algorithms for various variants of Coflow Scheduling and our respective improvements.

Table 1: Best known previous approximations and our results. The sources marked [*] are [2, 1, 9].

Case	Best Known	This Work
No Release Dates	400 [*]	3.415
Release Dates	500 [*]	4.36
Release Dates (integrality gap)	400 [9]	$3.893$
Asymptotic + No Release Dates	400 [*]	$2+\epsilon$
Asymptotic + Release Dates	500 [*]	$2+\epsilon$
Online	120 [17]	$11.415$

The main technical contributions are a novel allocation and rounding scheme for the individual edges of each coflow, inspired by techniques used in proving the Beck-Fiala Theorem from discrepancy theory and a framework for establishing the approximation ratio for combinations of several such edge scheduling algorithms.
Our algorithm follows a two phase approach. This has been done either implicitly or explicitly in most approaches found in the literature. In the first phase, for each coflow and its associated flows, deadlines are determined through an LP based approach combined with a randomized rounding procedure. These deadlines are equipped with a special structure which we exploit in the algorithm. They specifically provide a $2$ -approximation cost guarantee with respect to some optimum solution. In the second phase, the goal is to find a valid allocation for the coflows, or more precisely for the individual edges of each coflow, to time-slots. In previous works, this was achieved by a simple greedy allocation procedure, which in the case without release dates achieves a deadline violation of at most a factor $2$ for each coflow, yielding a $4$ -approximation. We use a combination of two algorithms, both the greedy allocation rule and a novel iterated rounding scheme. The greedy allocation performs well for small deadlines, but converges to a $2$ -approximation for larger deadlines, while the rounding procedure can schedule large deadlines arbitrarily well, but has worse results for small ones. Note that these factors only capture the completion time delay and do not take into account the additional loss of factor $2$ from the deadline construction. By running both algorithms and picking the better result, we are able to improve upon the factor $4$ .
To show the improved approximation ratio, we establish a general framework which can be used to bound the coflow scheduling approximation ratio for any collection of edge allocation algorithms.

1.2 Organization

The next chapter introduces important notation and discusses results from LP and graph theory which form an integral part of several subroutines. Section 3 provides a complete overview of the entire algorithm and the most important theorems for Coflow Scheduling without release dates. Full proofs and additional details can be found in the following Section 4. The extension to release dates is described in Section 5. Section 6 proves the $2+\epsilon$ guarantee in the asymptotic case. The full version of this paper contains additional details and discusses some further related theory, such as results regarding the structure and complexity of the used LP, extending the algorithm to high edge multiplicities, the improved LP integrality gap, and shows how to achieve a slight improvement over the approximation guarantee from Theorem 7.

2 Preliminaries

For $n\in\mathbb{N}$ we define $[n]:=\{1,\dotsc,n\}$ . Graphs $G=(V,E)$ are defined by a set of vertices $V$ and edges $E\subseteq V\times V$ . We slightly abuse notation and allow $E$ to contain multiple copies of the same edge. We use $\Delta(G)$ to denote the maximum degree of a graph. For some set of edges $E$ , $\Delta(E)$ refers to the maximum degree of the canonical graph induced by this edge set.

A coflow instance is given by a collection of bipartite edge multi-sets $E_{1},\dotsc,E_{n}$ on some set of vertices, together with weights $\omega_{1},\dotsc,\omega_{n}\in\mathbb{R}_{+}$ . We usually only refer to the edge sets and let the vertices of the underlying graph be implicitly described by them. We define $E:=\cup_{j\in[n]}E_{j}$ . In the case of release dates, for every coflow $j\in[n]$ there is a release date $r_{j}\in\mathbb{N}$ . A release date $r_{j}$ means that edges from coflow $E_{j}$ can be scheduled the earliest in time slot $r_{j}+1$ . We use the term flow to refer to a collection of identical edges within one coflow, so essentially an edge with its multiplicity. In the main body of this work we assume that edge multiplicities are encoded in this explicit way, meaning that multiplicities are represented by multiple copies. We discuss the case where the multiplicities are instead encoded as an integer in the full version of this paper. A valid schedule is a mapping of all edges to time slots, such that in every time slot the assigned edges form a matching. In the case of release dates, no edge from any coflow is allowed to be scheduled in a time slot smaller than or equal to the respective release date. The finishing time of a coflow is the latest time slot to which one of its edges is assigned. We call $C^{*}_{j}$ the finishing time of coflow $E_{j}$ and wish to minimize the weighted sum of completion times $\sum_{j\in[n]}\omega_{j}C^{*}_{j}$ .

As we use an iterated LP rounding scheme, we give a brief overview of the most important relevant theory here. For more details see for example [3, 24]. Let $A\in\mathbb{R}^{m\times n}$ be a matrix and $b\in\mathbb{R}^{m},c\in\mathbb{R}^{n}$ be vectors. We consider the polytope $\mathcal{P}=\{x\in[0,1]\ |\ Ax\leq b\}$ and the associated LP $\min_{x\in\mathcal{P}}c^{T}x$ . From standard LP theory we know that there always exists an LP optimum solution at a vertex of $\mathcal{P}$ which can be found in strongly polynomial time, as long as all values in $A$ are polynomially bounded [26]. One key fact we use is that additionally if $m<n$ , one can find such a vertex solution in which at least $n-m$ entries are in $\{0,1\}$ . This theorem can be extended to work when every entry $x_{i}$ is constrained to some interval $[0,n_{i}]$ , for $n_{i}\in\mathbb{N}$ .

The Coflow Scheduling problem is closely related to several well studied graph and coloring problems. As we use some of these results directly and implicitly in our work, we briefly review them here. Given some set of edges $E$ on a bipartite graph $G=(V,E)$ , the question whether they can be partitioned into some number of matchings $k$ is equivalent to asking whether there is a proper $k$ -edge-coloring of $E$ . Clearly at least $\Delta(G)$ , i.e. the maximum degree of the graph, colors are needed. Kőnig’s Theorem, and to an extent also Vizing’s Theorem, give a strong result for bipartite graphs:

Theorem 4 ([18]).

Any bipartite graph $G$ can be properly edge-colored with $\Delta(G)$ colors.

As a set of matchings is equivalent to a set of scheduled flows in the coflow setting, the question whether a set of edges can be scheduled during some collection of time points is equivalently answered by this. Any set of flows for which the induced graph has maximum degree $d$ can be scheduled within $d$ time slots. This result is essential to our analysis, as it shows that degree bounds for some set of selected flows are sufficient to ensure their schedulability.
Note that the proof of Theorem 4 can be done in a constructive way, leading to a polynomial time algorithm producing such an allocation. This algorithm can be extended to work even for edges with possibly superpolynomial multiplicities, for details see [22].

3 Algorithmic Framework

This section provides a complete overview of the algorithmic framework used to establish the improved approximation ratios. We focus on the case without release dates here, the necessary modifications for release dates are described in Section 5.

3.1 Coflow Deadlines

Given some coflow instance, we aim to determine a deadline for each of its coflows. These deadlines might not necessarily be strict in the sense that constructed schedules have to adhere to them, but they are rather used to both guide edge allocation procedures and to then bound their resulting costs. Most existing coflow approximation algorithms use a similar strategy.
We take a structural approach, where we first define structure which we want our deadlines to obey and then describe how such deadlines can be found. We capture the structural constraints in the following LP. It has been implicitly used in analysis by [14, 9] and others. Let $C_{1}\leq C_{2}\leq\dotsc\leq C_{n}$ be deadlines for the coflows and for easier notation define $C_{0}:=0$ .

\displaystyle\begin{aligned} \sum_{s\in[n]}x_{s,e}&&=&\quad 1&&\forall e\in E&% (I)\\ \sum_{e:v\in e}x_{s,e}&&\leq&\quad C_{s}-C_{s-1}\quad&&\forall s\in[n],\forall v% \in V&(II)\\ x_{s,e}&&=&\quad 0&&\forall j\in[n],\forall e\in E_{j},\forall s>j\quad&(III)% \\ x_{s,e}&&\geq&\quad 0\end{aligned}

(LP I)

Instead of enforcing the necessary constraints for Coflow Scheduling for each individual time slot, LP I groups the slots into blocks in between the coflow deadlines. The variable $x_{s,e}$ describes the assignment of edge $e$ to block $s$ . Constraint $(I)$ ensures that every edge from every coflow is fully scheduled. The block between $C_{s-1}$ and $C_{s}$ has size $C_{s}-C_{s-1}$ , so constraint $(II)$ ensures that in every such block for every vertex the amount of adjacent edges does not exceed the block size. Constraint $(III)$ forces edges to be zero for blocks after the respective deadline. The step from individual time slot degree bounds to block degree bounds is justified by Kőnig’s Theorem (Theorem 4), as it guarantees that any bipartite graph with some maximum degree $\Delta$ can be decomposed into $\Delta$ matchings. This is equivalent to saying that any set of bipartite edges $E$ for which the induced graph has maximum degree $\Delta$ can be scheduled in $\Delta$ time slots.

Assuming the deadlines $C_{1}\leq C_{2}\leq\dotsc\leq C_{n}$ are set as the coflow finishing times from some optimal schedule for the underlying coflow instance, the edge assignment directly induces a feasible point inside LP I. For each edge $e$ which is scheduled in some time slot $t$ in the optimal schedule, set $x_{s,e}:=1$ , for $s$ such that $t\in(C_{s-1},C_{s}]$ . Set all other variables to $0$ . As by definition of a valid solution, each edge is scheduled before its coflow’s deadline, constraint $(I)$ is fulfilled and constraint $(III)$ cannot be violated. As the edges in each time slot form a matching, constraint $(II)$ is also fulfilled.

Conversely, an integral solution to the LP corresponds to a valid solution for Coflow Scheduling. However, in order to be able to solve the LP in polynomial time, we cannot enforce integrality. Hence the constraints only enforce that the variable assignment corresponds to a fractional matching. There are instances and deadlines for which LP I is feasible, but no feasible integral point and therefore also no feasible integral schedule exists. In fact, determining whether an integral points exists is an $\mathbb{NP}$ -hard problem, for details we refer to the full version.

Finding some set of deadlines for which LP I is feasible is easy, as one can simply choose large enough values to guarantee feasibility. However, for the purpose of constructing good approximation algorithms for Coflow Scheduling, we require that the deadlines fulfill some cost guarantees with respect to an optimal coflow schedule.
There is an LP based approach which returns deadlines for which LP I is feasible and certain strong guarantees hold. This technique has been used by [14, 9] and others. They use a randomized rounding scheme on another LP formulation to determine integral deadlines $C^{\prime}_{1},\dotsc,C^{\prime}_{n}$ . We slightly modify their algorithm and leave out the final step in which they round up the deadlines and obtain $C_{1},\dotsc,C_{n}$ . These deadlines are thus potentially fractional. By slightly modifying their proof, the following bound can be shown.

Lemma 5 ([14]).

There is a polynomial time randomized algorithm determining deadlines $C_{1},\dotsc,C_{n}$ for which LP I is feasible and for which the following cost bound holds:

\sum_{j\in[n]}\omega_{j}\mathbb{E}[C_{j}]\quad\leq\quad 2\cdot\mathrm{OPT}\ -% \ \sum_{j\in[n]}\omega_{j}

More details about the procedure used by [14] to determine such deadlines can be found in the full version. They only implicitly work with LP I, so we provide additional details on the connection. Note that the procedure can be de-randomized to obtain a fully deterministic algorithm.

The multiplicative factor of $2$ in Lemma 5 is optimal assuming $\mathbb{P}\neq\mathbb{NP}$ . This follows from the factor $(2-\epsilon)$ -approximation hardness of Concurrent Open Shop Scheduling, as Coflow Scheduling can be seen as a generalization of this problem [23].

3.2 Integral Edge Assignments with Guarantees

Let $C_{1},\dotsc,C_{n}$ be deadlines for which LP I is feasible and Lemma 5 holds. Using the result of the lemma, we immediately obtain that if we are able to find an allocation such that all edges from each coflow are scheduled by their respective deadlines, we have achieved a $2$ -approximation for Coflow Scheduling. In the same way, if for some $\alpha\geq 1$ we are able to schedule each coflow $j$ by time $\alpha\cdot C_{j}$ , we obtain a $2\cdot\alpha$ approximation algorithm.

We analyze two edge allocation algorithms which provide different guarantees for the finishing times of the coflows. The first algorithm $\mathrm{Greedy}$ is a simple greedy allocation scheme. This procedure was used by previous authors to derive $4$ -approximation algorithms for Coflow Scheduling. Let $\mathrm{Greedy}(C_{j})$ denote the finishing time of coflow $E_{j}$ in the schedule produced by $\mathrm{Greedy}$ .

Lemma 6.

For given deadlines $C_{1},\dotsc,C_{n}$ for which LP I is feasible there is an algorithm $\mathrm{Greedy}$ returning a valid coflow schedule such that the following holds.

\mathrm{Greedy}(C_{j})\quad\leq\quad 2C_{j}-1

The second algorithm $\mathrm{CBF}^{\tau}$ is a novel allocation scheme using a form of iterated rounding inspired by the Beck-Fiala Theorem from discrepancy theory [4]. The algorithm is parameterized by $\tau\in\mathbb{N}_{\geq 2}$ and allocates the coflow deadlines to blocks, where each block’s size is some integer multiple of $\tau$ . An edge assignment is then determined which only slightly violates the size of each block. This leads to the following completion time guarantees.

Theorem 7.

For given deadlines $C_{1},\dotsc,C_{n}$ for which LP I is feasible, weights $\omega_{1},\dotsc,\omega_{n}$ , and a parameter $\tau\in\mathbb{N}_{\geq 2}$ , there is an algorithm $\mathrm{CBF}^{\tau}$ returning a valid coflow schedule such that the following holds.

\sum_{j\in[n]}\omega_{j}\cdot\mathrm{CBF}^{\tau}(C_{j})\quad\leq\quad\sum_{j% \in[n]}\omega_{j}\left(\frac{\tau+2}{\tau}C_{j}+\frac{\tau}{2}+2.5-\frac{2}{% \tau}\right)

Note that the approximation guarantees of both algorithms are quite different. $\mathrm{Greedy}$ achieves rather strong approximation for small deadlines, while for large deadlines, through an appropriate choice of $\tau$ , $\mathrm{CBF}^{\tau}$ gives good guarantees. In fact, under certain assumptions on the provided deadlines, $\mathrm{CBF}^{\tau}$ can achieve approximations arbitrarily close to the optimum of $1$ , see Section 6.

Our coflow algorithm aims to achieve guarantees for both small and large deadlines by combining both algorithms in some way. The procedure is straightfoward. It obtains the deadlines $C_{1},..,C_{n}$ through the procedure explained in Section 3.1 and then runs both $\mathrm{Greedy}$ and $\mathrm{CBF}^{6}$ independently, returning the schedule with lower cost. Analyzing the cost of the returned solution requires some care, as we need a uniform bound over any possible input instance.

3.3 Combining Algorithmic Guarantees

By the definition of the coflow algorithm, for each possible instance $\mathcal{I}$ , its cost $C_{\mathrm{ALG}}(\mathcal{I})$ is given as the minimum of the costs $C_{G}(\mathcal{I})$ and $C_{\mathrm{CBF}}(\mathcal{I})$ of the $\mathrm{Greedy}$ and respectively $\mathrm{CBF}^{6}$ algorithm.
We show a general proof framework for such algorithms which provide deadline guarantees, which gives sharp bounds for taking minimums over several algorithms’ costs. The derivation is not difficult, but the framework offers a surprisingly simple and strong method to establish bounds for large classes of algorithms. It only requires bounds for the delay guarantees of the algorithms, which are usually relatively simple to establish.
For this purpose, let $C_{1},\dotsc,C_{n}$ be deadlines for which LP I is feasible and let $\mathrm{ALG}_{1},\dotsc,\mathrm{ALG}_{k}$ be algorithms producing valid coflow schedules from such deadlines, with $\mathrm{ALG}_{i}(C_{j})$ being the finishing time of coflow $E_{j}$ in the schedule produced by $\mathrm{ALG}_{i}$ . For $j\in[k]$ , let $f_{j}$ be a function capturing a bound on the maximum weighted deadline delay of $\mathrm{ALG}_{j}$ . This means that $f_{j}$ is such that $\sum_{j\in[n]}\omega_{j}\cdot\mathrm{ALG}_{j}(C_{j})\leq\sum_{j\in[n]}\omega_{% j}\cdot f_{j}(C_{j})$ . Such functions might stem from bounds on individual deadlines like in the case of $\mathrm{Greedy}$ , but can also come from bounds which are already given as a weighted sum over all deadlines like for $\mathrm{CBF}^{\tau}$ .

Lemma 8.

Let $\lambda_{1},\dotsc,\lambda_{k}\geq 0$ with $\sum_{i\in[k]}\lambda_{i}=1$ and $\alpha\in\mathbb{R}^{+}$ . If for all $x\geq 1$

\sum_{i\in[k]}\lambda_{i}f_{i}(x)\quad\leq\quad\alpha(x+1),

then for all coflow instances $\mathcal{I}$ :

C_{\mathrm{ALG}}(\mathcal{I})\hskip 5.0pt=\hskip 5.0pt\min\{C_{\mathrm{ALG}_{1% }}(\mathcal{I}),\dotsc,C_{\mathrm{ALG}_{k}}(\mathcal{I})\}\hskip 5.0pt\leq% \hskip 5.0pt2\alpha\cdot\mathrm{OPT}(\mathcal{I})

Proof.

Let $\lambda_{1},\dotsc,\lambda_{k}\geq 0$ be fixed constants with $\sum_{j\in[k]}\lambda_{j}=1$ . Define $g(x):=\lambda_{1}f_{1}(x)+\lambda_{2}f_{2}(x)+\cdots+\lambda_{k}f_{k}(x)$ . We define a randomized algorithm $\mathrm{RALG}$ which for all $j\in[k]$ runs algorithm $\mathrm{ALG}_{j}$ with probability $\lambda_{j}$ . With these definitions, for the cost $C_{\mathrm{ALG}}$ of the combined algorithm $\mathrm{ALG}$ we obtain:

C_{\mathrm{ALG}}\hskip 5.0pt=\hskip 5.0pt\min\{C_{\mathrm{ALG}_{1}},\dotsc,C_{% \mathrm{ALG}_{k}}\}\hskip 5.0pt\leq\hskip 5.0pt\sum_{j\in[k]}\lambda_{j}\cdot C% _{\mathrm{ALG}_{j}}\hskip 5.0pt=\hskip 5.0pt\mathbb{E}[C_{\mathrm{RALG}}]

For the expected cost of $\mathrm{RALG}$ we have

	$\displaystyle\mathbb{E}[C_{\mathrm{RALG}}]$	$\displaystyle\hskip 5.0pt=\hskip 5.0pt\mathbb{E}\Big{[}\sum_{j\in[n]}\omega_{j% }\mathrm{RALG}(C_{j})\Big{]}\hskip 5.0pt=\hskip 5.0pt\sum_{j\in[n]}\omega_{j}% \mathbb{E}[\mathrm{RALG}(C_{j})]$
		$\displaystyle\phantom{------}\hskip 5.0pt\leq\hskip 5.0pt\sum_{j\in[n]}\omega_% {j}\left(\lambda_{1}f_{1}(C_{j})+\cdots+\lambda_{k}f_{k}(C_{j})\right)\hskip 5% .0pt=\hskip 5.0pt\sum_{j\in[n]}\omega_{j}g(C_{j}).$

So by establishing suitable bounds for $g(x)$ , we can show approximation bounds for $\mathrm{ALG}$ . Assume that there exists some $\alpha\in\mathbb{R}_{+}$ such that $g(x)\leq\alpha(x+1)$ . Then we can further bound

\sum_{j\in[n]}\omega_{j}g(C_{j})\hskip 5.0pt\leq\hskip 5.0pt\sum_{j\in[n]}% \omega_{j}(\alpha(C_{j}+1))\hskip 5.0pt\leq\hskip 5.0pt\alpha\cdot\sum_{j\in[n% ]}\omega_{j}C_{j}\hskip 5.0pt+\hskip 5.0pt\alpha\cdot\sum_{j\in[n]}\omega_{j}.

By using the deadlines $C_{1},\dotsc,C_{n}$ provided by Lemma 5 for which $\sum_{j\in[n]}\omega_{j}C_{j}\leq 2\cdot\mathrm{OPT}-\sum_{j\in[n]}\omega_{j}$ holds, this yields the desired bound:

C_{\mathrm{ALG}}\quad\leq\quad 2\alpha\cdot\mathrm{OPT}\

$\hfill\blacktriangleleft$

3.4 Main Theorem

Using the previous lemmata and theorems, we prove Theorem 1. The proof follows by application of the framework from Lemma 8 to selected edge allocation algorithms.

See 1

Proof.

For completeness, we restate the algorithm which has implicitly been described earlier. Given some coflow instance $\mathcal{I}$ , we first determine deadlines using the procedure described in Section 3.1. We then apply the two edge allocation algorithms $\mathrm{Greedy}$ and $\mathrm{CBF}^{6}$ to the deadlines to obtain two feasible coflow schedules and return the schedule with lower total cost.

Calling this algorithm $\mathrm{ALG}$ , its cost $C_{\mathrm{ALG}}(\mathcal{I})$ is thus given as $\min\{C_{\mathrm{Greedy}}(\mathcal{I}),C_{\mathrm{CBF}^{6}}(\mathcal{I})\}$ . We aim to use Lemma 8 to bound the approximation ratio of $\mathrm{ALG}$ . For this purpose, let $f_{G}$ be a function capturing an upper bound on the deadline delay of $\mathrm{Greedy}$ in the sense required by Lemma 8 and respectively $f_{\mathrm{CBF}}$ for $\mathrm{CBF}^{6}$ . From Lemma 6 and Theorem 7 we obtain that

f_{G}(x)\hskip 5.0pt=\hskip 5.0pt2x-1\qquad\text{ and }\qquad f_{\mathrm{CBF}}% (x)\hskip 5.0pt=\hskip 5.0pt\frac{4}{3}x+\frac{31}{6}

holds. Let $\lambda_{1}:=23/41$ and $\lambda_{2}:=18/41$ . This yields:

\lambda_{1}f_{G}(x)+\lambda_{2}f_{\mathrm{CBF}}(x)\hskip 5.0pt=\hskip 5.0pt(2% \cdot\tfrac{23}{41}+\tfrac{4}{3}\cdot\tfrac{18}{41})x+(\tfrac{31}{6}\cdot% \tfrac{18}{41}-\tfrac{23}{41})\hskip 5.0pt=\hskip 5.0pt\tfrac{70}{41}(x+1)

So the requirements of Lemma 8 are fulfilled with $\alpha=\tfrac{70}{41}$ , which implies that $\mathrm{ALG}$ is a $2\cdot\tfrac{70}{41}=\tfrac{140}{41}<3.415$ -approximation algorithm for Coflow Scheduling without release dates. $\hfill\blacktriangleleft$

4 Integral Edge Assignments with Guarantees

In this section we introduce and analyze algorithms which allocate edges of coflows to time-slots. They work on coflow deadlines fulfilling certain structural properties and their goal is to provide feasible schedules together with guarantees on the average delay each coflow experiences. Similar strategies are also used in most of the previous $4$ -approximations for Coflow Scheduling. We introduce the algorithms $\mathrm{Greedy}$ and $\mathrm{CBF}^{\tau}$ and show their guarantees in Lemma 6 and Theorem 7.

4.1 Greedy Scheduling

We start by introducing and analyzing a greedy allocation algorithm $\mathrm{Greedy}$ , which is one of the edge allocation procedures used in previous works to achieve a $4$ -approximation. Let $C_{1}\leq C_{2}\leq\cdots\leq C_{n}$ be deadlines for which LP I is feasible. $\mathrm{Greedy}$ schedules all coflows consecutively, starting with $E_{1}$ up to $E_{n}$ . Each edge is simply scheduled in a work-conserving way, meaning that it is scheduled in the earliest possible time-slot in which both its vertices are free. By doing this for all edges, a schedule is obtained. For $j\in[n]$ , let $\mathrm{Greedy}(C_{j})$ be the finishing time of coflow $E_{j}$ in the schedule obtained from this procedure.

See 6

Proof.

Consider some fixed $j\in[n]$ and let $e=(u,v)\in E_{j}$ be an edge on which coflow $j$ finishes. As LP I is feasible for the deadlines, for both $u$ and $v$ at most $C_{j}-1$ flow which contains one of these vertices from earlier coflows can exist. This implies that at most $\lfloor C_{j}-1\rfloor$ edges in $(\cup_{i\leq j}E_{i})\setminus\{e\}$ can be adjacent to each of $u$ and $v$ . So these edges can block at most $2(\lfloor C_{j}-1\rfloor)\leq 2C_{j}-2$ time slots, which implies that $\mathrm{Greedy}$ schedules $e$ the latest in slot $2C_{j}-1$ . $\hfill\blacktriangleleft$

$\mathrm{Greedy}$ provides a strict deadline guarantee for each coflow, meaning that in the schedule produced every coflow finishes the latest at the provided bound. This also implies that the same guarantee holds when taking weighted sums over the finishing times. Note that $2C_{j}-1=(2-\frac{1}{C_{j}})C_{j}$ , so for small $C_{j}$ this gives a tangible improvement over the factor $2$ .

4.2 Iterated Rounding using Beck-Fiala

This section gives a proof of Theorem 7. We start by providing a full description of the algorithm, then we give a preliminary analysis and subsequently strengthen the guarantee through further refinements.

Procedure Idea

Given some deadlines $C_{1}\leq C_{2}\leq\dotsc\leq C_{n}$ for which LP I is feasible, the core idea is to round these deadlines to the next integer multiple of some parameter $\tau\in\mathbb{N}_{\geq 2}$ and to then form blocks between consecutive rounded deadlines. With these blocks and associated deadlines, we show that it is possible to allocate all edges to blocks while only violating the block size by a small additive constant. Given such an allocation, using the guarantee provided by Kőnig’s Theorem (Theorem 4) there exists a feasible schedule containing all assigned edges within the maximum vertex load of each block. Through the rounding and the increase in blocks’ sizes the finishing times of the coflows are delayed with respect to their deadlines. We are however able to show strong bounds on this delay.
We call this algorithm $\mathrm{CBF}^{\tau}$ due to its close association with the proof of the Beck-Fiala Theorem [4].

Edge-to-Block Allocation LP

We use a rounding technique inspired by the proof of the Beck-Fiala Theorem. Let $\tau\in\mathbb{N}_{\geq 2}$ be a fixed constant. For $j\in[n]$ let $\bar{C}_{j}$ be the deadline $C_{j}$ rounded up to the next integer multiple of $\tau$ . The blocks’ sizes are defined by the distance between two non-equal consecutive deadlines. So for $\bar{C}_{j}\neq\bar{C}_{j-1}$ , block $j$ has size $\bar{C}_{j}-\bar{C}_{j-1}$ . We define the following LP, which models the allocation of coflow edges to blocks. One can assume without loss of generality that all rounded deadlines are distinct and that there are $n$ of them, as coflows whose rounded deadlines are equal can be joined in this step.

\displaystyle\begin{aligned} \sum_{b\in[n]}x_{e,b}\quad&=\quad 1&&\forall e\in E% &\qquad(I)\\ \sum_{e:v\in e}x_{e,b}\quad&\leq\quad\bar{C}_{b}-\bar{C}_{b-1}&&\forall v\in V% ,\forall b\in[n]&\qquad(II)\\ x_{e,b}\quad&=\quad 0\quad&&\forall j\in[n],\forall e\in E_{j},\forall b>j\\ x_{e,b}\quad&\geq\quad 0\end{aligned}

(LP CBF)

The structure of LP CBF is identical to LP I, though the special form of the rounded deadlines induces some additional properties. By definition $\bar{C}_{j}\geq C_{j}$ for all $j\in[n]$ . Additionally, with $\bar{C}_{0}=0$ , for each $b\in[n]$ we have $\bar{C}_{b}-\bar{C}_{b-1}=k_{b}\cdot\tau$ , for some $k_{b}\in\mathbb{N}$ . We identify each edge and vertex in the coflow instance with the respective variable in the LP and use both terms interchangeably.

We claim that feasiblity of LP I for $C_{1},\dotsc,C_{n}$ directly implies feasibility of LP CBF for $\bar{C}_{1},\dotsc,\bar{C}_{n}$ . For LP I, increasing the value of any deadline without violating the total order can only increase the feasible region, as the equal zero constraints get less restrictive and any possible excess assignment can be shifted between the two adjacent blocks whose size changes. Therefore, as all deadlines can only increase, LP CBF also has to be feasible.

LP Rounding

We describe a procedure which finds an integral edge-to-block assignment violating the block size constraint $(II)$ by a constant amount. To achieve this, we start with an initial solution to the LP and then successively refine and resolve the LP, until we have obtained an integral solution fulfilling certain strong properties.

From now on we assume that we started with deadlines $C_{1},\dotsc,C_{n}$ for which LP I is feasible, so we know that for $\bar{C}_{1},\dotsc,\bar{C}_{n}$ LP CBF is feasible. After obtaining an initial solution to LP CBF, we take two steps. We first fix all integral edges and remove their respective variables from the LP and modify the right hand side of $(II)$ accordingly and then we delete all constraints from $(II)$ with at most $k-1$ fractional variables remaining, for some fixed number $k\in\mathbb{N}$ . This corresponds to dropping the degree constraint on the respective vertex if at most $k-1$ adjacent edges are still fractional. Let $\mathcal{E}_{b}$ be the set of all fractional edges contained in block $b$ and let $\mathcal{V}_{b}$ be the vertices in block $b$ with at least $k$ fractional adjacent edges. Let $S_{v,b}$ be the set of already fixed edges adjacent to $v$ in block $b$ . This gives rise to the following resulting LP:

\displaystyle\begin{aligned} \sum_{b\in[n]}x_{e,b}\quad&=\quad 1&&\forall e\in% \bigcup_{b\in[n]}\ \mathcal{E}_{b}&\quad(I)\\ \sum_{e\in\mathcal{E}_{b}:v\in e}x_{e,b}\quad&\leq\quad k_{b}\cdot\tau-|S_{v,b% }|\quad&&\forall b\in[n],v\in\mathcal{V}_{b}&\quad(II)\\ x_{e,b}\quad&\geq\quad 0&&\forall b\in[n],e\in\mathcal{E}_{b}\end{aligned}

If we can show that this LP always contains strictly more variables than it contains constraints in $(I)$ and $(II)$ , by considering a basic feasible solution, we obtain at least one more integral variable, so repeating the fixing variables and removing constraints step leads to at least one more fixed variable. Therefore in a polynomial number of steps the procedure must terminate and we obtain an integral solution. The step in which we drop constraints means that this integral solution is most likely not feasible for the original LP, but we later show that the amount of violation cannot be very large, which yields the desired approximation behaviour.

Constraints and Variables

We want to show that the number of constraints is strictly smaller than the number of variables. The total number of variables in the LP is equal to $\sum_{b\in[n]}|\mathcal{E}_{b}|$ . The number of constraints in $(I)\cup(II)$ is equal to $|\bigcup_{b\in[n]}\mathcal{E}_{b}|+\sum_{b\in[n]}|\mathcal{V}_{b}|$ . We show two bounds which enable us to establish the desired inequality.

Lemma 9.

For all $b\in[n]:$ $\quad|\mathcal{V}_{b}|\quad\leq\quad\frac{2}{k}|\mathcal{E}_{b}|$

Proof.

As by definition each vertex in $\mathcal{V}_{b}$ has at least $k$ fractional adjacent edges, we obtain the following.

|\mathcal{V}_{b}|\cdot k\quad\leq\quad|\{(v,e)\ |\ \forall v\in\mathcal{V}_{b}% ,\forall e\in\delta_{E}(v)\cap\mathcal{E}_{b}\}|

Each edge contains exactly two vertices, so we additionally have the following inequality:

|\{(v,e)\ |\ \forall v\in\mathcal{V}_{b},\forall e\in\delta_{E}(v)\cap\mathcal% {E}_{b}\}|\quad\leq\quad 2\cdot|\mathcal{E}_{b}|

Combining the two inequalities gives the result. $\hfill\blacktriangleleft$

Lemma 10.

For all $b\in[n]:$ $\quad|\bigcup_{b\in[n]}\mathcal{E}_{b}|\quad\leq\quad\frac{1}{2}\sum_{b\in[n]}% |\mathcal{E}_{b}|$

Proof.

For arbitrary sets, the bound is only true without the factor $\tfrac{1}{2}$ on the right hand side. Equality is reached exactly when all elements are unique. In our case, whenever there is a fractional edge, due to constraint (I), at least one other variable associated to this edge in another block has to be fractional as well. Hence, these contribute at least twice to the right hand side and only once to the left hand side, which gives the inequality. $\hfill\blacktriangleleft$ Combining the two lemmata, we obtain:

|\mathrm{Cons}|\hskip 5.0pt=\hskip 5.0pt|\bigcup_{b\in[n]}\mathcal{E}_{b}|+% \sum_{b\in[n]}|\mathcal{V}_{b}|\hskip 5.0pt\leq\hskip 5.0pt\frac{1}{2}\sum_{b% \in[n]}|\mathcal{E}_{b}|+\frac{2}{k}\sum_{b\in[n]}|\mathcal{E}_{b}|\hskip 5.0% pt=\hskip 5.0pt\left(\frac{1}{2}+\frac{2}{k}\right)|\mathrm{Vars}|

So for all $k>4$ a strict inequality follows. For $k=4$ we have the inequality $|\mathrm{Cons}|\leq|\mathrm{Vars}|$ . We can however still achieve a strict inequality for this case by slightly modifying the LP. In its current form, the LP is given without an objective function. We can thus remove one constraint from $(II)$ and shift it to the objective function instead. This reduces the number of constraints by one without changing the number of variables. If $b,v\in\mathcal{V}_{b}$ are the parameters corresponding to the chosen inequality, the added objective function is $\min\sum_{e\in\mathcal{E}_{b}:v\in e}x_{e,b}$ . From the minimization objective it follows that feasible optimal points of the modified LP are feasible for the original LP, as the removed constraint cannot be violated.

Delay Bound

Looking at the integral assignment, we can show that the violation of constraints of LP CBF is small. Note that the following statement only requires integrality of the deadlines and not the special structure of the rounded deadlines.

Lemma 11.

Given integral deadlines $C_{1},\dotsc,C_{n}$ for which LP CBF is feasible and a parameter $\tau\in\mathbb{N}_{\geq 2}$ , in polynomial time we can find an integral point such that:

a)

All constraints $(II)$ in LP CBF are exceeded by at most $2$ .
b)

All other constraints in LP CBF are fulfilled.

Proof.

At all times during the procedure, the intermediate LP solutions fulfill the constraints not in $(II)$ , so b) follows. Remember that we never change variables once they are integral and that we only remove constraints from $(II)$ if the number of fractional variables in the sum is at most $k-1$ . This shows that each constraint in $(II)$ can only be violated by an additive term of $k-1$ . In fact, as the fractional variables have a strictly positive sum, the sum over the remaining integral variables at the time of removal can be at most $\tau-1$ , which implies that the violation is at most $k-2$ . For the choice $k=4$ , a) follows. $\hfill\blacktriangleleft$ Using this result, we can show an upper bound on the maximum delay for each deadline when applying $\mathrm{CBF}^{\tau}$ . In total, we obtain the following lemma, which at this point is slightly weaker than required for Theorem 7, as there is an additive constant of $\tau+2$ instead of $\tau/2+2.5-\frac{2}{\tau}$ . Nevertheless, the theorem in this form would already be sufficient to gain a significant improvement over the factor $4$ -approximation. Like in the case of $\mathrm{Greedy}$ , $\mathrm{CBF}^{\tau}(C_{j})$ is the finishing time of coflow $E_{j}$ in the schedule created by $\mathrm{CBF}^{\tau}$ .

Lemma 12.

For given deadlines $C_{1},\dotsc,C_{n}$ for which LP I is feasible and a parameter $\tau\in\mathbb{N}_{\geq 2}$ , there is an algorithm $\mathrm{CBF}^{\tau}$ returning a valid coflow schedule such that the following holds for all $j\in[n]$ .

\mathrm{CBF}^{\tau}(C_{j})\quad\leq\quad\frac{\tau+2}{\tau}C_{j}+\tau+2

Proof.

The algorithm is given by setting the deadlines to the next integer multiple of $\tau$ and then doing the iterated rounding procedure. Given the edge to block assignments, a valid schedule can be obtained using Kőnigs Theorem (Theorem 4).
Consider some fixed $C_{j}$ and let $k\in\mathbb{N}$ and $a\in[0,\tau)$ such that $C_{j}=k\cdot\tau+a$ . Assume for now that $a>0$ , then $\bar{C}_{j}=(k+1)\tau$ . By definition, the deadline $\bar{C}_{j}$ forms the $j$ -th block. From Lemma 11 it follows that each block’s size increases by at most $2$ , so the latest possible time at which coflow $j$ finishes is $(k+1)\tau+2j$ . As each block has size at least $\tau$ , we have $j\leq(k+1)$ , so $(k+1)\tau+2j\leq(k+1)(\tau+2)$ . Bounding this yields:

(k+1)(\tau+2)\hskip 5.0pt\leq\hskip 5.0pt(\tau+2)k+\frac{\tau+2}{\tau}\cdot a+% \tau+2\hskip 5.0pt=\hskip 5.0pt\frac{\tau+2}{\tau}C_{j}+\tau+2

For $a=0$ , the argument simplifies. One has $\bar{C}_{j}=k\cdot\tau$ and hence a finishing time upper bound of $k(\tau+2)$ . In this case a stronger bound of $\frac{\tau+2}{\tau}C_{j}$ follows. $\hfill\blacktriangleleft$

Reducing the Additive Constant

The additive constant $\tau$ in Lemma 12 assumes the worst case for each deadline, meaning that every deadline gets shifted from the very start to the very end of a block. We show that an averaging argument can be used to reduce the average amount of shift to $\tau/2+\frac{1}{2}-\frac{2}{\tau}$ . This requires that we show the bound across the weighted sum over all deadlines, unlike the previous proofs which established hard upper bounds for each individual deadline.

For $\lambda\in\mathbb{N}$ , we consider a variant $\mathrm{CBF}^{\tau}_{\lambda}$ of $\mathrm{CBF}^{\tau}$ where an additional first block of fixed size $\lambda$ is inserted. This is equivalent to rounding the deadlines to the next larger term in the sequence $\{\lambda+i\cdot\tau\}_{i\in\mathbb{N}}$ . Note that by simple modification of the arguments, the feasibility statements for LP CBF still apply. Lemma 11 is thus also applicable.

This change to the deadline rounding step can change the finishing time of deadlines in our procedure. On the one hand, the delay of some deadlines might increase, as the last time slot of their respective blocks gets increased. On the other hand, the delay of some deadlines might decrease, as they now get included in an earlier block. We show the following.

See 7

Proof.

The algorithm tries all $\lambda\in\{0,2,\dotsc,\tau-1,\tau+1\}$ and returns the solution with lowest total cost. We can upper bound this cost by instead considering a uniformly random $\lambda\in\{0,2,\dotsc,\tau-1,\tau+1\}$ and calculating the expected cost. Consider some fixed $C_{j}$ and let $k\in\mathbb{N},b\in\{0,1,\dotsc,\tau-1\},a\in[0,1)$ such that $C_{j}=k\cdot\tau+b+a$ . We assume for now that both $a\neq 0$ and $b\neq 0$ . We write $\bar{C}_{j}(\lambda)$ to denote the smallest term in the sequence $\{\lambda+i\cdot\tau\}_{i\in\mathbb{N}}$ which is greater than or equal to $C_{j}$ . The term $\bar{C}_{j}(\lambda)$ essentially captures the last time slot of the block to which $C_{j}$ will be assigned. In the base case $\lambda=0$ , this is equivalent to the rounding arguments used in Lemma 12. For $\lambda>0$ , compared to the base case, the index of the block to which $C_{j}$ is assigned might increase or stay the same, depending on the values of $\lambda$ and $b$ . If $\lambda\leq b$ , $C_{j}$ gets assigned to the same block, but an additional block is inserted at the start, while for $\lambda>b$ , $C_{j}$ moves one block earlier, so the index stays the same. Considering the different cases one by one, we obtain:
In case $\lambda=0$ , we have $\bar{C}_{j}(\lambda)=(k+1)\tau$ and by the same arguments as used in the proof of Lemma 12 we obtain

\mathrm{CBF}^{\tau}_{\lambda}(C_{j})\hskip 5.0pt\leq\hskip 5.0pt(k+1)(\tau+2)% \hskip 5.0pt=\hskip 5.0pt\frac{\tau+2}{\tau}C_{j}+\tau+2-\frac{\tau+2}{\tau}(a% +b).

In the case $\lambda\in\{2,\dotsc,b\}$ , we have $\bar{C}_{j}(\lambda)=(k+1)\tau+\lambda$ and the index of $C_{j}$ ’s block increases by one. This gives:

\mathrm{CBF}^{\tau}_{\lambda}(C_{j})\hskip 5.0pt\leq\hskip 5.0pt(k+1)(\tau+2)+% (\lambda+2)\hskip 5.0pt=\hskip 5.0pt\frac{\tau+2}{\tau}C_{j}+\tau+2-\frac{\tau% +2}{\tau}(a+b)+(\lambda+2)

In the case $\lambda\in\{b+1,\dotsc,\tau-1\}$ , we have $\bar{C}_{j}(\lambda)=k\tau+\lambda$ and the block index stays the same, thus:

\mathrm{CBF}^{\tau}_{\lambda}(C_{j})\hskip 5.0pt\leq\hskip 5.0ptk(\tau+2)+% \lambda+2\hskip 5.0pt=\hskip 5.0pt\frac{\tau+2}{\tau}C_{j}+\tau+2-\frac{\tau+2% }{\tau}(a+b)-(\tau-\lambda)

In the case $\lambda=\tau+1$ , as we assumed $a,b\neq 0$ , we have $\bar{C}_{j}(\lambda)=(k+1)\tau+1$ and the block index stays the same, so we have:

\mathrm{CBF}^{\tau}_{\lambda}(C_{j})\hskip 5.0pt\leq\hskip 5.0pt(k+1)(\tau+2)+% 1\hskip 5.0pt=\hskip 5.0pt\frac{\tau+2}{\tau}C_{j}+\tau+2-\frac{\tau+2}{\tau}(% a+b)+1

Every right hand side contains the same term $\hat{C}_{j}:=\frac{\tau+2}{\tau}C_{j}+\tau+2-\frac{\tau+2}{\tau}(a+b)$ , which is independent of $\lambda$ , so we define a random variable $D_{j}(\lambda):=\mathrm{CBF}^{\tau}_{\lambda}(C_{j})-\hat{C}_{j}$ which captures the respective remaining terms. Define $L_{1}=\{2,\dotsc,b\}$ and $L_{2}=\{b+1,\dotsc,\tau-1\}$ . Then we have

	$\displaystyle\mathbb{E}[D_{j}]$	$\displaystyle\hskip 4.62497pt\leq\hskip 4.62497pt\frac{b-1}{\tau}\cdot\mathbb{% E}[2+\lambda\ \|\ \lambda\in L_{1}]\hskip 4.62497pt-\hskip 4.62497pt\frac{\tau-% 1-b}{\tau}\cdot\mathbb{E}[\tau-\lambda\ \|\ \lambda\in L_{2}]\hskip 4.62497pt+% \hskip 4.62497pt\frac{1}{\tau}\cdot\mathbb{E}[1\ \|\ \lambda\in\{\tau+1\}]$
		$\displaystyle\hskip 4.62497pt=\hskip 4.62497pt\frac{b-1}{\tau}\left(\frac{1}{b% -1}\sum_{\lambda\in L_{1}}2+\lambda\right)\hskip 4.62497pt-\hskip 4.62497pt% \frac{\tau-1-b}{\tau}\left(\frac{1}{\tau-1-b}\sum_{\lambda\in L_{2}}\tau-% \lambda\right)\hskip 4.62497pt+\hskip 4.62497pt\frac{1}{\tau}$
		$\displaystyle\hskip 4.62497pt=\hskip 4.62497pt\frac{1}{\tau}\left(2(b-1)+\frac% {b(b+1)}{2}-1-\frac{1}{2}(\tau-b)(\tau-1-b)\right)\hskip 4.62497pt+\hskip 4.62% 497pt\frac{1}{\tau}$
		$\displaystyle\hskip 4.62497pt=\hskip 4.62497pt\frac{2b}{\tau}-\frac{\tau}{2}-% \frac{2}{\tau}+\frac{1}{2}+b.$

So overall, we obtain

	$\displaystyle\mathbb{E}[\mathrm{CBF}^{\tau}_{\lambda}(C_{j})]$	$\displaystyle\hskip 5.0pt\leq\hskip 5.0pt\hat{C}_{j}+\mathbb{E}[D_{j}]$
		$\displaystyle\hskip 5.0pt=\hskip 5.0pt\frac{\tau+2}{\tau}C_{j}+\tau+2-\frac{% \tau+2}{\tau}(a+b)+\left(\frac{2b}{\tau}-\frac{\tau}{2}-\frac{2}{\tau}+\frac{1% }{2}+b\right)$
		$\displaystyle\hskip 5.0pt=\hskip 5.0pt\frac{\tau+2}{\tau}C_{j}+\frac{\tau}{2}+% 2.5-\frac{2}{\tau}-(1+\tfrac{2}{\tau})a.$

As $a\in(0,1)$ , the result follows. For $a=0$ or $b=0$ , by checking all the cases one obtains that in each case, the bound stays the same or improves, so the theorem follows. $\hfill\blacktriangleleft$ $\hfill\blacktriangleleft$

5 Coflow Scheduling with Release Dates

In this section we show how the scheduling framework can be extended to work for the case with release dates. Theorem 2 gives an extension of the guarantee provided by Theorem 1 to the case with release dates, though this comes at the cost of a worse approximation ratio.

See 2

The overall proof structure is very similar to the case without release dates, just with tweaks at every step to account for the additional constraints. We provide the general outline here and omit some minor details which follow from modifications to the original arguments.

Coflow Deadlines

Like in the case of no release dates, we want to obtain deadlines for the coflows which obey some structural constraints. LP I does not contain release dates, but with some minor modifications we obtain a suitable LP. For this purpose, for some $\kappa\in[2n]$ , define a sequence $D_{1}\leq D_{2}\leq\dotsc\leq D_{\kappa}$ containing exactly all deadlines and release dates. For some edge $e\in E$ , let $r(e)$ be the index of the release date associated to $e$ in the chain of points in $D$ and respectively $d(e)$ the index of the deadline. Then we construct the following LP.

\displaystyle\begin{aligned} \sum_{s\in[\kappa]}x_{s,e}&&=&\quad 1&&\forall e% \in E\\ \sum_{e:v\in e}x_{s,e}&&\leq&\quad D_{s}-D_{s-1}\quad&&\forall s\in[\kappa],% \forall v\in V\\ x_{s,e}&&=&\quad 0&&\forall j\in[n],\forall e\in E_{j},\forall s\not\in\{r(e)+% 1,\dotsc,d(e)\}\quad\\ x_{s,e}&&\geq&\quad 0\end{aligned}

(LP

R

)

The structure of LP $R$ is very similar to LP I, just with added block separators for each release date and modification of the constraints to prevent edges from being scheduled in blocks before their respective release dates.

The same procedure by [14] used in Section 3.1 can be employed to obtain deadlines $C_{1},\dotsc,C_{n}$ for which LP $R$ is feasible and for which the same cost bound from Lemma 5 holds.

Edge Allocation

Given such a set of deadlines for which LP $R$ is feasible, we again describe two algorithms $\mathrm{Greedy}_{R}$ and $\mathrm{CBF}^{\tau}_{R}$ which provide feasible allocations for all edges. Their guarantees are slightly worse due to the added release date constraints.
Like previously, $\mathrm{Greedy}_{R}(C_{j})$ and $\mathrm{CBF}^{\tau}_{R}(C_{j})$ will be used to denote the finishing time of coflow $E_{j}$ in the schedule provided by the respective algorithm.

Lemma 13.

For given deadlines $C_{1},\dotsc,C_{n}$ for which LP $R$ is feasible there is an algorithm $\mathrm{Greedy}_{R}$ returning a valid coflow schedule such that the following holds for all $j\in[n]$ .

\mathrm{Greedy}_{R}(C_{j},r_{j})\quad\leq\quad r_{j}+2C_{j}-1

Proof.

The proof is essentially identical to the one of Lemma 6, just on a shifted interval. No edge of $E_{j}$ can be scheduled before $r_{j}$ . For the following $2\lceil C_{j}\rceil$ time slots the same vertex allocation argument applies, which leads to an upper bound of $r_{j}+2C_{j}-1$ . $\hfill\blacktriangleleft$ For the allocation procedure $\mathrm{CBF}^{\tau}_{R}$ , the guarantee worsens by an additive $\tau+2$ .

Lemma 14.

For given deadlines $C_{1},\dotsc,C_{n}$ for which LP $R$ is feasible, weights $\omega_{1},\dotsc,\omega_{n}$ , and a parameter $\tau\in\mathbb{N}_{\geq 2}$ , there is an algorithm $\mathrm{CBF}_{R}^{\tau}$ returning a valid coflow schedule such that the following holds.

\sum_{j\in[n]}\omega_{j}\cdot\mathrm{CBF}_{R}^{\tau}(C_{j})\quad\leq\quad\sum_% {j\in[n]}\omega_{j}\left(\frac{\tau+2}{\tau}C_{j}+\frac{3}{2}\tau+4.5-\frac{2}% {\tau}\right)

Proof.

The overall algorithm is almost identical to the one for the case of no release dates from Section 4.2. The main change is a different initial rounding.

Note that in order to obtain good guarantees, we need to ensure that the number of resulting blocks after rounding is not too large and that we have control over the blocks’ sizes. We therefore have to round both release dates and deadlines. Simply rounding both to the next multiple of $\tau$ would not suffice, as it could lead to infeasible LPs. For example, if for some $k\in\mathbb{N}$ , $r_{j}=k\cdot\tau+1$ and $C_{j}=(k+1)\cdot\tau-\epsilon$ , then rounding them in that way would lead to $r_{j}=C_{j}=(k+1)\cdot\tau$ . We instead round the release dates up to the next multiple of $\tau$ and the deadlines to the second next multiple of $\tau$ , meaning that we round them to the next multiple and then add an additional $\tau$ .

In the case without release dates, it is not hard to show that rounding up the deadlines cannot make the resulting LP infeasible, as the feasible region only increases. In the present case, a bit more care is needed, as the rounding of the release dates could lead to parts of the feasible region becoming infeasible. A feasible point for the original LP can be transformed to one in this LP by interpreting the assignment as a time-continuous one and essentially shifting the allocation by $\tau$ . This means that if an edge was scheduled at some point in time $t$ , we now treat it as if it was scheduled at time $t+\tau$ . More details about these transformations and interpretations can be found in [9].

Given the feasibility of the LP for the rounded release dates and deadlines, the same iterated rounding approach from Section 4.2 can be used to obtain an integral feasible schedule which violates the respective degree bound constraints by at most $2$ . As the procedure never changes variables as soon as they are integral and as the blocks’s sizes increasing only increases assigned time slots, the feasibility for the release date constraints is preserved.

For some given deadline $C_{j}$ , let $k\in\mathbb{N}$ and $a\in[0,\tau)$ such that $C_{j}=k\cdot\tau+a$ . Then $C_{j}$ gets rounded to $(k+2)\cdot\tau$ . There are at most $k+2$ blocks up to and including the block formed by $C_{j}$ , whose sizes all increase by at most two. This yields the following bound.

\mathrm{CBF}^{\tau}_{R}(C_{j})\hskip 5.0pt\leq\hskip 5.0pt(k+2)\tau+(k+2)2% \hskip 5.0pt=\hskip 5.0pt(k+2)(\tau+2)\hskip 5.0pt\leq\hskip 5.0pt\frac{\tau+2% }{\tau}C_{j}+2\tau+4

Like in the case of no release dates, this bound is slightly weaker than as stated in the lemma. Using the same averaging strategy as employed before reduces the additive constant by $\tau/2-\tfrac{1}{2}+\frac{2}{\tau}$ , leading to the result. $\hfill\blacktriangleleft$

Framework and Approximation Bound

The algorithm for Coflow Scheduling with release dates again works by obtaining deadlines and then running several edge allocation algorithms on these and returning the cheapest solution among them. To bound the cost, a framework very similar to the one described in Lemma 8 is used, though an additional bound on the distance to the optimum cost is needed due to the presence of the $r_{j}$ summand in the guarantee provided by $\mathrm{Greedy}_{R}$ . As in any optimal solution the finishing time $\mathrm{OPT}_{j}$ of coflow $E_{j}$ has to be after $r_{j}$ , we have $r_{j}\leq\mathrm{OPT}_{j}-1$ .

In the following lemma, like in Lemma 8, let $f_{1},\dotsc,f_{k}$ be some functions capturing the edge allocation guarantees provided by some collection of algorithms $\mathrm{ALG}_{1},\dotsc,\mathrm{ALG}_{k}$ . In this case the functions additionally depend on a parameter $r_{j}\in\mathbb{R}_{\geq 0}$ , which like in the case for $\mathrm{Greedy}$ captures the dependency on release dates.

Lemma 15.

Let $\lambda_{1},\dotsc,\lambda_{k}\geq 0$ with $\sum_{i\in[k]}\lambda_{i}=1$ and $a,b\in\mathbb{R}$ . If for all possible pairs $x\geq 1,r_{x}\in[0,x-1]$

\sum_{i\in[k]}\lambda_{i}f_{i}(x,r_{x})\quad\leq\quad a(x+1)+b(r_{x}+1),

then for all coflow instances $\mathcal{I}$ :

C_{\mathrm{ALG}}(\mathcal{I})\hskip 5.0pt=\hskip 5.0pt\min\{C_{\mathrm{ALG}_{1% }}(\mathcal{I}),\dotsc,C_{\mathrm{ALG}_{k}}(\mathcal{I})\}\hskip 5.0pt\leq% \hskip 5.0pt(2a+b)\cdot\mathrm{OPT}(\mathcal{I})

Proof.

Let $g(x)=\sum_{i\in[k]}\lambda_{i}f_{i}(x,r_{x})$ . Using the exact same argument as in the proof of Lemma 8 and inserting the upper bound on $g$ , we arrive at

C_{ALG}\quad\leq\quad\sum_{j\in[n]}\omega_{j}g(C_{j})\quad\leq\quad a\sum_{j% \in[n]}\omega_{j}C_{j}\ +\ a\sum_{j\in[n]}\omega_{j}\ +\ b\sum_{j\in[n]}\omega% _{j}(r_{j}+1)

Inserting $\sum_{j\in[n]}\omega_{j}C_{j}\quad\leq\quad 2\cdot\mathrm{OPT}-\sum_{j\in[n]}% \omega_{j}$ for the first sum and $r_{j}\leq\mathrm{OPT}_{j}-1$ into the second sum we obtain

C_{\mathrm{ALG}}\quad\leq\quad(2a+b)\cdot\mathrm{OPT}\

$\hfill\blacktriangleleft$ We can now apply this modified framework to show Theorem 2.

See 2

Proof.

We use algorithms $\mathrm{Greedy}_{R}$ and $\mathrm{CBF}_{R}^{4}$ . For their edge allocation guarantees we have

f_{G}(C_{j},r_{j})\hskip 5.0pt\leq\hskip 5.0ptr_{j}+2C_{j}-1\qquad\text{and}% \qquad f_{\mathrm{CBF}}(C_{j},r_{j})\hskip 5.0pt\leq\hskip 5.0pt\frac{3}{2}C_{% j}+10.

For $\lambda_{1}=0.68$ and $\lambda_{2}=0.32$ we obtain

\lambda_{1}f_{G}(x,r_{x})+\lambda_{2}f_{\mathrm{CBF}}(x,r_{x})\quad\leq\quad 1% .84(x+1)\hskip 5.0pt+\hskip 5.0pt0.68(r_{j}+1),

which by application of Lemma 15 gives a $4.36$ -approximation. $\hfill\blacktriangleleft$

6 Asymptotic $2+\epsilon$ Approximation

Theorem 1 establishes that there is an algorithm returning a $3.415$ -approximation for any given coflow input instance. In this section we show a stronger, asymptotically optimal, approximation for input instances with a certain structure. This result does not depend on the approximation framework but rather follows directly from the bounds established in Section 5. Note that to show $(2-\epsilon)$ -approximation hardness in [23], they construct a sequence of instances for which the ratio between the sum over all weights and the optimum grows arbitrarily large, which shows that the asymptotic result in Theorem 3 is essentially optimal.

See 3

Proof.

For a given instance $\mathcal{I}$ , using Lemma 5 we can obtain deadlines $C_{1},\dotsc,C_{n}$ feasible for LP I for which the (weakened) bound $\sum_{j\in[n]}\omega_{j}C_{j}\hskip 5.0pt\leq\hskip 5.0pt2\cdot\mathrm{OPT}$ holds.
Using the bound derived in Section 5, we know that there exists algorithm $\mathrm{CBF}_{R}^{\tau}$ which can find a feasible coflow schedule for these deadlines such that for every $j\in[n]:\mathrm{CBF}_{R}^{\tau}(C_{j})\leq\tfrac{\tau+2}{\tau}C_{j}+2\tau+2$ . Applying the algorithm to the deadlines yields:

	$\displaystyle\sum_{j\in[n]}\omega_{j}\cdot\mathrm{CBF}^{\tau}_{R}(C_{j})$	$\displaystyle\hskip 5.0pt\leq\hskip 5.0pt\sum_{j\in[n]}\omega_{j}\left(\frac{% \tau+2}{\tau}C_{j}+2\tau+2\right)$
		$\displaystyle\hskip 5.0pt=\hskip 5.0pt\left(1+\frac{2}{\tau}\right)\sum_{j\in[% n]}\omega_{j}C_{j}\hskip 5.0pt+\hskip 5.0pt(2\tau+2)\sum_{j\in[n]}\omega_{j}$
		$\displaystyle\hskip 5.0pt\leq\hskip 5.0pt\left(2+\frac{4}{\tau}+\hat{\epsilon}% (2\tau+2)\right)\mathrm{OPT}$

So for any $\epsilon$ by appropriate choice of $\tau$ large enough and respectively $\hat{\epsilon}$ small enough, the result follows. $\hfill\blacktriangleleft$

Note that while the requirements in Theorem 3 are rather technical, it implies several strong results for natural classes of coflow instances, such as instances where all coflows have large maximum degree.

Corollary 16.

For any $\epsilon>0$ , there is $D\in\mathbb{N}$ such that there is a $(2+\epsilon)$ -approximation algorithm for Coflow Scheduling without release dates for instances $\mathcal{I}$ fulfilling

\forall E_{j}\in\mathcal{I}:\quad\Delta(E_{j})\hskip 5.0pt\geq\hskip 5.0ptD.

Proof.

In any schedule, the finishing time of a coflow is lower bounded by its maximum degree. Therefore the optimum cost will be at least $D$ times the sum of the weights. So for $D$ large enough such that $\hat{\epsilon}\cdot D\geq 1$ , Theorem 3 yields the result. $\hfill\blacktriangleleft$

References

[1] Saksham Agarwal, Shijin Rajakrishnan, Akshay Narayan, Rachit Agarwal, David Shmoys, and Amin Vahdat. Sincronia: near-optimal network design for coflows. In Proceedings of SIGCOMM, pages 16–29. ACM, 2018. doi:10.1145/3230543.3230569.
[2] Saba Ahmadi, Samir Khuller, Manish Purohit, and Sheng Yang. On Scheduling Coflows. In Proceedings of IPCO, volume 10328, pages 13–24. Springer International Publishing, 2017. doi:10.1007/978-3-319-59250-3_2.
[3] Nikhil Bansal. New Developments in Iterated Rounding. In Proceedings of FSTTCS, volume 29 of LIPIcs, pages 1–10, Dagstuhl, Germany, 2014. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.FSTTCS.2014.1.
[4] József Beck and Tibor Fiala. “integer-making” theorems. Discrete Applied Mathematics, 3(1):1–8, 1981. doi:10.1016/0166-218x(81)90022-6.
[5] Maurizio Bonuccelli, Inder Gopal, and Chak-Kuen Wong. Incremental time-slot assignment in SS/TDMA satellite systems. IEEE Transactions on Communications, 39(7):1147–1156, 1991. doi:10.1109/26.87220.
[6] Mosharaf Chowdhury, Yuan Zhong, and Ion Stoica. Efficient coflow scheduling with Varys. In Proceedings of SIGCOMM, pages 443–454. ACM, 2014. doi:10.1145/2619239.2626315.
[7] Jeffrey Dean and Sanjay Ghemawat. Mapreduce: simplified data processing on large clusters. Communications of the ACM, 51(1):107–113, 2008. doi:10.1145/1327452.1327492.
[8] Alexander Eckl, Luisa Peter, Maximilian Schiffer, and Susanne Albers. Minimization of Weighted Completion Times in Path-based Coflow Scheduling. arXiv:1911.13085 [cs], 2020. doi:10.48550/arXiv.1911.13085.
[9] Takuro Fukunaga. Integrality Gap of Time-Indexed Linear Programming Relaxation for Coflow Scheduling. In Proceedings of APPROX/RANDOM, volume 245 of LIPIcs, pages 36:1–36:13, Dagstuhl, Germany, 2022. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.APPROX/RANDOM.2022.36.
[10] Naveen Garg, Amit Kumar, and Vinayaka Pandit. Order Scheduling Models: Hardness and Algorithms. In Proceedings of FSTTCS, volume 4855, pages 96–107. Springer Berlin Heidelberg, 2007. doi:10.1007/978-3-540-77050-3_8.
[11] Runxin Guo, Yi Zhao, Quan Zou, Xiaodong Fang, and Shaoliang Peng. Bioinformatics applications on apache spark. GigaScience, 2018. doi:10.1093/gigascience/giy098.
[12] Anand Gupta, Hardeo Kumar Thakur, Ritvik Shrivastava, Pulkit Kumar, and Sreyashi Nag. A big data analysis framework using apache spark and deep learning. In Proceedings of ICDMW, pages 9–16. IEEE, 2017. doi:10.1109/icdmw.2017.9.
[13] Pankaj Gupta. Scheduling in input queued switches: A survey. Technical document, Stanford University, 1996.
[14] Sungjin Im, Benjamin Moseley, Kirk Pruhs, and Manish Purohit. Matroid Coflow Scheduling. In Proceedings of ICALP, volume 132 of LIPIcs, pages 145:1–145:14. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019. doi:10.4230/LIPIcs.ICALP.2019.145.
[15] Sungjin Im and Manish Purohit. A tight approximation for co-flow scheduling for minimizing total weighted completion time, 2018. arXiv:1707.04331.
[16] Hamidreza Jahanjou, Erez Kantor, and Rajmohan Rajaraman. Asymptotically Optimal Approximation Algorithms for Coflow Scheduling. In Proceedings of SPAA. ACM, 2017. doi:10.1145/3087556.3087567.
[17] Samir Khuller, Jingling Li, Pascal Sturmfels, Kevin Sun, and Prayaag Venkat. Select and permute: An improved online framework for scheduling to minimize weighted completion time. Theoretical Computer Science, 795:420–431, 2019. doi:10.1016/j.tcs.2019.07.026.
[18] Dénes König. Über graphen und ihre anwendung auf determinantentheorie und mengenlehre. Mathematische Annalen, 77:453–465, 1916. doi:10.1007/BF01456961.
[19] Joseph Y.-T. Leung, Haibing Li, and Michael Pinedo. Scheduling orders for multiple product types to minimize total weighted completion time. Discrete Applied Mathematics, 155(8):945–970, 2007. doi:10.1016/j.dam.2006.09.012.
[20] Monaldo Mastrolilli, Maurice Queyranne, Andreas S. Schulz, Ola Svensson, and Nelson A. Uhan. Minimizing the sum of weighted completion times in a concurrent open shop. Operations Research Letters, 38(5):390–395, 2010. doi:10.1016/j.orl.2010.04.011.
[21] Ali Mostafaeipour, Amir Jahangard Rafsanjani, Mohammad Ahmadi, and Joshuva Arockia Dhanraj. Investigating the performance of hadoop and spark platforms on machine learning algorithms. The Journal of Supercomputing, 77(2):1273–1300, 2020. doi:10.1007/s11227-020-03328-5.
[22] Zhen Qiu, Cliff Stein, and Yuan Zhong. Minimizing the total weighted completion time of coflows in datacenter networks. In Proceedings of SPAA, pages 294–303. ACM, 2015. doi:10.1145/2755573.2755592.
[23] Sushant Sachdeva and Rishi Saket. Optimal inapproximability for scheduling problems via structural hardness for hypergraph vertex cover. In 2013 IEEE Conference on Computational Complexity, pages 219–229. IEEE, 2013. doi:10.1109/ccc.2013.30.
[24] Alexander Schrijver. Theory of linear and integer programming. John Wiley & Sons, Inc., USA, 1986.
[25] Konstantin Shvachko, Hairong Kuang, Sanjay Radia, and Robert Chansler. The hadoop distributed file system. In Proceedings of IEEE MSST, pages 1–10, 2010. doi:10.1109/MSST.2010.5496972.
[26] Éva Tardos. A strongly polynomial algorithm to solve combinatorial linear programs. Operations Research, 34(2):250–256, 1986. doi:10.1287/opre.34.2.250.
[27] Matei Zaharia, Mosharaf Chowdhury, Michael J. Franklin, Scott Shenker, and Ion Stoica. Spark: cluster computing with working sets. In Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing, page 10, USA, 2010. USENIX Association. URL: https://www.usenix.org/conference/hotcloud-10/spark-cluster-computing-working-sets.

[bib.bib1] [1] Saksham Agarwal, Shijin Rajakrishnan, Akshay Narayan, Rachit Agarwal, David Shmoys, and Amin Vahdat. Sincronia: near-optimal network design for coflows. In Proceedings of SIGCOMM, pages 16–29. ACM, 2018. doi:10.1145/3230543.3230569.

[bib.bib2] [2] Saba Ahmadi, Samir Khuller, Manish Purohit, and Sheng Yang. On Scheduling Coflows. In Proceedings of IPCO, volume 10328, pages 13–24. Springer International Publishing, 2017. doi:10.1007/978-3-319-59250-3_2.

[bib.bib3] [3] Nikhil Bansal. New Developments in Iterated Rounding. In Proceedings of FSTTCS, volume 29 of LIPIcs, pages 1–10, Dagstuhl, Germany, 2014. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.FSTTCS.2014.1.

[bib.bib4] [4] József Beck and Tibor Fiala. “integer-making” theorems. Discrete Applied Mathematics, 3(1):1–8, 1981. doi:10.1016/0166-218x(81)90022-6.

[bib.bib5] [5] Maurizio Bonuccelli, Inder Gopal, and Chak-Kuen Wong. Incremental time-slot assignment in SS/TDMA satellite systems. IEEE Transactions on Communications, 39(7):1147–1156, 1991. doi:10.1109/26.87220.

[bib.bib6] [6] Mosharaf Chowdhury, Yuan Zhong, and Ion Stoica. Efficient coflow scheduling with Varys. In Proceedings of SIGCOMM, pages 443–454. ACM, 2014. doi:10.1145/2619239.2626315.

[bib.bib7] [7] Jeffrey Dean and Sanjay Ghemawat. Mapreduce: simplified data processing on large clusters. Communications of the ACM, 51(1):107–113, 2008. doi:10.1145/1327452.1327492.

[bib.bib8] [8] Alexander Eckl, Luisa Peter, Maximilian Schiffer, and Susanne Albers. Minimization of Weighted Completion Times in Path-based Coflow Scheduling. arXiv:1911.13085 [cs], 2020. doi:10.48550/arXiv.1911.13085.

[bib.bib9] [9] Takuro Fukunaga. Integrality Gap of Time-Indexed Linear Programming Relaxation for Coflow Scheduling. In Proceedings of APPROX/RANDOM, volume 245 of LIPIcs, pages 36:1–36:13, Dagstuhl, Germany, 2022. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. doi:10.4230/LIPIcs.APPROX/RANDOM.2022.36.

[bib.bib10] [10] Naveen Garg, Amit Kumar, and Vinayaka Pandit. Order Scheduling Models: Hardness and Algorithms. In Proceedings of FSTTCS, volume 4855, pages 96–107. Springer Berlin Heidelberg, 2007. doi:10.1007/978-3-540-77050-3_8.

[bib.bib11] [11] Runxin Guo, Yi Zhao, Quan Zou, Xiaodong Fang, and Shaoliang Peng. Bioinformatics applications on apache spark. GigaScience, 2018. doi:10.1093/gigascience/giy098.

[bib.bib12] [12] Anand Gupta, Hardeo Kumar Thakur, Ritvik Shrivastava, Pulkit Kumar, and Sreyashi Nag. A big data analysis framework using apache spark and deep learning. In Proceedings of ICDMW, pages 9–16. IEEE, 2017. doi:10.1109/icdmw.2017.9.

[bib.bib13] [13] Pankaj Gupta. Scheduling in input queued switches: A survey. Technical document, Stanford University, 1996.

[bib.bib14] [14] Sungjin Im, Benjamin Moseley, Kirk Pruhs, and Manish Purohit. Matroid Coflow Scheduling. In Proceedings of ICALP, volume 132 of LIPIcs, pages 145:1–145:14. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2019. doi:10.4230/LIPIcs.ICALP.2019.145.

[bib.bib15] [15] Sungjin Im and Manish Purohit. A tight approximation for co-flow scheduling for minimizing total weighted completion time, 2018. arXiv:1707.04331.

[bib.bib16] [16] Hamidreza Jahanjou, Erez Kantor, and Rajmohan Rajaraman. Asymptotically Optimal Approximation Algorithms for Coflow Scheduling. In Proceedings of SPAA. ACM, 2017. doi:10.1145/3087556.3087567.

[bib.bib17] [17] Samir Khuller, Jingling Li, Pascal Sturmfels, Kevin Sun, and Prayaag Venkat. Select and permute: An improved online framework for scheduling to minimize weighted completion time. Theoretical Computer Science, 795:420–431, 2019. doi:10.1016/j.tcs.2019.07.026.

[bib.bib18] [18] Dénes König. Über graphen und ihre anwendung auf determinantentheorie und mengenlehre. Mathematische Annalen, 77:453–465, 1916. doi:10.1007/BF01456961.

[bib.bib19] [19] Joseph Y.-T. Leung, Haibing Li, and Michael Pinedo. Scheduling orders for multiple product types to minimize total weighted completion time. Discrete Applied Mathematics, 155(8):945–970, 2007. doi:10.1016/j.dam.2006.09.012.

[bib.bib20] [20] Monaldo Mastrolilli, Maurice Queyranne, Andreas S. Schulz, Ola Svensson, and Nelson A. Uhan. Minimizing the sum of weighted completion times in a concurrent open shop. Operations Research Letters, 38(5):390–395, 2010. doi:10.1016/j.orl.2010.04.011.

[bib.bib21] [21] Ali Mostafaeipour, Amir Jahangard Rafsanjani, Mohammad Ahmadi, and Joshuva Arockia Dhanraj. Investigating the performance of hadoop and spark platforms on machine learning algorithms. The Journal of Supercomputing, 77(2):1273–1300, 2020. doi:10.1007/s11227-020-03328-5.

[bib.bib22] [22] Zhen Qiu, Cliff Stein, and Yuan Zhong. Minimizing the total weighted completion time of coflows in datacenter networks. In Proceedings of SPAA, pages 294–303. ACM, 2015. doi:10.1145/2755573.2755592.

[bib.bib23] [23] Sushant Sachdeva and Rishi Saket. Optimal inapproximability for scheduling problems via structural hardness for hypergraph vertex cover. In 2013 IEEE Conference on Computational Complexity, pages 219–229. IEEE, 2013. doi:10.1109/ccc.2013.30.

[bib.bib24] [24] Alexander Schrijver. Theory of linear and integer programming. John Wiley & Sons, Inc., USA, 1986.

[bib.bib25] [25] Konstantin Shvachko, Hairong Kuang, Sanjay Radia, and Robert Chansler. The hadoop distributed file system. In Proceedings of IEEE MSST, pages 1–10, 2010. doi:10.1109/MSST.2010.5496972.

[bib.bib26] [26] Éva Tardos. A strongly polynomial algorithm to solve combinatorial linear programs. Operations Research, 34(2):250–256, 1986. doi:10.1287/opre.34.2.250.

[bib.bib27] [27] Matei Zaharia, Mosharaf Chowdhury, Michael J. Franklin, Scott Shenker, and Ion Stoica. Spark: cluster computing with working sets. In Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing, page 10, USA, 2010. USENIX Association. URL: https://www.usenix.org/conference/hotcloud-10/spark-cluster-computing-working-sets.

3.415-Approximation for Coflow Scheduling via Iterated Rounding

Abstract

Keywords and phrases:

Category:

Funding:

Copyright and License:

2012 ACM Subject Classification:

Related Version:

DOI:

Event:

Editors:

Series and Publisher:

1 Introduction

1.1 Our Contribution

Theorem 1.

Theorem 2.

Theorem 3.

1.2 Organization

2 Preliminaries

Theorem 4 ([18]).

3 Algorithmic Framework

3.1 Coflow Deadlines

Lemma 5 ([14]).

3.2 Integral Edge Assignments with Guarantees

Lemma 6.

Theorem 7.

3.3 Combining Algorithmic Guarantees

Lemma 8.

Proof.

3.4 Main Theorem

Proof.

4 Integral Edge Assignments with Guarantees

4.1 Greedy Scheduling

Proof.

4.2 Iterated Rounding using Beck-Fiala

Procedure Idea

Edge-to-Block Allocation LP

LP Rounding

Constraints and Variables

Lemma 9.

Proof.

Lemma 10.

Proof.

Delay Bound

Lemma 11.

Proof.

Lemma 12.

Proof.

Reducing the Additive Constant

Proof.

5 Coflow Scheduling with Release Dates

Coflow Deadlines

Edge Allocation

Lemma 13.

Proof.

Lemma 14.

Proof.

Framework and Approximation Bound

Lemma 15.

Proof.

Proof.

6 Asymptotic 𝟐+ϵ Approximation

Proof.

Corollary 16.

Proof.

References

$3.415$ -Approximation for Coflow Scheduling via Iterated Rounding

6 Asymptotic $2+\epsilon$ Approximation