Scheduling on Identical Machines with Setup Time and Unknown Execution Time
Abstract
In this study, we investigate a scheduling problem on identical machines in which jobs require initial setup before execution. We assume that an algorithm can dynamically form a batch (i.e., a collection of jobs to be processed together) from the remaining jobs. The setup time is modeled as a known monotone function of the set of jobs within a batch, while the execution time of each job remains unknown until completion. This uncertainty poses significant challenges for minimizing the makespan. We address these challenges by considering two scenarios: each job batch must be assigned to a single machine, or a batch may be distributed across multiple machines. For both scenarios, we analyze settings with and without preemption. Across these four settings, we design online algorithms that achieve asymptotically optimal competitive ratios with respect to both the number of jobs and the number of machines.
Keywords and phrases:
Online scheduling, Competitive analysis, Makespan minimization, Identical machines schedulingCopyright and License:
2012 ACM Subject Classification:
Theory of computation Online algorithmsFunding:
This work was partially supported by the joint project of Kyoto University and Toyota Motor Corporation, titled “Advanced Mathematical Science for Mobility Society”, JST ERATO Grant Number JPMJER2301, JST PRESTO Grant Number JPMJPR2122, and JSPS KAKENHI Grant Numbers JP17K12646, JP19K22841, JP20H00609, JP20H05967, JP20K19739, JP21K17708, JP21H03397, and JP25K00137.Editors:
Pat Morin and Eunjin OhSeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
Efficient job allocation across multiple workers or machines is crucial for optimizing productivity in industrial environments. For example, consider distributing computational jobs across multiple virtual machines. The processing time for a batch of jobs consists of two components: the setup time, which includes configuring the environment or installing necessary software or libraries, and the execution time, which represents the duration of performing the actual tasks. While setup times are typically known in advance, the execution time of each job is often unpredictable until the job is completed. This uncertainty complicates the design of scheduling algorithms aimed at minimizing the makespan, which is the time when all jobs are completed.
In this paper, we introduce a scheduling problem involving jobs on identical machines with known setup times and unknown execution times, referred to as the unknown execution time scheduling (UETS) problem. We assume that an algorithm can dynamically form a batch – a collection of jobs that are processed together – from the remaining jobs. The setup time for a batch is defined as a set function over subsets of jobs, representing the total time required to prepare the jobs in the batch for processing. We explore two primary scenarios: one called sUETS, in which each constructed batch must be assigned to a single machine, and the other called mUETS, in which a batch can be distributed across multiple machines. Additionally, we examine two settings based on whether preemption is allowed. In the non-preemptive setting, once a batch process is started, it must be run without interruption until completion. In the preemptive setting, the processing of a batch can be interrupted. After an interruption, only uncompleted jobs are regrouped into batches and the process is resumed from the setup phase (completed jobs do not need to be processed again). Note that, the non-preemptive setting generally requires longer processing times compared to the preemptive one, because preemption allows for more flexible handling of incomplete batches and may reduce redundant processing.
Our objective is to design online algorithms for these scheduling problems. We analyze the performance of an online algorithm by the competitive ratio, which is the worst-case ratio between the makespan achieved by the online algorithm and that of an optimal offline algorithm. We assume that offline algorithms have complete knowledge of each job’s execution time in advance. We refer to the schedule produced by the optimal offline algorithm as the optimal schedule and its makespan as the optimal makespan. We refer to the makespan of the schedule produced by an algorithm simply as the algorithm’s makespan. An online algorithm is said to be -competitive if its makespan is at most times the optimal makespan for any instance. We will design online algorithms with asymptotically optimal competitive ratios with respect to both the number of machines and the number of jobs.
1.1 Our results
We study the competitive ratios for the scheduling problem with setup time and unknown execution times. We first show that any algorithm designed for jobs with a release time of can be adapted to handle arbitrary release times at the expense of only a constant factor increase in the competitive ratio. We formally state this in Theorem 2. Thus, we may assume that the release time of each job is , i.e., jobs can start processing immediately at time . A summary of our results is provided in Table 1.
For the non-preemptive sUETS problem, we present two algorithms. First, we construct an algorithm with a competitive ratio of (Theorem 3). Second, we design an algorithm that is -competitive (Theorem 5). By combining these two algorithms, we obtain an -competitive algorithm (Corollary 6). Moreover, we prove lower bounds showing that every online algorithm for this problem must have a competitive ratio of at least (Theorem 11) and (Theorem 10). For the preemptive sUETS problem, we design an algorithm with a competitive ratio of (Theorem 8), and we demonstrate that the competitive ratios of and are best possible (Theorem 11).
Turning to the non-preemptive mUETS problem, we first construct an -competitive algorithm (Theorem 12). By integrating this with the -competitive algorithm for the non-preemptive sUETS problem (Theorem 5), we derive an -competitive algorithm for the non-preemptive mUETS problem (Corollary 13). We further prove that and are optimal (Theorem 17). Finally, for the preemptive mUETS problem, we establish that the best possible competitive ratios are and (Theorems 8, 15, and 18).
Due to space limitations, some proofs are omitted in this version. The complete proofs are available in a full version of the paper.
1.2 Related work
Classical scheduling problems typically assume that the setup time is for every job batch. In this case, the well-known list scheduling algorithm achieves -competitive, which is proven to be optimal [16, 17, 18, 32]. This algorithm assigns unprocessed jobs to any available machine in the order they appear on the job list, disregarding execution times.
When execution times are unknown until job completion, the problem falls under the category of non-clairvoyant scheduling [26]. Recent studies have explored non-clairvoyant scheduling in input prediction models [3, 21]. Shmoys et al. [32] introduced a technique to transform a -competitive algorithm for a scheduling problem without release times into a -competitive algorithm for the same problem with release times. For comprehensive overviews of online scheduling, see surveys [31, 29] and the book by Pinedo [28].
Scheduling problems that incorporate both setup times and execution times arise in various applications, such as cloud computing (where virtual machines must be initialized based on job types) and and production systems (where machines require reconfiguration, such as changing molds or colors) [30, 25, 20, 13, 9, 1]. For example, in plastic production systems, attaching a specific mold to a machine constitutes the setup time. If consecutive jobs use the same mold, no additional setup time is required. The total setup time for a batch can be precomputed as the minimum time needed for attaching, exchanging, and removing molds.
Gambosi and Nicosia [13] studied an online version of scheduling with setup times in the one-by-one model. Mäcker et al. [24] investigated non-clairvoyant scheduling with setup times and proposed an -competitive algorithm for minimizing maximum flow time on a single machine. Dogeas et al. [10] considered a scenario where the execution time of each job is only revealed after an obligatory test with a known duration. This test time can be viewed as a kind of setup time, although it differs in that the execution time becomes apparent.
Goko et al. [15] introduced a scheduling model with a metric state space that involves setup time and unknown execution time settings together. Their model includes a scheduling problem faced by repair companies where each worker needs to visit customers’ houses, do the repair jobs, and then return to the office. The setup time corresponds to the shortest tour length for the customers’ houses, and the execution time corresponds to the duration of the repair jobs. Their model also includes the online dial-a-ride problem [2, 22, 11, 23, 7, 6, 4, 5], in which taxis are offered to pick up and drop passengers for transportation jobs. Our preemptive mUETS problem can be seen as an abstraction of their model, and hence, a similar approach can be used to solve it. However, the other problems (i.e., preemptive/non-preemptive sUETS and non-preemptive mUETS) are different and require other approaches.
The setting in which all jobs have an execution time of and are released at time , with only the setup time considered as processing time, can be viewed as an offline load balancing problem. This problem is NP-hard even if the setup time is additive and the number of machines is two, as this special case corresponds to the PARTITION problem [14]. For the additive setup time, the list scheduling algorithm works as a -approximation algorithm. Moreover, this problem admits a polynomial time approximation scheme (PTAS) [19]. Svitkina and Fleischer [33] presented an -approximation algorithm for the submodular setup times, and demonstrated that this is best possible. Furthermore, Nagano and Kishimoto [27] provided a -approximation algorithm for the subadditive setup times, where is an optimal partition and is the curvature of at .
2 Preliminaries
For a positive integer , we write to denote the set . We are given a set of jobs and identical machines. We assume that all jobs are given and released at time . As we will show in Section 2.2, this assumption only increases a constant factor in the competitive ratios. We denote the set of machines by . We assume that , as the optimal scheduling is clear otherwise. The jobs are executed in batches, and each batch’s processing time consists of two components: the setup time and the execution time.
The setup time is the total time one machine takes to set up jobs in the batch. For a batch , the setup time for is represented as . The execution time for each job , denoted by , is unknown until its process is completed. We write to denote the total execution time of a batch , i.e., . Consequently, the overall processing time of a batch on a single machine is given by . We assume that the setup time for every is available in advance, while the execution time of each job is unknown until its completion.
We also deal with a situation where one batch can be assigned to multiple machines. We assume that when a batch is assigned to machines, each of the machines incurs a setup time , and the jobs are executed on a first-come, first-served basis. For example, when a batch is processed in a production system, we assume that each machine forms a queue of the attachments needed for jobs in , and proceeds as follows: (i) dequeues and installs the first attachment in the queue, (ii) repeatedly executes an unprocessed job that is compatible with the current attachment as long as such a job exists, and (iii) removes the current attachment and returns to step (i) if the attachment queue is not empty. The minimum processing time for batch across machines falls within the range of and .
At each time, we can assign a batch of jobs that have not yet been completed and are not assigned anywhere else. In the sUETS and mUETS problems, a batch can be allocated to a single machine and multiple machines, respectively. Our objective is to minimize the makespan, which is the time at which all the jobs are completed.
We assume that the setup time function is monotone subadditive. Monotonicity means that for any . Subadditivity means that for any disjoint subsets and from , the sum of their setup times is greater than or equal to the setup time of their union, i.e., . Note that this assumption of subadditivity does not lose generality. To observe this, for any (not necessarily subadditive) function , define the function . This new function is monotone and subadditive by construction – it represents the minimum total setup time achievable if one were allowed to split the batch into sub-batches and incur the setup cost separately for each. Importantly, replacing with in our analysis does not change the fundamental nature of the scheduling problem, and thus we assume without loss of generality that the setup time function is monotone and subadditive.
We explore two settings based on whether an allocated batch can be preempted. In the non-preemptive setting, once a batch is assigned, it must be processed to completion without interruption. Conversely, in the preemptive setting, the processing of a batch can be halted and canceled. Suppose that a batch is assigned to machines and is preempted after units of time from the start of its processing. In this case, we assume that the machines will become available to process another batch after at most units of time. This assumption ensures that the processing time does not increase even if preemption occurs during the setup phase. Furthermore, at the moment of preemption, let denote the set of jobs that have been completed. Then, the following inequality holds: . This condition ensures that the total time spent up to the point of preemption is consistent with the setup and execution times for the completed jobs.
Note that, in any setting, we can assume that an optimal schedule does not perform preemptions and assigns only one batch to each machine at the beginning because the setup time is subadditive. Thus, an optimal schedule can be represented as a partition of jobs and the optimal makespan is independently of the settings. Based on this observation, we derive a lower bound for the optimal makespan, as formalized in the following lemma.
Lemma 1.
The optimal makespan is at least
Proof.
Let be an optimal schedule. Then, the optimal makespan is at least
Additionally, we have
by monotone subadditivity of the setup time. Therefore, we have the lower bound.
2.1 Examples of setup times
This subsection illustrates several examples of setup times raised in practical applications. In addition, we discuss the approximability of the partition (load balancing) problem
| (1) |
for each class of setup times.
Constant setup times.
One of the simplest examples of the setup time family is the constant setup time, where the setup time is a certain constant independent of the batch. Formally, the setup time is represented as for and . It is not difficult to see that any partition of the jobs is optimal for the problem (1).
Type-specific setup times.
In production systems with attachments, jobs are partitioned based on their types. Let be the set of types and is the partition of jobs with types , i.e., and for all distinct . Let be the setup time for type . Then, the setup time for is defined as . Additionally, we say that a type-specific setup time is unweighted if for all , i.e., for . We can obtain a PTAS for the problem (1) with type-specific setup times by treating each as a single job and applying a PTAS for the load balancing problem [19]. Moreover, for the unweighted case, the problem (1) is solvable in polynomial time because an optimal solution can be obtained by simply balancing the number of types assigned to each machine.
Library-based setup times.
In cloud computing environments, where each job requires the installation of multiple libraries, and each library takes a fixed amount of time to install. The setup time needed to process a batch of jobs is the total installation time of the required libraries. Let be the set of libraries, and let be the set of required libraries for job . Let be the installation time for library . Then, the setup time for is defined as . Note that if every job requires a unique library (i.e., ), this is a type-specific setup time. This setup time is a monotone submodular function; more specifically, it is a weighted coverage function. Hence, we can apply the -approximation algorithm for the uniform submodular load balancing problem [33] to solve the problem (1).
TSP-based setup times.
For applications like repair companies, the setup time is defined by the optimal value of a traveling salesman problem (TSP) instance. Let be a metric space with an origin . The origin corresponds to the location of the company. Each job is associated with a point . Then, the setup time for is defined as the optimal value of the TSP on . Note that this class of setup times is a generalization of type-specific setup times. Indeed, for a star metric where for all and for all distinct , the setup time is for when for each job . For TSP-based setup times, the problem (1) can be viewed as the -TSP. It is known that the -TSP admits a -approximation algorithm [12, 8].
2.2 Reduction of the Problem with Release Time
In this subsection, we show that our assumption that all jobs have a release time of (i.e., each job is available for processing from time ) does not lose generality. Suppose instead that each job has a release time, and its existence is hidden until its release time. Let be a -competitive algorithm for a UETS problem in a certain setting without release times. Then, we demonstrate that can be transformed into a -competitive algorithm for the UETS problem with release times of the corresponding setting. For this end, we use the IGNORE strategy, which appeared in the paper of Shmoys et al. [32] and is named by Ascheuer et al. [2].
The IGNORE strategy keeps the machines remain idle until a set of jobs appears. Then, IGNORE immediately decides a schedule for the jobs in following the algorithm and assigns job batches to machines. We refer to this schedule as a subschedule for . All jobs that arrive during the process of the subschedule are temporarily ignored until the subschedule for is completed. After all the machines complete the jobs in , IGNORE decides a subschedule for all new jobs that arrived during the previous process. If there are no such jobs, all the machines become idle. The IGNORE strategy repeats this procedure.
Theorem 2.
The strategy with a -competitive algorithm is -competitive for the UETS problem with release time.
Proof.
Fix an instance, and let be the optimal makespan. For a subset of jobs , we denote as the makespan of the algorithm, assuming that jobs in are given at time . Note that by the monotonicity of the optimal makespan with respect to the jobs.
If the instance contains no job, then is clearly optimal. Thus, we assume that the instance contains at least one job. Let be the last release time of the jobs. Then, as the last released job can only be processed from .
Suppose that the machines are idle at time . Let be the set of jobs processed in the last subschedule by the IGNORE strategy. Then, the makespan of IGNORE is
On the other hand, suppose that some machines are not idle at time . Then is in the second last subschedule of the algorithm, and the last subschedule starts right after the second-to-last subschedule ends. Let and be the sets of jobs processed in the last and the second-to-last subschedules, respectively. Then, the makespan of IGNORE is
Therefore, the competitive ratio of IGNORE is at most .
It should be noted that this reduction remains valid even when the competitive ratio of depends on the number of machines or the number of jobs , provided it is monotonically nondecreasing with respect to . From this theorem, the optimum competitive ratio is the same up to a constant factor regardless of the release time.
3 Single Machine Batch Allocation
In this section, we analyze the sUETS problem where each batch can be allocated only on a single machine. Recall that is the number of jobs and is the number of machines.
3.1 Algorithms
We first observe that the algorithm of processing the batch consisting of all jobs by a single machine without preemption is -competitive, where is the number of machines. Note that this algorithm is feasible for any settings of sUETS and mUETS since no cancellation is performed.
Theorem 3.
The algorithm of processing the batch consisting of all jobs by a single machine is -competitive for the non-preemptive sUETS problem.
Proof.
As the makespan of the algorithm is and the optimal makespan is at least by Lemma 1, the competitive ratio is at most
where the second inequality is implied by the subadditivity of the setup time .
Second, we analyze the competitive ratio of the list scheduling algorithm.
Theorem 4.
The algorithm that greedily assigns a batch consisting of a single unprocessed job to any available single machine is -competitive for the non-preemptive sUETS problem.
Proof.
The makespan of the algorithm is at most
where the inequality holds by Lemma 1. Thus, the competitive ratio is at most .
Next, we provide an -competitive algorithm, which is an improved version of the list scheduling algorithm. The idea is to reduce the total setup time by grouping jobs. The algorithm first divides the jobs into batches in such a way that this partition minimizes the maximum setup time, under the condition that each batch contains at most jobs. Then, it processes the batches according to the list scheduling algorithm, which greedily allocates an unprocessed batch to any available machine. The formal description of this algorithm is summarized in Algorithm 1.
Theorem 5.
Algorithm 1 is -competitive for the non-preemptive sUETS problem.
Proof.
Let be the optimal schedule and let denote the optimal makespan . Define , and let be a -partition of the jobs that minimizes subject to for all .
We first observe that . This is because by dividing each with into sub-batches of almost equal size, we can obtain a partition that refines . Here, we have for all . Thus, it holds that .
Then, the makespan of the algorithm is at most
where the first inequality follows from the subadditivity of the setup time and the second inequality from Lemma 1. Hence, Algorithm 1 is -competitive.
By combining Theorems 3 and 5, we obtain the following corollary.
Corollary 6.
There exists an -competitive algorithm for the non-preemptive sUETS problem.
Proof.
If , then the algorithm that processes the batch by a single machine is -competitive by Theorem 3. On the other hand, if , the competitive ratio of Algorithm 1 is by Theorem 5. Thus, an -competitive algorithm exists for either case.
Remark 7.
Obtaining the partition at line 2 in Algorithm 1 is computationally hard. Thus, we mention the competitive ratio when we can utilize an -approximation algorithm for the problem (1). Let be an -partition of the jobs that is obtained by the approximation algorithm. Let . Then, we can easily construct a -partition that is a refinement of and satisfies for all . By using instead of and employing this partition at line 2, Algorithm 1 is -competitive for the non-preemptive sUETS problem. By combining this with Theorem 4, we can obtain an -competitive algorithm. Moreover, by combining this with Theorem 3, we can also obtain an -competitive algorithm.
In the rest of this subsection, we provide an -competitive algorithm for the preemptive sUETS problem. For a given instance of the sUETS problem, let be an integer such that . Note that . The execution of our algorithm consists of phases. Without loss of generality, we may assume that , since otherwise the competitive ratio of the algorithm in Theorem 3 is .
The intuition of our algorithm is as follows. Our algorithm repeatedly completes a -fraction of the jobs in each phase. Consequently, all the jobs can be processed in iterations. This increases the cost of the setup time by a factor of . Additionally, we ensure that at least machines work at any point of the algorithm. This guarantees that the cost in terms of execution time will only increase by a factor of at most . By these properties, the competitive ratio of the algorithm is .
Let us explain our algorithm more precisely. Initially, it computes an -partition of the jobs that minimizes the maximum setup time . In the first phase, is allocated to machine for each . Then, each machine processes the assigned batch until either the machine completes the batch or the number of uncompleted machines becomes less than or equal to . At that time, the machines still processing will preempt their batches. In the th phase (), the algorithm computes an -partition of the remaining jobs such that, for any , and for some . We can always obtain such a partition by dividing each batch left in the previous phase into sub-batches. Thereafter, each machine processes the assigned batch until either the batch is completed or the number of uncompleted machines becomes less than or equal to . At the end of th phase, the number of remaining jobs is at most . In the st phase, the algorithm assigns up to one remaining job to each machine. A formal description of the algorithm is provided in Algorithm 2.
Theorem 8.
Algorithm 2 is -competitive for the preemptive sUETS problem.
Proof.
Let be an integer such that . If , then the competitive ratio is by Theorem 3. Thus, we assume .
Let be the optimal makespan and let . In addition, let be the time length of phase . By the definition of the algorithm, the jobs in are completed in phase . Thus, the total execution time of jobs in phase is at most because at most jobs are partially executed. Taking into account the setup time of at most and the time for preemption of at most , the length of phase is
where the second inequality is by Lemma 1 and the third is by . In addition, by Lemma 1, the time length of phase is
Thus, the makespan of Algorithm 2 is
Hence, the competitive ratio of Algorithm 2 is .
Remark 9.
Similar to Remark 7, suppose that we take an -partition of the jobs that is an -approximation of the problem (1) at line 6 in Algorithm 2. Then, the competitive ratio of Algorithm 2 becomes -competitive for the preemptive sUETS problem. Moreover, by setting to be the integer that satisfies , we can slightly improve the competitive ratio to .
3.2 Lower bounds
In this subsection, we show that the algorithms provided in Section 3.1 are asymptotically optimal with respect to both the number of machines and the number of jobs.
We first provide a lower bound for the non-preemptive setting.
Theorem 10.
The competitive ratio of the non-preemptive sUETS problem is at least and even for constant setup times.
Proof.
Let and for any non-empty batch (i.e., constant setup times). We will set the execution time for at most jobs (referred to as heavy jobs) and for the other jobs (referred to as light jobs) depending on the behavior of a given online algorithm. It is not difficult to see that the optimal makespan is at most .
We fix an online algorithm. Let us consider its behavior when every job is light. Suppose that every batch assigned to machines consists of at most jobs. Then, the algorithm assigns at least batches to the machines in total. Consequently, some machines must process at least batches, which requires setups. Thus, the makespan is at least in this case, and we have done.
Conversely, suppose that the algorithm allocates a batch consisting of more than jobs in the instance where every job is light. In another instance where jobs in the first such batch are set to be heavy, the algorithm’s behavior is the same until the batch is assigned. Thus, the algorithm assigns all the heavy jobs to one machine, and the makespan is at least . Therefore, in both cases, the competitive ratio is at least .
Next, we provide a lower bound for the preemptive setting.
Theorem 11.
The competitive ratio of the preemptive sUETS problem is at least and even for constant setup times.
4 Multiple Machines Batch Allocation
In this section, we explore the mUETS problem, where a batch can be allocated to multiple machines.
4.1 Algorithms
We first provide an -competitive algorithm for the non-preemptive mUETS problem. The algorithm computes a -partition of that minimizes maximum setup time. Then, it allocates each batch to machines. The algorithm is formally described in Algorithm 3.
Theorem 12.
Algorithm 3 is -competitive for the non-preemptive mUETS problem.
Proof.
Let be an optimal schedule and let be the optimal makespan. In addition, let and be a -partition of that minimizes . We analyze the algorithm that allocates each batch to machines. The makespan of the algorithm is at most . By the choice of and the subadditivity of the setup time , we have
| (2) | ||||
where we denote for . Therefore, by Lemma 1 and (2), the makespan is at most
which means that the competitive ratio of the algorithm is .
By combining Theorems 12 and 5, we can obtain the following corollary.
Corollary 13.
There is an -competitive algorithm for the non-preemptive mUETS problem.
Proof.
If , then the algorithm in Theorem 12 is -competitive. On the other hand, if , the competitive ratio of Theorem 5 is . Thus, in either case, there is an -competitive algorithm.
Remark 14.
To account for the computational issue, suppose that we only have an -partition of the jobs that is an -approximation for the problem (1). Let . Then, the schedule that assigns batch to machines for each is -competitive for the non-preemptive mUETS problem. By combining this with Remark 7, we can obtain an -competitive algorithm for the non-preemptive mUETS problem.
For the preemptive mUETS problem, Algorithm 2 is also -competitive, and this is asymptotically best possible as we will see in Theorem 18. Thus, even when we are allowed to assign a job batch to more than one machine, we cannot improve the competitive ratio with respect to the number of jobs. In contrast, we show that the competitive ratio with respect to the number of machines can be exponentially improved by allowing batches to be assigned to multiple machines.
Let be an integer such that and let be . Here, and . We have from the assumption that .
Our algorithm for the preemptive mUETS is similar to Algorithm 2 for the preemptive sUETS. However, instead of splitting the uncompleted batch into several smaller batches of similar size, the algorithm increases the number of machines assigned to that batch. This ensures that the competitive ratio of the algorithm does not depend on the number of jobs .
Our algorithm computes an -partition of the jobs that minimizes the maximum setup time . In the first phase, each batch is allocated to one machine. Then, each batch is processed until it is completed, or the number of uncompleted batches becomes less than or equal to . In the th phase (), each uncompleted batch is assigned to machines, and it is processed until it is completed, or the number of uncompleted batches becomes . If the number of uncompleted batches becomes smaller than because multiple batches are completed simultaneously, then break the tie arbitrarily and defer the rest batches to the next phase. Note that this allocation is feasible because . At the end of the th phase, all the batches are completed since by . Our algorithm is formally described in Algorithm 4. It should be noted that this algorithm is based on a similar idea to that of [15, Algorithm 1].
Theorem 15.
Algorithm 4 is -competitive for the preemptive mUETS problem.
Proof.
Let be the optimal makespan and let . For each , let be the set of indices of batches that have been completed in the th phase. By the definition of the algorithm, we have
Let be the time length of phase . We first bound it for . As every batch in is not finished in phase and the preemption takes time at most , we have
Note that if and for . Thus, we have
| (3) |
for every .
Next, we bound the time length of phase . As the batch for each is processed by machines, we have
| (4) |
Hence, by (3) and (4), the makespan of the algorithm is
Therefore, the competitive ratio of Algorithm 4 is .
Remark 16.
Suppose that we take an -partition of the jobs that is an -approximation for the problem (1) at line 2 in Algorithm 2. Then, the competitive ratio of Algorithm 4 becomes -competitive for the preemptive mUETS problem. Moreover, by setting to be the integer that satisfies , we can slightly improve the competitive ratio to .
4.2 Lower Bounds
In this subsection, we show the asymptotic optimality of our algorithms. We first provide lower bounds for the non-preemptive case.
Theorem 17.
The competitive ratio of the non-preemptive mUETS is at least and even for unweighted type-specific setup times.
Next, we provide lower bounds for the preemptive case.
Theorem 18.
The competitive ratio of the preemptive mUETS problem is at least and even for unweighted type-specific setup times.
5 Conclusion and Discussion
In this paper, we introduced the UETS problem and studied it in terms of competitive analysis. We obtained tight bounds of the competitive ratio in each setting. In the following, we discuss the application of our results to variant settings.
When a batch is assigned to machines, we have assumed that every machine incurs a setup time . However, for instance, in the production system example, unnecessary reassignment of attachments can be skipped. Additionally, if a machine processes multiple batches, the setup time incurred from the second or subsequent batch may be reduced. Our algorithmic results remain valid even for these cases as long as an optimal schedule assigns only one batch to each machine separately. This is because, for algorithms, a decrease in setup time only contributes to decreasing the makespan.
In the preemptive setting, we assumed that a job completed at the time of preemption does not need to be executed again. We can also consider a setting in which, if a batch process is preempted, the entire batch must be restarted from scratch. This means that even completed jobs in the batch must be executed again. Algorithm 4 also works in this setting, and it is -competitive because its analysis does not use the information of which jobs are completed. However, Algorithm 2 may not achieve the competitive ratio of as the proof does not work for this setting. Nevertheless, if we can obtain information on the execution time of completed jobs, we can reestablish an -competitive algorithm by processing the jobs that are completed once but need to be reprocessed between phases.
References
- [1] Ali Allahverdi, Chi To Ng, TC Edwin Cheng, and Mikhail Y Kovalyov. A survey of scheduling problems with setup times or costs. European journal of operational research, 187(3):985–1032, 2008. doi:10.1016/j.ejor.2006.06.060.
- [2] Norbert Ascheuer, Sven O. Krumke, and Jörg Rambau. Online Dial-a-Ride Problems: Minimizing the Completion Time. In Proceedings of the Symposium on Theoretical Aspects of Computer Science, volume 1770, pages 639–650, 2000. doi:10.1007/3-540-46541-3_53.
- [3] Evripidis Bampis, Alexander V. Kononov, Giorgio Lucarelli, and Fanny Pascual. Non-clairvoyant makespan minimization scheduling with predictions. In Satoru Iwata and Naonori Kakimura, editors, Proceedings of the International Symposium on Algorithms and Computation, volume 283 of LIPIcs, pages 9:1–9:15. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2023. doi:10.4230/LIPIcs.ISAAC.2023.9.
- [4] Alexander Birx. Competitive analysis of the online dial-a-ride problem. PhD thesis, Technische Universität Darmstadt, 2020.
- [5] Alexander Birx and Yann Disser. Tight analysis of the smartstart algorithm for online dial-a-ride on the line. SIAM Journal on Discrete Mathematics, 34(2):1409–1443, 2020. doi:10.1137/19M1268513.
- [6] Alexander Birx, Yann Disser, and Kevin Schewior. Improved bounds for open online dial-a-ride on the line. In Proceedings of Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM), volume 145, 2019. doi:10.4230/LIPIcs.APPROX-RANDOM.2019.21.
- [7] Vincenzo Bonifaci, Maarten Lipmann, and Leen Stougie. Online multi-server dial-a-ride problems. SPOR-Report : reports in statistics, probability and operations research. Technische Universiteit Eindhoven, 2006.
- [8] Nicos Christofides. Worst-case analysis of a new heuristic for the travelling salesman problem. Operations Research Forum, 3(1):20, 2022. doi:10.1007/s43069-021-00101-z.
- [9] Srikrishnan Divakaran and Michael Saks. An online scheduling problem with job set-ups. DIM ACS Technical Report: 2000, 34, 2000.
- [10] Konstantinos Dogeas, Thomas Erlebach, and Ya-Chun Liang. Scheduling with Obligatory Tests. In Proceedings of the Annual European Symposium on Algorithms, pages 48:1–48:14, 2024. doi:10.4230/LIPIcs.ESA.2024.48.
- [11] Esteban Feuerstein and Leen Stougie. On-line single-server dial-a-ride problems. Theoretical Computer Science, 268(1):91–105, 2001. doi:10.1016/S0304-3975(00)00261-9.
- [12] Greg N. Frederickson, Matthew S. Hecht, and Chul E. Kim. Approximation Algorithms for Some Routing Problems. SIAM Journal on Computing, 7(2):178–193, 1978. doi:10.1137/0207017.
- [13] Giorgio Gambosi and Gaia Nicosia. On-line scheduling with setup costs. Information Processing Letters, 73(1–2):61–68, 2000. doi:10.1016/S0020-0190(99)00152-0.
- [14] M. R. Garey and David S. Johnson. Computers and Intractability: A Guide to the Theory of NP-Completeness, volume 174. W. H. Freeman, 1979.
- [15] Hiromichi Goko, Akitoshi Kawamura, Yasushi Kawase, Kazuhisa Makino, and Hanna Sumita. Online scheduling on identical machines with a metric state space. In Proceedings of the Symposium on Theoretical Aspects of Computer Science, pages 32:1–32:21, 2022. doi:10.4230/LIPIcs.STACS.2022.32.
- [16] Ronald L. Graham. Bounds for Certain Multiprocessing Anomalies. Bell System Technical Journal, 45(9):1563–1581, 1966. doi:10.1002/j.1538-7305.1966.tb01709.x.
- [17] Dan Gusfield. Bounds for naive multiple machine scheduling with release times and deadlines. Journal of Algorithms, 5(1):1–6, 1984. doi:10.1016/0196-6774(84)90035-X.
- [18] Leslie A. Hall and David B. Shmoys. Approximation schemes for constrained scheduling problems. In Proceedings of Annual Symposium on Foundations of Computer Science, pages 134–139, 1989. doi:10.1109/SFCS.1989.63468.
- [19] Dorit S Hochbaum and David B Shmoys. Using dual approximation algorithms for scheduling problems theoretical and practical results. Journal of the ACM, 34(1):144–162, 1987. doi:10.1145/7531.7535.
- [20] Hongtao Hu, K. K.H. Ng, and Yichen Qin. Robust Parallel Machine Scheduling Problem with Uncertainties and Sequence-Dependent Setup Time. Scientific Programming, 2016. doi:10.1155/2016/5127253.
- [21] Sungjin Im, Ravi Kumar, Mahshid Montazer Qaem, and Manish Purohit. Non-clairvoyant scheduling with predictions. ACM Transactions on Parallel Computing, 10(4):1–26, 2023. doi:10.1145/3593969.
- [22] Sven O. Krumke. Online optimization: Competitive analysis and beyond. PhD thesis, Technische Universität Berlin, 2001.
- [23] Maarten Lipmann, Xiwen Lu, Willem E. de Paepe, Rene A. Sitters, and Leen Stougie. On-Line Dial-a-Ride Problems Under a Restricted Information Model. Algorithmica, 40(4):319–329, 2004. doi:10.1007/s00453-004-1116-z.
- [24] Alexander Mäcker, Manuel Malatyali, Friedhelm Meyer auf der Heide, and Sören Riechers. Non-clairvoyant scheduling to minimize max flow time on a machine with setup times. In Proceedings of the International Workshop on Approximation and Online Algorithms, pages 207–222. Springer, 2017. doi:10.1007/978-3-319-89441-6_16.
- [25] David M Miller, Hui-Chuan Chen, Jessica Matson, and Qiang Liu. A hybrid genetic algorithm for the single machine scheduling problem. Journal of Heuristics, 5(4):437–454, 1999. doi:10.1023/A:1009684406579.
- [26] Rajeev Motwani, Steven Phillips, and Eric Torng. Nonclairvoyant scheduling. Theoretical computer science, 130(1):17–47, 1994. doi:10.1016/0304-3975(94)90151-1.
- [27] Kiyohito Nagano and Akihiro Kishimoto. Subadditive load balancing, 2019. doi:10.48550/arXiv.1908.09135.
- [28] Michael L. Pinedo. Scheduling: Theory, Algorithms, and Systems. Springer Publishing Company, Incorporated, 2008. doi:10.1007/978-3-031-05921-6.
- [29] Kirk Pruhs, Jirí Sgall, and Eric Torng. Online scheduling. In Joseph Y.-T. Leung, editor, Handbook of Scheduling - Algorithms, Models, and Performance Analysis. Chapman and Hall/CRC, 2004. doi:10.1201/9780203489802.CH15.
- [30] Vinod K Sahney. Single-server, two-machine sequencing with switching time. Operations Research, 20(1):24–36, 1972. doi:10.1287/opre.20.1.24.
- [31] JiŘí Sgall. On-line scheduling, pages 196–231. Springer Berlin Heidelberg, 1998.
- [32] David B. Shmoys, Joel Wein, and David P. Williamson. Scheduling parallel machines on-line. SIAM Journal on Computing, 24(6):1313–1331, 1995. doi:10.1137/S0097539793248317.
- [33] Zoya Svitkina and Lisa Fleischer. Submodular approximation: Sampling-based algorithms and lower bounds. SIAM Journal on Computing, 40(6):1715–1737, 2011. doi:10.1137/100783352.
