Abstract 1 Introduction 2 Model and Problem Formulation 3 Solution Approach 4 Experiments 5 Conclusion References

Speed-Aware Network Design: A Parametric Optimization Approach

Ugo Rosolia ORCID Amazon Science & Tech, Luxembourg, Luxembourg Marc Bataillou Almagro Amazon Science & Tech, Luxembourg, Luxembourg George Iosifidis Delft University of Technology, The Netherlands Martin Gross Amazon Science & Tech, Luxembourg, Luxembourg Georgios Paschos ORCID Amazon Science & Tech, Luxembourg, Luxembourg
Abstract

Network design problems have been studied from the 1950s, as they can be used in a wide range of real-world applications, e.g., design of communication and transportation networks. In classical network design problems, the objective is to minimize the cost of routing the demand flow through a graph. In this paper, we introduce a generalized version of such a problem, where the objective is to tradeoff routing costs and delivery speed; we introduce the concept of speed-coverage, which is defined as the number of unique items that can be sent to destinations in less than 1-day. Speed-coverage is a function of both the network design and the inventory stored at origin nodes, e.g., an item can be delivered in 1-day if it is in-stock at an origin that can reach a destination within 24 hours. Modeling inventory is inherently complex, since inventory coverage is described by an integer function with a large number of points (exponential to the number of origin sites), each one to be evaluated using historical data. To bypass this complexity, we first leverage a parametric optimization approach, which converts the non-linear joint routing and speed-coverage optimization problem into an equivalent mixed-integer linear program. Then, we propose a sampling strategy to avoid evaluating all the points of the speed-coverage function. The proposed method is evaluated on a series of numerical tests with representative scenarios and network sizes. We show that when considering the routing costs and monetary gains resulting from speed-coverage, our approach outperforms the baseline by 8.36% on average.

Keywords and phrases:
Network Design, Transportation Networks, Mixed-Integer Programming, Speed-Coverage, Parametric Optimization
Copyright and License:
[Uncaptioned image] © Ugo Rosolia, Marc Bataillou Almagro, George Iosifidis, Martin Gross, and Georgios Paschos; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Applied computing Transportation
Editors:
Jonas Sauer and Marie Schmidt

1 Introduction

1.1 Background & Motivation

Expedite delivery services are becoming increasingly important for e-commerce supply chains such as Amazon, Alibaba and Walmart as they improve directly customer experience and can indirectly contribute to attaining key sustainability goals. Indeed, the option of expedite delivery increases naturally the range of items customers are willing to purchase from e-commerce platforms instead of visiting physical retail (brick-and-mortar) shops, and this has been found to reduce transportation-based carbon emissions in many scenarios [23, 21]. These reasons contribute to a growing volume of research aiming to improve expedite delivery by means of last-mile routing optimization [17], vehicle dispatch scheduling [16], or innovative crowd-shipping models [8], among others.

Refer to caption
Figure 1: Middle-mile network design problem where destination nodes – i.e., distribution centers – are connected to origin nodes – i.e., warehouse where items are stored. Note that origin nodes have partially overlapping inventory, e.g., the inventory stored at Origin 2 and Origin 3 is also stored at Origin 1. Thus, connecting with speed-paths Origin 1 to destination nodes allows us to offer expedite delivery services for all items.

Nevertheless, the above works overlook the role of the middle-mile network connecting warehouses (origins) to distribution centers (destinations) in the efficacy and feasibility of such expedite delivery services. As the demand grows, items have to be stored at out-of-city warehouses, and therefore, the possibility of offering expedite delivery for these items is shaped by the network connections. This situation creates an unavoidable tension between cost and speed when designing the transportation network. To reduce costs we need to minimize the number of trucks used to ship orders to customers. Such a goal can be achieved by leveraging intermediate consolidation hubs between origin and destination nodes. Consider the example from Figure 1, where we have nine commodities – one for each origin-destination pair – requiring one-third of truck’s capacity. By utilizing intermediate consolidation hubs, the network configuration from Figure 1 requires only seven trucks to route packages. In contrast, serving these same nine commodities through speed-paths – i.e., paths directly connecting origins to destinations – would require nine trucks, one for each origin-destination pair. To tackle the tradeoff between volume consolidation and opening speed-paths, many studies formulate the middle-mile network design problem as a minimum-cost flow problem with maximum path-length constraints, or aim to balance transit times with path costs (see Sec. 1.2).

Yet, a key factor that has been largely overlooked is the effect of the network connectivity on the share of inventory for which we can offer expedite deliver services. In this paper, we propose to measure such an effect by evaluating the network’s speed-coverage, which we define as the number of unique items that can be delivered in 1-day. Speed-coverage is a function of (a) the network topology which determines the transit time from origins to destinations and (b) the inventory stored at origin nodes. We recognize an inherent tradeoff: connecting an origin to a destination via a short route (speed-path) increases the speed-coverage, but may increase routing cost by making volume consolidation more difficult. Additionally, the benefit of speed-paths to speed-coverage exhibits diminishing returns. The more speed-paths we introduce, the less their differential impact to the overall speed-coverage objective. In this work, we formulate a problem that captures exactly these tradeoffs and then propose a technique to solve for the jointly optimal routing costs and speed-coverage for the general case when origins have partially overlapping inventory. To the best of our knowledge, this paper is the first to introduce a mathematical formulation of these tradeoffs, and a scalable methodology for solving the optimization problem.

1.2 Literature Review

As the importance of, and demand for, expedite delivery services grows, the design of speed-aware middle-mile networks becomes an increasing priority for service providers. Prior works on network design that cater for delivery speed include the hub-network design problem with time-definite delivery. In this setting, the objective is to decide hub locations together with routing paths, and speed deadlines are captured through path-eligibility constraints, see e.g., [7]. Similarly, [27] optimizes the location of hubs, assignment of demand centers to hubs, and the vehicle routing. Expanding on these ideas, [25] considers additionally hub capacity constraints; [3] studies a single-hub overnight delivery system and optimizes routing subject to timing constraints; and [28] focuses on express air-service and decides which routes to operate with the company-owned cargo planes and how much capacity to purchase on commercial flights. Finally, [14] studies the middle-mile consolidation problem with delivery deadlines while accounting for consolidation delays. Similar models have been studied in the context of scheduled service network design that optimizes small (less-than-truckload) inter-city shipments with timing constraints for the delivery or intermediate hops, see [15] and references therein. All the above works model the delivery time requirements through path-length constraints, e.g., a subset of origin-destination nodes are forced to be connected via paths that have transit time smaller than a predefined threshold. In contrast to this binary approach of enforcing time constraints, our work explicitly optimizes the tradeoff between speed and cost, providing a flexible framework for network design decisions.

Time-expanded graphs are one of the most common tools for capturing such timing restrictions but increase the problem’s dimension and compound their solution, cf. [19]. This modeling approach augments the dimension of the problem and therefore its complexity. For this reason, several approaches based on decomposition methods [26, 11], adaptive discretization techniques [5], and model condensation strategies [19] have been proposed. The key difference of our approach from these prior works is that we explicitly consider the overlap of inventory at origins to determine the number of unique items that are eligible for a 1-day delivery option. We do not assume to have an analytical model that captures how the delivery speed affects the inventory, but instead we provide a practical methodology for directly incorporating data-sets and look-up tables in the optimization problem. A key element of our strategy is a parametric linear model [2] which allows to express the inventory function in a compact and tractable form. Parametric optimization has been particularly successful for a range of applications, see [22], but, to the best of the authors knowledge has not been employed for expedite delivery optimization in middle-mile networks.

1.3 Methodology & Contributions

Jointly optimizing network design decisions and speed-coverage is a challenging problem. First of all, the network design problem, even without the speed-coverage objective, is already NP-hard to solve optimally when one has to (a) use unsplittable paths, (b) add integer link capacities (trucks), and (c) consider a large number of multi-hop path variables, [9, 10]. Secondly, the inventory effect of each new speed-path depends on which other origins are connected to a desination. In other words, speed-path assignment decisions are coupled across all origins serving a certain destination, but also across origins that use intersecting (at one or more edges) paths due to the edge capacities. Third, the inventory coverage function – i.e., the function mapping a set of origins to the number of unique items stored at these nodes – does not admit a convenient analytical expression. In fact, this function depends on the inventory overlap at origins, and in practice can be calculated using inventory datasets and look-up tables. This function format, unfortunately, does not facilitate – actually prohibits – its direct inclusion in the network optimization program, as it renders the problem highly nonlinear – see equation (4) from Section 2 for further details.

Our method relies on multi-parametric optimization [12, 22] to create a continuous (concave) approximation of the inventory function. This uses as input the available inventory data-points – i.e., the number of unique items stored at a set of origin nodes – and creates a continuous piecewise affine interpolation over the convex combination of the input data-points. We prove that this approximation is exact at the input data-points, and hence can serve as a meaningful proxy for maximizing this metric of interest. This, in turn, enables the inclusion of the inventory function in the network design problem, without inflating it with new discrete variables. The result of this new joint formulation is a speed-aware middle-mile network, where the designer can tradeoff network costs with the number of unique items that can be delivered to customers in 1-day. Finally, we take an extra step to reduce the dimension of this problem through a sampling process, namely a low-complexity practical algorithm for selecting the approximation data-points used to approximate inventory overlap at origin nodes.

In summary, the main contributions of this paper are as follows: (i) We introduce the speed-aware network design problem of jointly optimizing network costs and speed-coverage, which we define as the share of unique items for which we can offer a 1-day delivery option. (ii) We introduce a parametric-based modeling approach for enabling this joint optimization, overcoming the lack of a tractable analytical expressions for the number of unique items stored at a set of origin nodes. (iii) We further propose a simple and practical sampling algorithm for reducing the size of the approximation problem, trading off interpolation accuracy in a systematic fashion. (iv) The proposed joint model and approximation strategy are evaluated numerically in extensive tests on representatives scenarios.

1.4 Paper Organization & Notation

The rest of this paper is organized as follows. Section 2 introduces the speed-aware minimum cost Multicommodity Capacitated Network Design problem under study. Section 3 describes the proposed reformulation based on a parametric interpolation, and the related approximation strategy; and Section 4 presents a series of numerical experiments with representative scenarios and different network sizes. We conclude and present future works in Section 5.

Throughout the paper we define the sets of (non-negative) reals and integers as (+n)n and (+n)n, respectively. Vectors are denoted by boldface lowercase letters and sets by uppercase italic letters, e.g., 𝒗n and 𝒞. Given a set of vectors 𝒞={𝒗1,𝒗2}, we denote the cardinality of 𝒞 as |𝒞| and its convex hull as Cvx(𝒞). Finally, we define the vectors of zeros 𝟎nn, ones 𝟏nn, and the unit base vector of zeros having one only in the ith component as 𝒆ni=[0,,0,1,0,,0]n.

2 Model and Problem Formulation

2.1 Minimum cost MCND-U

We first introduce the standard minimum cost Multicommodity Capacitated Network Design with Unsplittable demands (MCND-U) problem, which aims to assign one path to each commodity and open trucks on links to transport orders at minimum cost. Our network is modeled with a directed graph 𝒢=(𝒱,), where 𝒱=𝒪𝒟 is the set of nodes, consisting of the set 𝒪 of nO origins, the set 𝒟 of nD destinations and the set of nH other interim nodes (or hubs), and is the set of directed links. We are given a set 𝒦 of K commodities, where each commodity k=(ok,dk) originates from an origin ok𝒪 towards a destination dk𝒟, and has volume vk0. We are also given a set of active network paths 𝒫, where 𝒫k is used to denote the active paths of commodity k and 𝒫ke𝒫k the active commodity k paths that traverse link e. Our goal is to assign exactly one path to each commodity and we do so using the path selection vector x=(xp{0,1},p𝒫).111We do not need to index variables x with commodities because each path p can only be used by a single commodity, the one that corresponds to the origin-destination pair of the path. Furthermore, the links have capacities that depend on our choice of opened trucks, where opening a truck costs ce on link e and adds capacity Ve+n. We use the truck deployment vector y=(ye𝐙+,e) to decide how many trucks are opened on each link. The minimum cost MCND-U problem is to assign paths to commodities with x and open trucks with y in order to transport the volumes at a minimum cost:

1:minx,y eceye (1)
subject to k𝒦p𝒫kevkxpVeye e,
p𝒫kxp=1 k𝒦,
xp{0,1} k𝒦,p𝒫k,
ye𝐙+ e.

Problem 1 is the standard MCND-U problem studied in the literature, for instance see [13], which is known to be NP–hard [9, 10] – See Section 1.3 for details.

2.2 Modeling network speed

In this work we augment the MCND-U problem with a novel speed model, that captures the number of customer orders that can be delivered within a day, a service that is called Next Day Delivery (NDD), see [4]. Let us consider a destination node d, which serves a number of customers in a given area. Whenever a customer in that area makes an order, we say that the order is covered (with NDD service) if the ordered item is stored in any of the origins that are connected to d with a short path, where a path is said to be short if its total transit time is less than an input parameter; if path p is short we will write wp=1, else it is long and we write wp=0. Notice, we require two conditions for NDD: (i) the ordered item is in the storage of some origin node, and (ii) that origin node is connected to d with a path that is short. First, let us study how we measure NDD coverage in a scenario where (ii) is always satisfied, i.e., for now, we consider networks where all paths are short. We introduce the coverage function Id which counts the number of covered orders at a destination d. It is typically impossible to derive the analytical form of Id, but it is reasonable to estimate Id from available large datasets; using these datasets, one can compute an estimate of the number of covered unique NDD orders.

Refer to caption
Figure 2: Venn diagram of two origin nodes storing items that are being ordered; although the sum of individually unique ordered items stored in the two nodes is 50+60=110, the actual total unique items are only 90 due to overlap between the two inventories.

For example, see Figure 2. Assuming both origins of Figure 2 are connected to d with short paths, Id would evaluate to 90 items in this example, and we may imagine that in more complicated scenarios, the computation of Id would boil down into counting unique orders in large datasets of requests and inventories. The model, however, becomes more interesting when we study networks that include long paths, where condition (ii) is not always satisfied. In such a case, depending on our path assignment decisions x, an origin-destination pair (o,d) may not be able to transport items eligible for NDD. For instance, if (o,d) commodity is assigned a long path, the contribution of o-inventory to Id should not be counted because although the inventory is available, its delivery takes more than one day. We should then adjust the coverage function Id to reflect this. A naive approach would be to adjust it to Id(x) to denote the dependence on path assignment, but instead we use an alternative approach that reduces significantly the dimensions, and hence the complexity of computing this function: we use the dependent speed variables

z=(zod,(o,d)𝒦), (2)

where zod=1 if the path selected to connect o to d is short. Since we have only one assigned path per commodity (i.e., it holds p𝒫kxp=1) it follows that

zod=p𝒫kwpxp,k=(o,d),

where recall that wp=1 if p is short, and 0 otherwise. Using variables z we can write down the form of Id for the toy scenario of Figure 2:

Id(z)=60z1d(1z2d)+50z2d(1z1d)+90z1dz2d. (3)

More broadly, the function will have the form:

Id(z)=s𝒫𝒮(𝒪d)βsiszidjs(1zjd), (4)

where 𝒫𝒮(.) is the power set, 𝒪d is the set of origins that form a commodity with d, and βs is the unique item count of subset s of origins – we may obtain parameters βs by performing counting operations on our large datasets. Note, that this function would need to have many more terms if we were to use path assignment variables x (one additive term for each element in the powerset of active paths), hence we have effectively reduced the number of times we need to count parameters in large datasets by an exponential factor. Also, observe that the form of (4) demonstrates well how deeply intertwined is the speed objective Id(z) with the cost optimization of the MCND-U problem (1), both depending on our choice of x. Some paths may offer better consolidation by mixing volume in consolidation hubs, but because they are longer, they result in smaller NDD coverage. In the next subsection we augment the standard MCND-U problem with our newly introduced speed model.

2.3 Speed-aware MCND-U

The NDD coverage impacts customer satisfaction and shapes long-term revenues. We model this effect with a customer conversion factor γ>0; in practice, this factor can be estimated by analyzing customer behaviors via A/B testing. Therefore, our focus in this paper is to minimize the transportation costs and maximize long-term revenues from NDD coverage, which is formalized in the following optimization program:

2:min𝒙,𝒚,𝒛 eceyeγd𝒟Id(𝒛) (5)
subject to k𝒦p𝒫kevkxpVeye e,
p𝒫kxp=1 k𝒦,
zod=p𝒫kwpxp, k=(o,d)𝒦,
xp{0,1} k𝒦,p𝒫k,
ye𝐙+ e,
zod{0,1} o𝒪,d𝒟.

Problem 2 is extremely difficult to solve. Evidently, it is a generalization of the NP–hard MCND-U, but a major additional complexity factor is function Id. This coverage function is known to be submodular [4], i.e., the more short paths we assign (by switching 𝒛 to 𝒛), the smaller is the benefit Id(𝒛)Id(𝒛) per added short path, due to overlapping inventory between origins. The domain of function Id(𝒛) has 2nO points, each one of which requires a full computation on the large dataset. In fact, it is known that even finding the maximum point of such a function is an NP–hard problem [20, 18], let alone considering it inside a broader optimization problem as in 2. Last, observe that the objective of 2 is non-linear, see for example (4). From previous studies in the space of maximum coverage problem [1] we know that the non-linear components of (4) would make even the continuous relaxation of 2 NP–hard. For realistic e-commerce scenarios with multiple origins serving each destination, and a network with many destinations, solving 2 or even obtaining a lower bound via its continuous relaxation is intractable.

In this paper, we overcome this challenge by leveraging multi-parametric programming [12, 22] to create an interpolation (or continuous extension) of integer functions Id,d𝒟, which in turn allows us to propose a solution methodology for the speed-aware MCND-U.

3 Solution Approach

Our solution methodology is based on replacing the term Id(𝒛) in (5) with a continuous extension, by interpolating integer points using parametric optimization. As we explain below, this interpolation is carefully designed to lead to a concave function, which eventually allows us to reformulate the speed-aware MCND-U as a Mixed-Integer Linear Programming (MILP). Finally, to reduce the dimensions and make the resulting problem tractable, we utilize an additional approximation technique to eliminate many of the integer points, and only consider a subset of them.

Refer to caption
(a) Dataset of speed-path assignments and unique items stored at origin nodes.
Refer to caption
(b) Piecewise linear interpolation of the speed-path assignement from Figure 3(a).
Figure 3: Illustration of the approximation strategy for the example from Figure 2, where a destination d is connected to Origin 1 and Origin 2.

3.1 Parametric interpolation of unique inventory

We will interpolate between the integer points of function Id, which are denoted with z and introduced in (2). For reasons that will become clear below when we present our approximation technique, we will introduce additional notation zdi to enumerate the vector pointing to the ith integer point of Id(𝒛). Hence,

𝒵d={zd1,zd2,,zdncomb}, (6)

is another way to express the domain of Id(z) and we have ncomb=2nO. See for example Figure 3(a).

For any vector zdi, we can use inventory data to compute the number of unique items Id(zdi) for which we can offer NDD at d𝒟. Then, for a vector zd in the convex hull of 𝒵d, we can use the following optimization problem to interpolate Id(zdi):

I~d(zd)=max𝜶𝒅+ncomb i=1ncombαdiId(zdi) (7)
subject to i=1ncombαdizdi=zd,
i=1ncombαdi=1.

Notice that problem (7) is a linear parametric program and therefore I~d is a concave piecewise linear function [6, Chapter 6]; we may think of I~d as an extension of Id, referred to in the literature as the concave closure [24]. We will leverage the concavity of the extended function I~d to reformulate (5) as a MILP. Note, that submodular functions may also be approximated by their convex closure, which is achieved by replacing the min operator with the max in (7). In this work, we have used the concave closure as this allows us to reformulate the speed-aware network design problem (5) as mixed-integer linear minimization problem, whereas using the convex closure would have resulted in a min-max mixed integer optimization problem.

We highlight that the interpolation from (7) is exact at all integer points zd𝒵d, since for each integer point zdj𝒵d we must have that zdj=i=1ncombαdizdi, hence αdj=1 and I~d(zdj)=αdjId(zdj)=Id(zdj). Note that I~d is exact at the integer points only when each zdi𝒵d is an extreme point, i.e., it cannot be expressed as a convex combination of the vectors in {𝒵dzdi}. This condition holds in our case, and yields the following proposition:

Proposition 1.

Let the function I~d be defined by the parametric program (7). For all integer points zd𝒵d we have that

Id(𝒛d)=I~d(𝒛d).

Proof.

Consider the optimization (7) evaluated at zdi, i.e., I~(zdi). By definition we have that zdi are vertices of a hypercube and hence zdiCvx(𝒵dzdi) for all zdi𝒵d. Thus, we have that setting αdi=1 is the unique feasible solution to I~d(zdi). Thus, we conclude that I~d(zdi)=αdiId(zdi)=Id(zd).

In Section 3.2 we combine the parametric extension of Id with the speed-aware MCND-U to obtain a MILP.

3.2 Parametric speed-aware MCND-U

Next, we reformulate the speed-aware MCND-U presented in (5) replacing the term Id(𝒛) with its parametric interpolation I~d(𝒛) from (7):

min𝒙,𝒚,{zd,𝜶d}d𝒟 eceyeγd𝒟i=1ncombαdiId(𝒛di) (8)
subject to k𝒦p𝒫kevkxpVeye e,
p𝒫kxp=1 k𝒦,
zod=p𝒫kwpxp, k=(o,d)𝒦,
i=1ncombαdizdi=zd,i=1ncombαdi=1 d𝒟,
xp{0,1} k𝒦,p𝒫k,
ye𝐙+ e,
zod{0,1} o𝒪,d𝒟,
𝜶d+ncomb d𝒟.

The above MILP is equivalent to the original problem (5), as the interpolation from (7) is exact at integer points, as discussed in Proposition 1. The main advantage of our reformulation is that it can be solved with off-the-shelf solvers. Unfortunately, for each destination d𝒟 there are up to 2ncomb pre-computed vectors zdi that make the reformulation intractable for real-world problems. Thus, in the next section, we propose an algorithm to select only a subset of assignments to reduce the problem dimensionality.

3.3 Approximation Strategy

Estimating the coverage function Id(z) at all integer points z𝒵d from (6) and then solving (8) are both intractable due to the very large number of such points. In this section, we provide heuristics to select a subset of such vectors 𝒵~d𝒵d, to simplify (8). While this approximation introduces some loss in accuracy, we aim to select points that maintain a reasonable balance between computational efficiency and solution quality. Dropping vectors from 𝒵d means that certain paths will always be chosen to be long, and hence, our heuristics attempt to drop the origins that are expected to have the smallest contributions to coverage.

The first step is to rank all origins in terms of individual coverage achieved by taking the short path from this origin and long paths from all others, denoted with Id(𝟏o) for origin o𝒪. Calculating individual coverage is a cheap operation (only nO calculations per destination), and intuitively helps us prioritize origins with large inventory capabilities – notice, however, that due to inventory overlap, choosing the top origins in this regard does not guarantee maximal coverage. Below we use the notation 𝒯d(κ)𝒪 to denote the top κ origins with individual coverage. Note, that κ is a user-defined hyper-parameter, which controls the complexity/performance tradeoff of our heuristic.

Our strategy employs four sets of vectors, 𝒵~d={0,1,2,3} that contain a subset of all possible speed variable assignments. We use the terms “short” and “long” to refer to the speed variable assignments: when we say an origin has a “short” path to a destination (zod=1), it means that origin o can offer next-day delivery to destination d. Conversely, a “long” path (zod=0) indicates that origin o cannot provide next-day delivery to destination d. Based on this terminology we define the following sets:

  1. 1.

    0: all short/long combinations of origins in 𝒯d(κ) (2κ cases).

  2. 2.

    1: all origins in 𝒯d(κ) are short, and one of the remaining origins short (noκ cases).

  3. 3.

    2: all origins in 𝒯d(κ) are long, and one of the remaining origins short (noκ cases).

  4. 4.

    3: all origins in 𝒯d(i),i=κ+1,,no are short and the rest long (noκ cases).

Note that our heuristic strategy has reduced significantly the amount of vectors for which we need to calculate coverage and add to (8), from 2nO down to 2κ+3(nOκ), and we can use κ to control how small this number is, trading off with the achieved accuracy of the optimization. We provide an illustrative example for κ=2 and no=5:

0={ z~d1=[00000],z~d2=[10000],
z~d3=[01000],z~d4=[11000]}.
1={z~d5=[11100],z~d6=[11010],z~d7=[11001]}.
2={z~d8=[00100],z~d9=[00010],z~d10=[00001]}.
3={z~d11=[11100],z~d12=[11110],z~d13=[11111]}.
Algorithm 1 Heuristic to select subset 𝒵~d.

Given 𝒵~d, we use the below program to approximate coverage:

I~d(zd)=max𝜶𝒅 iαdiId(z~di) (9)
subject to iαdiz~di=zd,
iαdi=1.

In the next section, we empirically analyze the effect of our approximation technique on run-time and solution quality.

4 Experiments

We evaluate the proposed approach using a range of representative scenarios. First, we examine how optimizing jointly for speed and costs affects the network topology. Afterwards, we analyze the effect of the approximation from Algorithm 1 on the solution quality. To perform these analyses, we randomly generate networks with nO origins, nD destinations, and 5 intermediate nodes that can be used to consolidate volume. For each origin-destination pair, the demand is randomly generated together with the travel times and transportation costs that are proportional to the traveled distance – the location of all nodes is randomly selected from a uniform distribution. To serve one origin-destination pair, the optimizer can select either a direct path connecting the two nodes, or a path going through one of the five intermediate nodes. We assume that overall there are nit=50nO unique items that we can offer to customers. For each origin i, we generate a random vector vi{0,1}nit, where each jth entry indicates if item j is stored at origin i. If the travel time from an origin o to a destination d is less than max_tt=8 hours, we assume that we can offer NND for all items stored at origin o𝒪.

Table 1: Experimental results for five randomly generated networks. For each network, we run the proposed method for γ{0,0.1,1} to show the effect of revenues from NDD on the network topology.
nO nD Costs Avg. Items App. Avg. Items γ Directs Sol. Time %Gap
10 10 689.1 0 342.8 0 20 58.4 0.06
10 10 698.5 383.7 383.7 0.1 26 10.9 0
10 10 731.9 404 404 1 30 2.6 0.02
10 20 1419.2 0 370.75 - 58 7200 1.85
10 20 1435.5 403.55 403.55 0.1 72 7200 2.51
10 20 1490.4 411.6 411.6 1 80 7200 0.19
20 10 1392.2 0 770.4 0 63 1196.1 0.1
20 10 1413.6 806.5 811.2 0.1 73 1799.2 0.1
20 10 1501.8 839.4 839.4 1 98 438.8 0.1
50 10 3528.2 2010 2032.2 0 149 7200 7.55
50 10 3489.8 2025.3 2037.3 0.1 146 7200 3.65
50 10 3653.4 2092.7 2092.7 1 178 7200 0.63
100 100 62204.2 0 3651.1 0 2254 7200 2.82
100 100 65258.6 4154.37 4172.14 0.1 3335 7200 8.43
100 100 66314.6 4211.57 4211.57 1 3383 7200 0.58
Table 2: Experimental results for five randomly generated network. For each networks, we run the proposed method for γ=0.1 and κ{1,5,10}. As baseline we compute the cost optimal network. Thus, for the baseline (in blue) we do not have approximated revenues and a value for the parameter κ that is not used to in the optimization.
nO nD Costs App. Rev. Rev. Cost - (App. Rev.) Cost - Rev. κ %Gap
10 10 689.1 - 342.8 689.1 346.3 - 0
10 10 698.5 383.7 383.7 314.8 314.8 1 0
10 10 698.5 383.7 383.7 314.8 314.8 5 0
10 10 698.5 383.7 383.7 314.8 314.8 10 0
10 20 1419.2 - 370.7 1419.2 677.7 - 1.8
10 20 1459.7 405.9 407 647.9 645.7 1 3.4
10 20 1434.1 397.8 400.7 638.5 632.7 5 2.8
10 20 1435.5 403.5 403.5 628.4 628.4 10 2.5
20 10 1392.2 - 770.4 1392.2 621.8 - 0.1
20 10 1422.9 807.3 815.7 615.6 607.2 1 0.1
20 10 1422.9 807.3 815.7 615.6 607.2 5 0.1
20 10 1413.6 806.5 811.2 607.1 602.4 10 0.1
50 10 3375.8 - 1845.2 3375.8 1530.6 - 1.8
50 10 3528.2 2010 2032.2 1518.2 1496.0 1 7.5
50 10 3499.0 2031.5 2039.1 1467.5 1459.9 5 3.8
50 10 3489.8 2025.3 2037.3 1464.4 1452.4 10 3.6
100 100 62204.2 - 3651.1 62204.2 25693.2 - 2.8
100 100 65911.4 4159.3 4175.7 24317.9 24154.2 1 11.2
100 100 65531.7 4156.8 4173.1 23963.2 23800.5 5 9.5
100 100 65258.6 4154.3 4172.1 23714.9 23537.2 10 8.4

4.1 Trading-off transportation costs and speed

In this section, we demonstrate through empirical experiments how the parameter γ controls the trade-off between transportation costs and speed-coverage in the network design. Higher values of γ result in networks that provide speed-paths for a broader range of unique items, while lower values prioritize cost minimization by encouraging consolidation through intermediate hubs. Table 1 shows results for five randomly generated networks. For each network, we solve three optimization problems: in the first problem, we optimize only for transportation costs – i.e., we set γ=0; in the second problem we set γ=0.1; and in the third we use γ=1. The solver terminates either when the gap is below 0.1% or the solution time exceeds two hours. As expected, for larger values of γ both transportation costs and the average number of unique items eligible for NDD increase, i.e., the optimizer decides to open more expensive connections to gain revenues from NDD. Note that the number of direct paths (not crossing any intermediate hub) increases with γ, as they have lower transit time and thus are more likely to offer NDD. In all experiments, we set the parameter κ from Algorithm 1 equal to 10, meaning that the approximation from (9) is exact for the unique items delivered by the top 10 origins for each destination – see Section 4.2 for an empirical analysis on the effect of κ on the solution. Indeed, we notice from Table 1 that for nO=10 the approximate average number of unique items eligible for NDD (column Avg. Items App.) matches the exact number (column Avg. Items). On the other hand, for nO=100, the proposed strategy only computes a lower bound to the average number of unique items eligible for NDD at a destination d𝒟.

Table 3: Experimental results showing how the computational time changes as a function of the user defined parameter κ{1,5,10}.
nO nD Cost-(App. Rev) Cost-Rev Pre. Time Sol Time Gap(%) κ napp
20 10 1392.2 621.8 0.08 1184.9 0.1 - 0
20 10 615.6 607.2 0.08 577.8 0.1 1 59
20 10 615.6 607.2 0.11 261.2 0.1 5 89
20 10 607.1 602.4 1.59 3851.9 0.1 10 1081
20 10 602.8 601.2 6.52 3146.9 0.1 12 4153
20 10 599.0 599.0 36.0 6871.8 0.1 14 16441
20 10 599.0 599.0 334.7 6978.5 0.1 16 65593

4.2 The effect of the proposed approximation

In this section, we test how the parameter κ from Algorithm 1 affects the solution quality. Table 2 shows results for five different networks and parameter κ{1,5,10}. The table reports also results for the baseline cost-optimal network (in blue) that is obtained minimizing only the transportation costs, i.e., the optimizer does not approximate revenues from NDD and therefore the parameter κ is not required to run the baseline. For all cases we report the total cost defined as the difference between the transportation costs and revenues from speed-paths222As in Problem 5, revenues from speed-paths are computed by summing all unique items eligible for NDD at each destination and using the conversion factor γ. (column Cost-Rev.). For κ=1, in Algorithm 1 we do not consider different combinations of origins offering NDD at a destination, but we simply sort origins by number of unique items eligible for NDD and we use the heuristic described in Section 3.3. This strategy allows us to prune the origins combinations used to construct the approximation from (9). On the other hand for κ{5,10}, after sorting the origins by the number of unique items eligible for NDD, we consider all possible combination for the top κ origins and for the remaining we leverage the heuristic from Algorithm 1. As discussed in Section 3.3, this approximation allows us to reduce the number of constraints needed to build function (7). However when κ is smaller than the number of origins nO, the number of unique items computed using (9) is only an approximation. This fact is shown in Table 2, where we compare the true revenues (column Rev.) and the approximated revenues (column App. Rev.) obtained by multiplying the number of unique items eligible for NDD by the coefficient γ. Finally, we notice that the total cost (column Cost-Rev.) defined as the difference between the transportation cost and the revenues from NDD decreases for larger value of κ. Note that for κ=1, the approximation from (9) is constructed using napp=59 data points, while for κ=10 the approximation is constructed with napp=1081, i.e., for κ=10 we increase by 94.5% the number of data points used compared with the approximation with κ=1. From the experiments in Table 2, we notice that using 94.5% more data points in the approximation from (9) improves on average the network cost by 1.79%, i.e., as expected a more accurate approximation of revenues from NDD allows us to design a better network by trading off transportation costs and from NDD services. It is also interesting to notice that even for κ=1, our approach is able to reduce the overall cost – i.e., the cost defined as the difference between transportation cost and revenues from speed-paths – by 8.36% compared to the baseline (in blue).

Finally, we investigate the effect of the parameter κ on the computation time. Table 3 shows both solver times and pre-processing times needed to compute the data points napp used to construct the approximation from (9). We notice that increasing κ leads to higher pre-processing and solver time, as more data points are used for approximating the number of unique items eligible for NDD. However, we notice that increasing κ from 10 to 16 results in only a 0.57% cost improvement, while the computation cost increases by 1.9x. This result highlights the effectiveness of the approximation from Algorithm 1, where we ranked origins by the number of unique items eligible for NDD and considered all possible combination for a subset of them.

5 Conclusion

In this paper, we consider the problem of jointly optimizing transportation costs and the share of inventory with a Next Day Delivery (NDD) option. Compared to previous methods, we account for overlapping inventory at origin nodes and how it affects the selection of speed-paths, i.e., origin-destination connections with a short travel time that enable NDD. The inventory is modeled with a coverage function, that requires the computation of a very large number of integer points. To tackle this complexity, we present an approach based on parametric optimization to construct a continuous extension of the inventory coverage function. Such an approximation requires solving a parametric linear program where the number of constraints increase exponentially with the number of origin nodes. To mitigate this issue, we present a sampling algorithm to tradeoff the accuracy of our approximation with computational complexity. The efficacy of the proposed approach is demonstrated on randomly generated networks, where we show that our strategy beats the baseline approach that computes a cost optimal network without optimizing for revenues from NDD.

References

  • [1] Alexander A Ageev and Maxim I Sviridenko. Pipage rounding: A new method of constructing algorithms with proven performance guarantee. Journal of Combinatorial Optimization, 8:307–328, 2004. doi:10.1023/B:JOCO.0000038913.96607.C2.
  • [2] Bernd Bank, Jürgen Guddat, Diethard Klatte, Bernd Kummer, and Klaus Tammer. Non-linear parametric optimization, volume 58. Walter de Gruyter GmbH & Co KG, 1982.
  • [3] Cynthia Barnhart and Rina R Schneur. Air network design for express shipment service. Operations Research, 44(6):852–863, 1996. doi:10.1287/OPRE.44.6.852.
  • [4] Konstantinos Benidis, Georgios Paschos, Martin Gross, and George Iosifidis. Middle-mile optimization for next-day delivery. arXiv preprint arXiv:2310.18388, 2023.
  • [5] Natashia L Boland and Martin WP Savelsbergh. Perspectives on integer programming for time-dependent models. Top, 27(2):147–173, 2019.
  • [6] Francesco Borrelli, Alberto Bemporad, and Manfred Morari. Predictive control for linear and hybrid systems. Cambridge University Press, 2017.
  • [7] James F Campbell. Hub location for time definite transportation. Computers & Operations Research, 36(12):3107–3116, 2009. doi:10.1016/J.COR.2009.01.009.
  • [8] Valentina Carbone, Aurélien Rouquet, and Christine Roussat. The rise of crowd logistics: a new way to co-create logistics value. Journal of Business Logistics, 38(4):238–252, 2017.
  • [9] Sunil Chopra, Itzhak Gilboa, and S Trilochan Sastry. Source sink flows with capacity installation in batches. Discrete Applied Mathematics, 85(3):165–192, 1998. doi:10.1016/S0166-218X(98)00024-9.
  • [10] Bernard Fortz, Luís Gouveia, and Martim Joyce-Moniz. Models for the piecewise linear unsplittable multicommodity flow problems. European Journal of Operational Research, 261(1):30–42, 2017. doi:10.1016/J.EJOR.2017.01.051.
  • [11] Ioannis Fragkos, Jean-François Cordeau, and Raf Jans. Decomposition methods for large-scale network expansion problems. Transportation Research Part B: Methodological, 144:60–80, 2021.
  • [12] Tomas Gal and Josef Nedoma. Multiparametric linear programming. Management Science, 18(7):406–422, 1972.
  • [13] Bernard Gendron, Teodor Gabriel Crainic, and Antonio Frangioni. Multicommodity capacitated network design. In Telecommunications network planning, pages 1–19. Springer, 1999.
  • [14] Lacy M Greening, Mathieu Dahan, and Alan L Erera. Lead-time-constrained middle-mile consolidation network design with fixed origins and destinations. Transportation Research Part B: Methodological, 174:102782, 2023.
  • [15] Mike Hewitt and Fabien Lehuede. New formulations for the scheduled service network design problem. Transportation Research Part B: Methodological, 172:117–133, 2023.
  • [16] Mathias A Klapp, Alan L Erera, and Alejandro Toriello. The one-dimensional dynamic dispatch waves problem. Transportation Science, 52(2):402–415, 2018. doi:10.1287/TRSC.2016.0682.
  • [17] Vienna Klein and Claudius Steinhardt. Dynamic demand management and online tour planning for same-day delivery. European Journal of Operational Research, 307(2):860–886, 2022. doi:10.1016/J.EJOR.2022.09.011.
  • [18] Andreas Krause and Daniel Golovin. Submodular function maximization. Tractability, 3(71-104):3, 2014.
  • [19] Cristiana L Lara, Jochen Koenemann, Yisu Nie, and Cid C de Souza. Scalable timing-aware network design via lagrangian decomposition. European Journal of Operational Research, 309(1):152–169, 2023. doi:10.1016/J.EJOR.2023.01.018.
  • [20] George L Nemhauser, Laurence A Wolsey, and Marshall L Fisher. An analysis of approximations for maximizing submodular set functions—i. Mathematical programming, 14:265–294, 1978. doi:10.1007/BF01588971.
  • [21] Henrik Pålsson, Fredrik Pettersson, and Lena Winslott Hiselius. Energy consumption in e-commerce versus conventional trade channels-insights into packaging, the last mile, unsold products and product returns. Journal of cleaner production, 164:765–778, 2017.
  • [22] Iosif Pappas, Dustin Kenefake, Baris Burnak, Styliani Avraamidou, Hari S Ganesh, Justin Katz, Nikolaos A Diangelakis, and Efstratios N Pistikopoulos. Multiparametric programming in process systems engineering: Recent developments and path forward. Frontiers in Chemical Engineering, 2:620168, 2021.
  • [23] Sadegh Shahmohammadi, Zoran JN Steinmann, Lau Tambjerg, Patricia van Loon, JM Henry King, and Mark AJ Huijbregts. Comparative greenhouse gas footprinting of online versus traditional shopping for fast-moving consumer goods: A stochastic approach. Environmental science & technology, 54(6):3499–3509, 2020.
  • [24] Jan Vondrák, Chandra Chekuri, and Rico Zenklusen. Submodular function maximization via the multilinear relaxation and contention resolution schemes. In Proceedings of the forty-third annual ACM symposium on Theory of computing, pages 783–792, 2011. doi:10.1145/1993636.1993740.
  • [25] Haotian Wu, Ian Herszterg, Martin Savelsbergh, and Yixiao Huang. Service network design for same-day delivery with hub capacity constraints. Transportation Science, 57(1):273–287, 2023. doi:10.1287/TRSC.2022.1155.
  • [26] Yu Yao, Xiaoning Zhu, Hongyu Dong, Shengnan Wu, Hailong Wu, Lu Carol Tong, and Xuesong Zhou. ADMM-based problem decomposition scheme for vehicle routing problem with time windows. Transportation Research Part B: Methodological, 129:156–174, 2019.
  • [27] Baris Yildiz, Hande Yaman, and Oya Ekin Karasan. Hub location, routing, and route dimensioning: Strategic and tactical intermodal transportation hub network design. Transportation Science, 55(6):1351–1369, 2021. doi:10.1287/TRSC.2021.1070.
  • [28] Barış Yıldız and Martin Savelsbergh. Optimizing package express operations in china. European Journal of Operational Research, 300(1):320–335, 2022. doi:10.1016/J.EJOR.2021.09.035.