Approximating Prize-Collecting Variants of TSP

Alimi, Morteza; Mömke, Tobias; Ruderer, Michael

doi:10.4230/LIPIcs.MFCS.2025.7

Approximating Prize-Collecting Variants of TSP

Morteza Alimi

University of Augsburg, Germany Tobias Mömke

University of Augsburg, Germany Michael Ruderer

University of Augsburg, Germany

Abstract

We present an approximation algorithm for the Prize-collecting Ordered Traveling Salesman Problem (PCOTSP), which simultaneously generalizes the Prize-collecting TSP and the Ordered TSP. The Prize-collecting TSP is well-studied and has a long history, with the current best approximation factor slightly below $1.6$ , shown by Blauth, Klein and Nägele [IPCO 2024]. The best approximation ratio for Ordered TSP is $\frac{3}{2}+\frac{1}{e}$ , presented by Böhm, Friggstad, Mömke, Spoerhase [SODA 2025] and Armbruster, Mnich, Nägele [Approx 2024]. The former also present a factor 2.2131 approximation algorithm for Multi-Path-TSP.

We present a $2.097$ -approximation algorithm for PCOTSP, which is, to the best of our knowledge, the first result for this problem. Key ideas in our approach are to sample a set of trees and then to probabilistically pick up some vertices, and to use the pruning ideas of Blauth, Klein, Nägele [IPCO 2024] on the sampled vertices. While the sampling probability of vertices for our problem is lower than for PCTSP, intuitively leaving less spare penalty to spend, we leverage the cycle structure induced by the sampled trees together with a simple combinatorial algorithm to bring the approximation factor below 2.1.

Our techniques extend to Prize-collecting Multi-Path TSP, building on results from Böhm, Friggstad, Mömke, Spoerhase [SODA 2025], leading to a $2.41$ -approximation.

Keywords and phrases:

Approximation Algorithms, TSP

Copyright and License:

2012 ACM Subject Classification:

Theory of computation

\rightarrow

Approximation algorithms analysis

Related Version:

Previous Version: https://arxiv.org/abs/2411.14994

Funding:

Partially supported by DFG Grant 439522729 (Heisenberg-Grant).

DOI:

10.4230/LIPIcs.MFCS.2025.7

Event:

50th International Symposium on Mathematical Foundations of Computer Science (MFCS 2025)

Editors:

Paweł Gawrychowski, Filip Mazowiecki, and Michał Skrzypczak

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

The metric Traveling Salesman Problem (TSP), which asks for a shortest closed tour (i.e., a Hamiltonian cycle) in a metric space $(V,c),\ c:V\times V\to\mathbb{Q}_{+}$ on an $n$ element vertex set $V$ visiting each vertex exactly once¹¹1Note that the problem can alternatively be defined as having a weighted graph as input, and seeking to find a shortest closed walk (or shortest Eulerian multi-subgraph) that visits each vertex at least once. is one of the most well-studied problems in combinatorial optimization in its various incarnations. Christofides [15] and Serdjukov [23] gave a simple $\frac{3}{2}$ approximation algorithm for symmetric (undirected) TSP; an approximation factor slightly below $\frac{3}{2}$ was provided by Karlin, Klein and Oveis Gharan [20].

The path versions of TSP have also been subject to intense study. This line of study led to a surprising outcome: Traub, Vygen, and Zenklusen [25] show that for any $\epsilon>0$ , there is a reduction from path-TSP to TSP which only loses $\epsilon$ in the approximation factor. In their book, Traub and Vygen [24] simplify the reduction and use Multi-Path-TSP (see below) as a building block. The book gives a comprehensive overview of the aforementioned results and more, including asymmetric (directed) variants of TSP.

A more general and more practical version of the problem can be defined by allowing the tour to omit some cities by paying some additional penalty. This prize collecting paradigm has been intensely studied for various combinatorial optimization problems; see, e.g., [1, 2] for some recent results regarding Prize-collecting Steiner Forest and Steiner Tree Problems. As regards the Prize-collecting Traveling Salesman Problem (PCTSP), Bienstock et al. [7] give a $2.5$ -approximation for this problem. Goemans and Williamson [18] give a factor 2 approximation. The first group to break the 2 barrier was Archer et al. [3], who achieved a factor of $1.979$ . Goemans [17] observed that by carefully combining the two aforementioned algorithms, one can achieve an approximation factor of $1.91$ . Blauth and Nägele [9] gave a factor $1.774$ -approximation. Blauth, Klein, and Nägele [8] achieved a factor of $1.599$ .

Another important generalization of TSP is attained by enforcing some ordering on a subset of vertices. Formally, in the metric Ordered Traveling Salesman Problem (OTSP), we are given a metric graph $G=(V,E),c\colon E\to\mathbb{R}_{+}$ together with $k$ terminals $O=\{o_{1},\ldots,o_{k}\}$ , and the objective is to find a shortest tour that visits the terminals in order. OTSP is closely related to vehicle routing problems with pickup and delivery and the dial-a-ride problem; see [22, 6, 14, 16, 19]. Any $\alpha$ -approximation for TSP can be utilized to give an $(\alpha+1)$ -approximation algorithm for OTSP by finding an $\alpha$ -approximate tour on $V\setminus O$ and combining it with the cycle on $O$ [10]. Furthermore, there is a combinatorial $(2.5-2/k)$ -approximation algorithm [11]. A substantial improvement in the approximation factor to $\frac{3}{2}+\frac{1}{e}$ was achieved by [13] and [4].

In Multi-Path Traveling Salesman Problem (Multi-Path-TSP), in addition to a weighted graph, we are given a list of $2k$ terminals $\mathcal{T}=\{(s_{1},t_{1}),\ldots,(s_{k},t_{k})\}$ . The objective is to find $k$ paths $P_{i},\ 1\leq i\leq k$ of minimum total length, covering all vertices of the graph (see also Section 16.4 of [24]). Böhm, Friggstad, Mömke and Spoerhase [13] give a factor $2.2131$ approximation algorithm for the problem.

1.1 Our Contribution and Overview of Techniques

As mentioned above, the prize-collecting setting is a standard setting for studying algorithmic problems. Theoretically, this is closely related to the Lagrangian relaxation paradigm, where constraints are allowed to be violated by paying some penalty. It is also natural from a practical perspective, where it is often possible to refuse to cover some entity if it incurs too high a cost, paying them some predefined penalty instead. In this article, we study OTSP and Multi-Path-TSP in this setting, which can also be viewed as generalizations of PCTSP.

Definition 1.

In the Prize-collecting Ordered Traveling Salesman Problem (PCOTSP) we are given a metric weighted complete graph with penalties for vertices $G=(V,E),\ c\colon E\to\mathbb{R}_{+},\pi:V\to\mathbb{R}_{+}$ , together with $k$ terminals $O=\{o_{1},\ldots,o_{k}\}$ . The objective is to find a tour $C$ traversing $o_{1},\ldots,o_{k}$ in order, that minimizes $c(C)+\sum_{v\not\in V(C)}\pi(v)$ .

Definition 2.

In the Prize-collecting Multi-Path Traveling Salesman Problem (PC-Multi-Path-TSP), we are given a metric weighted complete graph with vertex penalties $G=(V,E),\ c\colon E\to\mathbb{R}_{+},\pi\colon V\to\mathbb{R}_{+}$ , and a set of $k$ terminal pairs $\mathcal{T}=\{(s_{1},t_{1}),\ldots,(s_{k},t_{k})\}$ . The objective is to find $k$ paths $P_{i},1\leq i\leq k$ , such that $P_{i}$ is a path with endpoints $s_{i},t_{i}$ , minimizing $\sum_{i=1}^{k}c(P_{i})+\sum_{v\not\in(\cup V(P_{i}))}\pi(v)$ .²²2 Similar to Path-TSP, we consider the $k$ terminal pairs in Multi-Path TSP as part of what defines the problem which cannot be relaxed. This is also consistent with standard practical applications where e.g. the start and finish depots cannot be dropped. Hence in PC-Multi-Path-TSP and its special case, PCOTSP, we assume the penalties for terminals are $\infty$ . The techniques in this paper do not seem to extend directly to the case where terminals are allowed to be dropped.

Theorem 3.

There is a $2.097$ -approximation algorithm for PCOTSP.

Theorem 4.

There is a $2.41$ -approximation algorithm for PC-Multi-Path-TSP.

In the remainder of this section we provide an overview of our algorithms and techniques for PCOTSP and PC-Multi-Path-TSP. The details of the algorithms and the proofs of correctness are covered in the following sections.

In Section 2, we specify the LP relaxation (OLP) of PCOTSP. Our algorithm first solves the LP, and then it samples a tree $T_{i}$ for each of the $k$ parts of the LP solution, in the manner of Lemma 2 in [8], and Theorem 4 in [13], which are both based on a result of Bang-Jensen, Frank and Jackson [5] and Post and Swamy [21]³³3Post and Swamy showed that the Bang-Jensen, Frank, Jackson decomposition can be computed in polynomial time. (see Lemma 6). The expected cost of each tree is at most the corresponding part of the LP solution, and its expected coverage is at least that of the corresponding $y$ values. Now a key idea in our algorithm is to even out the penalty ratios paid by the algorithm. This is implemented in two ways.

1.

Probabilistically picking up some of the vertices left out of the sampled trees (inevitably increasing the connection cost). This is achieved through a pick-up threshold, which determines the vertices which should be picked up (with a suitable probability), based on the LP solution.
2.

Utilizing the spare penalty ratios of sampled vertices to bring down the cost of parity correction.

The second point is realized through an adaptation of the pruning idea from [8] to probabilistically prune away portions of the resultant structure with low connectivity. This allows us to assign weights to the edges of the tree, which, when combined with a suitable multiple of the LP solution, gives a point in the $Q$ -join polytope, where $Q$ is the set of odd degree vertices of the current structure. This allows for cheaper parity correction, because the tree edges with higher assigned weights, which are associated with lower value cuts, are pruned with higher probability, and thus the expected cost incurred by them remains low.

The prominent difference between our problem and the standard PCTSP, which necessitates finding new ideas for pruning, is the sampling probability for vertices. In PCTSP, the sampling probability for each vertex $v$ is equal to $y_{v}$ , hence the expected penalty ratio for $v$ is one, and intuitively, there is lots of spare penalty for $v$ to utilize for pruning. But in PCOTSP, the sampling probability of a vertex $v$ is lower (bounded from below by $1-e^{-y_{v}}$ ); hence there is little spare penalty ratio left (and for fewer vertices) to utilize for pruning. We obtain an improved adjustment of the pruning by combining our algorithm with a version of the classical algorithm for OTSP [10], which combines the cycle on terminals with a $\frac{3}{2}$ -approximation for TSP on the remaining vertices. If the length of the cycle on terminals is low, this algorithm already gives a good approximation ratio. If the length of the cycle is large, this fact can be utilized to improve the analysis for parity correction, because we can show that there is no need to assign (nonzero) weights to the cycle edges.

We believe that this view of evening out the penalty ratio of vertices vis-a-vis the optimal LP solution is a useful conceptual tool for approaching prize collecting versions of TSP or other combinatorial optimization problems. In the case of PCOTSP, it is partly realized through the idea of probabilistically including in the solution those vertices which currently have a too high penalty ratio (based on the desired approximation factor). Squeezing out the slack penalty ratio (again, based on the target approximation factor), is achieved through pruning. Here, the specific structure of the OTSP, and the partially constructed solution proves useful; we can show that if the current cycle through the terminals is small, combining the cycle on terminals with the best PCTSP approximation for the rest of vertices gives a good factor, while a large cycle improves the cost of pruning.

In the case of Multi-Path-TSP, Böhm, Friggstad, Mömke, and Spoerhase [13] propose two algorithms, and show that a careful combination of them leads to a good approximation. The first one, which doubles all edges except those on $s_{i},t_{i}$ paths (and is good when the sum of $s_{i},t_{i}$ distances is large) can be adapted to our setting by probabilistically picking up vertices with high penalty ratio, as opposed to picking up all remaining vertices in [13]. We replace their second algorithm, which finds a minimum length forest in which each terminal appears in exactly one of the components and then adds direct $s_{i},t_{i}$ edges with an algorithm that uses Lemma 6 to sample a tree from the related PCTSP, and then adds direct $s_{i},t_{i}$ edges.

2 Prize-collecting Ordered TSP

In this section we describe our algorithm for PCOTSP. Some of the technical proofs will be presented in the following sections.

2.1 Preliminaries and Definitions

In the OTSP, a tour can be decomposed into $k$ paths, between $o_{i}$ and $o_{i+1}$ .⁴⁴4For notational convenience, we identify $o_{k+1}$ with $o_{1}$ . We can take the polyhedron determined by the following inequalities as the relaxation of one such path between two vertices $s$ and $t$ .⁵⁵5 Similar to the LP of [4]; see also [12, 8]. For each vertex $v$ , the variable $y_{v}$ indicates its fractional degree in the stroll.

\begin{array}[]{lrlll}&y_{s}=y_{t}&=\frac{1}{2}&\\[2.0pt] &x(\delta(v))&=2y_{v}&\forall v\in V\\[2.0pt] &x(\delta(S))&\geq 1&\forall S\subseteq V\setminus\{t\},s\in S\\[2.0pt] &x(\delta(S))&\geq 2y_{v}&\forall v\in S\subseteq V\setminus\{s,t\}\\[2.0pt] &x,y&\geq 0&\end{array}\quad\quad\textbf{($s$-$t$-stroll relaxation)}

Now, the PCOTSP can be modeled as the following linear program. For $i=1,\ldots k$ we define $x_{i}$ and $y_{i}$ to be the vectors $(x_{i,e})_{e\in E}$ and $(y_{i,v})_{v\in V}$ , respectively. For each $i$ , the vector $(x_{i},y_{i})$ is constrained to be a feasible point in the $o_{i}$ - $o_{i+1}$ -stroll relaxation, the sum $y_{v}=\sum_{i=1}^{k}y_{i,v}$ over all fractional degrees of a vertex $v$ is an indicator of to what degree $v$ is used in the solution. When $v$ is not fully used, i.e., when $y_{v}<1$ , the LP has to pay a fractional penalty of $\pi_{v}(1-y_{v})$ . We will usually refer to the pair $(o_{i},o_{i+1})$ as $(s_{i},t_{i})$ , to emphasize that $(x_{i},y_{i})$ is a fractional tour/stroll from $o_{i}$ to $o_{i+1}$ .

\begin{array}[]{llll}\text{minimize}&\lx@intercol\displaystyle\sum\limits_{e% \in E}\sum\limits_{i=1}^{k}c_{e}x_{i,e}+\displaystyle\sum\limits_{v\in V}\pi_{% v}(1-y_{v})\hfil\lx@intercol\\[12.0pt] \text{subject to \quad}&\displaystyle y_{v}=\sum\limits_{i=1}^{k}y_{i,v}&% \forall v\in V\\[12.0pt] &(x_{i},y_{i})\text{ lies in the }s_{i}\text{-}t_{i}\text{-stroll relaxation}&% \forall i\in[k]\end{array}\quad\quad\quad\text{(OLP)}

Note that every feasible solution to (OLP) has $y_{o}=1$ for all terminals $o$ . Similar to $y_{v}$ , we will use $x_{e}$ as a shorthand for $\sum_{i=1}^{k}x_{i,e}$ . It follows immediately from the LP constraints that $x(\delta(S))\geq 2y_{v}$ for any $v\in S\subseteq V\setminus O$ . By contracting all terminals $o\in O$ into a single vertex $r$ , we therefore obtain the relaxation of the normal PCTSP from [8], which also involves a root, which (without loss of generality; see, e.g., [3]) is required to be in the tour. Given a solution $(x,y)$ to (OLP) and some threshold $\rho\in[0,1]$ , we define $V_{\rho}\coloneqq\{v\in V\mid y_{v}\geq\rho\}$ .

2.2 A simple algorithm

We consider the following simple algorithm for PCOTSP, inspired by [10]. First, directly connect the terminals $o_{1},\ldots,o_{k}$ in order, creating a simple cycle $\hat{C}$ . Then, compute a solution to the PCTSP on the same instance. Since all terminals have infinite penalty, every tour $T$ obtained in this way includes all terminals. We obtain an ordered tour of cost $c(T)+c(\hat{C})$ by following the original cycle $\hat{C}$ and grafting in the tour at an arbitrary terminal $o\in O$ .

Using an ${\hat{\alpha}}$ -approximation algorithm to solve the PCTSP, we know that the sum of tour- and penalty costs for this solution is at most ${\hat{\alpha}}\cdot\text{opt}_{\text{PCTSP}}\leq{\hat{\alpha}}\cdot\text{opt}_% {\text{PCOTSP}}$ . Since $c(\hat{C})\leq\text{opt}_{\text{PCOTSP}}$ , this immediately implies a $(1+{\hat{\alpha}})$ -approximation algorithm for PCOTSP. One can see that this algorithm performs even better if we can guarantee that $\hat{C}$ is small. To be precise, for any $\alpha\geq{\hat{\alpha}}$ we obtain an $\alpha$ -approximation provided that $c(\hat{C})\leq(\alpha-{\hat{\alpha}})\text{opt}_{\text{PCOTSP}}$ . We therefore may assume $c(\hat{C})\geq(\alpha-{\hat{\alpha}})\text{opt}_{\text{PCOTSP}}$ in the analysis of our main algorithm. The currently best value for ${\hat{\alpha}}$ is the approximation guarantee of (roughly) 1.599 obtained by [8].

2.3 Overview of our main algorithm

Fix $\alpha=2.097$ . We first solve (OLP)⁶⁶6The separation oracle for the LP boils down to separating subtour elimination constraints. Hence the LP can be solved in polynomial time using the ellipsoid algorithm., to get an optimal solution $(x^{\ast},y^{\ast})$ . Using the following Lemma 5, we then split off the vertices $v$ for which $y_{v}\leq\theta$ , for a parameter $\theta$ to be determined later, to get a solution $(\hat{x},\hat{y})$ for the LP were the remaining vertices have a certain minimum connectivity to the terminals. Vertices that have been split off will not be used in our final tour. Instead, we pay the full penalty for them. We state the following lemma for PCOTSP, as the proof is identical to the one for PCTSP.

Lemma 5 (Splitting off [8]).

Let $(x,y)$ be a feasible solution to the PCOTSP relaxation and let $S\subseteq V\setminus O$ . Then we can efficiently compute another feasible solution $(x^{\prime},y^{\prime})$ in which $y^{\prime}_{v}=0$ for all $v\in S$ , but $y^{\prime}_{v}=y_{v}$ for all $v\not\in S$ , and $c(x^{\prime})\leq c(x)$ .

Lemma 5 ensures that we can remove vertices from the support of $x$ , without increasing the cost of $x$ . Since we only split off vertices for which $y^{\ast}_{v}$ is relatively low, we can afford to pay the additional penalties if we set $\theta=1-\frac{1}{\alpha}$ (we will prove this in Section 3.1). We proceed by sampling a set of trees based on $(\hat{x},\hat{y})$ .

Lemma 6 ([8], following [5]).

Suppose $(x,y)$ is a feasible point in the $s$ - $t$ -stroll relaxation. In polynomial time we can find a set of trees $\mathcal{T}$ and weights $(\mu(T))_{T\in\mathcal{T}}$ such that (i) $\sum_{T\in\mathcal{T}}\mu(T)=1$ ; and (ii) $\sum_{T\in\mathcal{T}:e\in E(T)}\mu(T)\leq x_{e}\quad\forall e\in E$ ; and (iii) $\sum_{T\in\mathcal{T}:v\in V(T)}\mu(T)\geq y_{v},\quad\forall v\in V\setminus% \{s,t\}$ ; and (iv) all trees span $s$ and $t$ .

Lemma 6 can be used to sample a random tree of expected cost at most $c(x)$ which contains each vertex $v$ with probability at least $y_{v}$ and is guaranteed to connect $s$ to $t$ . It is straightforward to see that this can be achieved by choosing each tree $T\in\mathcal{T}$ with probability $\mu(T)$ .

For each component $(\hat{x}_{i},\hat{y}_{i})$ of our solution, we apply Lemma 6 and sample a tree $T_{i}$ as we have described. Define $T:={\dot{\bigcup}}_{i=1}^{k}T_{i}$ . Note that $T$ is no longer a tree, and it might even have repeated edges.

2.3.1 Pruning and Picking up

To get some intuition on the the following steps, suppose that we had to pay the penalty for a vertex $v$ if and only if $v$ has not been sampled in $T$ . In Lemma 10, we will show that the probability for this is at most $e^{-y^{\ast}_{v}}$ , which gives us a bound of $e^{-y_{v}^{*}}\pi_{v}$ on the expected penalty cost of $v$ . The LP however pays a $(1-y^{\ast}_{v})$ -fraction of $\pi_{v}$ . We thus define the penalty ratio $\rho_{v}$ of a vertex $v$ as $\rho_{v}=\frac{e^{-y^{\ast}_{v}}}{1-y^{\ast}_{v}}$ and define $\sigma_{0}$ as unique solution to the equation $\alpha(1-{\sigma_{0}})=e^{-{\sigma_{0}}}$ . With this, if $y_{v}^{\ast}={\sigma_{0}}$ , we have $\rho_{v}=\alpha$ , i.e., our expected penalty for $v$ is at most $\alpha$ times the LP penalty.

Note that in the PCTSP regime, e.g., in the result of [8], the probability that a vertex $v$ is in the (one) sampled tree is exactly $y^{\ast}_{v}$ , which means that $\rho_{v}=1<\alpha$ for every $v$ . But in our setting, the probability of a vertex not being sampled is [bounded from above by] $e^{-y^{\ast}_{v}}$ , which intuitively means that the spare penalties are much more constrained, and the gains would be more meager. Nonetheless, we show how to utilize the specific structure of our problem to gain almost as much from pruning as in the setting of PCTSP.

We will now describe how we deal with vertices whose $y^{\ast}$ -value is smaller or larger than ${\sigma_{0}}$ . First, we consider those vertices for which $y^{\ast}_{v}<\sigma_{0}$ (and thus $\rho_{v}<\alpha$ ). These vertices $v$ have some spare penalty ratio, which we utilize to prune $v$ with certain probability.

For our pruning step, we need the following definition from [8]:

Definition 7.

For a fixed LP solution $(x,y)$ , a tree $T$ , and a threshold $\gamma\in[0,1]$ , we define $\text{core}(T,\gamma)$ as the inclusion-wise minimal subtree of $T$ that spans all vertices in $V(T)\cap V_{\gamma}$ .

Algorithmically, $\text{core}(T,\gamma)$ can be obtained by iteratively removing leaves $v$ with $y_{v}<\gamma$ . To prune a tree $T_{i}$ simply means to replace it by $T_{i}^{\prime}=\text{core}(T_{i},\gamma)$ . We emphasize that for pruning $T_{i}$ , we do not consider the local penalty values $y_{i,v}$ , but the global values $y_{v}$ . Furthermore, our construction ensures that $s_{i}$ as well as $t_{i}$ are part of the pruned tree $T_{i}^{\prime}$ (remember that $y^{\ast}_{s_{i}}=y^{\ast}_{t_{i}}=1$ ). We will draw the global pruning threshold $\gamma$ according to a suitable distribution $D_{\gamma}$ which is defined by the following cumulative probability function: $F_{\gamma}(y)=\text{Pr}\bigl{[}\gamma\leq y\bigr{]}\coloneqq\frac{1-\alpha(1-y% )}{1-e^{-y}}$ . We will prove in Section 3.1 that this ensures that the slack in the penalty ratios is fully utilized.

We continue with those vertices for which $y_{v}>\sigma_{0}$ and describe our pickup step. For vertices $v$ with $y_{v}>\sigma_{0}$ , the penalty ratio is greater than $\alpha$ . In fact, their penalty ratio becomes arbitrarily high as $y^{\ast}_{v}$ approaches $1$ . Intuitively, this means that the probability of these vertices to not be sampled is too high. We call these vertices critical and define $V_{\sigma_{0}}=\{v\in V\mid y^{\ast}_{v}\geq{\sigma_{0}}\}$ . We pick up these critical vertices with certain probability. That means, we will probabilistically select some unsampled critical vertices and connect them to our solution. We first describe how these vertices are selected and then give the details on how we connect them.

Note that by our observation on $\rho_{v}$ , the chance for picking up a fixed unsampled critical vertex $v$ should increase with the value of $y^{*}_{v}$ . Our strategy is therefore the following: As we did for our pruning threshold, we will draw a global pickup threshold $\sigma\in[\sigma_{0},1]$ from a distribution $D_{\sigma}$ . We then pick up all critical vertices whose $y$ -value is above $\sigma$ , i.e., all vertices in $V_{\sigma}\setminus V(T)$ .

The distribution $D_{\sigma}$ is determined by the cumulative probability function $F_{\sigma}$ which we state in the following. Choosing $\sigma$ according to this function will ensure that for any $v\in V_{\sigma_{0}}$ the probability of being picked up is just high enough so that the expected penalty paid for $v$ is at most $\alpha$ times the fractional penalty $\pi_{v}(1-y^{\ast})$ paid by the LP solution. We will formally prove this fact in Section 3.1. For now, we simply define $F_{\sigma}(y)\coloneqq 1-\frac{\alpha(1-y)}{e^{-y}}$ . We remark that by definition of ${\sigma_{0}}$ , we have $\frac{\alpha(1-y)}{e^{-y}}=\frac{e^{y}(1-y)}{e^{\sigma_{0}}(1-{\sigma_{0}})}$ , from which it is easy to verify that $F_{\sigma}$ is indeed a cumulative probability function, i.e., $F_{\sigma}({\sigma_{0}})=0$ , $F_{\sigma}(1)=1$ and $F_{\sigma}$ is monotonously increasing on $[{\sigma_{0}},1]$ . To properly describe how we connect $V_{\sigma}\setminus T$ to $T$ , we use the following definition from [13].

Definition 8.

Let $X\subseteq Y\subseteq V$ . An $X$ -rooted spanning forest of $Y$ is a spanning forest of $Y$ such that each of its connected components contains a vertex of $X$ .

A cheapest $X$ -rooted spanning forest of $Y$ can be efficiently computed by contracting $X$ , computing a minimal spanning tree of $Y$ in the contracted graph and then reversing the contraction.

For our pickup step, we buy the cheapest forest ${F_{P}}$ that spans $V_{\sigma}$ and is rooted in ${V_{\sigma}\cap V(T)}$ . This forest ${F_{P}}$ has two crucial properties: (i) after buying ${F_{P}}$ , each vertex $v\in V_{\sigma}\setminus T$ will be connected to $T$ ; and (ii) ${F_{P}}$ only spans vertices with $y^{\ast}_{v}\geq\sigma$ . Property (ii) follows immediately from the definition of ${F_{P}}$ , and property (i) follows from the fact that $O\subseteq V_{\sigma}\cap T$ for any choice of $\sigma$ .

Finally, we remark that our pickup and pruning steps target the disjoint vertex sets $\{v\in V\mid y^{\ast}_{v}\geq\sigma_{0}\}$ and $\{v\in V\mid\theta\leq y^{\ast}_{v}<\sigma_{0}\}$ , so neither step interferes with the other.

Algorithm 1 The Prize-collecting Ordered TSP Algorithm.

2.3.2 Obtaining an ordered tour

To turn $T^{\prime\prime}$ into a feasible tour, we first need to correct parities, i.e., we ensure that every vertex has an even degree. Let $H$ be the graph that is obtained by adding a cheapest $\text{odd}(T^{\prime\prime})$ -join to $T^{\prime\prime}$ , where $\text{odd}(T^{\prime\prime})$ is defined as the set of odd degree vertices of $T^{\prime\prime}$ . Observe that initially, each tree $T_{i}$ contains a path $P_{i}$ between $o_{i}$ and $o_{i+1}$ . Since all terminals $o\in O$ have $y^{\ast}_{o}=1$ , all of these paths survive the pruning step. So the multigraph $H$ still contains $k$ edge-disjoint $o_{i}$ – $o_{i+1}$ –paths which, taken together, form a closed walk $C$ . By removing $C$ from $H$ , we obtain a graph whose connected components have even degree and can thus be shortcut into cycles. Furthermore, since $H$ was connected, each of these cycles has a common vertex with $C$ . Hence we can obtain a feasible tour of no greater cost than $H$ by following the walk $C$ and grafting in the cycles obtained by shortcutting the components of $H\setminus C$ on the way.

3 Analysis of Algorithm 1

In this section, we prove Theorem 3. We will compare the expected cost of our computed solution to the cost of the optimal LP solution $(x^{\ast},y^{\ast})$ . In particular, we compare the expected cost of our computed tour to the cost of $x^{\ast}$ (Section 3.3) and the expected sum of our incurred penalties to the fractional penalty cost defined by $y^{*}$ (Section 3.1). For a derandomized version of our algorithm, we refer to the appendix.

In the following, we use $(x^{\ast},y^{\ast})$ to refer to the optimal LP solution computed by the algorithm, and $(x,y)$ to refer to the LP solution after the splitting-off step.

3.1 Bounding the Penalty Ratios

In this subsection, we prove the following lemma:

Lemma 9.

The expected total penalty cost paid by Algorithm 1 is at most

{\alpha\cdot\sum_{v\in V}\pi_{v}\cdot(1-y^{*}_{v})}.

We note that we can express the expected total penalty cost paid by our algorithm as $\sum_{v\in V}\pi_{v}\cdot{\text{Pr}\bigl{[}v\not\in V(T^{\prime\prime})\bigr{]}}$ . We can thus prove Lemma 9 by showing that for each vertex $v$ , the ratio ${\text{Pr}\bigl{[}v\not\in V(T^{\prime\prime})\bigr{]}}/(1-y^{*}_{v})$ is at most $\alpha$ .

Due to Step 2 in Algorithm 1, i.e., due to the splitting-off step, $T^{\prime\prime}$ spans only vertices whose $y$ -value is at least $\theta$ . Vertices $v$ with $y^{*}_{v}<\theta$ are therefore included in $V(T^{\prime\prime})$ with probability 0. However, our choice of $\theta=1-\frac{1}{\alpha}$ guarantees that for the vertices $v$ with $y^{*}_{v}<\theta$ , we have ${\text{Pr}\bigl{[}v\not\in V(T^{\prime\prime})\bigr{]}}/(1-y^{*}_{v})\leq 1/(1% -\theta)=\alpha$ . To continue our analysis for the remaining vertices, i.e., those with $y^{*}_{v}\geq\theta$ , we show the following lemma:

Lemma 10.

Let $v$ be a vertex with $y^{*}_{v}\geq\theta$ . Then the probability that $v$ is not in $T$ is at most $e^{-y^{*}}$ .

Proof.

By Lemma 6, the probability that $v$ is not in $T_{i}$ is at most $1-y^{\ast}_{i,v}$ for any fixed $i\in[k]$ . The probability that this happens for all $i\in[k]$ is at most

\prod_{i=1}^{k}\text{Pr}[v\not\in V(T_{i})]\leq\prod_{i=1}^{k}(1-y^{\ast}_{i,v% })\leq\exp\left(-\sum_{i=1}^{k}y^{\ast}_{i,v}\right)=e^{-y^{\ast}_{v}}.\

$\hfill\blacktriangleleft$ Note that the distributions from which $\sigma$ and $\gamma$ are chosen guarantee that $\theta\leq\gamma<{\sigma_{0}}\leq\sigma\leq 1$ , i.e., only vertices $v$ with $y^{*}_{v}\in[\theta,{\sigma_{0}})$ can be pruned and only vertices with $y^{*}_{v}\in[{\sigma_{0}},1]$ can be picked up.

Now consider a vertex $v$ with $y^{*}\in[\theta,{\sigma_{0}})$ . There are two cases in which we have to pay $v$ ’s penalty: if $v$ is not sampled, and if $v$ is sampled but immediately pruned afterwards.

By Lemma 10, the probability of the former is at most $e^{-y^{\ast}_{v}}$ , whereas the probability of being pruned – given that $v$ was sampled in the first place – can be bounded from above by $\text{Pr}[\gamma>y^{\ast}_{v}]=1-F_{\gamma}(y^{\ast}_{v})$ .⁷⁷7The probability can be lower if one of the sampled trees has a vertex with higher $y^{\ast}$ -value in the subtree rooted at $v$ . Our choice of $F_{\gamma}(y)=\frac{1-\alpha(1-y)}{1-e^{-y}}$ guarantees that the expected penalty is just high enough:

	$\displaystyle\frac{{\text{Pr}\bigl{[}v\not\in V(T^{\prime\prime})\bigr{]}}}{1-% y^{*}_{v}}$	$\displaystyle\leq\frac{(1-\text{Pr}[v\in T])+\text{Pr}[v\in T]\cdot(1-F_{% \gamma}(y^{\ast}_{v}))}{1-y^{*}_{v}}$
		$\displaystyle=\frac{1-\text{Pr}[v\in T]F_{\gamma}(y^{\ast}_{v})}{1-y^{}_{v}}% \leq\frac{1-(1-e^{-y^{\ast}_{y}})F_{\gamma}(y^{\ast}_{v})}{1-y^{}_{v}}=\alpha.$

Finally, consider a vertex $v$ with $y^{*}_{v}\in[{\sigma_{0}},1]$ . This time, there is only one case in which we have to pay the penalty for $v$ : when $v$ is neither sampled, nor picked up afterwards. Again, the probability of not being sampled is at most $e^{-y^{\ast}_{v}}$ whereas the probability of not being picked up – given that $v$ was not sampled previously – is $\text{Pr}[\sigma>y^{\ast}_{v}]=1-F_{\sigma}(y^{\ast}_{v})$ . Again, our choice of $F_{\sigma}(y)=1-\frac{\alpha(1-y)}{e^{-y}}$ guarantees that

\displaystyle\frac{{\text{Pr}\bigl{[}v\not\in V(T^{\prime\prime})\bigr{]}}}{1-% y^{*}_{v}}

\displaystyle\leq\frac{e^{-y^{\ast}_{v}}\cdot(1-F_{\sigma}(y^{\ast}_{v}))}{1-y% ^{*}_{v}}=\alpha.

This concludes our proof of Lemma 9.

3.2 Parity Correction

In order to analyze the expected cost of both the parity correction step and of our final tour, we first need to further investigate the structure of our computed solution and introduce some additional notation. Recall that $T$ denotes the union of all sampled trees and that it includes the closed walk $C$ , which is obtained by joining all $s_{i}$ - $t_{i}$ -paths $P_{i}\subseteq T_{i}$ . Let $R$ be the graph that contains all remaining edges, i.e., the (multi-)graph induced by $E(T)\setminus E(C)$ .

While it is convenient to think of $C$ as a cycle, note that the paths $P_{1},\ldots,P_{k}$ are not necessarily vertex disjoint. However, we can guarantee that $C$ is a Eulerian multigraph spanning all terminals. In the same sense, $R$ can be thought of as a collection of small trees which are all rooted at the cycle $C$ (in reality, some of those trees may intersect each other). In the following, we will call the edges of $C$ cycle edges and the edges of $R$ tree edges. A depiction of $C$ and $R$ can be seen in Fig. 1.

When we prune the trees $T_{i}$ into $T_{i}^{\prime}$ , the paths $P_{i}$ are not affected (because the pruning step guarantees that $o_{i}$ and $o_{i+1}$ stay connected in $T_{i}^{\prime}$ ). So our pruning step can only remove edges from $R$ . We thus define $\text{core}(R,\gamma)$ as the graph that is obtained by pruning all trees in $T$ with pruning threshold $\gamma$ and then removing the cycle $C$ . We now partition the edge set of $R$ into layers. Intuitively, the $i$ -th layer of $R$ contains those edges that are contained in $\text{core}(R,\gamma)$ if and only if $\gamma$ does not exceed some value $\eta_{i}$ .

Let $\eta_{1}>\ldots>\eta_{\ell}$ be the $y^{*}$ -values of all vertices that might be affected by our pruning step, i.e., $\{\eta_{1},\ldots,\eta_{\ell}\}=\{y_{v}\mid v\in V\text{ and }\theta\leq y_{v}% \leq\sigma_{0}\}\cup\{\sigma_{0}\}$ . By definition, we always have $\eta_{1}=\sigma_{0}$ . Now let $E_{1}=\text{core}(T,\eta_{1})$ and $E_{i}=\text{core}(T,\eta_{i})\setminus E_{i-1}$ for $i=2,\ldots,\ell$ . One can see that $E_{1}\cup E_{2}\cup\ldots\cup E_{\ell}$ is a partitioning of $E(R)$ .

To be able to bound the cost of the cheapest $\text{odd}(T^{\prime\prime})$ -join $J$ , we define the following vector

z\coloneqq\beta x+\underbrace{\sum_{i:\eta_{i}\geq\gamma}(1-2\eta_{i}\beta)% \chi^{E_{i}}}_{z_{\gamma}}+\underbrace{\max\{0,1-2\beta\sigma\}\chi^{{F_{P}}}}% _{z_{\sigma}}

(1)

where $\beta=\frac{1}{3\sigma_{0}-\theta}$ . To simplify future arguments, we also define the two components $z_{\gamma}$ and $z_{\sigma}$ of $z$ , as specified in (1). We remark that the vector $\beta x+z_{\gamma}$ has been used (for a different value of $\beta$ ) in [8]. In Section 3.3, the cost of $z$ will be used as an upper bound for $c(J)$ . To this end, we now prove the following lemma.

Lemma 11.

$z$ lies in the dominant of the $\text{odd}({T^{\prime\prime}})$ -join polytope.

Proof.

First, observe that all coefficients $z_{e}$ are non-negative. We now consider an arbitrary $S\subseteq V$ for which $|S\cap\text{odd}(H)|$ is odd and show that $z(\delta(S))\geq 1$ .

We begin with the case where $S$ cuts the terminal set, i.e., where $0<|S\cap\{o_{1},\ldots,o_{k}\}|<k$ . In this case, $S$ separates at least two terminal pairs $(o_{i_{1}},o_{i_{1}+1})$ and $(o_{i_{2}},o_{i_{2}+1})$ , which implies $x(\delta(S))\geq 2$ and therefore $z(\delta(S))\geq\beta x(\delta(S))\geq 1$ . In the following we can thus assume that $\emptyset\neq S\subseteq V\setminus\{o_{1},\ldots,o_{k}\}$ .

Now suppose that $\delta(S)$ contains a pickup edge $e=\{u,v\}\in E({F_{P}})$ s.t. $u\in S$ and $v\not\in S$ . Recall that ${F_{P}}$ only spans vertices $w$ with $y_{w}\geq\sigma\geq\sigma_{0}$ . We therefore have $u\in S\subseteq V\setminus\{o_{1},\ldots,o_{k}\}$ which (by the connectivity constraints of (OLP)) implies $x(\delta(S))\geq 2y_{u}\geq 2\sigma_{0}$ and therefore $z(\delta(S))\geq\beta x(\delta(S))+\max\{0,1-2\beta\sigma_{0}\}\geq 1$ .

It remains to show the bound for the case when $\delta(S)$ only contains cycle and tree edges. By a simple counting argument, one can see that $|\delta_{T^{\prime\prime}}(S)|$ has the same parity as $|S\cap\text{odd}({T^{\prime\prime}})|$ and therefore must be odd. Now observe that by our assumption, $|\delta_{{F_{P}}}(S)|=0$ and that because $C$ is a Eulerian multigraph, $|\delta_{C}(S)|$ must be even. If follows that $\delta(S)$ must have an odd number of tree edges. We finish our proof by marginally adapting an argument from [8].

First, we consider the case where $\delta(S)$ contains exactly one tree edge $e$ . This is only possible if $S$ includes a whole subtree $T_{e}$ of one of the trees in $T^{\prime}$ . Because this subtree $T_{e}$ has survived the pruning step, $e$ must lie in some layer $E_{i}$ for which $\eta_{i}\geq\gamma$ . Furthermore, if $e\in E_{i}$ , then $T_{e}$ must contain at least one vertex $v$ with $y_{v}\geq\eta_{i}$ , which implies that $x(\delta(S))\geq 2\eta_{i}$ . So we have

z(\delta(S))\geq\beta x(\delta(S))+(1-2\eta_{i})\geq 2\beta\eta_{i}+(1-2\beta% \eta_{i})\geq 1.

If, however, $\delta(S)$ contains at least three tree edges, we know that all vertices $v\in S$ have $y_{v}\geq\theta$ and that each tree edge contributes at least $(1-2\beta\sigma_{0})$ to $z_{\gamma}$ . Therefore $z(\delta(S))\geq 2\beta\theta+3(1-2\beta\sigma_{0})\geq 1$ . $\hfill\blacktriangleleft$

(a)

(b)

Figure 1: (a) The graph

T^{\prime\prime}

after pruning and picking up critical vertices. The terminals in

O

are drawn as black rectangles. The cycle

C

is depicted in red, the surviving edges of

R

are drawn in black and edges in

{F_{P}}

in green. The greyed out vertices and edges do not belong to

T^{\prime\prime}

. They have either been pruned (the dashed vertices and edges), not sampled, or split off. (b) The same graph

T^{\prime\prime}

with the various cuts that are considered in the proof of Lemma 11 drawn in different colors. The dashed blue cut is an example for the case where

0<|S\cap O|<k

. The dashed pink cut shows the case where a pickup edge (drawn in green) is cut. Note that even though the vertex in

S

was not picked up itself, it is still part of

{F_{P}}

and thus must have a high

y

-value. The remaining two cuts (drawn in brown and light blue) show the two cases where

\delta(S)

contains an odd number of tree edges.

3.3 Bounding the Tour Cost

In this section, we show the following bound on the cost of our computed tour:

Lemma 12.

${\mathbb{E}\bigl{[}{c(T^{\prime\prime})}\bigr{]}}\leq\alpha\cdot c(x^{\ast})$

The total cost of our computed tour can be bounded from above by

\displaystyle{\mathbb{E}\bigl{[}{c(H)}\bigr{]}}\leq{\mathbb{E}\bigl{[}{c(T^{% \prime})}\bigr{]}}+{\mathbb{E}\bigl{[}{c({F_{P}})}\bigr{]}}+{\mathbb{E}\bigl{[% }{c(J)}\bigr{]}}.

Our bound on the cost of $J$ is given by the cost of the parity correction vector $z=\beta x+z_{\gamma}+z_{\sigma}$ . We start by analyzing the expected combined value of $c(T^{\prime})$ and $c(z_{\gamma})$ .

Lemma 13.

${\mathbb{E}\bigl{[}{c(T^{\prime})+c(z_{\gamma})}\bigr{]}}\leq c(x)\Bigl{(}2+{% \hat{\alpha}}-\alpha-(2+2{\hat{\alpha}})\beta\sigma_{0}+2\alpha\beta\sigma_{0}% \Bigr{)}$ where ${\hat{\alpha}}$ denotes the approximation guarantee of the current best PCTSP algorithm.

The main idea of the proof of Lemma 13 is that the weight which each layer $E_{i}$ is assigned in $z_{\gamma}$ is large when $\eta_{i}$ is small and vice versa. At the same time, the chance for layer $E_{i}$ to be present after pruning and thus to contribute at all to both $z_{\gamma}$ and $c(T^{\prime})$ is an increasing function of $\eta_{i}$ . By balancing out these two values, we obtain an upper bound or the contribution of all layers in $R$ to $c(T^{\prime})+c(z_{\gamma})$ . At the same time we take into account that the cycle $C$ only contributes toward $c(T^{\prime})$ but crucially not towards the cost of the parity correction vector. This is where we utilize our assumption that $\hat{C}$ and therefore also $C$ can be lower bounded by a constant fraction of opt. For formally proving Lemma 13, we need the following technical Lemma 14.

Lemma 14.

For $\frac{2}{3}\leq\sigma_{0}\leq 1$ , the function $g(y)=\text{Pr}[\gamma\leq y](2-2\beta y)$ attains its maximum value over the interval $[\theta,\sigma_{0}]$ at $y=\sigma_{0}$ .

Proof.

First, observe that $\text{Pr}[\gamma\leq y]=F_{\gamma}(y)=\frac{1-\alpha(1-y)}{1-e^{-y}}$ and therefore $g(y)=2F_{\gamma}(y)(1-\beta y)$ . The claim follows from proving that $g$ is monotonously increasing on $[\theta,\sigma_{0}]$ by showing that its derivative is strictly positive:

$\displaystyle\frac{1}{2}\frac{d}{dy}g(y)$	$\displaystyle=f_{\gamma}(y)(1-\beta y)-\beta F_{\gamma}(y)\geq f_{\gamma}(y)(1% -\beta\sigma_{0})-\beta$
	$\displaystyle\geq\alpha(1-\beta\sigma_{0})-\beta$	(2)
	$\displaystyle>0,$	(3)

where $f_{\gamma}$ denotes the density function of $\gamma$ . In (2) we used that for $y\in[\theta,\sigma_{0}]$

	$\displaystyle f_{\gamma}(y)$	$\displaystyle=\frac{\alpha}{1-e^{-y}}-(1-\alpha(1-y))\frac{e^{-y}}{(1-e^{-y})^% {2}}=\frac{\alpha}{1-e^{-y}}-F_{\gamma}(y)\frac{e^{-y}}{1-e^{-y}}$
		$\displaystyle\geq\frac{\alpha-e^{-y}}{1-e^{-y}}=\alpha+\frac{\alpha-1}{e^{y}-1% }\geq\alpha.$

Furthermore, (3) follows because $\alpha$ and $\beta$ are determined by $\sigma_{0}$ :

\alpha=\frac{e^{-{\sigma_{0}}}}{1-\sigma_{0}},\quad\theta=1-\frac{1}{\alpha},% \text{ and}\quad\beta=\frac{1}{3\sigma_{0}-\theta}.

By plugging in these dependencies we obtain the expression

\displaystyle\alpha(1-\beta\sigma_{0})-\beta=\alpha-\beta(\alpha\sigma_{0}+1)=% \frac{\text{e}^{-\sigma_{0}}}{1-\sigma_{0}}-\frac{1}{3\sigma_{0}-1+\frac{1}{% \frac{e^{-\sigma_{0}}}{1-\sigma_{0}}}}\biggl{(}\frac{\text{e}^{-\sigma_{0}}}{1% -\sigma_{0}}\sigma_{0}+1\biggr{)},

which has a positive value if $\sigma_{0}\in(\frac{2}{3},1)$ . $\hfill\blacktriangleleft$

Proof of Lemma 13.

Our analysis follows the basic approach from [8], but distinguishes between the cycle $C$ and the remaining part $R$ of the sampled solution $T$ to leverage the lower bound on the the cost of the cycle, which we get by running the simple algorithm from Section 2 in parallel. Another difference to [8] is that $T$ is not a single tree, but consists of $k$ distinct sampled trees, which requires some additional notation.

Let $\mathcal{T}=\mathcal{T}_{1}\times\ldots\times\mathcal{T}_{k}$ denote the set of all possible outcomes obtained by sampling the $k$ trees as described in Section 2. One can see that the probability of a fixed outcome $\pi=(T_{1},\ldots,T_{k})\in\mathcal{T}$ is $\mu(\pi)=\prod\limits_{i=1}^{k}\mu_{i}(T_{i})$ and that ${\sum\limits_{\pi\in\mathcal{T}}\mu(\pi)=1}$ .

Recall that technically, the subgraphs $C, T$ as well as the layers $E_{i}$ depend on the combination of sampled trees $\pi$ . We therefore write, e.g., $C_{\pi}$ to refer to the concrete cycle $C$ that arises from sampling the trees $T_{i}$ in $\pi=(T_{1},\ldots,T_{k})$ . Now

$\displaystyle{\mathbb{E}\bigl{[}{c(T^{\prime})+c(z_{\gamma})}\bigr{]}}$	$\displaystyle\leq{\mathbb{E}\bigl{[}{c(C)}\bigr{]}}+c(R^{\prime})+{\mathbb{E}% \bigl{[}{c(z_{\gamma})}\bigr{]}}$
	$\displaystyle=\sum_{\pi\in\mathcal{T}}\mu(\pi)\left(c(C_{\pi})+\mathbb{E}_{% \gamma}\bigl{[}c(R^{\prime}_{\pi})\bigr{]}+\sum_{i=1}^{\ell_{\pi}}F_{\gamma}(% \eta_{j})\left(1-2\beta\eta_{j}\right)c(E_{i,\pi})\right)$
	$\displaystyle=\sum_{\pi\in\mathcal{T}}\mu(\pi)\left(c(C_{\pi})+\sum_{i=1}^{% \ell_{\pi}}\underbrace{\text{Pr}[\gamma\leq\eta_{i}]\left(2-2\beta\eta_{j}% \right)}_{\eqqcolon g(\eta_{i})}c(E_{i,\pi})\right)$
	$\displaystyle\leq\sum\limits_{\pi\in\mathcal{T}}\mu(\pi)\Bigl{(}c(C_{\pi})+g(% \sigma_{0})c(R_{\pi})\Bigr{)}$	(4)
	$\displaystyle=\sum\limits_{\pi\in\mathcal{T}}\mu(\pi)\Bigl{(}c(C_{\pi})+g(% \sigma_{0})\bigl{(}c(T^{\prime}_{\pi})-c(C_{\pi})\bigr{)}\Bigr{)}$
	$\displaystyle=\sum\limits_{\pi\in\mathcal{T}}\mu(\pi)\Bigl{(}g(\sigma_{0})c(T^% {\prime}_{\pi})-\bigl{(}f(\sigma_{0})-1\bigr{)}c(C_{\pi})\Bigr{)}$
	$\displaystyle\leq g(\sigma_{0})c(x)-\bigl{(}g(\sigma_{0})-1\bigr{)}c(\hat{C})=% 2\bigl{(}1-\beta\sigma_{0}\bigr{)}c(x)-\bigl{(}1-2\beta\sigma_{0}\bigr{)}c(% \hat{C}).$

In (4) we used the fact from Lemma 14 that $g(\eta_{i})$ is maximized at $\eta_{i}=\sigma_{0}$ and that the layers $E_{i,\pi}$ partition the edge set of $R_{\pi}$ . Now recall that we may assume that $c(\hat{C})\geq(\alpha-{\hat{\alpha}})\text{opt}_{PCOTSP}\geq(\alpha-{\hat{% \alpha}})c(x)$ , which yields a bound of

	$\displaystyle{\mathbb{E}\bigl{[}{c(T^{\prime})+c(z_{\gamma})}\bigr{]}}$	$\displaystyle\leq 2\bigl{(}1-\beta\sigma_{0}\bigr{)}c(x)-\bigl{(}1-2\beta% \sigma_{0}\bigr{)}c(\hat{C})$
		$\displaystyle\leq c(x)\Bigl{(}2-2\beta\sigma_{0}-(1-2\beta\sigma_{0})(\alpha-{% \hat{\alpha}})\Bigr{)}$
		$\displaystyle=c(x)\Bigl{(}2-2\beta\sigma_{0}-\alpha+{\hat{\alpha}}+2\alpha% \beta\sigma_{0}-2{\hat{\alpha}}\beta\sigma_{0}\Bigr{)}$
		$\displaystyle=c(x)\Bigl{(}2+{\hat{\alpha}}-\alpha-(2+2{\hat{\alpha}})\beta% \sigma_{0}+2\alpha\beta\sigma_{0}\Bigr{)}.\$

$\hfill\blacktriangleleft$ We will next analyze the cost of ${F_{P}}$ , in Lemma 16. The proof builds on the following theorem.

Theorem 15 (Böhm et al. [13]).

Let $X\subseteq U\subseteq V$ . Furthermore, let $S\subseteq U\setminus X$ be a randomly chosen subset such that $\text{Pr}[v\not\in S]\leq\rho$ for each $v\in U\setminus X$ . Let $F_{X}$ and $F_{X\cup S}$ denote the cheapest $X$ -rooted spanning forest and the cheapest $(X\cup S)$ -rooted spanning forest of $U$ respectively. Then ${\mathbb{E}\bigl{[}{c(F_{X\cup S})}\bigr{]}}\leq\rho\cdot c(F_{X})$ .

Lemma 16.

For a fixed value of $\sigma\in[\sigma_{0},1]$ , the cost of ${F_{P}}$ is at most $\frac{e^{-\sigma}}{\sigma}\cdot c(x)$ .

Proof.

We invoke Theorem 15 with $U=V_{\sigma}$ , $X=O$ and $S=(V_{\sigma}-O)\cap V(T)$ . Recall that our pickup forest ${F_{P}}$ is the cheapest $V_{\sigma}\cap V(T)$ rooted spanning forest of $V_{\sigma}$ and observe that $V_{\sigma}\cap V(T)=X\cup S$ . By Lemma 10, each vertex $v\in U\setminus X$ is not sampled (and therefore not in $S$ ) with probability at most $e^{-\sigma}$ . It remains to show that the cost of the cheapest $O$ -rooted spanning forest of $V_{\sigma}$ is at most $\frac{c(x)}{\sigma}$ .

As we have already observed in Section 2, the cost of the cheapest $O$ -rooted spanning forest of $V_{\sigma}$ is equal to the cost of a minimum spanning tree in the graph obtained by contracting all vertices of $O$ into a single vertex $r$ , i.e., on $V^{\prime\prime}=(V_{\sigma}-O)\cup\{r\}$ . Furthermore, we have also observed that by applying the very same contraction to our solution ( $x, y$ ), we obtain a feasible solution $(x^{\prime},y^{\prime})$ to the (non-ordered) PCTSP relaxation. By splitting off all vertices in $V-V_{\sigma}$ and scaling up by a factor of $\frac{1}{\sigma}$ , we obtain a vector $x^{\prime\prime}$ for which $x^{\prime}(\delta(s))\geq 2$ for all $\emptyset\neq S\subseteq V^{\prime\prime}$ , i.e., a feasible point in the dominant⁸⁸8It is possible to obtain a feasible point in the polytope itself by applying a sequence of splitting-off operations. of the subtours elimination polytope of $V^{\prime\prime}$ .

The cost of the minimum spanning tree on $V^{\prime\prime}$ is therefore at most $c(x^{\prime\prime})\leq\frac{1}{\sigma}c(x)$ , which concludes the proof. $\hfill\blacktriangleleft$ Equipped with this upper bound, we can now bound the expected cost of ${F_{P}}$ , utilizing the density function $f_{\sigma}(y)=\frac{d}{dy}F_{\sigma}(y)=\alpha ye^{y}$ and integrating over the range $[\sigma_{0},1]$ from which $\sigma$ is drawn:

\displaystyle\mathbb{E}\bigl{[}c({F_{P}})\bigr{]}=c(x)\int_{\sigma_{0}}^{1}f_{% \sigma}(y)\frac{e^{-y}}{y}dy=\alpha c(x)\int_{\sigma_{0}}^{1}1dy=\alpha(1-% \sigma_{0})c(x)=e^{-\sigma_{0}}c(x).

(5)

For the last equality we have used the definition of $\sigma_{0}$ . We emphasize that by randomizing the choice of $\sigma$ instead of flatly using $\sigma=\sigma_{0}$ , we have gained a factor of $\sigma_{0}$ . In a similar way, we can use Lemma 16 to compute the expected cost of $z_{\sigma}$ .

Lemma 17.

${\mathbb{E}\bigl{[}{c(z_{\sigma})}\bigr{]}}=\alpha c(x)\left(\frac{1}{4\beta}-% \sigma_{0}+\beta\sigma_{0}^{2}\right)$ .

Proof.

Note that when $\sigma>\frac{1}{2\beta}$ , then $z_{\sigma}$ is by definition the zero-vector. We compute

	$\displaystyle\mathbb{E}\bigl{[}c(z_{\sigma})\bigr{]}$	$\displaystyle=\mathbb{E}\bigl{[}\max\{0,1-2\beta\sigma\}c(\chi^{{F_{P}}})\bigr% {]}=c(x)\int_{\sigma_{0}}^{\frac{1}{2\beta}}f_{\sigma}(y)\frac{e^{-y}(1-2\beta y% )}{y}dy$
		$\displaystyle=\alpha c(x)\int_{\sigma_{0}}^{\frac{1}{2\beta}}(1-2\beta y)dy=% \alpha c(x)\left(\frac{1}{2\beta}-\sigma_{0}-2\beta\int_{\sigma_{0}}^{\frac{1}% {2\beta}}y\ dy\right)$
		$\displaystyle=\alpha c(x)\left(\frac{1}{2\beta}-\sigma_{0}-2\beta\left(\frac{1% }{8\beta^{2}}-\frac{\sigma_{0}^{2}}{2}\right)\right)=\alpha c(x)\left(\frac{1}% {4\beta}-\sigma_{0}+\beta\sigma_{0}^{2}\right).\$

$\hfill\blacktriangleleft$ Finally, we are able to combine the bounds on the various parts of $c(H)$ , and obtain our final upper bound on the expected tour cost:

	$\displaystyle{\mathbb{E}\bigl{[}{c(H)}\bigr{]}}$	$\displaystyle\leq{\mathbb{E}\bigl{[}{c(T^{\prime})}\bigr{]}}+{\mathbb{E}\bigl{% [}{c({F_{P}})}\bigr{]}}+{\mathbb{E}\bigl{[}{c(J)}\bigr{]}}$
		$\displaystyle={\mathbb{E}\bigl{[}{c(T^{\prime})+c(z_{\gamma})}\bigr{]}}+{% \mathbb{E}\bigl{[}{c({F_{P}})}\bigr{]}}+{\mathbb{E}\bigl{[}{c(z_{\sigma})}% \bigr{]}}+\beta c(x^{\ast})$
		$\displaystyle\leq c(x^{\ast})\Bigl{(}2+{\hat{\alpha}}+\beta-\alpha-(2+2{\hat{% \alpha}})\beta\sigma_{0}+2\alpha\beta\sigma_{0}+e^{-\sigma_{0}}+\frac{\alpha}{% 4\beta}-\alpha\sigma_{0}+\alpha\beta\sigma_{0}^{2}\Bigr{)}.$

Note that the values of $\beta$ and $\sigma_{0}$ are functions of $\alpha$ . We therefore can express our upper bound as ${\mathbb{E}\bigl{[}{c(H)}\bigr{]}}\leavevmode\nobreak\ \leq\leavevmode\nobreak% \ c(x^{\ast})f(\alpha,{\hat{\alpha}})$ . By Lemma 9, Algorithm 1 pays at most $\alpha$ times the penalty incurred by $(x^{\ast},y^{\ast})$ . Running Algorithm 1 with $\theta$ and $\sigma$ determined by the single parameter $\alpha$ thus yields an approximation factor of $\max(\alpha,f(\alpha,{\hat{\alpha}}))$ . Recall that the value of ${\hat{\alpha}}$ is currently slightly below $1.599$ . For ${\hat{\alpha}}=1.599$ , the term $\max(\alpha,f(\alpha,{\hat{\alpha}}))$ is minimized at $\alpha\approx 2.096896<2.097$ . In fact, if we set $\alpha=2.097$ , the term evaluates to $2.097$ .

For $\alpha=2.097$ and ${\hat{\alpha}}=1.599$ , we have $\beta\approx 0.548775$ and $\sigma_{0}\approx 0.781790$ . Thus we have proven Theorem 3.

4 Prize-collecting Multi-Path TSP

PC-Multi-Path-TSP can be modeled as the following linear program.

\begin{array}[]{llll}\text{minimize}&\lx@intercol\displaystyle\sum\limits_{e% \in E}\sum\limits_{i=1}^{k}c_{e}x_{i,e}+\sum\limits_{v\in V-\mathcal{T}}\pi_{v% }(1-y_{v})\hfil\lx@intercol\\[12.0pt] \text{subject to \quad}&\displaystyle y_{v}=\sum\limits_{i=1}^{k}y_{i,v}&% \forall v\in V\\[10.0pt] &(x_{i},y_{i})\text{ lies in the }s_{i}\text{-}t_{i}\text{-stroll relaxation}&% \forall i\in[k]\end{array}\quad\quad\quad\text{(kLP)}

Similar to [13], we describe two algorithms for the problem and show that an appropriate combination of the two algorithms gives an $2.41$ -approximation.

In one algorithm, (which we call Algorithm $A$ ), we first sample $k$ trees using Lemma 6 and an optimal solution $(x^{\ast},y^{\ast})$ of (kLP), where tree $T_{i}$ connects terminals $s_{i},t_{i}$ . The remaining vertices are picked up in a probabilistic fashion akin to Algorithm 1, i.e., we choose a random threshold $\sigma\in[\sigma_{0}^{\prime},1]$ and buy a $\bigl{(}V_{\sigma}\cap\bigcup_{i=1}^{k}V(T_{i})\bigr{)}$ -rooted spanning forest ${F_{P}}$ of $V_{\sigma}$ . Here, ${\sigma^{\prime}_{0}}$ is a constant whose value will be determined later. Then we double every edge that does not, for any $i$ , lie on the $s_{i}$ - $t_{i}$ path in $T_{i}$ . This gives a tour $H_{A}$ . Define $\Delta:=\sum c(s_{i},t_{i})$ and $\eta\coloneqq\frac{\Delta}{c(x^{\ast})}$ . Then it is easy to see that

\displaystyle c(H_{A})

\displaystyle\leq 2c(x^{\ast})+2{\mathbb{E}\bigl{[}{c({F_{P}})}\bigr{]}}-\Delta.

One can already see that intuitively, this yields good results whenever $\Delta$ is large.

Now we define the distribution from which $\sigma$ is drawn. We choose $\sigma$ s.t. $\text{Pr}[\sigma\leq y]=F^{\prime}_{\sigma}(y)$ where $F^{\prime}_{\sigma}(y)=1-\frac{e^{y}(1-y)}{e^{{\sigma^{\prime}_{0}}}(1-{\sigma% ^{\prime}_{0}})}$ . Note that except for the constant ${\sigma^{\prime}_{0}}$ , this is exactly the same distribution that we used in Algorithm 1. In fact, if we define $\rho\coloneqq\frac{e^{-{\sigma^{\prime}_{0}}}}{1-{\sigma^{\prime}_{0}}}$ , we obtain $F^{\prime}_{\sigma}(y)=1-\frac{\rho(1-y)}{e^{-y}}$ (compare this to $F_{\sigma}(y)=1-\frac{\alpha(1-y)}{e^{-y}}$ ), so any result about $F_{\sigma}$ obtained in the previous setting carries over to $F^{\prime}_{\sigma}$ if we replace $\alpha$ by $\rho$ . We remark that we use the symbol $\rho$ instead of $\alpha^{\prime}$ because unlike in the previous case, $\rho$ will not be our final approximation factor.

We can thus bound the expected cost of ${F_{P}}$ in the same way as we did for PCOTSP. This is possible because the distribution $F_{\sigma}$ as well as the (lower bound on the) probability of a vertex $v\in V_{\sigma}$ being sampled at least once are the same as in Section 3.3.

First, we prove an equivalent of Lemma 16, i.e., we show that $c({F_{P}})=\frac{e^{-\sigma}}{\sigma}$ for any fixed value of $\sigma$ , and then we compute the expected cost ${\mathbb{E}\bigl{[}{c({F_{P}})}\bigr{]}}=e^{-{\sigma^{\prime}_{0}}}$ by integrating over the range $[{\sigma^{\prime}_{0}},1]$ from which $\sigma$ is chosen as we have done in Equation 5. This gives us the following upper bound on the expected tour cost:

\displaystyle c(H_{A})

\displaystyle\leq\biggl{(}2+2e^{-{\sigma^{\prime}_{0}}}-\eta\biggr{)}c(x^{\ast% }).

Furthermore, by similar reasoning as in Section 3.1, we know that the expected total penalty paid by this algorithm is at most $\rho=\frac{e^{-{\sigma^{\prime}_{0}}}}{1-{\sigma^{\prime}_{0}}}$ times the fractional penalty cost incurred by $(x^{\ast},y^{\ast})$ .

Now we give a simple Algorithm $B$ that works well when $\eta$ is small. Contract all the $2k$ terminals into one mega-vertex $w$ with penalty $\infty$ , solve the PCTSP LP for this instance, and sample a single tree $T$ from the solution, using Lemma 6. In the original graph, $T$ corresponds to a $\mathcal{T}$ -rooted forest of cost at most $c(x^{\ast})$ that contains each vertex $v$ with probability at least $y_{v}$ . Now double every edge of $T$ (obtained in the original graph), and then add the $k$ edges $\{s_{i},t_{i}\}$ to get a final solution $H_{B}$ . It is easy to see that $c(H_{B})\leq(2+\eta)c(x^{\ast})$ and that the incurred penalty is no higher than the fractional penalty cost of $(x^{\ast},y^{\ast})$ .

Running both algorithms $A$ and $B$ and returning each solution with probability $\frac{1}{2}$ , yields a tour of expected cost $\frac{c(H_{A})+c(H_{B})}{2}\leq(2+e^{-\sigma_{0}^{\prime}})\cdot c(x^{*})$ and an expected total penalty cost of at most

\frac{1}{2}\biggl{(}\frac{e^{-\sigma_{0}^{\prime}}}{1-\sigma_{0}^{\prime}}+1% \biggr{)}\cdot\sum_{v\in V-\mathcal{T}}(1-y^{\ast}_{v}).

The approximation ratio that we get from combining both algorithms is

\max\biggl{\{}2+e^{-\sigma_{0}^{\prime}},\frac{1}{2}\biggl{(}\frac{e^{-\sigma_% {0}^{\prime}}}{1-\sigma_{0}^{\prime}}+1\biggr{)}\biggr{\}},

which is minimized at $\sigma_{0}\approx 0.892769$ . This yields an approximation guarantee slightly below $2.41$ , proving Theorem 4. We remark that both algorithm $A$ and algorithm $B$ can be derandomized in the same fashion as Algorithm 1.

5 Derandomization

In this section we discuss how our algorithm can be derandomized. Note that we used randomization for the choice of the thresholds $\sigma$ and $\gamma$ , as well as for sampling the trees $T_{1},\ldots,T_{k}$ .

For the thresholds, we follow the approach of [8]: since the thresholds are used to prune (pick up) vertices whose $y$ -value is below (above) the respective threshold, we may generate all possible outcomes by running our algorithm once for each pair of values $\gamma,\sigma\in\{0,1\}\cup\{y_{v}\mid v\in V\}$ .

The sampling of the trees $T_{1},\ldots,T_{k}$ can be derandomized using the method of conditional expectations, as it is done in [4].

The basic idea is to iteratively fix the trees $T_{i}\coloneqq T_{i}^{\ast}$ for $i=1,\ldots,k$ , while minimizing the conditional expectation

\mathop{\mathbb{E}}_{\sigma,\gamma,T_{i+1},\ldots,T_{k}}[c(T^{\prime})+c(F_{p}% )+c(J)\mid T_{1}=T_{1}^{\ast},\ldots,T_{i}=T_{i}^{\ast}]

at each step. The expected conditional costs of $T^{\prime}$ and $F_{p}$ can be computed in a similar manner as it is done in [4], whereas the expected conditional cost of $J$ can be bounded by a linear combination of the expected conditional costs of $T^{\prime}$ and $F_{p}$ , and the costs of $x^{\ast}$ and $\hat{C}$ . Thus, the conditional expectation can be computed efficiently each round.

6 Discussion

The PCOTSP, as a generalization of both PCTSP and OTSP, poses the challenges of each of the individual problems plus new challenges. The best approximation algorithms for OTSP ([13, 4]) both pick up vertices which have been left out of sampling, which imposes a cost of $\frac{1}{e}$ over the solution. In the latest PCTSP result ([8]), the cost of parity correction is slightly below $0.6$ . So even if we simply add up the overheads of the two problems for parity correction and vertex pickup, the approximation factor would be close to two. But a straightforward application of the techniques from the latest results on Ordered TSP and Prize-Collecting TSP to PCOTSP actually leads to an approximation factor much higher than 2. This is because in PCOTSP the sampling probability for each vertex is lower than both PCTSP and OTSP; this makes pickup more expensive. Together with the need for parity correction for the picked up vertices, this also makes parity correction more costly than PCTSP.

It is not difficult to see that for the special cases of PCTSP, OTSP, our algorithms produce the best previously-known approximation factors for these problems. For example, when $k=1$ , PCOTSP is simply PCTSP. In this case, the cycle length over the one vertex is zero, and the output of our algorithm would be no worse than the output PCTSP algorithm of [8] on the remaining vertices. Likewise, setting all penalties to $\infty$ turns PCOTSP into OTSP (and thus all $y$ values are 1). Thus all vertices that have not been sampled will be picked up (i.e., $\sigma=1$ ), and no vertex is split off. This is equivalent to the algorithms of [13] and [4]. In a similar vein, setting all penalties to $\infty$ turns PC-Multi-Path-TSP into Multi-Path-TSP, and here our algorithm would do the same as the factor $2.367$ algorithm of [13]. Their factor $2.2131$ algorithm, however, does not directly carry over to our setting. The issue is that in the prize collecting setting, we require a picking-up step which leads to additional costs exceeding the additional gains as soon as we have to sample more than one tree.

The distributions used in this article have been carefully balanced to achieve the target approximation factor; it seems unlikely that their further tuning leads to better factors. It is an intriguing open question whether the problem admits an approximation factor of 2 or below, which we believe requires improvements of at least one of PCTSP or OTSP.

References

[1] Ali Ahmadi, Iman Gholami, MohammadTaghi Hajiaghayi, Peyman Jabbarzade, and Mohammad Mahdavi. 2-approximation for prize-collecting steiner forest. In Proceedings of the 2024 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 669–693. SIAM, 2024. doi:10.1137/1.9781611977912.25.
[2] Ali Ahmadi, Iman Gholami, MohammadTaghi Hajiaghayi, Peyman Jabbarzade, and Mohammad Mahdavi. Prize-collecting steiner tree: A 1.79 approximation. In Proceedings of the 56th Annual ACM Symposium on Theory of Computing, pages 1641–1652, 2024. doi:10.1145/3618260.3649789.
[3] Aaron Archer, MohammadHossein Bateni, MohammadTaghi Hajiaghayi, and Howard Karloff. Improved approximation algorithms for prize-collecting Steiner tree and TSP. SIAM journal on computing, 40(2):309–332, 2011. doi:10.1137/090771429.
[4] Susanne Armbruster, Matthias Mnich, and Martin Nägele. A (3/2+ 1/e)-approximation algorithm for ordered TSP. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2024). Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2024.
[5] Jørgen Bang-Jensen, András Frank, and Bill Jackson. Preserving and increasing local edge-connectivity in mixed graphs. SIAM Journal on Discrete Mathematics, 8(2):155–178, 1995. doi:10.1137/S0036142993226983.
[6] Gerardo Berbeglia, Jean-François Cordeau, Irina Gribkovskaia, and Gilbert Laporte. Static pickup and delivery problems: a classification scheme and survey. Top, 15:1–31, 2007.
[7] Daniel Bienstock, Michel X Goemans, David Simchi-Levi, and David Williamson. A note on the prize collecting traveling salesman problem. Mathematical programming, 59(1):413–420, 1993. doi:10.1007/BF01581256.
[8] Jannis Blauth, Nathan Klein, and Martin Nägele. A better-than-1.6-approximation for prize-collecting TSP. In International Conference on Integer Programming and Combinatorial Optimization, pages 28–42. Springer, 2024. doi:10.1007/978-3-031-59835-7_3.
[9] Jannis Blauth and Martin Nägele. An improved approximation guarantee for prize-collecting TSP. In Proceedings of the 55th Annual ACM Symposium on Theory of Computing, pages 1848–1861, 2023. doi:10.1145/3564246.3585159.
[10] Hans-Joachim Böckenhauer, Juraj Hromkovič, Joachim Kneis, and Joachim Kupke. On the approximation hardness of some generalizations of TSP. In Algorithm Theory–SWAT 2006: 10th Scandinavian Workshop on Algorithm Theory, Riga, Latvia, July 6-8, 2006. Proceedings 10, pages 184–195. Springer, 2006. doi:10.1007/11785293_19.
[11] Hans-Joachim Böckenhauer, Tobias Mömke, and Monika Steinová. Improved approximations for TSP with simple precedence constraints. J. Discrete Algorithms, 21:32–40, 2013. doi:10.1016/J.JDA.2013.04.002.
[12] Martin Böhm, Zachary Friggstad, Tobias Mömke, and Joachim Spoerhase. Approximating TSP variants using a bridge lemma. arXiv preprint arXiv:2405.12876, 2024. doi:10.48550/arXiv.2405.12876.
[13] Martin Böhm, Zachary Friggstad, Tobias Mömke, and Joachim Spoerhase. Approximating traveling salesman problems using a bridge lemma. In Proceedings of the 2025 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA). SIAM, 2025.
[14] Moses Charikar and Balaji Raghavachari. The finite capacity dial-a-ride problem. In Proceedings 39th Annual Symposium on Foundations of Computer Science (Cat. No. 98CB36280), pages 458–467. IEEE, 1998. doi:10.1109/SFCS.1998.743496.
[15] Nicos Christofides. Worst-case analysis of a new heuristic for the travelling salesman problem. Operations Research Forum, 3(1):20, 2022. doi:10.1007/S43069-021-00101-Z.
[16] Jean-François Cordeau and Gilbert Laporte. The dial-a-ride problem: models and algorithms. Annals of operations research, 153:29–46, 2007. doi:10.1007/S10479-007-0170-8.
[17] Michel X Goemans. Combining approximation algorithms for the prize-collecting TSP. arXiv preprint arXiv:0910.0553, 2009. arXiv:0910.0553.
[18] Michel X Goemans and David P Williamson. A general approximation technique for constrained forest problems. SIAM Journal on Computing, 24(2):296–317, 1995. doi:10.1137/S0097539793242618.
[19] Inge Li Gørtz, Viswanath Nagarajan, and Ramamoorthi Ravi. Minimum makespan multi-vehicle dial-a-ride. ACM Transactions on Algorithms (TALG), 11(3):1–29, 2015. doi:10.1145/2629653.
[20] Anna R Karlin, Nathan Klein, and Shayan Oveis Gharan. A (slightly) improved approximation algorithm for metric TSP. In Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing, pages 32–45, 2021. doi:10.1145/3406325.3451009.
[21] Ian Post and Chaitanya Swamy. Linear programming-based approximation algorithms for multi-vehicle minimum latency problems (extended abstract). In Piotr Indyk, editor, Proceedings of the Twenty-Sixth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2015, San Diego, CA, USA, January 4-6, 2015, pages 512–531. SIAM, 2015. doi:10.1137/1.9781611973730.35.
[22] Martin WP Savelsbergh and Marc Sol. The general pickup and delivery problem. Transportation science, 29(1):17–29, 1995. doi:10.1287/TRSC.29.1.17.
[23] AI Serdjukov. Some extremal bypasses in graphs [in russian]. Upravlyaemye Sistemy, 17(89):76–79, 1978.
[24] Vera Traub and Jens Vygen. Approximation Algorithms for Traveling Salesman Problems. Cambridge University Press, 2024. URL: https://books.google.de/books?id=o5jV0AEACAAJ.
[25] Vera Traub, Jens Vygen, and Rico Zenklusen. Reducing path TSP to TSP. In Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing, pages 14–27, 2020. doi:10.1145/3357713.3384256.

[bib.bib1] [1] Ali Ahmadi, Iman Gholami, MohammadTaghi Hajiaghayi, Peyman Jabbarzade, and Mohammad Mahdavi. 2-approximation for prize-collecting steiner forest. In Proceedings of the 2024 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 669–693. SIAM, 2024. doi:10.1137/1.9781611977912.25.

[bib.bib2] [2] Ali Ahmadi, Iman Gholami, MohammadTaghi Hajiaghayi, Peyman Jabbarzade, and Mohammad Mahdavi. Prize-collecting steiner tree: A 1.79 approximation. In Proceedings of the 56th Annual ACM Symposium on Theory of Computing, pages 1641–1652, 2024. doi:10.1145/3618260.3649789.

[bib.bib3] [3] Aaron Archer, MohammadHossein Bateni, MohammadTaghi Hajiaghayi, and Howard Karloff. Improved approximation algorithms for prize-collecting Steiner tree and TSP. SIAM journal on computing, 40(2):309–332, 2011. doi:10.1137/090771429.

[bib.bib4] [4] Susanne Armbruster, Matthias Mnich, and Martin Nägele. A (3/2+ 1/e)-approximation algorithm for ordered TSP. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2024). Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2024.

[bib.bib5] [5] Jørgen Bang-Jensen, András Frank, and Bill Jackson. Preserving and increasing local edge-connectivity in mixed graphs. SIAM Journal on Discrete Mathematics, 8(2):155–178, 1995. doi:10.1137/S0036142993226983.

[bib.bib6] [6] Gerardo Berbeglia, Jean-François Cordeau, Irina Gribkovskaia, and Gilbert Laporte. Static pickup and delivery problems: a classification scheme and survey. Top, 15:1–31, 2007.

[bib.bib7] [7] Daniel Bienstock, Michel X Goemans, David Simchi-Levi, and David Williamson. A note on the prize collecting traveling salesman problem. Mathematical programming, 59(1):413–420, 1993. doi:10.1007/BF01581256.

[bib.bib8] [8] Jannis Blauth, Nathan Klein, and Martin Nägele. A better-than-1.6-approximation for prize-collecting TSP. In International Conference on Integer Programming and Combinatorial Optimization, pages 28–42. Springer, 2024. doi:10.1007/978-3-031-59835-7_3.

[bib.bib9] [9] Jannis Blauth and Martin Nägele. An improved approximation guarantee for prize-collecting TSP. In Proceedings of the 55th Annual ACM Symposium on Theory of Computing, pages 1848–1861, 2023. doi:10.1145/3564246.3585159.

[bib.bib10] [10] Hans-Joachim Böckenhauer, Juraj Hromkovič, Joachim Kneis, and Joachim Kupke. On the approximation hardness of some generalizations of TSP. In Algorithm Theory–SWAT 2006: 10th Scandinavian Workshop on Algorithm Theory, Riga, Latvia, July 6-8, 2006. Proceedings 10, pages 184–195. Springer, 2006. doi:10.1007/11785293_19.

[bib.bib11] [11] Hans-Joachim Böckenhauer, Tobias Mömke, and Monika Steinová. Improved approximations for TSP with simple precedence constraints. J. Discrete Algorithms, 21:32–40, 2013. doi:10.1016/J.JDA.2013.04.002.

[bib.bib12] [12] Martin Böhm, Zachary Friggstad, Tobias Mömke, and Joachim Spoerhase. Approximating TSP variants using a bridge lemma. arXiv preprint arXiv:2405.12876, 2024. doi:10.48550/arXiv.2405.12876.

[bib.bib13] [13] Martin Böhm, Zachary Friggstad, Tobias Mömke, and Joachim Spoerhase. Approximating traveling salesman problems using a bridge lemma. In Proceedings of the 2025 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA). SIAM, 2025.

[bib.bib14] [14] Moses Charikar and Balaji Raghavachari. The finite capacity dial-a-ride problem. In Proceedings 39th Annual Symposium on Foundations of Computer Science (Cat. No. 98CB36280), pages 458–467. IEEE, 1998. doi:10.1109/SFCS.1998.743496.

[bib.bib15] [15] Nicos Christofides. Worst-case analysis of a new heuristic for the travelling salesman problem. Operations Research Forum, 3(1):20, 2022. doi:10.1007/S43069-021-00101-Z.

[bib.bib16] [16] Jean-François Cordeau and Gilbert Laporte. The dial-a-ride problem: models and algorithms. Annals of operations research, 153:29–46, 2007. doi:10.1007/S10479-007-0170-8.

[bib.bib17] [17] Michel X Goemans. Combining approximation algorithms for the prize-collecting TSP. arXiv preprint arXiv:0910.0553, 2009. arXiv:0910.0553.

[bib.bib18] [18] Michel X Goemans and David P Williamson. A general approximation technique for constrained forest problems. SIAM Journal on Computing, 24(2):296–317, 1995. doi:10.1137/S0097539793242618.

[bib.bib19] [19] Inge Li Gørtz, Viswanath Nagarajan, and Ramamoorthi Ravi. Minimum makespan multi-vehicle dial-a-ride. ACM Transactions on Algorithms (TALG), 11(3):1–29, 2015. doi:10.1145/2629653.

[bib.bib20] [20] Anna R Karlin, Nathan Klein, and Shayan Oveis Gharan. A (slightly) improved approximation algorithm for metric TSP. In Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing, pages 32–45, 2021. doi:10.1145/3406325.3451009.

[bib.bib21] [21] Ian Post and Chaitanya Swamy. Linear programming-based approximation algorithms for multi-vehicle minimum latency problems (extended abstract). In Piotr Indyk, editor, Proceedings of the Twenty-Sixth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2015, San Diego, CA, USA, January 4-6, 2015, pages 512–531. SIAM, 2015. doi:10.1137/1.9781611973730.35.

[bib.bib22] [22] Martin WP Savelsbergh and Marc Sol. The general pickup and delivery problem. Transportation science, 29(1):17–29, 1995. doi:10.1287/TRSC.29.1.17.

[bib.bib23] [23] AI Serdjukov. Some extremal bypasses in graphs [in russian]. Upravlyaemye Sistemy, 17(89):76–79, 1978.

[bib.bib24] [24] Vera Traub and Jens Vygen. Approximation Algorithms for Traveling Salesman Problems. Cambridge University Press, 2024. URL: https://books.google.de/books?id=o5jV0AEACAAJ.

[bib.bib25] [25] Vera Traub, Jens Vygen, and Rico Zenklusen. Reducing path TSP to TSP. In Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing, pages 14–27, 2020. doi:10.1145/3357713.3384256.