Improved Approximation Algorithms for (1,2)-TSP and Max-TSP Using Path Covers in the Semi-Streaming Model
Abstract
We investigate semi-streaming algorithms for the Traveling Salesman Problem (TSP). Specifically, we focus on a variant known as the -TSP, where the distances between any two vertices are either one or two. Our primary emphasis is on the closely related Maximum Path Cover Problem, which aims to find a collection of vertex-disjoint paths that covers the maximum number of edges in a graph. We propose an algorithm that, for any , achieves a -approximation of the maximum path cover size for an -vertex graph, using passes. This result improves upon the previous -approximation by Behnezhad et al. [3] in the semi-streaming model. Building on this result, we design a semi-streaming algorithm that constructs a tour for an instance of -TSP with an approximation factor of , improving upon the previous -approximation factor algorithm by Behnezhad et al. [3]111Although Behnezhad et al. do not explicitly state that their algorithm works in the semi-streaming model, it is easy to verify..
Furthermore, we extend our approach to develop an approximation algorithm for the Maximum TSP (Max-TSP), where the goal is to find a Hamiltonian cycle with the maximum possible weight in a given weighted graph . Our algorithm provides a -approximation for Max-TSP in passes, improving on the previously known -approximation obtained via maximum weight matching in the semi-streaming model.
Keywords and phrases:
-TSP, Max-TSP, Maximum Path Cover, Semi-Streaming Algorithms, Approximation Algorithms, Graph AlgorithmsFunding:
Tobias Mömke: Partially supported by DFG Grant 439522729 (Heisenberg-Grant).Copyright and License:
![[Uncaptioned image]](x1.png)
2012 ACM Subject Classification:
Theory of computation Graph algorithms analysis ; Theory of computation Streaming, sublinear and near linear time algorithms ; Theory of computation Approximation algorithms analysisEditors:
Olaf Beyersdorff, Michał Pilipczuk, Elaine Pimentel, and Nguyễn Kim ThắngSeries and Publisher:

1 Introduction
The Traveling Salesman Problem (TSP) is a fundamental problem in combinatorial optimization. Given a graph with distances assigned to the edges, the objective is to find a Hamiltonian cycle with the lowest possible cost. The general form of TSP is known to be inapproximable unless [22]. Consequently, research often focuses on specific types of distance functions, particularly the metric TSP, where distances satisfy the triangle inequality. Two notable metric versions of TSP are the graphic TSP, where distances correspond to the shortest path lengths in an unweighted graph, and -TSP, a variant of TSP with distances restricted to either one or two [1, 4, 5, 6, 12, 14, 15, 17, 18, 19, 23, 24, 25].
Most research is conducted within the classic centralized model of computation. However, with the surge of large data sets in various real-world applications (as reviewed in [7]), there is a growing demand for algorithms capable of handling massive inputs. For very large graphs, classical algorithms are not only too slow but also suffer from excessive space complexity. When a graph’s size exceeds the memory capacity of a single machine, algorithms that rely on random access to the data become impractical, necessitating alternative computational models. One such model that has gained significant attention recently is the graph stream model, introduced by Feigenbaum et al. [8, 9]. In this model, edges of the graph are not stored in memory but arrive sequentially in a stream, requiring processing in that order. The challenge is to design algorithms that use minimal space and ideally make only a small constant number of passes over the stream. A widely studied variant of this is the semi-streaming model. In the semi-streaming model, as outlined by Feigenbaum et al. [9], we consider a graph with vertices. The algorithm processes the graph’s edges as they arrive in the stream and aims to compute results with minimal passes while using limited memory, constrained to .
It is straightforward to design a deterministic one-pass streaming algorithm to compute the cost of a Minimum Spanning Tree (MST) exactly, even in graph streams, which in turn immediately provides an space algorithm to estimate TSP cost within a factor of 2. Thus, in the semi-streaming regime, the key challenge is to estimate TSP cost within a factor that is strictly better than 2. Recently, Chen, Khanna, and Tan [5] proposed a deterministic two-pass 1.96-approximation factor algorithm for metric TSP cost estimation in the semi-streaming model. For the case of -TSP, using the approach of Behnezhad et al. [3], it is possible to provide a -approximation factor algorithm in the semi-streaming model. In [3], authors presented a sub-linear version of their algorithm; however, it is straightforward to implement their algorithm in the semi-streaming model.
In section 11 of [3], the authors showed a reduction from (1,2)-TSP to maximum matching and stated that achieving better approximation than for (1,2)-TSP in the sub-linear model, solves an important open problem in sub-linear maximum matching. Considering the same reduction in semi-streaming model shows achieving non-trivial approximations for (1,2)-TSP in semi-streaming model is challenging. Since the maximum matching problem is studied in the semi-streaming model extensively, the following question naturally arises.
Question. What is the trade off between the approximation ratio and the number of passes for (1,2)-TSP in the semi-streaming model?
Maximum Path Cover.
In an unweighted graph , a subset of edges is called a path cover if it forms a union of vertex-disjoint paths. A maximum path cover (MPC222Throughout this paper we use this acronym for ’Maximum Path Cover’. Please note that we do not refer to the common abbreviation for ’Massively Parallel Computation’.) in an unweighted graph is a path cover with the maximum number of edges (not paths) among all possible path covers in the graph. The problem of finding an MPC is known to be -complete. It is straightforward to see that a maximum matching provides a -approximation for MPC. Therefore, computing a maximal matching, which is a -approximation for maximum matching, yields a -approximate solution for MPC.
Behnezhad et al. [3] developed a -approximate MPC algorithm, which provides a -approximate solution for -TSP. Their algorithm can be implemented in one pass within the semi-streaming model using space to return the cost, and in two passes if the approximate solution itself is required. Our primary contribution is an improvement in the approximation factor of their algorithm.
Result 1 (Formally as Theorem 8). For a given unweighted graph , there is a semi-streaming algorithm that returns a -approximation of MPC in passes.
-TSP.
The classical problem -TSP is well-studied and known to be -hard [15], and even -hard [20]. One can easily observe that in an instance of -TSP, the optimal tour is almost the same as finding the MPC of the induced subgraph on edges with weight 1 and then joining their endpoints with edges with weight 2, except for a possible difference of 1 (in the case that there exists a Hamiltonian cycle all of whose edges have weight 1). A simple computation shows that if one can find a set of vertex-disjoint paths that is at least times the optimal size (), then one can also find a tour whose cost is no more than times the optimal cost for -TSP. Thus, Result 1 implies the following result.
Result 2 (Formally as Theorem 10). For an instance of -TSP, there is a semi-streaming algorithm that returns a ()-approximation of -TSP in passes.
In the second part of the paper, we examine Max-TSP in the semi-streaming model.
Max-TSP.
For a given complete weighted graph , the goal of Max-TSP is to find a Hamiltonian cycle such that the sum of the weights of the edges in this cycle is maximized.
It is evident that a maximum weighted matching provides a -approximation for the cost of Max-TSP. Consequently, the result of Huang and Saranurak [13], which computes a -approximate maximum weight matching in the semi-streaming model, yields a ()-approximation for Max-TSP. In this paper, we improve this bound to . Our result is as follows.
Result 3 (Formally as Theorem 16). For a given weighted graph , there is a semi-streaming algorithm that returns a -approximation of Max-TSP in passes.
To the best of our knowledge, this is the first non-trivial approximation algorithm for Max-TSP in the semi-streaming model.
Further related work
Our approach for computing MPC, -TSP and Max-TSP mainly uses the subroutines for computing maximum matching in unweighted graphs and maximum weight matching in weighted graphs.
In the semi-streaming model, Fischer, Mitrovic and Uitto [10] gave a -approximation for the maximum matching problem in passes. This result was an improvement over the passes algorithm by McGregor [16].
For the maximum weight matching in the semi-streaming model, Paz and Schwartzman gave a simple deterministic single-pass -approximation algorithm [21]. Gamlath, Kale, Mitrovic, and Svensson gave a -approximation streaming algorithm that uses passes and memory. This was the first -approximation streaming algorithm for weighted matching that uses a constant number of passes (only depending on ) [11]. Also, Huang and Su in [13], gave a deterministic -approximation for maximum weighted matching using passes in the semi-streaming model. When is smaller than a constant but at least , their algorithm is more efficient than [11].
1.1 Notation
Let be a simple graph. We denote the set of vertices and edges of by and respectively. We also denote the maximum matching size in by and the size of the MPC in by .
For a subset of edges , we denote as the contraction of on , which is the graph derived by repeatedly removing edges of (it is well-known that the order does not matter) from the graph and merging its endpoints to be a single node in the new graph. Note that after contraction the graph might have parallel edges, but this does not interfere with our algorithm. In the weighted case, if is the weight of edge , then we define to be the sum of weights of the elements of i.e. . Let be a path of length ( for ). We call and end points of and for middle points of .
2 Technical Overview and Our Contribution
We propose a simple algorithm that constructs a path cover with an approximation factor of almost for MPC. This new algorithm merely depends on basic operations and computing matching and approximate matching.
Our algorithm for MPC is as follows. Assume that is a -approximation algorithm for computing maximum matching in an unweighted graph . We use as a subroutine in our algorithm, which has two phases. In the first phase, we run to compute a -approximate maximum matching, denoted by . In the next phase, we contract all edges in to obtain a new graph . Then, we compute a -approximate maximum matching, denoted by , for using . Finally, we return the edges of and as a path cover for (see Algorithm 1).
We will show that the output of our algorithm is a collection of vertex-disjoint paths, i.e., a valid path cover (see Lemma 1).
The algorithm is simple, but proving that its approximation factor is is challenging. As a warm-up, it is straightforward to see that by computing a maximum matching in the first phase, we achieve a -approximate MPC. However, the challenge lies in the second phase, which helps to improve the approximation factor.
Now we explain the idea of our proof to find the approximation factor of Algorithm 1. For the matching in graph , we provide a lower bound for . We show that if we consider a maximum path cover and contract on , the contracted graph becomes a particular graph in which we can find a lower bound on the size of its maximum matching. Let be a maximum matching of , then this results in a lower bound for . Finally, we exploit this lower bound for together with a lower bound for , to come up with the approximation factor of Algorithm 1.
We explain how to implement this algorithm in the semi-streaming model, achieving an improved approximation factor for -TSP within this model.
For the Max-TSP, we use a similar algorithm, except we compute maximum weight matching instead of maximum matching (see Algorithm 3). By computing the approximation factor of this algorithm, we provide a non-trivial approximation algorithm for Max-TSP in the semi-streaming model. Despite the extensive study of the weighted version of the maximum matching problem, Max-TSP has not been studied extensively in the literature within the semi-streaming model. One reason could be that it is not possible to extend the approaches for the unweighted version to the weighted version. Fortunately, we can extend our algorithm to the weighted version and improve the approximation factor of Max-TSP in the semi-streaming model. However, our method for analyzing the approximation factor of Algorithm 1 does not apply to the weighted version, so we present a different proof approach for computing the approximation factor of Algorithm 3.
3 Improved Approximation Factor Semi-Streaming Algorithm for MPC
In Algorithm 1, we presented our novel algorithm for MPC. This section provides an analysis of its approximation factor, followed by a detailed explanation of its streaming implementation.
We start by proving the correctness of this algorithm.
Lemma 1.
If and are the matchings obtained in Algorithm 1, then forms a path cover for .
Proof.
We claim that is a vertex-disjoint union of paths of length or . As a result, it is a path cover. Suppose . Let us denote the vertices of by where represents the vertices and , merged in the contracted graph and ’s for are the rest of the vertices. Let be an arbitrary edge. By symmetry between and , there are three cases as follows:
-
1.
.
In this case, and are intact vertices after contraction, which means there are no edges in adjacent to and . Since and is a matching, there are no other edges in adjacent to and in . As a result, would be a path of length in . -
2.
and .
In this case, for some . As a result, would be or in . By symmetry, assume that in . Since is a matching, no other edges in are adjacent to and . No edge in is adjacent to . Since is a matching, the only edge in adjacent to at least one of and in is . Finally, we can see that is a path of length in . -
3.
.
In this case, let and and by symmetry, assume that is the edge connecting to in . Since is a matching, the only edges in adjacent to at least one of are and . Since is a matching, the only edge in adjacent to at least one of is . As a result, would be a path of length in .
So, is a union of vertex-disjoint paths of length or .
3.1 Analysis of the Approximation Factor of Algorithm 1
We start with a simple and basic lemmas that is crucial in our proof.
Lemma 2.
Let be an arbitrary graph. We have:
Proof.
Since every matching is a path cover, we have . Also, given an MPC, we can select every other edge in this MPC to obtain a matching that contains at least half of its edges, which implies .
Corollary 3.
If is a -approximation of maximum matching in graph , then
We now present a lemma regarding the size of the maximum matching in a specific type of graph. This lemma may be of independent interest. In this paper we utilize this lemma on to derive a lower bound for .
Lemma 4.
Assume is a graph without loops such that each vertex of has degree ,, or . If denotes the set of vertices of degree in , then we have
Proof.
The proof of this lemma is a bit long and technical. Here, we provide the proof sketch of the lemma due to page limit constraints and we refer the reader to the full version of the paper to see the complete proof.333A full version of this extended abstract can be found at [2]. The main tools for the proof are induction and a charging scheme. The induction is on the number of edges between nodes of degree 2 or 4 in graph denoted by .
In a high level, we provide some special local topological properties of the graph. If does not satisfy at least one of the properties, then we do some delicate local changes on to get such that . Next, we improve a matching in to a matching for with the desired size.
In the case that satisfies all of the properties, we provide a charging scheme that takes some tokens for each edge in an arbitrary maximum matching and spread these tokens between incident edges, which proves the desired inequality.
Using above lemma, we have the main lemma of this section as follows.
Lemma 5.
If is an arbitrary matching in a graph , then
Proof.
Assume is a maximum path cover in such that is maximal. We claim that every connects two middle points of . The proof of this claim follows from a case by case argument. For the sake of contradiction, assume does not connect two middle points of . We have three cases for and as follows.
-
None of and belong to (case 1).
-
Exactly one of them (say ) belongs to . Then, is an end point (case 2), or is a middle point (case 3).
-
Both and belong to . Then, we have two sub cases.
and are on different paths. Then either they are both end points (case 4), or one is a middle point (say ) and the other one is an end point (case 5). Note that we have considered that both of and are not middle points at the same time.
and belong to the same path. Then, either they are both end points (case 6), or one is a middle point (say ) and the other one is an end point (case 7). Note that we have assumed both of them are not middle points at the same time.
Now, we explain each case in detail.
-
1.
Neither nor belongs to .
This case is impossible because is a path cover with a size larger than , which is in contradiction with being MPC (see Figure 1(a)). -
2.
is an end point of a path in and is not contained in .
Again, this case is impossible since is a path cover with a size larger than , which is in contradiction with being MPC (see Figure 1(b)). -
3.
is a middle point of a path in and is not contained in .
Let be the path in containing . Replace by which is an MPC of (see Figure 1(c)). Since , we have . Therefore, increments. This is in contradiction with being maximal. -
4.
and are end points of different paths in .
In this case, let and be the paths in containing and , respectively. would be a path cover of size greater than which is in contradiction with being MPC (see Figure 1(d)). -
5.
and are the middle and end points of different paths in , respectively.
In this case, let and be the paths in containing and respectively. Replace by which is an MPC of (see Figure 1(e)). Since , we have . Therefore, increments. This is in contradiction with being maximal. -
6.
and are end points of the same path in .
In this case, let be the path in containing and . Replace by which is an MPC of (see Figure 1(f)). Since we have . Therefore, increments. This is in contradiction with being maximal. -
7.
and are the middle and end points of the same path in , respectively.
In this case, let be the path in containing and (since , we have ). Replace by which is an MPC of (see Figure 1(g)). Since , we have . Therefore, increments. This is in contradiction with being maximal.
Each case leads to a contradiction, implying that every connects two middle points of .
Now, contraction of each makes a vertex of degree 4 in . Contraction of each makes a vertex of degree 2 and decrements the number of edges in . As a result, is a graph whose vertices’ degrees are 1,2 or 4, and . Finally, using Lemma 4 for we have
Since is a subgraph of we have
Using the above results, we compute the approximation factor of Algorithm 1.
Theorem 6.
The approximation factor of Algorithm 1 is , i.e.,
(1) |
Proof.
By Corollary 3 and, Lemma 5, we have
Since is a path cover, we have . Hence, the approximation factor of Algorithm 1 is at least .
Now, we show that our analysis of the approximation factor of Algorithm 1 is tight. Consider the graph in Figure 2(a) and denote it by . If we run Algorithm 1 on , then the edges of could be the red edges shown in Figure 2(b). After contracting on , we have shown in Figure 2(c). Finally, the second matching found by Algorithm 1 in contains at most one edge which implies . On the other hand, maximum path cover in contains edges shown in Figure 2(d).
As a result, , so this example and Theorem 6 imply that the approximation factor of Algorithm 1 is .
3.2 Implementation of Algorithm 1 in the Semi-Streaming Model
Now we explain how to implement Algorithm 1 in the semi-streaming model. We start with the following theorem by Fischer, Mitrovic and Uitto [10].
Theorem 7 (Theorem 1.1 in [10]).
Given a graph on vertices, there is a deterministic -approximation algorithm for maximum matching that runs in passes in the semi-streaming model. Furthermore, the algorithm requires words of memory.
To implement Algorithm 1 in the semi-streaming model, we proceed as follows: In the first phase, by applying Theorem 7, we compute a -approximate matching for the graph , denoted as . At the end of this phase, we have the edges of this matching. In the second phase, we again apply Theorem 7 to compute a matching for in the streaming model.
During the second phase, when we apply the algorithm of Theorem 7, while processing each edge in the stream, we follow these rules: If , we ignore this edge. If , but one of or is an endpoint of an edge in (e.g., ), then since is contracted, we consider and as a single vertex, . In this case, is considered adjacent to the new vertex . Consequently, we can compute a -approximation matching for in the next passes.
Thus, combining the results of Algorithm 1, Theorem 6, and Theorem 7, we have the main result of this section:
Theorem 8.
Given an unweighted graph on vertices, there is a deterministic algorithm that returns a -approximate MPC in the semi-streaming model in passes.
4 -TSP
In this section, we present our algorithm for the -TSP, detailed in Algorithm 2, and analyze its approximation factor. We also provide an explanation of how to implement this algorithm in the semi-streaming model.
Theorem 9.
The approximation factor of Algorithm 2 for -TSP is .
Proof.
Let be the optimal solution of (1,2)-TSP, be the size of an MPC in , and be the number of vertices of . Since every Hamiltonian cycle contains edges with weights or , we have . We also have or , where occurs only when contains a Hamiltonian cycle consisting solely of edges with weight 1.
Let be the size of the path cover obtained by Algorithm 1 in . The Hamiltonian cycle obtained by Algorithm 2 has a cost of at most . Let be the approximation factor of Algorithm 1, then we have . As a result, . We also have:
(2) | |||||
By Theorem 6, we have . Using Equation 2, we conclude that:
Hence, . So, the approximation factor of our algorithm for (1,2)-TSP is .
4.1 Implementation of Algorithm 2 in the Semi-Streaming Model
For a given instance of -TSP in the streaming model, we compute an approximate MPC for the induced subgraph on the edges of weight as explained in Theorem 8, then we add extra edges to connect these paths and vertices not in these paths arbitrarily to construct a Hamiltonian cycle, which gives us a -approximate tour for -TSP. So, we have the main result of this section as follows.
Theorem 10.
Given an instance of -TSP on vertices, there is a deterministic algorithm that returns a -approximate -TSP in the semi-streaming model in passes.
5 Max-TSP
In this section, we introduce our algorithm for Max-TSP, which closely resembles our approach for MPC. The key difference is that, instead of using , we employ a subroutine to compute an approximate maximum weight matching in a weighted graph.
Let be a subroutine for computing a -approximate maximum weighted matching in a weighted graph . First, we compute a matching for using . Then, we contract the edges of to obtain another graph and compute another matching, , for using again. We derive the union of the two weighted matchings, . Similar to Lemma 1, it is evident that forms a union of vertex-disjoint paths in . Finally, since the graph is complete, there can be only one vertex that is not in . In this case we connect this vertex to one of the paths in . Now, we add edges arbitrarily between the endpoints of the paths in to obtain a Hamiltonian cycle for .
Note that after contracting on to obtain , this new graph might have parallel edges between to vertices. Since we aim to find a maximum matching in , we can simply consider the edge with the largest weight for parallel edges and ignore the rest.
5.1 Analysis of the Approximation Factor of Algorithm 3
To analyze the approximation factor of Algorithm 3, we begin with a series of lemmas.
Lemma 11.
Suppose is a cycle of length in a weighted graph . Then, there exists a matching such that .
Proof.
Assume that is the edge with the minimum weight. Hence, . Since is a path, there is a matching (which is also a subset of ) whose weight is at least . Finally,
Lemma 12.
Suppose is a path or a cycle in a weighted graph . Then there exists a matching such that .
Proof.
We have two cases
-
is a path. Enumerate the edges of from one end point to the other. The odd numbered edges form a matching called . The same applies for even numbered edges, which form a matching called . Since , at least one of these two matchings has weight no less than .
-
is a cycle. If it is a cycle of length (i.e. consists of two parallel edges), then obviously we can pick the edge with bigger weight that satisfies . If the length of is at least , then using Lemma 11, we conclude that there is a matching such that .
Now, we provide a lemma similar to Lemma 5 which works for the weighted version.
Lemma 13.
Suppose is a matching in a weighted graph and is a maximum weight Hamiltonian cycle of . Then,
Proof.
We contract on in two steps. First, we contract on the edges in . Next, we contract the resulting graph on . After the first step, is a cycle with weight (see Figure 3).
Here is a matching that connects some vertices of together (see Figure 4(a), Figure 4(b) and Figure 4(c)).
Assume that the length of is .
Using Lemma 11, there is a matching whose weight is at least
(see Figure 4(d)).
Since the matching contains at most half of the edges of , we conclude that . As a result,
(3) | |||||
Since , we conclude that . Because and are matchings, it follows that is a union of disjoint paths and cycles (see Figure 4(e)).
As a result, after doing the second step of contraction, would also be a disjoint union of paths and cycles (whose number of edges is equal to since ) in . For instance, if contains a cycle of length , then would contain a cycle of length which contains parallel edges.
Now, consider each connected component of . This component is either a path or a cycle. Hence, by Lemma 12, We obtain a matching with a weight of at least one-third of the weight of the component.
( and correspond to blue and red edges respectively).
Finally, since these components are vertex-disjoint, the union of obtained matching would be a matching whose weight is at least . Note that this matching is also a matching in . Hence, using Equation 3, we have
So, we have the following theorem which is a lower bound for the approximation factor of Algorithm 3.
Theorem 14.
The approximation factor of Algorithm 3 is at least .
Proof.
Let be a maximum weight Hamiltonian cycle in . By Lemma 11, there exists at least one matching whose weight is at least
Since is a -approximation of the maximum weighted matching in we have
By using Lemma 13 for on , we have
Since the weight of the edges of are nonnegative, we have
Finally, is a Hamiltonian cycle which means . Hence, the approximation factor of Algorithm 1 is at least .
5.2 Implementation of Algorithm 3 in the semi-streaming model
The implementation of Algorithm 3 in the semi-streaming model follows a similar approach as described in the previous section. Therefore, we omit a detailed explanation here. However, note that in this part, we should use a subroutine for computing a -approximate maximum weight matching in the semi-streaming model. First, we recall the following theorem from [13]. We use the algorithm of this theorem as in our semi-streaming implementation of Algorithm 3.
Theorem 15 (Theorem 1.3 in [13]).
There exists a deterministic algorithm that returns a -approximate maximum weight matching using passes in the semi-streaming model. The algorithm requires words of memory where is the maximum edge weight in the graph.
Thus, Algorithm 3, Theorem 14, and Theorem 15 present the following theorem for Max-TSP in the semi-streaming model.
Theorem 16.
Given an instance of Max-TSP on vertices, there is an algorithm that returns a -approximate Max-TSP the semi-streaming model in passes. The algorithm requires words of memory where is the maximum edge weight in the graph.
6 Future Work
As a future work we propose the following algorithm that can help to improve the approximation factor for MPC in the semi-streaming model. The algorithm improves Algorithm 1 by iteratively finding new matchings and contracting the graph over these matchings. It is crucial to ensure that this process preserves the path cover property. Hence, during the th iteration of our loop, we must remove all the edges in that are incident to a middle point of any path (connected component) within . This is because must remain a path cover, which means cannot include any edge incident to a middle point of a path in . See Algorithm 4.
We leave the computation of the approximation factor of Algorithm 4 as a challenging open problem. Currently, we know that the approximation factor is at most . Consider the graph in Figure 5(a): the algorithm may select the red edges as . After contraction, it might select the red edge in Figure 5(b) as . In the next iteration, the graph becomes empty, as we must remove any edge incident to a middle point of . Thus, the algorithm terminates with a path of length 3. However, the MPC has 4 edges (see Figure 5(c)).
The main bottleneck to find the approximation ratio of Algorithm 4 is that after the second iteration, there might be a lot of edges that we have to remove from the contracted graph in order to make sure that the union of matchings remains a path cover. More precisely, while running line 5 of Algorithm 4, a bunch of edges that are contained in every maximum path cover might be removed. We are not aware of any argument how to bound the number of these edges. This prevents us to provide an argument like Lemma 5 and Lemma 13. Finding the exact approximation ratio of Algorithm 4 seems to require clever new ideas, already for three matchings.
References
- [1] Anna Adamaszek, Matthias Mnich, and Katarzyna Paluch. New approximation algorithms for (1, 2)-tsp. In Ioannis Chatzigiannakis, Christos Kaklamanis, Dániel Marx, and Donald Sannella, editors, 45th International Colloquium on Automata, Languages, and Programming, ICALP 2018, July 9-13, 2018, Prague, Czech Republic, volume 107 of LIPIcs, pages 9:1–9:14. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2018. doi:10.4230/LIPIcs.ICALP.2018.9.
- [2] Sharareh Alipour, Ermiya Farokhnejad, and Tobias Mömke. Improved approximation algorithms for (1,2)-tsp and max-tsp using path covers in the semi-streaming model, 2025. arXiv:2501.04813.
- [3] Soheil Behnezhad, Mohammad Roghani, Aviad Rubinstein, and Amin Saberi. Sublinear algorithms for TSP via path covers. CoRR, abs/2301.05350, 2023. doi:10.48550/arXiv.2301.05350.
- [4] Yu Chen, Sampath Kannan, and Sanjeev Khanna. Sublinear algorithms and lower bounds for metric TSP cost estimation. In Artur Czumaj, Anuj Dawar, and Emanuela Merelli, editors, 47th International Colloquium on Automata, Languages, and Programming, ICALP 2020, July 8-11, 2020, Saarbrücken, Germany (Virtual Conference), volume 168 of LIPIcs, pages 30:1–30:19. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2020. doi:10.4230/LIPIcs.ICALP.2020.30.
- [5] Yu Chen, Sanjeev Khanna, and Zihan Tan. Sublinear algorithms and lower bounds for estimating MST and TSP cost in general metrics. In Kousha Etessami, Uriel Feige, and Gabriele Puppis, editors, 50th International Colloquium on Automata, Languages, and Programming, ICALP 2023, July 10-14, 2023, Paderborn, Germany, volume 261 of LIPIcs, pages 37:1–37:16. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2023. doi:10.4230/LIPIcs.ICALP.2023.37.
- [6] Artur Czumaj and Christian Sohler. Estimating the weight of metric minimum spanning trees in sublinear-time. In László Babai, editor, Proceedings of the 36th Annual ACM Symposium on Theory of Computing, Chicago, IL, USA, June 13-16, 2004, pages 175–183. ACM, 2004. doi:10.1145/1007352.1007386.
- [7] Doratha E. Drake and Stefan Hougardy. Improved linear time approximation algorithms for weighted matchings. In Sanjeev Arora, Klaus Jansen, José D. P. Rolim, and Amit Sahai, editors, Approximation, Randomization, and Combinatorial Optimization: Algorithms and Techniques, 6th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems, APPROX 2003 and 7th International Workshop on Randomization and Approximation Techniques in Computer Science, RANDOM 2003, Princeton, NJ, USA, August 24-26, 2003, Proceedings, volume 2764 of Lecture Notes in Computer Science, pages 14–23. Springer, 2003. doi:10.1007/978-3-540-45198-3_2.
- [8] Joan Feigenbaum, Sampath Kannan, Andrew McGregor, Siddharth Suri, and Jian Zhang. Graph distances in the streaming model: the value of space. In Proceedings of the Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2005, Vancouver, British Columbia, Canada, January 23-25, 2005, pages 745–754. SIAM, 2005. URL: http://dl.acm.org/citation.cfm?id=1070432.1070537.
- [9] Joan Feigenbaum, Sampath Kannan, Andrew McGregor, Siddharth Suri, and Jian Zhang. On graph problems in a semi-streaming model. Theor. Comput. Sci., 348(2-3):207–216, 2005. doi:10.1016/J.TCS.2005.09.013.
- [10] Manuela Fischer, Slobodan Mitrovic, and Jara Uitto. Deterministic (1+)-approximate maximum matching with poly(1/) passes in the semi-streaming model and beyond. In Stefano Leonardi and Anupam Gupta, editors, STOC ’22: 54th Annual ACM SIGACT Symposium on Theory of Computing, Rome, Italy, June 20 - 24, 2022, pages 248–260. ACM, 2022. doi:10.1145/3519935.3520039.
- [11] Buddhima Gamlath, Sagar Kale, Slobodan Mitrovic, and Ola Svensson. Weighted matchings via unweighted augmentations. In Peter Robinson and Faith Ellen, editors, Proceedings of the 2019 ACM Symposium on Principles of Distributed Computing, PODC 2019, Toronto, ON, Canada, July 29 - August 2, 2019, pages 491–500. ACM, 2019. doi:10.1145/3293611.3331603.
- [12] Shayan Oveis Gharan, Amin Saberi, and Mohit Singh. A randomized rounding approach to the traveling salesman problem. In Rafail Ostrovsky, editor, IEEE 52nd Annual Symposium on Foundations of Computer Science, FOCS 2011, Palm Springs, CA, USA, October 22-25, 2011, pages 550–559. IEEE Computer Society, 2011. doi:10.1109/FOCS.2011.80.
- [13] Shang-En Huang and Hsin-Hao Su. (1-1013)-approximate maximum weighted matching in poly(1/1013, log n) time in the distributed and parallel settings. In Rotem Oshman, Alexandre Nolin, Magnús M. Halldórsson, and Alkida Balliu, editors, Proceedings of the 2023 ACM Symposium on Principles of Distributed Computing, PODC 2023, Orlando, FL, USA, June 19-23, 2023, pages 44–54. ACM, 2023. doi:10.1145/3583668.3594570.
- [14] Anna R. Karlin, Nathan Klein, and Shayan Oveis Gharan. A (slightly) improved approximation algorithm for metric TSP. In Samir Khuller and Virginia Vassilevska Williams, editors, STOC ’21: 53rd Annual ACM SIGACT Symposium on Theory of Computing, Virtual Event, Italy, June 21-25, 2021, pages 32–45. ACM, 2021. doi:10.1145/3406325.3451009.
- [15] Richard M. Karp. Reducibility among combinatorial problems. In Raymond E. Miller and James W. Thatcher, editors, Proceedings of a symposium on the Complexity of Computer Computations, held March 20-22, 1972, at the IBM Thomas J. Watson Research Center, Yorktown Heights, New York, USA, The IBM Research Symposia Series, pages 85–103. Plenum Press, New York, 1972. doi:10.1007/978-1-4684-2001-2_9.
- [16] Andrew McGregor. Finding graph matchings in data streams. In Chandra Chekuri, Klaus Jansen, José D. P. Rolim, and Luca Trevisan, editors, Approximation, Randomization and Combinatorial Optimization, Algorithms and Techniques, 8th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems, APPROX 2005 and 9th InternationalWorkshop on Randomization and Computation, RANDOM 2005, Berkeley, CA, USA, August 22-24, 2005, Proceedings, volume 3624 of Lecture Notes in Computer Science, pages 170–181. Springer, 2005. doi:10.1007/11538462_15.
- [17] Matthias Mnich and Tobias Mömke. Improved integrality gap upper bounds for traveling salesperson problems with distances one and two. Eur. J. Oper. Res., 266(2):436–457, 2018. doi:10.1016/j.ejor.2017.09.036.
- [18] Tobias Mömke and Ola Svensson. Approximating graphic TSP by matchings. CoRR, abs/1104.3090, 2011. arXiv:1104.3090.
- [19] Marcin Mucha. 13/9-approximation for graphic TSP. In Christoph Dürr and Thomas Wilke, editors, 29th International Symposium on Theoretical Aspects of Computer Science, STACS 2012, February 29th - March 3rd, 2012, Paris, France, volume 14 of LIPIcs, pages 30–41. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2012. doi:10.4230/LIPIcs.STACS.2012.30.
- [20] Christos H. Papadimitriou and Mihalis Yannakakis. Optimization, approximation, and complexity classes. J. Comput. Syst. Sci., 43(3):425–440, 1991. doi:10.1016/0022-0000(91)90023-X.
- [21] Ami Paz and Gregory Schwartzman. A (2+ )-approximation for maximum weight matching in the semi-streaming model. ACM Transactions on Algorithms (TALG), 15(2):1–15, 2018. doi:10.1145/3274668.
- [22] Sartaj Sahni and Teofilo F. Gonzalez. P-complete approximation problems. J. ACM, 23(3):555–565, 1976. doi:10.1145/321958.321975.
- [23] András Sebö and Jens Vygen. Shorter tours by nicer ears: 7/5-approximation for the graph-tsp, 3/2 for the path version, and 4/3 for two-edge-connected subgraphs. Comb., 34(5):597–629, 2014. doi:10.1007/s00493-014-2960-3.
- [24] Xianghui Zhong. On the approximation ratio of the k-opt and lin-kernighan algorithm for metric and graph TSP. In Fabrizio Grandoni, Grzegorz Herman, and Peter Sanders, editors, 28th Annual European Symposium on Algorithms, ESA 2020, September 7-9, 2020, Pisa, Italy (Virtual Conference), volume 173 of LIPIcs, pages 83:1–83:13. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2020. doi:10.4230/LIPIcs.ESA.2020.83.
- [25] Xianghui Zhong. On the approximation ratio of the 3-opt algorithm for the (1, 2)-tsp. CoRR, abs/2103.00504, 2021. arXiv:2103.00504.