Fantastic Flips and Where to Find Them: A General Framework for Parameterized Local Search on Partitioning Problems
Abstract
Parameterized local search combines classic local search heuristics with the paradigm of parameterized algorithmics. While most local search algorithms aim to improve given solutions by performing one single operation on a given solution, the parameterized approach aims to improve a solution by performing simultaneous operations. Herein, is a parameter called search radius for which the value can be chosen by a user. One major goal in the field of parameterized local search is to outline the trade-off between the size of and the running time of the local search step.
In this work, we introduce an abstract framework that generalizes natural parameterized local search approaches for a large class of partitioning problems: Given items that are partitioned into bins and a target function that evaluates the quality of the current partition, one asks whether it is possible to improve the solution by removing up to items from their current bins and reassigning them to other bins. Among others, our framework applies for the local search versions of problems like Cluster Editing, Vector Bin Packing, and Nash Social Welfare. Motivated by a real-world application of the problem Vector Bin Packing, we introduce a parameter called number of types and show that all problems fitting in our framework can be solved in time, where denotes the total input size. In case of Cluster Editing, the parameter generalizes the well-known parameter neighborhood diversity of the input graph.
We complement these algorithms by showing that for all considered problems, an algorithm significantly improving over our algorithm with running time would contradict the Exponential Time Hypothesis. Additionally, we show that even on very restricted instances, all considered problems are W[1]-hard when parameterized by the search radius alone. In case of the local search version of Vector Bin Packing, we provide an even stronger W[1]-hardness result.
Keywords and phrases:
Flip-Neighborhood, Cluster Editing, Vector Bin Packing, Vertex Cover, NP-hard problem, Max -CutFunding:
Niels Grüttemeier: Supported by the project Datenfabrik.NRW, a project by KI.NRW, funded by the Ministry for Economics, Innovation, Digitalization and Energy of the State of North Rhine-Westphalia (MWIDE).Copyright and License:
2012 ACM Subject Classification:
Theory of computation Parameterized complexity and exact algorithms ; Computing methodologies Discrete space searchAcknowledgements:
We would like to thank Lea Hoeing, Kai Lebbing, Christiane Markuse-Schneider, and Lukas Ptock (Schmitz Cargobull AG) for showing us the interesting application of Vector Bin Packing.Editors:
Pat Morin and Eunjin OhSeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
The principle of local search is among the most important heuristic approaches in combinatorial optimization and it is highly relevant to find good solutions to NP-hard problems in practice [19]. The idea is to apply small modifications on a given starting solution to obtain a new solution with a better target value than the starting solution. Local search has been studied extensively and it has been proven to be highly efficient [2, 19, 26, 1]. Furthermore, it is easy to understand and it is also a good plugin to improve already competitive solutions provided by other metaheuristics [31, 10].
Consider a classic partitioning problem as Multi Knapsack, where the goal is to assign items (each with a weight) to multiple knapsacks (each with a weight capacity and specific values for the items) in a way that all capacity constraints are satisfied and the total value is maximal. One of the most natural ways to apply small modifications in a local search scenario is an item flip, where one removes a single item from its current knapsack and inserts it into another knapsack. This natural idea of flipping the assignment of single items to improve a solution is studied for knapsack problems [4] and for other partitioning problems [8, 27]. A general drawback in performing these single item flips – and also in local search in general – is the chance of getting stuck in poor local optimal solutions. Thus, to obtain a robust local search application, a strategy to prevent getting stuck in these poor local optima is required.
In this work, we consider the approach of parameterized local search [29, 7] to decrease the chance of getting stuck in poor local optima reached by item flips. Instead of searching for an improving solution by one single operation (here: a single item flip), the user may set a search radius to extend the search space for possible improvements that can be reached with up to simultaneous operations. Thus, the idea is to make the search space larger so that getting stuck in poor local optima becomes less likely. Parameterized local search combines local search with the paradigm of parameterized algorithmics [3] and aims to outline the trade-off between the size of the search radius and the running time of the search step. Parameterized local search has been studied extensively in the algorithmic community: for vertex deletion and partitioning problem in graphs [7, 13, 16, 18, 5, 21, 11, 10], for problems on strings and phylogenetic networks [17, 23], and many other problems [29, 32, 9, 15]. One major goal in parameterized local search is to show that finding an improvement within search radius is fixed parameter tractable (FPT) for . That is, finding an improving solution can be done in time , where is the total encoding length of the input instance and is some computational function only depending on . Note that an algorithm with such a running time nicely outlines the trade-off between radius size and running time, as the superpolynomial part only depends on while only contributes as a polynomial factor to the running time. Unfortunately, most parameterized local search problems are W[1]-hard for [29, 7, 17, 30] and therefore, an algorithm with running time presumably does not exist.
Motivated by the negative results for the parameter , one often studies the combination of and some structural parameter to obtain algorithms with running . This approach has been successful both from a theoretical and experimental perspective [18, 21, 12, 10]. In this work, we follow this direction by studying parameterization by and an additional parameter that we call the number of types. We consider the local search versions of a large class of well-known combinatorial problems including Max -Cut, Multi Knapsack, Cluster Editing and Vector Bin Packing, where the search space is defined by performing flips. Recall that one flip intuitively removes one item from its assigned set and inserts in into some other set. The interpretation of flips and of our parameter always depends on the concrete problem and will be explained for each problem individually.
Our approach of exploiting the parameter is motivated by a real-world production planning application at the company Schmitz Cargobull AG. Planning the production at Schmitz Cargobull AG corresponds to solving an instance of Vector Bin Packing [28]. In Vector Bin Packing, one is given a large collection of vectors together with a vector and an integer . The question is, whether there exists a partition of into parts , such that for all . In the application at Schmitz Cargobull AG, we have and the vectors correspond to customer orders where the entries of specify whether a specific option is chosen ( the entry has value ) or not ( the entry has value ). The entries of correspond to the production capacities available for the corresponding product option at one day of production. Instead of asking for a partition where each resulting vector set satisfies , one asks for a partition minimizing the total overload. More precisely, the overload of a single set is defined as , and the total overload is the sum of all overloads of all sets . Note that the total overload is if and only if the given instance is a yes-instance of the decision version of Vector Bin Packing. While Schmitz Cargobull AG usually receives a large number of customer orders, relatively many of these orders request the exact same product option combinations. Consequently, the number of distinct vectors in is much smaller than . Therefore, our research is motivated by setting to be the number of distinct vectors in an instance of Vector Bin Packing and study parameterized local search for the combination of and where the target is to minimize the total overload.
From a more abstract point of view, Vector Bin Packing is a problem where one assigns a collection of items (vectors) to a collection of bins (sets of the resulting partition) in a way that a target function (total overload) is minimized. Furthermore, if two elements from have the same vector, these elements have the same effect on the target function. In other words, there are only distinct ways in which a specific item might have an influence on the target function when assigned to a specific bin. This more abstract view leads to a framework providing running times and for the parameterized local search versions for a wide range of well-known combinatorial problems that behave in the same way as Vector Bin Packing. The running-time is particularly motivated since the value of is a small constant chosen by the user [24].
Recall that the concrete interpretation of the parameter , of the items, and of the bins always depends on the concrete problem and will be explained for each problem individually. Besides the number of distinct vectors, can be interpreted as the neighborhood diversity of an input graph in case of Max -Cut or Cluster Editing, or as the number of distinct value-weight combinations in case of Multi Knapsack.
Our Contributions.
In the first part (see Section 3), we introduce the LS Generalized Bin Problem, which is a general framework capturing many parameterized local search problems where the search radius is defined by a number of item flips. Moreover, we introduce the parameter number of types as an abstract concept for LS Generalized Bin Problem capturing well-known parameters such as the neighborhood diversity of a graph. We describe general algorithms for the abstract LS Generalized Bin Problem leading to running times of and for the local search versions of many problems that fit into our framework (see Theorem 3.5).
| Problem | Items | Bins | Parameter | Section |
|---|---|---|---|---|
| Max -Cut | vertices | color classes | neighborhood diversity | 4.1 |
| Cluster Editing | vertices | clusters | neighborhood diversity | 4.2 |
| Vector Bin Packing | vectors | bins | number of distinct vectors | 4.3 |
| Vertex Cover | vertices | (vc, is) | neighborhood diversity | 4.4 |
| Nash Social Welfare | items | agents | number of distinct items | 4.5 |
| Multi Knapsack | items | knapsacks | number of distinct items | 4.5 |
In the second part (see Section 4) we provide simple example applications of the introduced framework leading to new results for the parameterized local search versions of classic combinatorial optimization problems, graph problems, and one problem from computational social choice (Nash Social Welfare). The formal problem definitions can be found in Section 4. All results in this part mainly rely on simply reformulating the concrete problem as LS Generalized Bin Problem. An overview of the studied problems is given in Table 1. To the best of our knowledge, this is the first work studying parameterized local search for Nash Social Welfare, Vector Bin Packing, and Multi Knapsack.
In Section 5, we complement our results by studying parameterization by the search radius alone. We show that the parameterized local search versions of Nash Social Welfare and Multi Knapsack are W[1]-hard when parameterized by . Furthermore, we provide a strong hardness result for Vector Bin Packing showing W[1]-hardness for , where denotes the maximum number of non-zero-entries over all vectors from the input. The parameter is particularly motivated by the real-world application from the company Schmitz Cargobull AG, where is even smaller than . We finally show that all of our algorithms with running time are tight in the sense of the Exponential Time Hypothesis (ETH) [20].
2 Preliminaries
For integers and with , we define . Given a set and some integer , we call a mapping a -partition of or a -coloring of . For every , we let . The flip between two -partitions and is defined as . The flip distance between and is then defined as . For a set , we let denote the power set of .
For a graph , by we denote the number of vertices and by we denote the number of edges. By and by we denote the open neighborhood of and , respectively. Two vertices and have the same neighborhood class if . By we denote the number of neighborhood classes of . A vertex set is a vertex cover for if each edge of has at least one endpoint in . A vertex set is an independent set of , if no edge has of has both endpoints in .
Proofs of statements marked with are deferred to the full version.
3 Generalized Parameterized Local Search for Partitioning Problems
We present a framework that captures many computational problems such as Vector Bin Packing, Cluster Editing, and Nash Social Welfare. On a high level, all these problems have in common, that one aims to partition some set (e.g. a set of vectors, a set of vertices, or a set of items) into multiple “bins”. A target function assigns a value to the resulting partition. The goal is to find a partition that minimizes (or maximizes) the target function. This section is structured as follows. We first introduce a “Generalized Bin Problem” that generalizes the computational problems studied in this work. Afterwards, we introduce a parameter called “types”. Finally, we provide the algorithmic results for this parameter.
3.1 The Generalized Bin Problem
Intuitively, we aim to partition a given set into a given number of “bins” , and a target value specifies how good this -partition of is. This target value is determined by an “individual bin evaluation”, which is a collection of local target values of each single bin. These values are then combined to obtain the target value for the whole -partition of .
Definition 3.1.
Let be a set, let , and let . An individual bin evaluation (IBE) is a -tuple of functions . An IBE defines a target value for every -partition by
where is a commutative and associative binary operation satisfying for all .
While our general framework works for arbitrary commutative and associative operations , this work only considers concrete problems where is either the summation or the multiplication of integer numbers. The value corresponds to infeasible assignments of bins, for example, violating capacity constraints of a knapsack. In case of a minimization problem we consider IBE with and in case of a maximization problem we have . In the remainder of this section, all problem statements and algorithms are given for the case where one aims to minimize the target function. Maximization problems are defined analogously. With Definition 3.1 at hand, we define the following general computational problem for every fixed commutative and associative operation .
Generalized Bin Problem Input: An integer , a set , and an IBE . Goal: Find a -partition that minimizes .
Note that we did not yet specify how the IBE from the input is given. As this depends on the concrete problems, this will be discussed for each problem individually. Throughout this section, we analyze the algorithms running times with respect to the parameter denoting the running time needed for one evaluation of a value with .
Recall that our aim is to study parameterized local search for the flip neighborhood. More precisely, we aim to find a -partition that has a better target value than some given -partition , while for a given . The corresponding computational problem is defined as follows.
LS Generalized Bin Problem Input: An integer , a set , and an IBE , a -partition , and an integer . Question: Is there a -partition with such that ?
3.2 Types in Generalized Bins
We next define a parameter called “number of types” . The idea is, that in a polynomial-time preprocessing step, the set is partitioned into classes of elements in a way that all elements in each have the exact same impact on the target value for every possible bin assignment. The intuitive idea of “same impact” is formalized as follows.
Definition 3.2.
Let be a set, let , and let be an IBE for and . Two (not necessarily distinct) elements and are target equivalent (), if for every and for every with we have .
Proposition 3.3 ().
The relation is an equivalence relation on .
The main idea of our algorithm is that target equivalent elements can be treated equally as they have the same influence on the target value. When considering concrete problems, we always use a simple pairwise relation between the elements leading to a partition of into classes of pairwise target equivalent elements. These classes do not necessarily need to be maximal under this constraint. Thus, it suffices to consider the following relaxation of the equivalence classes.
Definition 3.4.
Let be a set, let , and let be an IBE for and . A tuple of disjoint sets with is called a type partition of if the elements of each are pairwise target equivalent. For every , we say that the elements of have type .
Obviously, the equivalence classes for always form a type partition with the minimum number of sets. Throughout this work, we assume that each instance of LS Generalized Bin Problem is associated with a specified type partition, and the parameter number of types is defined as the number of sets of the type partition associated with .
3.3 Algorithmic Results for LS Generalized Bin Problem
Our goal is to study LS Generalized Bin Problem parameterized by . Applying this on concrete problems then leads to FPT algorithms for a great range of parameterized local search versions of well-known computational problems. The interpretation of the parameter always depends on the concrete problem. We provide the following algorithmic results. Recall that denotes the running time needed for one evaluation of a value with .
Theorem 3.5.
LS Generalized Bin Problem can be solved
-
a)
in time, and
-
b)
in time
if a type partition is additionally given as part of the input.
To present the algorithms behind Theorem 3.5, we introduce the notion of type specification and type specification operations. Recall that elements of the same type behave in the same way when assigned to a bin. Thus, when evaluating the target function, one may consider the types of the assigned elements instead of the concrete elements. A type specification is a vector that intuitively corresponds to a collection of elements in containing exactly elements from the class .
Definition 3.6.
Let be an integer, let be a set, and let be an IBE. Moreover, let be a type partition. A type specification is a vector with for each .
Since elements of the same type have the same impact on the target function, we can address the change of target values by just specifying the types of elements added to and removed from with . To ensure that after adding and removing elements of specific types from a set corresponds to an actual subset of , we introduce the notion of subtractive and additive compatibility.
Definition 3.7.
Let be an integer, let be a set, and let be an IBE. Moreover, let be a type partition. We say that a type specification is
-
a)
subtractive compatible with a set if for every , we have .
-
b)
additive compatible with a set if for every , we have .
We use to denote that is subtractive compatible with and is additive compatible with . Given and with , the type vector operation for each is defined as
where is some set containing exactly arbitrary elements from for every , and is some set containing exactly arbitrary elements from for every .
The notion of target equivalence (Definition 3.2) together with the notion of subtractive and additive compatibility guarantees the following.
Proposition 3.8.
The type vector operation is well-defined.
Proof.
Since is subtractive compatible with , there are at least elements in and thus, the set exists. Analogously, the set exists due to the additive compatibility of with . Consequently, and therefore, is defined.
It remains to show that the value does not depend on the choice of elements in and . First, let and let with . Then,
by Definition 3.2. Analogously, let and let with we have
Therefore, the type vector operation is well-defined.
With the notion of type specifications at hand, we next present the algorithmic results. Intuitively, the algorithms behind Theorem 3.5 work as follows: One considers every possibility of how many elements of which type belong to the (at most) elements in . For each such choice, one computes the best improving flip corresponding to the choice. Note that the information how many elements of which type are flipped, can be encoded as a type specification in the way that all correspond to the number of flipped elements of type . We first describe the subroutine computing the improving -partition corresponding to a given type specification . Afterwards, we describe how to efficiently iterate over all possible to obtain the running times stated in Theorem 3.5.
We use the notation to indicate that for all vector entries, and we let denote the component-wise difference between and .
Lemma 3.9.
Let be an integer, let be a set, let be an IBE, and let be a -partition. Moreover, let be a type partition and let be a type specification and we let . A -partition with
that minimizes among all such -partitions can be computed in
-
a)
time, or in
-
b)
time.
Proof.
We first provide an algorithm based on dynamic programming and afterwards we discuss the running times and .
Intuition.
Before we formally describe the dynamic programming algorithm, we provide some intuition: Every gets removed from its bin and is inserted into a new bin. We fix an arbitrary ordering of the bins and compute solutions for prefixes of this ordering in a bottom-up manner. For every bin, there is a set of removed elements and a set of inserted elements . Since the target value depends on the types of the elements rather than the concrete sets and , we may only consider how many elements of which type are removed from a bin and inserted into a bin. Therefore, we compute the partial solutions for given and , where corresponds to the removed types, and corresponds to the inserted types. After the computation, we consider the solution, where , that is, the solution where the exact same types were removed and inserted. Going back from abstract types to concrete elements then yields the resulting -partition .
Dynamic programming algorithm.
The dynamic programming table has entries of the form where , , and . One such table entry corresponds to the minimal value of
under every choice of sets and for that satisfy
The table is filled for increasing values of . As base case, we set if . Recall that is true if and only if is subtractive compatible with and is additive compatible with . If or is incompatible, we set . The recurrence to compute an entry with is
Note that is always true, and therefore, such a minimum always exists. Herein, denotes the -dimensional vector where each entry is .
Claim 3.10.
The recurrence is correct.
Proof.
Throughout the proof of this claim, we call the properties posed on the sets and in the definition of the table entries the desired properties for , , and .
We show that the recurrence is correct via induction over . If , we have . Thus, by Definition 3.7, there are sets and where is a set containing exactly elements from for every , and is a set containing exactly elements from for every and we have . Since the value is invariant under the concrete choices of elements in and due to Proposition 3.8, the value is minimal among all such and . Thus, the Recurrence holds for the base case .
Next, let and assume that the recurrence holds for . Let and be sets with the desired properties for , , and . We show
Since is associative and commutative, we have
Note that the vectors and satisfy and . Moreover, since and , it holds that is true. Thus, the minimum of the right-hand-side of the recurrence includes and , and by the definition of type specification operations and Proposition 3.8, we have . Since by the inductive hypothesis we have , we conclude .
Let and be the vectors minimizing the right-hand-side of the recurrence. By the definition of type specification operations, there are sets and with and . By inductive hypothesis, the value corresponds to the minimal value of for sets and that satisfy the desired properties for , , and . Then, corresponds to the value for sets and that satisfy the desired properties for , , and . Thus, it follows that .
Computation of a solution .
We next describe how to compute the desired -partition after filling the dynamic programming table . The sets and corresponding to the table entry can be found via trace back. We let . Due to the property for all , all are disjoint. Together with the properties on the vectors and , this implies for every . Consequently, for every , there is a mapping , that maps exactly elements of to for every . Intuitively, the mapping assigns a new bin to each item of type that was removed from some bin.
The -partition is defined via the mappings as follows: For every , we set . For every , we set for the corresponding with .
By the definition of , the -partitions and may only differ on and therefore, implies . It remains to show that is minimal. By the construction of we have , where . Then, by the construction of the we have
Thus, the sets and contain the exact same number of elements from each . Consequently, we have for every and thus, by the definition of types, the minimality of implies the minimality of .
Running time.
We finally provide two different ways to analyze the running time.
-
a)
The vector can be regarded as a set containing individual identifiers from which exactly are labeled with type for each . With this view on , the vectors and correspond to subsets of , so we have possible choices for and possible choices for . Consequently, the size of the table is . To compute one entry, one needs to consider up to choices of and . For each such choice, can be checked in time. With the evaluation of the IBE, this leads to a total running time of .
-
b)
The number of possible vectors that are component-wise smaller than is , since each entry is an element of . Since , the product of all is maximal if all have roughly the same size . Thus, we have . Consequently, the size of is upper-bounded by . To compute one entry, one needs to consider up to choices of and . For each such choice, can be checked in time. With the evaluation of the IBE, this leads to a total running time of .
We next use Lemma 3.9 to prove Theorem 3.5.
Proof of Theorem 3.5.
Recall that the idea is to consider every possible vector specifying how many elements of which type belong to the elements of the flip. For each possible we then use the algorithm behind Lemma 3.9. It remains to describe how to efficiently iterate over the possible choices of to obtain the running times and .
-
a)
Let be the -dimensional unit vectors. That is, only the th entry of equals and all other entries equal . We enumerate all with by considering all possibilities to repetitively draw up to -times from the set . For each such choice , we check whether for each to ensure that is a type specification. If this is the case, we apply the algorithm behind Lemma 3.9. With the running time from Lemma 3.9 , this leads to a total running time of .
-
b)
Note that for a vector with , we have for every . Thus, in time we can enumerate all possible . Analogously to , we check whether for each . With the running time from Lemma 3.9 , this leads to a total running time of .
4 Applications of the Framework
We next study parameterized local search versions of a wide range of well-known problems. For each of these problems, we consider parameterization by the search radius in combination with some structural parameter. All results rely on the algorithm behind Theorem 3.5. Therefore, it suffices to express an instance of a local search problem as a corresponding instance of LS Generalized Bin Problem. The proofs in this section are given in the following structure:
-
1.
GBP Construction: Given the instance of the local search problem, we specify the universe , the number of bins and the IBE of the instance . This also includes a description of how to efficiently evaluate the IBE from the input instance .
-
2.
Type Partition: We express how a type partition for is computed from and describe how the targeted structural parameter corresponds to the number of types as defined in Section 3.
-
3.
Solution Correspondence: Recall that an instance of a local search problem always contains a solution of the underlying problem. Let be the solution given in the instance . To show the solution correspondence, we describe how solutions of can be transformed into solutions of and vice versa, such that the following holds: A solution of is better than the given solution of if and only if the corresponding -partition has strictly smaller (larger) target value than with respect to the IBE .
4.1 Max -Cut
Let and let be an undirected graph. We say that an edge is properly colored by a -partition of , if assigns distinct colors to the endpoints of .
Max -Cut Input: An undirected graph . Goal: Find a -partition of that maximizes the number of properly colored edges under .
Note that the goal of Max -Cut can also be equivalently redefined as: Find a -partition of that minimizes the number of edges that are not properly colored under , that is, a -partition that minimizes .
The input of the corresponding local search problem LS Max -Cut additionally consists of a -partition and some , and one aims to find a -partition with and .
LS Max -Cut is W[1]-hard parameterized by [10] and cannot be solved in time unless the ETH is false [30]. Form a positive side, it was shown that the problem can be solved in time on apex-minor-free graphs [7]. Moreover, on general graphs, the problem can be solved in time [10], where denotes the maximum degree of the input graph. This algorithms found successful application as a post-processing algorithm for a state-of-the-art heuristic for Max -Cut [10].
We show the following.
Theorem 4.1.
LS Max -Cut can be solved in and time.
Proof.
The proof relies on the algorithm behind Theorem 3.5. We describe how to use this algorithm to solve an instance of LS Max -Cut.
1. GBP Construction.
We describe how to obtain an instance of LS Generalized Bin Problem. Our universe is exactly the vertex set of , and the number of bins is exactly the number of colors. We next define the IBE. For each and each vertex set , we define . Note that for each value can obviously be computed in time, and thus, the IBE can be evaluated in polynomial time from the input instance . Our operation is the sum of integer numbers.
2. Type Partition.
Our type partition is the collection of neighborhood classes of . Recall that this collection can be computed in time.
To show that this collection is in fact a type partition, we show that two vertices from the same neighborhood class are target equivalent according to Definition 3.2. To this end, let be a neighborhood class of , let and be vertices of . We show that and are target equivalent. Let be a vertex set with . We show that for each , . Let . Since all IBEs are identically defined, it suffices to only consider . By the fact that is a neighborhood class of , and have the same neighbors in . Hence, . This implies that and are target equivalent.
3. Solution Correspondence.
Note that the set of solutions of LS Max -Cut is exactly the set of all -partitions of . Since and , the solutions of have a one-to-one correspondence to the solutions of . Moreover, note that the definition of the IBE is based on the reformulation of the objective function of Max -Cut. Since is the sum of integer numbers, we have for every -partition . Therefore, is a better solution than for LS Max -Cut if and only if is a smaller target value than with respect to the IBE.
4.2 Cluster Editing
A cluster graph is an undirected graph in which each connected component forms a clique. Let be an undirected graph. In Cluster Editing, one aims to apply a minimum number of edge modification (edge insertions and edge deletions) on , such that the resulting graph is a cluster graph. We let denote the set of two-element subsets of vertices corresponding to edge modifications. Given a set , we let denote the symmetric difference corresponding to the application of the graph modifications.
Cluster Editing Input: An undirected graph . Goal: Find a set such that is a cluster graph and is minimal under this property.
Note that the maximum number of clusters corresponds to the number of vertices . We consider an equivalent definition of Cluster Editing where one asks for an -labeling with partitioning into (possibly empty) classes , such that
is maximal [30]. Intuitively, maximizing this score corresponds to maximizing the number of present edges in the connected components minus the required edge insertions such that these components form a cliques.
We study a parameterized local search version of Cluster Editing where one flip in the flip neighborhood corresponds to moving single vertices from their current cluster into another cluster. The input of the corresponding local search problem LS Cluster Editing consists of the input of Cluster Editing, together with an additional integer and an -labeling of for some. The task is to compute a -labeling such that and .
With respect to this neighborhood, LS Cluster Editing is known to be W[1]-hard parameterized by and cannot be solved in time unless the ETH is false [11]. Form a positive side, it was shown that the problem can be solved in time [11], where denotes the maximum degree of the input graph.
We now show the following.
Theorem 4.2.
LS Cluster Editing can be solved in time and in time.
Proof.
The proof relies on the algorithm behind Theorem 3.5. We describe how to use this algorithm to solve an instance .
1. GBP Construction.
We describe how corresponds to an LS Generalized Bin Problem-instance . Our universe is exactly the vertex set of , and the number of bins is exactly . We next define the IBE. For each and each vertex set , we define . Note that each value can obviously be computed in time from the input graph . Our operation is the sum of integer numbers.
2. Type Partition.
Our type partition is the collection of neighborhood classes of . Recall that this collection can be computed in time.
To show that this collection is in fact a type partition, we show that two vertices from the same neighborhood class of are target equivalent according to Definition 3.2. To this end, let be a neighborhood class of and let and be vertices of . Let be a vertex set with . We obviously have . Moreover, since and have the exact same neighbors in , we have . Then, by the definition of the IBE we have for all .
3. Solution Correspondence.
Note that the set of solutions of LS Cluster Editing is exactly the set of all -partitions of . Since and , the solutions of have a one-to-one correspondence to the solutions of . Moreover, note that the definition of the IBE is based on the reformulation of the objective function of Cluster Editing and is the sum of integer numbers, we have for every -partition . Therefore, is a better solution than for Cluster Editing if and only if has a larger target value than with respect to the IBE.
4.3 Vector Bin Packing
Let be integers. In this section for , is a bin. Each bin is associated with a weight vector . Let be a set of vectors from . A -partition of is called a bin assignment.
Given a subset , we say that the overload of bin with respect to is . Our target is to find an assignment minimizing the total overload, which is defined as .
Vector Bin Packing Input: Integers , a set of vectors from , a vector for each . Goal: Find a bin assignment of such that the total overload is minimized, that is, is minimized.
The input of the corresponding local search problem LS Vector Bin Packing additionally consists of a bin assignment and some , and one aims to find a bin assignment with and .
We say that two vectors and are identical if they agree in all dimensions, that is, if for each dimensions and two vectors are distinct otherwise.
To the best of our knowledge, parameterized local search for Vector Bin Packing has not yet been studied. We now show the following.
Theorem 4.3.
Let denote the maximum number of pairwise distinct items in an instance of LS Vector Bin Packing. LS Vector Bin Packing can be solved in time and in time.
Proof.
The proof relies on the algorithm behind Theorem 3.5. We describe how to use this algorithm to solve an instance
of LS Vector Bin Packing
1. GBP construction.
Initially, we describe how corresponds to an instance of LS Generalized Bin Problem. Our universe is exactly the set and the number is the number of bins in . It remains to define the IBE. For , we define . That is, the th IBE corresponds to the overload of with respect to the vectors . Our operation is the sum of integer numbers.
2. Type Partition.
Let be a partition of where each is an inclusion-wise maximal set of pairwise identical elements. Note tat this partition can clearly be computed in time.
To show that this collection is in fact a type partition, we show that two elements from one set are target equivalent according to Definition 3.2. Let and be two vectors of the same set . Let be a set with . Let . We show that for each , . Without loss of generality, we consider the IBE . Since is a type class containing and , we have . Consequently, and thus and are target equivalent.
3. Solution Correspondence.
Since is the sum of integer numbers and for all , we have for every bin assignment .
In Theorem 5.3 we complement the above algorithm by showing that LS Vector Bin Packing is W[1]-hard parameterized by and cannot be solved in time unless the ETH fails.
4.4 Vertex Deletion Distance to Specific Graph Properties
We next consider a general class of graph problems. Given a graph, one aims to delete a minimum number of vertices such that the remaining graph satisfies a specific property (for example: being bipartite). For any fixed graph-property that can be verified in polynomial time, we define the following problem.
Vertex Deletion Input: A graph . Goal: Find a subset such that fulfills property and is minimal.
Parameterized local search was considered for Vertex Deletion for general graph properties [7] and in particular for the special case of being the family of all edgeless graphs, that is, for Vertex Cover [7, 13, 21, 24]. For the corresponding local search problems LS Vertex Deletion and LS Vertex Cover, the considered local neighborhood is the -swap neighborhood, that is, the set of all solutions that have a symmetric difference with the current solution of size at most . This neighborhood coincides with the -flip neighborhood if a solution for Vertex Deletion and Vertex Cover is represented as a -coloring of the vertex set , where fulfills property and the goal is to minimize the number of vertices that receive color under .
It was shown that LS Vertex Deletion is -hard when parameterized by for all hereditary graph properties that contain all edgeless graphs but not all cliques or vice versa [7]. This holds in particular for the special case of LS Vertex Cover [7, 13, 24]. Furthermore, a closer inspection shows that LS Vertex Cover cannot be solved in time unless the ETH fails [24, Theorem 3.7]. Since LS Vertex Cover is a special case of LS Vertex Deletion, we also obtain the same hardness results for the more general LS Vertex Deletion. From the positive side, LS Vertex Cover admits an FPT-algorithm for , if the input graph has a bounded local treewidth. Moreover, algorithms for LS Vertex Cover are known that run in time, where is any of (i) the maximum degree [21], (ii) the -index of the input graph [24], or (iii) the treewidth of the input graph [24]. In particular, the performance of the algorithm combining and was shown to be very successful [21].
We show that our framework is applicable for an even more general problem LS Multi Component Deletion in which we aim to find a vertex set of minimal size to remove, so that the remaining vertices can be partitioned into sets, that each fulfill property . Note that LS Vertex Deletion is the special case of .
Theorem 4.4 ().
LS Multi Component Deletion can be solved in time and in time.
4.5 Further Applications of the Framework
With similar applications the algorithm behind Theorem 3.5, we can obtain results for the local search versions of the well-known problems Nash Social Welfare and Multi Knapsack. Here, we only present the problem definitions and the respective results. References to the related literature and the proofs of our theorems for both problems are deferred to the full version.
Nash Social Welfare.
Let be integers. In this section, is a set of agents and is a set of items. Furthermore, we have utility functions . An -partition of is called an allocation. For an allocation , the function is called the Nash score of .
Nash Social Welfare Input: A set of agents, a set of items, and a set of utilities. Goal: Find an allocation that maximizes .
The input of the corresponding local search problem LS Nash Social Welfare additionally consists of an allocation and some and one asks for an allocation with and .
We say that two items and are identical if each agents values and similarly, that is, if for each . Two items are distinct if they are not identical.
Theorem 4.5 ().
Let denote the maximum number of pairwise distinct items in an instance of LS Nash Social Welfare. LS Nash Social Welfare can be solved in time and in time.
Multi Knapsack.
Let be a number of knapsacks, with their capacities and let be a set of items with values and weights for all and . We say that an -partition of fits into the knapsacks, if for every . We define the score of as . Intuitively, for , the set corresponds to the chosen items for the th knapsack and corresponds to the set of non-chosen items.
Multi Knapsack Input: Knapsacks with capacities , items with values and weights for all and . Goal: Find an -partition of that fits into the knapsacks, such that is maximal.
The input of the corresponding local search problem LS Multi Knapsack additionally consists of an -partition that fits into the knapsacks and some and one asks for an improving -partition with that fits into the knapsacks.
We say that two items and are identical if and for all . Two elements are distinct if they are not identical.
To the best of our knowledge, parameterized local search for Multi Knapsack has not yet been studied. We now show the following.
Theorem 4.6 ().
Let denote the maximum number of pairwise distinct items in an instance of LS Multi Knapsack. LS Multi Knapsack can be solved in time and in time.
5 Intractability Results for the Considered Local Search Problems
In this section, we present (parameterized) intractability results for the considered local search problems in this work and tight running-time lower bounds with respect to the -time algorithms derived in Section 4. As already discussed in the previous sections, some of the considered problems in this work are known to be -hard when parameterized by and cannot be solved in time, unless the ETH fails. This is the case for LS Max -Cut [30], LS Cluster Editing [11], LS Vertex Deletion [7, 13, 24]. In this section we thus only show that these intractability results also hold for the remaining local search problems considered in this work, that is, for LS Multi Knapsack, LS Nash Social Welfare, and LS Vector Bin Packing.
We start by presenting our (parameterized) intractability results and running-time lower bounds for the local search problems LS Multi Knapsack and LS Nash Social Welfare.
Theorem 5.1 ().
LS Multi Knapsack is -hard with respect to and cannot be solved in time, unless the ETH fails, where denotes the number of items in the input instance. This holds even if there is only a single knapsack.
Theorem 5.2 ().
LS Nash Social Welfare is -hard with respect to and cannot be solved in time, unless the ETH fails. This holds even if there are only two agents and both have the same evaluation for each item.
Now, we provide matching hardness results for LS Vector Bin Packing. More precisely, we provide two hardness results even if each entry in each vector is only 0 or 1. First, we show that LS Vector Bin Packing is W[1]-hard with respect to and that an algorithm with running time violates the ETH. Consequently, our time algorithm presented in Theorem 4.3 is tight if the ETH is true. Second, we show W[1]-hardness for LS Vector Bin Packing parameterized by . Here, is the maximal sum of entries over all vectors. Recall that is usually smaller than in our real-world application.
Theorem 5.3.
LS Vector Bin Packing is -hard with respect to and cannot be solved in time, unless the ETH fails. This holds even if , each entry in each vector is only 0 or 1, and each entry of the vector for each bin is 1.
Proof.
We present a simple reduction from LS Max -Cut which provides the desired intractability results. Let and let be an instance of LS Max -Cut where is locally optimal if and only if is globally optimal. Even under these restrictions, LS Max -Cut is -hard with respect to and cannot be solved in time, unless the ETH fails [10]. Let and let . We obtain an equivalent instance of LS Vector Bin Packing as follows: For each bin , we define the vector as the vector of length that has a 1 in each dimension. For each vertex , we create a vector that has a 1 at dimension if and only if vertex is incident with edge . Let be the set of these vectors and let be the coloring obtained from by assigning for each vertex , color to vector , that is, . Finally, we set .
For the sake of simplicity, we may identify each vector by its corresponding vertex . Similarly, we may consider -partitions of instead of -partitions of as solutions for based on the obvious bijection between vertices and vectors. Next, we show that is a yes-instance of LS Max -Cut if and only if is a yes-instance of LS Vector Bin Packing. To this end, we first analyze the objective functions of both instances with respect to corresponding solutions.
Let be a -coloring of and let be the corresponding -coloring of . Let be an edge of having endpoints of distinct color under . Then, no bin produces an overload in dimension because only the vectors and have a 1 at dimension and both vectors receive distinct colors under . Similarly, let be an edge of having endpoints of the same color under . Then, no bin except produces an overload at dimension and bin produces an overhead of exactly in dimension , because only the vectors and have a 1 at dimension and both vectors receive color under . Consequently, the total overload of over all dimensions and all bins equals minus the number of properly colored edges of under . In other words, is a better solution for than if and only if is a better solution for than . Since , this implies that is a yes-instance of LS Max -Cut if and only if is a yes-instance of LS Vector Bin Packing. Furthermore, since is a locally optimal solution if and only if is a globally optimal solution, is a locally optimal solution if and only if is a globally optimal solution.
Recall that LS Max -Cut is -hard with respect to and cannot be solved in time, unless the ETH fails [10]. Since , each vector has dimensions, and , this implies that LS Vector Bin Packing is -hard with respect to and cannot be solved in time, unless the ETH fails.
Now, we present our second hardness result for the parameter plus , the maximal sum of entries over all vectors.
Theorem 5.4 ().
LS Vector Bin Packing is -hard with respect to , even if each entry in each vector is only 0 or 1, and each entry of the vector for each bin is 1.
6 Discussion
There are several ways of extending our work: In all of our applications of the framework we extensively used the expressive power of the IBEs and the flexibility of the bins. So far, we have not exploited the power of the types. For future work it is thus interesting to find examples where the number of types is smaller than the neighborhood diversity or the number of distinct vectors. Also, it is interesting to study the parameterized complexity of the non-local search versions of the considered problems parameterized by alone. For example, Bin Packing admits an FPT-algorithm when parameterized by alone [22]. It appears to be reasonable that the algorithm can be modified to obtain fixed-parameter tractability for Vector Bin Packing parameterized by as well. Furthermore, Vertex Cover admits an FPT-algorithm when parameterized by [25] and Max -Cut admits an FPT-algorithm when parameterized by [14]. However, it is not known whether such algorithms are possible for all (classic) variants of our study; for example for Nash Social Welfare such an algorithm is not known.
References
- [1] Thomas Bläsius, Philipp Fischbeck, Lars Gottesbüren, Michael Hamann, Tobias Heuer, Jonas Spinner, Christopher Weyand, and Marcus Wilhelm. PACE Solver Description: KaPoCE: A Heuristic Cluster Editing Algorithm. In Proceedings of the 16th International Symposium on Parameterized and Exact Computation (IPEC ’21), volume 214 of LIPIcs, pages 31:1–31:4. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2021. doi:10.4230/LIPICS.IPEC.2021.31.
- [2] Shaowei Cai, Kaile Su, Chuan Luo, and Abdul Sattar. NuMVC: An Efficient Local Search Algorithm for Minimum Vertex Cover. Journal of Artificial Intelligence Research, 46:687–716, 2013. doi:10.1613/JAIR.3907.
- [3] Marek Cygan, Fedor V. Fomin, Lukasz Kowalik, Daniel Lokshtanov, Dániel Marx, Marcin Pilipczuk, Michal Pilipczuk, and Saket Saurabh. Parameterized Algorithms. Springer, 2015. doi:10.1007/978-3-319-21275-3.
- [4] Juan A. Díaz and Elena Fernández. A tabu search heuristic for the generalized assignment problem. European Journal of Operational Research, 132(1):22–38, 2001. doi:10.1016/S0377-2217(00)00108-9.
- [5] Martin Dörnfelder, Jiong Guo, Christian Komusiewicz, and Mathias Weller. On the parameterized complexity of consensus clustering. Theoretical Computer Science, 542:71–82, 2014. doi:10.1016/J.TCS.2014.05.002.
- [6] Rodney G. Downey and Michael R. Fellows. Fundamentals of Parameterized Complexity. Texts in Computer Science. Springer, 2013. doi:10.1007/978-1-4471-5559-1.
- [7] Michael R. Fellows, Fedor V. Fomin, Daniel Lokshtanov, Frances A. Rosamond, Saket Saurabh, and Yngve Villanger. Local search: Is brute-force avoidable? Journal of Computer and System Sciences, 78(3):707–719, 2012. doi:10.1016/J.JCSS.2011.10.003.
- [8] Paola Festa, Panos M Pardalos, Mauricio GC Resende, and Celso C Ribeiro. Randomized heuristics for the max-cut problem. Optimization methods and software, 17(6):1033–1058, 2002. doi:10.1080/1055678021000090033.
- [9] Robert Ganian, Fabian Klute, and Sebastian Ordyniak. On structural parameterizations of the bounded-degree vertex deletion problem. Algorithmica, 83(1):297–336, 2021. doi:10.1007/S00453-020-00758-8.
- [10] Jaroslav Garvardt, Niels Grüttemeier, Christian Komusiewicz, and Nils Morawietz. Parameterized Local Search for Max -Cut. In Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI ’23), pages 5586–5594. ijcai.org, 2023. doi:10.24963/IJCAI.2023/620.
- [11] Jaroslav Garvardt, Nils Morawietz, André Nichterlein, and Mathias Weller. Graph clustering problems under the lens of parameterized local search. In Proceedings of the 18th International Symposium on Parameterized and Exact Computation (IPEC ’23), volume 285 of LIPIcs, pages 20:1–20:19. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2023. doi:10.4230/LIPICS.IPEC.2023.20.
- [12] Serge Gaspers, Joachim Gudmundsson, Mitchell Jones, Julián Mestre, and Stefan Rümmele. Turbocharging treewidth heuristics. Algorithmica, 81(2):439–475, 2019. doi:10.1007/S00453-018-0499-1.
- [13] Serge Gaspers, Eun Jung Kim, Sebastian Ordyniak, Saket Saurabh, and Stefan Szeider. Don’t be strict in local search! In Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence (AAAI ’12). AAAI Press, 2012. doi:10.1609/AAAI.V26I1.8128.
- [14] Tomas Gavenciak, Martin Koutecký, and Dusan Knop. Integer programming in parameterized complexity: Five miniatures. Discrete Optimization, 44(Part):100596, 2022. doi:10.1016/J.DISOPT.2020.100596.
- [15] Niels Grüttemeier, Christian Komusiewicz, and Nils Morawietz. Efficient Bayesian network structure learning via parameterized local search on topological orderings. In Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI ’21), pages 12328–12335. AAAI Press, 2021. Full version available at https://doi.org/10.48550/arXiv.2204.02902. doi:10.1609/AAAI.V35I14.17463.
- [16] Jiong Guo, Sepp Hartung, Rolf Niedermeier, and Ondrej Suchý. The parameterized complexity of local search for TSP, more refined. Algorithmica, 67(1):89–110, 2013. doi:10.1007/S00453-012-9685-8.
- [17] Jiong Guo, Danny Hermelin, and Christian Komusiewicz. Local search for string problems: Brute-force is essentially optimal. Theoretical Computer Science, 525:30–41, 2014. doi:10.1016/J.TCS.2013.05.006.
- [18] Sepp Hartung and Rolf Niedermeier. Incremental list coloring of graphs, parameterized by conservation. Theoretical Computer Science, 494:86–98, 2013. doi:10.1016/J.TCS.2012.12.049.
- [19] Holger H. Hoos and Thomas Stützle. Stochastic Local Search: Foundations & Applications. Elsevier / Morgan Kaufmann, 2004.
- [20] Russell Impagliazzo, Ramamohan Paturi, and Francis Zane. Which problems have strongly exponential complexity? Journal of Computer and System Sciences, 63(4):512–530, 2001. doi:10.1006/JCSS.2001.1774.
- [21] Maximilian Katzmann and Christian Komusiewicz. Systematic exploration of larger local search neighborhoods for the minimum vertex cover problem. In Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence (AAAI ’17), pages 846–852. AAAI Press, 2017. doi:10.1609/AAAI.V31I1.10659.
- [22] Dusan Knop, Martin Koutecký, Asaf Levin, Matthias Mnich, and Shmuel Onn. Parameterized complexity of configuration integer programs. Operations Research Letters, 49(6):908–913, 2021. doi:10.1016/J.ORL.2021.11.005.
- [23] Christian Komusiewicz, Simone Linz, Nils Morawietz, and Jannik Schestag. On the complexity of parameterized local search for the maximum parsimony problem. In Proceedings of the 34th Annual Symposium on Combinatorial Pattern Matching (CPM ’23), volume 259 of LIPIcs, pages 18:1–18:18. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2023. doi:10.4230/LIPICS.CPM.2023.18.
- [24] Christian Komusiewicz and Nils Morawietz. Parameterized Local Search for Vertex Cover: When Only the Search Radius Is Crucial. In Proceedings of the 17th International Symposium on Parameterized and Exact Computation (IPEC ’22), volume 249 of LIPIcs, pages 20:1–20:18. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2022. doi:10.4230/LIPICS.IPEC.2022.20.
- [25] Martin Kouteckỳ. Solving hard problems on neighborhood diversity. Master’s thesis, Charles University in Prague, 2013. URL: https://koutecky.name/mgr/mgr.pdf.
- [26] Ruizhi Li, Shuli Hu, Shaowei Cai, Jian Gao, Yiyuan Wang, and Minghao Yin. NuMWVC: A novel local search for minimum weighted vertex cover problem. Journal of the Operational Research Society, 71(9):1498–1509, 2020. doi:10.1080/01605682.2019.1621218.
- [27] Andrea Lodi, Silvano Martello, and Daniele Vigo. Approximation algorithms for the oriented two-dimensional bin packing problem. European Journal of Operational Research, 112(1):158–166, 1999. doi:10.1016/S0377-2217(97)00388-3.
- [28] Christiane Markuse-Schneider. Personal communication. Schmitz Cargobull production site in Vreden, Germany, 2023.
- [29] Dániel Marx. Searching the -change neighborhood for TSP is W[1]-hard. Oper. Res. Lett., 36(1):31–36, 2008. doi:10.1016/J.ORL.2007.02.008.
- [30] Nils Morawietz. On the complexity of local search problems with scalable neighborhoods. PhD thesis, Friedrich-Schiller-Universität Jena, 2024. Dissertation. URL: https://www.db-thueringen.de/receive/dbt_mods_00064137.
- [31] Quan Ouyang and Hong Yun Xu. Genetic algorithm for single machine scheduling problem with setup times. Applied Mechanics and Materials, 457:1678–1681, 2014.
- [32] Stefan Szeider. The parameterized complexity of -flip local search for SAT and MAX SAT. Discrete Optimization, 8(1):139–145, 2011. doi:10.1016/J.DISOPT.2010.07.003.
