Abstract 1 Introduction 2 Preliminaries 3 Rendering Unit Interval Graphs Edgeless in 𝑶(𝒏𝐥𝐨𝐠𝒏) time 4 Minimising the Total Moving Distance for 𝚷edgeless on Weighted Interval Graphs is Hard 5 Minimising the Maximum Moving Distance for 𝚷edgeless on Unit Disk Graphs is Hard 6 Concluding Remarks References

On the Complexity of Minimising the Moving Distance for Dispersing Objects

Nicolás Honorato-Droguett ORCID Nagoya University, Japan Kazuhiro Kurita ORCID Nagoya University, Japan Tesshu Hanaka ORCID Kyushu University, Fukuoka, Japan Hirotaka Ono ORCID Nagoya University, Japan
Abstract

We study Geometric Graph Edit Distance (GGED), a graph-editing model to compute the minimum edit distance of intersection graphs that uses moving objects as an edit operation. We first show an O(nlogn)-time algorithm that minimises the total moving distance to disperse unit intervals. This algorithm is applied to render a given unit interval graph (i) edgeless, (ii) acyclic and (iii) k-clique-free. We next show that GGED becomes strongly 𝖭𝖯-hard when rendering a weighted interval graph (i) edgeless, (ii) acyclic and (iii) k-clique-free. Lastly, we prove that minimising the maximum moving distance for rendering a unit disk graph edgeless is strongly 𝖭𝖯-hard over the L1 and L2 distances.

Keywords and phrases:
Intersection graphs, Optimisation, Graph modification
Funding:
Kazuhiro Kurita: This work is partially supported by JSPS KAKENHI Grant Numbers JP21K17812, JP22H03549, and JST ACT-X Grant Number JPMJAX2105.
Tesshu Hanaka: This work is partially supported by JSPS KAKENHI Grant Numbers JP21K17707, JP23H04388, JST CRONOS Grant Number JPMJCS24K2.
Hirotaka Ono: This work is partially supported by JSPS KAKENHI Grant Numbers JP20H05967, JP21K19765, JP22H00513, JST CRONOS Grant Number JPMJCS24K2.
Copyright and License:
[Uncaptioned image] © Nicolás Honorato-Droguett, Kazuhiro Kurita, Tesshu Hanaka, and Hirotaka Ono; licensed under Creative Commons License CC-BY 4.0
2012 ACM Subject Classification:
Theory of computation Mathematical optimization
Related Version:
Full Version: https://doi.org/10.48550/arXiv.2502.12903 [17]
Editors:
Pat Morin and Eunjin Oh

1 Introduction

Graph modification is a fundamental topic to address graph similarity and dissimilarity, where a given graph is deformed by adding or deleting vertices or edges to satisfy a specific non-trivial graph property, while minimising the cost of edit operations. The problem of determining this cost is commonly known as graph modification problem (GMP) and has applications in various disciplines, such as computer vision [3], network interdiction [15], and molecular biology [14]. GMPs are often categorised into vertex and edge modification problems, with edit operations restricted to the vertex and edge sets, respectively.

The cost of a single edit operation in a GMP is often determined by the specific application. In theoretical studies, a unit-cost model is often assumed, where each addition or deletion of a vertex or edge has a uniform cost. However, for such models, it is known that determining whether a graph can be modified to obtain a member of a given class is 𝖭𝖯-hard for a wide range of graphs and classes [19, 2, 12, 25]. These negative bounds of GMPs motivate alternative formulations for graph editing that consider domain-specific constraints and cost measures.

The choice of edit operations and their associated costs is a crucial aspect of GMPs, as different formulations capture different structural properties and computational challenges. Analogous to string similarity analysis, where modifications are based on biologically significant operations such as DNA mutations and repeats [20], graph modification problems should reflect the inherent constraints and structural properties of the graphs being studied. In particular, geometric intersection graphs (hereafter intersection graphs) provide a suitable framework for studying GMPs for scenarios where graphs represent spatial relationships (see, e.g., [23, 6, 10]). Given a collection of geometric objects 𝒮, an intersection graph G(𝒮) is a graph where there is a one-to-one correspondence between the vertex set V(G(𝒮)) and 𝒮, and two vertices are adjacent if and only if their corresponding objects intersect. This model includes many well-known graph classes, such as interval graphs and disk graphs. These graphs can be frequently found in real-world applications such as network modelling and bioinformatics [22].

Motivated by this context, this paper investigates GMPs for intersection graphs. In this context, two natural questions arise:

  1. 1.

    Are standard graph edit operations suitable for modifying intersection graphs?

  2. 2.

    How can the geometric properties of objects be exploited to overcome the hardness of GMPs?

To answer these questions, we introduce Geometric Graph Edit Distance, a model for modifying intersection graphs from a geometric perspective.

In the intersection graph model, a natural edit operation is to move the objects in 𝒮. We treat this movement as a graph edit operation and focus on minimising the cost required to modify an intersection graph so that the resulting graph is in a specific graph class. The cost is quantified by the total moving distance, which is the sum of the distances by which objects in 𝒮 are moved. More precisely, we define the problem as follows:

We assume that Π is given by an oracle, i.e. we have an algorithm to determine whether the intersection graph G(𝒮) is in Π.

Related work

Numerous GMPs are known to be computationally hard. In the early 1980s, Lewis and Yannakakis [19] showed that vertex-deletion problems are 𝖭𝖯-complete for any hereditary graph class. Similarly, many edge modification problems have been shown to be 𝖭𝖯-complete, such as transforming a graph into a perfect, chordal, or interval graph [2]. As a result, the past decade has seen a growing interest in addressing these problems from the perspective of parameterised complexity. The recent survey by Crespelle et al.[5] provides a comprehensive overview of this subject (see also[8]).

Although classical GMPs focus on structural modifications of graphs, recent studies have explored models that include geometric constraints. Honorato-Droguett et al. [16] introduced the above geometric approach to graph modification, demonstrating that graphs of certain classes, such as graph completeness and the existence of a k-clique, can be efficiently obtained on interval graphs. Their work highlights how the underlying geometric properties of intersection graphs can be exploited to design appropriate modification models.

In a similar vein, Fomin et al. [10] studied the disk dispersal problem, where a set 𝒮 of n disks, an integer k0, and a real number d0 are given, and the goal is to determine whether an edgeless disk graph can be realised by moving at most k disks by at most d distance each. They proved that this problem is 𝖭𝖯-hard when d=2 and k=n and also 𝖥𝖯𝖳 when parameterised by k+d. Furthermore, they showed that the problem becomes 𝖶[1]-hard when parameterised by k when disk movement is restricted to rectilinear directions.

Expanding on this line of research, Fomin et al. [11] conducted a parameterised complexity study of edge modification problems where scaling objects is considered as the edit operation. Their results illustrate how alternative edit operations in geometric intersection graphs can impact computational complexity, enabling further study of geometric modification graph models. In particular, their work includes several 𝖥𝖯𝖳 results to achieve independence, acyclicity and connectivity on disk graphs.

Our work continues these developments by introducing Geometric Graph Edit Distance, a model that considers object movement as an edit operation to modify intersection graphs. Unlike prior studies that focus on vertex and edge modifications or object scaling, our approach explicitly considers movement costs by quantifying the total moving distance required to obtain a graph in a given class. This approach enables the exploration of new algorithmic and complexity-theoretic questions in the context of geometric intersection graphs.

Our contribution

Our results are mainly focused on interval graphs and summarised in Table 1. In this paper, we deal with the following graph classes: Πedgeless (edgeless graphs), Πacyc (acyclic graphs) and Πk-clique¯ (k-clique-free graphs).

Table 1: Summary of our results. In this table, L1 and L2 are the Manhattan and Euclidean distances, respectively. The terms IG, UIG and UDG are abbreviations of interval graphs, unit interval graphs and unit disk graphs, respectively.
Problem Type Graph Target Graph Class Metric Weighted Complexity
minsum UIG Πedgeless L2(=L1) No O(nlogn)
UIG Πacyc L2(=L1) No O(nlogn)
UIG Πk-clique¯ L2(=L1) No O(nlogn)
IG Πedgeless L2(=L1) Yes strongly 𝖭𝖯-hard
IG Πacyc L2(=L1) Yes strongly 𝖭𝖯-hard
IG Πk-clique¯ L2(=L1) Yes strongly 𝖭𝖯-hard
for any 1kn
minimax UDG Πedgeless L2,L1 Yes strongly 𝖭𝖯-hard

In [16], the model presented is studied mainly for classes of dense graphs. This inspires the present paper as a subsequent work, where we instead focus on classes for sparse graphs. As two fundamental classes of sparse graphs, we consider edgeless graphs (Πedgeless) and acyclic graphs (Πacyc). These classes have also been studied in related work on geometric intersection graphs[10, 11].

As we shall detail, Πacyc is contained in Πk-clique¯ in our context. As a result, one might argue that the distinction of both classes is irrelevant. However, we still consider them distinctively, as forests are a well-known class of graphs. Our analysis highlights the computational complexity of modifying intersection graphs while considering movement-based edit operations, a perspective distinct from prior work that focuses on exclusively modifying the graph structure.

Paper Organisation.

Section 2 formally describes the definitions needed to address the above ideas. Section 3 presents the problem Interval Dispersal and shows that it can be solved in O(nlogn) time. Using this algorithm, we establish that Geometric Graph Edit Distance can also be solved in O(nlogn) time for classes Πedgeless, Πacyc, and Πk-clique¯ on unit interval graphs. Section 4 demonstrates that Geometric Graph Edit Distance becomes strongly 𝖭𝖯-hard on weighted interval graphs for classes Πedgeless, Πacyc, and Πk-clique¯. Section 5 shows that the minimax version of Geometric Graph Edit Distance is strongly 𝖭𝖯-hard on weighted unit disk graphs for Πedgeless under both the L1 and L2 distance metrics. Section 6 concludes with remarks on our results and potential future directions.

Due to space restrictions, we omit in-depth explanations and all full proofs of statements with a -mark. The reader is referred to the full version of this paper [17] for these details.

2 Preliminaries

This section provides the main definitions used in the paper, referencing geometry, graph, and convexity terminology from textbooks [24, 4, 7, 1].

Objects.

An interval I is a line segment on the real line of length len(I)+. Intervals are assumed to be open, unless explicitly stated otherwise. An interval such that len(I)=1 is called unit interval. The left endpoint (I) of an interval I is the point that satisfies (I)y for any yI. Similarly, the right endpoint r(I) of I is the point that satisfies yr(I) for any yI. The centre c(I) of I is the point c(I)=(r(I)(I))/2. Throughout the paper, we assume that the indices of a collection of intervals ={I1,,In} follow the order given by centres of intervals. That is, c(Ii)c(Ii+1) for all 1in1. However, it is not assumed that collections are ordered when given as the input graph. Given a radius r>0 and a point p, a disk D centred at p is the set D={x2x,p2r}. An open disk D is a disk without its boundary circle; that is, D={x2x,p2<r}. We assume that the disks are open, unless we mention the contrary. A unit disk is a disk of radius r=1/2.

Graphs.

Throughout the paper, a graph G=(V,E) is assumed to be a simple, finite, and undirected graph with vertex set V and edge set E. An edgeless graph is a graph G=(V,E) such that E=. A k-clique of a graph G=(V,E) is a subset WV such that |W|=k and for all u,vW,uv, {u,v}E, for kn. If such W exists in V, we say that G contains a k-clique. An interval graph is an intersection graph G()=(V,E) where the vertex set V={v1,,vn} corresponds to a collection of intervals ={I1,,In} and an edge {vi,vj}E exists if and only if IiIj, for any 1i,jn,ij. An interval graph is called unit interval graph if len(I)=1 for all I. Similarly, a disk graph is an intersection graph G(𝒟)=(V,E) where the vertex set V={v1,,vn} corresponds to a disk collection 𝒟={D1,,Dn}. An edge (vi,vj)E exists if and only if DiDj, for any 1i,jn,ij. A unit disk graph is a disk graph in which the collection contains exclusively unit disks. Unless stated otherwise, all intersection graphs are assumed to be unweighted. A weighted intersection graph assigns a multiplicative weight, called the distance weight, to the moving distance function of each object. The formal definition of distance weight appears in later sections when required. An (infinite) set of graphs Π is a graph class (or simply a class), and we say that G is in Π if GΠ. A graph class Π is non-trivial if infinitely many graphs belong to Π and infinitely many graphs do not belong to Π. In this paper, we deal with the following non-trivial classes: (i) Πedgeless={G:G is an edgeless graph.}, (ii) Πacyc={G:G is an acyclic graph.}, (iii) Πk-clique={G:G contains a k-clique.} and (iv) Πk-clique¯={G:GΠk-clique}.

3 Rendering Unit Interval Graphs Edgeless in 𝑶(𝒏𝐥𝐨𝐠𝒏) time

We show that a graph in Πedgeless can be obtained in O(nlogn) time given a collection of n unit intervals. We start by defining a problem that we call Interval Dispersal and then use the algorithm designed to obtain a graph in Πedgeless, Πacyc and Πk-clique¯. Interval Dispersal receives as input a collection of n intervals and a real s1, and asks for the minimum value of the total moving distance to obtain a collection that satisfies c(Ij)c(Ii)s for each Ii,Ij, i<j. When s=1, Interval Dispersal is equivalent to Geometric Graph Edit Distance on unit interval graphs and Πedgeless. For simplicity, the intervals are assumed to be open. This avoids the need to address infinitesimally small distances required to separate closed intervals. We must first introduce some basic definitions and notation to describe the algorithm. Given a collection of n intervals ={I1,,In}, let D=(d1,,dn) be a vector such that di is the moving distance applied to Ii. We denote by D={I1D,InD} the collection of intervals such that c(IiD)=c(Ii)+di. The set 𝒟()n is the set of vectors that describe the moving distance applied to intervals such that the condition of Interval Dispersal is satisfied. In other words, for all D=(d1,,dn)𝒟(), c(IjD)+c(IiD)s holds for i<j. We use 𝒟𝑜𝑝𝑡()𝒟() to denote the subset of vectors in 𝒟() that minimises the total moving distance applied to intervals; i.e. 𝒟𝑜𝑝𝑡()={D=(d1,,dn)𝒟()1in|di|=minD=(d1,,dn)𝒟()1in|di|}.

Intuitively, we aim to find a vector D𝒟𝑜𝑝𝑡() to move each interval so that the distance between each pair of intervals is at least s. Given an arbitrary D𝒟𝑜𝑝𝑡(), the order of D may be different from the order of . However, it was previously shown [16] that there is always a vector D𝒟𝑜𝑝𝑡() such that the order of D preserves the order of . This implies that there always exists an optimal solution of Interval Dispersal for which checking the inequality (c(Ii+1)+di+1)(c(Ii)+di)s for in1 is sufficient.

We now define the equispace function, which moves intervals so that the distance between their centres is exactly s, maintaining the order induced by interval centres.

Definition 1 (Equispace function).

Let (,s) be an instance of Interval Dispersal where is a collection of unit intervals. The equispace function of to a point x is a function E:× defined as:

E(,x)=i=1nfi(x),fi(x)=|xc(Ii)(ni)s|.

The vector that describes the moving distances given by E(,x) is defined as Ex()=(e1,,en)=(α1f1(x),,αnfn(x)) where αi=1 if xc(Ii)+(ni)s and αi=1 otherwise, for 1in. We also denote by Ex()={I1Ex(),InEx()} the collection of intervals where c(IiEx())=c(Ii)+αifi(x) for 1in.  

By the above, Ex()𝒟() for all x. Moreover, c(Ii+1Ex())c(IiEx())=s for all 1in1. We first prove that for certain collections of intervals, minimising E gives a vector contained in 𝒟𝑜𝑝𝑡().

Lemma 2 ().

The equispace function E(,x) is a piecewise-linear convex function.

We define the set of breakpoints of E(,x) to be the set B={b1,,bn}={c(Ii)+(ni)sIi, 1in}. Given a collection of intervals , we define the equispace function E(,x) as a sequence of linear functions E1(,x),,E||+1(,x). The slope of Ei(,x) is less than the slope of Ej(,x) for 1i<j||. Since the equispace function is convex and piecewise linear, the points that minimise E are located within a range bxbr, where bbr and b,brB. We prove that b and br can be easily found.

Lemma 3 ().

The minimum value of E(,x) is given by the breakpoint b(n+1)/2 if n is odd, and by breakpoints bn/2 and b(n/2)+1 otherwise.

By Lemma 3, the minimum value of E for an arbitrary collection of intervals is given by the median value(s) of B. We now show which collections allow minimising E to obtain a vector in 𝒟𝑜𝑝𝑡(), characterised as follows:

Definition 4 (Optimally Equispaceable Collections).

Given a collection of intervals , we say that is optimally equispaceable if there exists a D𝒟𝑜𝑝𝑡() such that D=Ex() and xargminxE(,x). Equivalently, is optimally equispaceable if Ex()𝒟𝑜𝑝𝑡() for all xargminxE(,x).

Lemma 5.

Let ={I1,,In} be a collection of unit intervals such that c(Ii+1)c(Ii)s for 1in1. Then is optimally equispaceable. Moreover, there exists a D𝒟𝑜𝑝𝑡() such that c(Ii+1D)c(IiD)=s holds for all 1in1.

Proof.

We only prove the latter, as the existence of D in 𝒟𝑜𝑝𝑡() directly implies the optimal equispaceability of . That is, we show that D satisfies c(Ii+1D)c(IiD)=s, for 1in1. By the definition of Interval Dispersal, we have c(Ii+1D)c(IiD) and c(Ii+1D)c(IiD)s for 1in1. Suppose that there exists a pair of intervals Ii and Ii+1 that satisfies c(Ii+1D)c(IiD)>s. Let s=c(Ii+1D)c(IiD) and δ=ss. We show how to obtain a total moving distance D such that dD|d|<dD|d| and c(Ii+1D)c(IiD)=s.

We divide the proof into three cases: (i) di0, (ii) di+10 and (iii) di0 and di+10. For case (i), it follows that djdj10 for i+1jn and (c(Ii+1D)δ)c(IiD)=c(Ii+1)+(di+1δ)(c(Ii)+di)=s holds. Let D(d1,,dn)=(d1,,di,di+1δ,,dnδ). The dispersal condition is satisfied by D. Furthermore, since δ>0, the total moving distance satisfies dD|d|=j=1i|dj|+j=i+1ndjδ<dD|d|, which contradicts the optimality of D.

For case (ii), djdj+1 for 1ji holds, and the argument for case (i) applies analogously for D=(d1,,dn)=(d1+δ,,di+δ,di+1,,dn).

We only need to prove case (iii). Let δ=ss as in the previous cases. If δdi+1, then we move the intervals as in the first case. If δdi, then we move intervals as in the second case. In both cases, the same argument applies and the total moving distance contradicts the optimality of D. Thus we assume that δ>di+1,di holds. Without loss of generality, we move intervals Ij for i+1jn by di+1 to the left by δ=di+1 and intervals Ij for 1ji to the right by δ′′=(c(Ii+1D)δ)c(IiD)s. Then (c(Ii+1D)δ)(c(IiD)+δ′′)=s holds since di+1δ=0. Let D=(d1,,dn)=(d1+δ′′,,di+δ′′,di+1δ,,dnδ). The inequality dD|d|=j=1idj+δ′′+j=i+1ndjδ<dD|d| holds since δ,δ′′>0, which contradicts the optimality of D. Therefore, in an optimal solution, must satisfy c(Ii+1)+di+1(c(Ii)+di)=c(Ii+1D)c(IiD)=s, for 1in1.

Let ={I1,,In} and 𝒥={J1,,Jm} be two collections of unit intervals and let x1,x2argminxE(,x), x1x2, and y1,y2argminxE(𝒥,x), y1y2, be the breakpoints that minimise E for and 𝒥, respectively. We say that and 𝒥 intersect when equispaced when y1x2+|𝒥|s. In other words, and 𝒥 intersect when equispaced whenever there exist points x1xx2 and y1yy2 such that there exist IEx() and I𝒥Ey(𝒥) for which c(J)c(I)<s.

Lemma 6 ().

Given that 𝒥={I1,,In,J1,,Jm}, 𝒥 is optimally equispaceable if and only if y1x2+|𝒥|s.

Corollary 7 is directly implied by Lemma 6.

Corollary 7.

If y1>x2+|𝒥|s, then 𝒥 is not optimally equispaceable. Moreover, the minimum total moving distance for dispersing 𝒥 is equal to E(,x)+E(𝒥,y) for arbitrary x1xx2 and y1yy2.

Given a collection of n unit intervals, we note that can be partitioned into mn subcollections a1,b1,,am,bm such that for all 1im, c(Ij+1)c(Ij)s for aijbi1. By Lemma 5, each ai,bi is an optimally equispaceable collection. We use Lemma 6 and prove the statement of Lemma 8.

Lemma 8 ().

Let ={I1,,In}=a1,b1am,bm be a collection of n unit intervals partitioned as above. If there exist integers α1,,αk such that aαi,bαi and aαi+1,bαi+1 intersect when equispaced, then there exists an optimal solution for dispersing that disperses the intervals in a way that c(Ij+1)+dj+1(c(Ij)+dj)=s holds for 1ik and aαij<bαi+1.

Outline of Algorithm 1

Given a collection of unit intervals and a dispersal value s1, the algorithm starts by sorting and partitioning into mn disjoint subcollections a1,b1,,am,bm such that each ai,bi satisfies Lemma 5. Subsequently, the optimal breakpoints are determined for each E(ai,bi,x). Whenever there exist two subcollections ai,bj,ij and ak,b,k that intersect when equispaced, the algorithm considers both subcollections as a unique subcollection ai,b=ai,bjak,b and recursively determines the optimal breakpoints of E(ai,b,x) using the breakpoint sets of E(ai,bj,x) and E(ak,b,x). Lemma 8 ensures that this recursion partitions into non-intersecting subcollections when equispaced. Lastly, the algorithm returns the total moving distance, which is calculated as the sum of the optimal values of E for each subcollection.

Before showing the complexity of the algorithm, we must characterise the set of breakpoints further. When a collection of unit intervals ={I1,,In} is partitioned into m disjoint subcollections a1,b1,,am,bm of intervals that satisfy Lemma 5, the set of breakpoints Bai,bi is equal to {c(Ij)+(|ai,bi|j)sIi,aijbi} for each 1im. Consequently, B can be reformulated as follows:

B={c(Ij)+(|ai,bi|j+k=i+1m|ak,bk|)s1im,aijbi}.

As a result, if b and b are the breakpoints for I in Bai,bi and B, respectively, then b=bj=i+1m|aj,bj| holds. Moreover, the breakpoints of any union of subcollections ai,bj=ai,biaj,bj can be calculated in the same way by subtracting k=j+1m|ak,bk| from any breakpoint bB calculated using an interval Iai,bj. It follows that the order of Bai,bj is the same as the order of the corresponding breakpoints in B.

The above implies that the breakpoints of any (union of) subcollection(s) can be obtained from B. We denote the set ikj{b+sl=k+1m|al,bl|bBak,bk} by Bai,bj and call it the cumulative set of breakpoints of Bai,bj. We prove that Ba1,b1,,Bam,bm can be found in O(nlogn) time.

Algorithm 1 Dispersing n unit intervals in O(nlogn) time.
Lemma 9 ().

Let ={I1,,In}=a1,b1am,bm be a collection of n unit intervals partitioned as above. Then the cumulative sets of breakpoints Ba1,b1,,Bam,bm such that each Bai,bi is sorted can be obtained in O(nlogn) total time.

Lemma 10 ().

Let ={I1,,In}=a1,b1am,bm be a collection of n unit intervals partitioned as above. If cumulative breakpoint sets Ba1,b1,,Bam,bm are given so that each Bai,bi is sorted, then merging them into one sorted set can be done in O(nlogn) total time.

Theorem 11.

Given a collection of unit intervals and a value s1, Interval Dispersal can be solved in O(nlogn) time.

Proof.

We show the complexity of Algorithm 1. Line 2 can be done in O(nlogn) time for sorting and O(n) time to determine the initial m partitions. Similarly, line 3 can be done in O(nlogn) time by Lemma 9. Given that each Bai,bi is sorted, the ((|ai,bi|+1)/2)th element (resp. (|ai,bi|/2)th and ((|ai,bi|/2)+1)th element) can be calculated in O(log|ai,bi|) time using binary search on Bai,bi. This ensures that line 4 is done for all 1im in O(mlogn) total time. We initialise D𝑜𝑝𝑡 as a doubly linked list where each node i contains the information of (Bai,bi,xai,bi1,xai,bi2). We show the complexity of the loop in line 6. We merge both Bai,bj and Bak,b to obtain a sorted Bai,b. Hence, the median value(s) of Bai,b can be calculated in O(logn) time by binary search. At each execution of line 7, two partitions are merged; thus the number of partitions is reduced by one unit at each iteration. Initially, there exist m partitions, and hence the loop of line 6 iterates at most m1 times. Moreover, merging m cumulative sets of breakpoints into one sorted set can be done in O(nlogn) time by Lemma 10, which implies that any partial merge of these sets is also bounded by O(nlogn). Consequently, the total running time of line 6 is O(nlogn) time. Lastly, in line 9 the two merged sets are deleted and the new one is added. Since D𝑜𝑝𝑡 is a doubly linked list, this can be done in O(1) time by connecting the previous and next node of Bai,bj and Bak,b to a new node containing Bai,b, respectively. Once there is no pair of subcollections left to merge, the total moving distance is calculated in O(n) time in line 10 following the definition of cumulative set of breakpoints, which concludes that the total running time of Algorithm 1 is O(nlogn) time.

Theorem 11 implies the following result for Πedgeless on unit interval graphs when s=1.

Corollary 12 ().

Given a collection of n unit intervals , Geometric Graph Edit Distance can be solved in O(nlogn) time so that G()Πedgeless.

3.1 Classes 𝚷acyc and 𝚷𝒌-clique¯ on Unit Interval Graphs

This section shows how to use Algorithm 1 for obtaining graphs in Πacyc and Πk-clique¯ on unit interval graphs. We first show the case for Πk-clique¯.

It is shown in [16] that given a collection of unit intervals , G() does not contain a k-clique if and only if c(Ii+k1)c(Ii)1 for all 1ink+1. This inequality can be decomposed into k1 inequalities of the following form: for each 0rk2, c(Ii+k1)c(Ii)1 for all 1ink+1 such that imodk1=r. If is decomposed into k1 subcollections such that =1ik1i, i={Ij1jn,j(modk1)=i}, then Algorithm 1 can be applied to each i independently for s=1 to satisfy the above inequalities. Since unit interval graphs are chordal, G is acyclic if it is triangle-free; i.e. G is contained in Π3-clique¯. Consequently Πacyc is equivalent to Πk-clique¯ when k=3. The above ideas imply Corollary 13.

Corollary 13.

Given a collection of n unit intervals , Geometric Graph Edit Distance can be solved in O(nlogn) time so that (i) G()Πacyc and (ii) G()Πk-clique¯.

4 Minimising the Total Moving Distance for 𝚷edgeless on Weighted Interval Graphs is Hard

In this section we show that Geometric Graph Edit Distance is strongly 𝖭𝖯-hard on weighted interval graphs for Πedgeless. We show a reduction from 3-Partition [13]. 3-Partition receives as input a set A of 3m elements, a bound B+ and a size s(a)+ such that B/4<s(a)<B/2 and aAs(a)=mB, and the task is to decide whether A can be partitioned into m disjoint sets A1,,Am such that for 1im, |Ai|=3 and aAis(a)=B.

Given an instance (A,B,s) of 3-Partition, we construct a collection of intervals A and show that A can be partitioned if and only if A can be modified so that G(A)Πedgeless with at most total moving distance T. Given two intervals I,I such that c(I)c(I), we say that I and I intersect if c(I)c(I)<(len(I)+len(I))/2.

We show the construction of A (see Figure 1). We define A as the collection sb where ={I1,,I3m},s={I1s,,Im1s},b={I,Ir} and,

  1. (i)

    for 1i3m, Ii is an interval such that len(Ii)=s(ai) and c(Ii)=s(ai)/2 (that is, r(Ii)=0),

  2. (ii)

    for 1im1, Iis is an interval where len(Iis)=B and c(Iis)=(2i1)B+B/2 and

  3. (iii)

    I and Ir are intervals such that len(I)=len(Ir)=3Bm2+maxaAs(a), c(I)=3Bm2/2 and c(Ir)=(2m1)B+3Bm2/2.

Figure 1: Reduction Overview.

For an interval IA, we define the moving distance function dI: as:

dI(x)={|c(I)x|,I,12Bm2|c(I)x|,Isb.

Given an instance (A,B,s) of 3-Partition, we show the following properties.

Lemma 14 ().

Given an arbitrary partition of A of m disjoint sets A1,,Am such that Ai={a1i,a2i,a3i} for 1im, i=1m6(i1)B+i=1m(3a1i+2a2i+a3i)<3Bm2 holds.

We note that Lemma 14 works for any partition of A as described above, even without the restrictions of the 3-Partition output.

Lemma 15.

Given an instance (A,B,s) of 3-Partition, A can be partitioned into m disjoint sets A1,,Am such that for 1im Ai={a1i,a2i,a3i}, |Ai|=3 and aAis(a)=B if and only if IA can be modified so that G(A)Πedgeless with total moving distance of at most 3Bm2.

Lastly, we remark that the polynomial construction of A is straightforward by iterating over A and following the definitions given at the beginning of the section. We summarise the main result of this section as follows:

Theorem 16.

Geometric Graph Edit Distance is strongly 𝖭𝖯-hard on weighted interval graphs for the class Πedgeless.

We notice that Theorem 16 can be extended to show that obtaining graphs in Πacyc and Πk-clique¯ is also strongly 𝖭𝖯-hard.

In particular, when obtaining a graph in Πk-clique¯, we create k1 overlapping copies of the intervals in sb and add k1 overlapping intervals of size B into the spaces between intervals of sb with the same moving distance function. Any interval forms a k-clique with the k copies of overlapping intervals. Consequently, moving the intervals of with total moving distance of at most 3Bm2 is equivalent to removing all k-cliques from A with at most the same distance. Moreover, by the chordality of interval graphs, it is sufficient to obtain a graph in Πk-clique¯ when k=3 for class Πacyc. As a result, Corollary 17 is obtained.

Corollary 17.

Geometric Graph Edit Distance is strongly 𝖭𝖯-hard on weighted interval graphs for classes Πacyc and Πk-clique¯.

5 Minimising the Maximum Moving Distance for 𝚷edgeless on Unit Disk Graphs is Hard

In this section, we deal with the minimax version of Geometric Graph Edit Distance, defined as follows:

We show that minimax-Geometric Graph Edit Distance is strongly 𝖭𝖯-hard on unit disk graphs for Π=Πedgeless over the L1 and L2 distances by reducing from Planar 3-SAT. Specifically, we show a proof for Theorem 18.

Theorem 18.

minimax-Geometric Graph Edit Distance is strongly 𝖭𝖯-hard on unit disk graphs for Πedgeless over the L1 and L2 distances.

Due to space constraints, we only give an overview of the reduction. The complete reduction and proofs can be found in the full-version of the paper [17].

5.1 Proof Overview of Theorem 18: Reducing Planar 3-SAT to minimax-Geometric Graph Edit Distance

We show a reduction from the following 𝖭𝖯-complete variation of Planar 3-SAT [21, 18, 26]. Given CNF formula Φ equipped with a planar rectilinear embedding GΦ, a set X of n variables, a set C of m clauses over X such that each cC has length |c|3, each variable xX appears in at most three clauses, and Φ=cCc, Planar 3-SAT asks whether Φ is satisfiable. We give a simplified overview of the reduction. The idea is to emulate each component (clauses, variables and connectors) of GΦ using disk gadgets and construct a collection of disks 𝒟Φ equivalent to GΦ. That is, our objective is to construct a 𝒟Φ such that Φ is satisfiable if and only if 𝒟Φ is a yes-instance of minimax-Geometric Graph Edit Distance for Πedgeless. To do this, we emulate the truth assignment using a proper movement of disks. To force the disk movement, we deliberately insert intersecting disks in 𝒟Φ. In particular, we insert intersecting disks in clause gadgets and restrict the movement of such disks to moving a sequence of disks such that a free slot of a variable gadget is used. To allow the removal of the intersection, the gadgets are connected following the structure of GΦ using consecutive disks separated by distance K. For example, consider the boolean formula Φ and its rectilinear embedding GΦ, illustrated in Figure 2.

Figure 2: Reduction Overview: An arbitrary instance Φ of Planar 3-SAT with its rectilinear embedding GΦ.

A skeleton of the reduction is shown in Figure 3(a), where representations of clause and variable gadgets are connected following GΦ.

Figure 3: Reduction Overview: (a) The skeleton given by the instance (Φ,GΦ) of Figure 2; (b) The intersection of the gadget for c=(x1x2¯x4) is removed by moving disks in a way that a free slot of the gadget for x2 is used. Since c=𝑡𝑟𝑢𝑒 when x2=𝑓𝑎𝑙𝑠𝑒, the free slots for the other two gadgets become blocked, being unable to remove their intersection using the variable gadget for x2.

Let c=(x1x2¯x4) and suppose that x2 is assigned to 𝑓𝑎𝑙𝑠𝑒. This assignment implies a movement of disks that (i) removes the intersections in the clause gadget for c and (ii) blocks the truth value of the variable gadget for x2 (see Figure 3(b)). We must block the truth value of the variable gadget so that another clause gadget c does not use the free slot in the variable gadget for x2 when x2=𝑡𝑟𝑢𝑒. Consequently, their intersections must be removed using other gadgets. It can be shown that removing all intersections in this way is equivalent to a valid assignment of variables for which Φ=𝑡𝑟𝑢𝑒. The disks are moved by assigning a new location, and the distance is calculated using a function that we call moving distance function, which is the L1 or L2 distance metric multiplied by a distance weight. We employ two types of disks classified by their distance weight, called transition disk and heavy disk. The transition disks are the disks that we aim to move, whereas heavy disks are used to restrict the movement of transition disks. The moving distance function of a heavy disk is intuitively defined such that any significant movement that alters the construction exceeds a distance of K. We show that a solution that allows removing all intersections from 𝒟Φ with minimum maximum moving distance K exclusively relies on the movement of transition disks. We remark that, although heavy disks can move, their movement is negligible. Combining this condition and the above construction, it can be shown that Φ is satisfiable if and only if 𝒟Φ can be modified so that G(𝒟Φ)Πedgeless using minimum maximum moving distance K.

6 Concluding Remarks

The main contribution of this paper is two-fold. First, we continued the study of Geometric Graph Edit Distance originally presented in [16], showing complexity results for obtaining graphs in several classes for sparse graphs on interval graphs. In particular, we showed that obtaining a graph in Πedgeless, Πacyc and Πk-clique¯ is solvable in O(nlogn) time on unit interval graphs. In contrast, we showed that the problem becomes strongly 𝖭𝖯-hard on weighted interval graphs for the same classes. Second, we defined minimax-Geometric Graph Edit Distance as a variation of the above problem and showed that it is strongly 𝖭𝖯-hard for Πedgeless on weighted unit disk graphs over the L1 and L2 distances.

There are several directions for further research. Our results provide a comprehensive picture of the complexity of Geometric Graph Edit Distance on interval graphs. In particular, we showed that the problem becomes hard even in lower dimensions when the input is not restricted by interval size and distance weight. As a result, a potential future work is to study the complexity when exclusively one of the restrictions is applied. Another interesting direction is to study the model for Πedgeless in higher dimensions. Related works [10, 11, 9] suggest that our model on more complex intersection graphs becomes intractable for some of the graph classes presented in this work. In general, we deal with the edit operation that moves the objects of the given intersection graph. However, the model is not restricted to this operation. Determining Geometric Graph Edit Distance using other geometric edit operations (such as shrinking or rotating objects) is left for future research for all intersection graphs and graph classes presented in this work.

References

  • [1] Stephen P. Boyd and Lieven Vandenberghe. Convex Optimization. Cambridge University Press, 2004. doi:10.1017/cbo9780511804441.
  • [2] Pablo Burzyn, Flavia Bonomo, and Guillermo Durán. NP-completeness results for edge modification problems. Discrete Applied Mathematics, 154(13):1824–1844, 2006. doi:10.1016/j.dam.2006.03.031.
  • [3] Fan R. K. Chung and David Mumford. Chordal Completions of Planar Graphs. J. Comb. Theory Ser. B, 62(1):96–106, 1994. doi:10.1006/jctb.1994.1056.
  • [4] Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, and Clifford Stein. Introduction to Algorithms, 3rd Edition. MIT Press, 2009.
  • [5] Christophe Crespelle, Pål Grønås Drange, Fedor V. Fomin, and Petr A. Golovach. A survey of parameterized algorithms and the complexity of edge modification. Computer Science Review, 48:100556, 2023. doi:10.1016/j.cosrev.2023.100556.
  • [6] Mark de Berg, Sándor Kisfaludi-Bak, and Gerhard J. Woeginger. The complexity of dominating set in geometric intersection graphs. Theoretical Computer Science, 769:18–31, 2019. doi:10.1016/j.tcs.2018.10.007.
  • [7] Reinhard Diestel. Graph Theory, 5th Edition. Graduate texts in mathematics. Springer, Berlin, Germany, 2017. doi:10.1007/978-3-662-53622-3.
  • [8] Pål Grønås Drange. Parameterized Graph Modification Algorithms. PhD thesis, The University of Bergen, 2015. URL: https://bora.uib.no/bora-xmlui/handle/1956/10774.
  • [9] Jirí Fiala, Jan Kratochvíl, and Andrzej Proskurowski. Systems of distant representatives. Discrete Applied Mathematics, 145(2):306–316, 2005. doi:10.1016/j.dam.2004.02.018.
  • [10] Fedor V. Fomin, Petr A. Golovach, Tanmay Inamdar, Saket Saurabh, and Meirav Zehavi. Kernelization for spreading points. In ESA 2023, volume 274 of LIPIcs, pages 48:1–48:16, 2023. doi:10.4230/LIPICS.ESA.2023.48.
  • [11] Fedor V. Fomin, Petr A. Golovach, Tanmay Inamdar, Saket Saurabh, and Meirav Zehavi. Parameterized geometric graph modification with disk scaling. In ITCS 2025, volume 325 of LIPIcs, pages 51:1–51:17. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2025. doi:10.4230/LIPICS.ITCS.2025.51.
  • [12] Fedor V. Fomin, Saket Saurabh, and Neeldhara Misra. Graph modification problems: A modern perspective. In Frontiers in Algorithmics, pages 3–6. Springer International Publishing, 2015. doi:10.1007/978-3-319-19647-3_1.
  • [13] M. R. Garey and David S. Johnson. Computers and Intractability: A Guide to the Theory of NP-Completeness. W.H. Freeman, 1979.
  • [14] Marc Hellmuth, Manuela Geiß, and Peter F. Stadler. Complexity of modification problems for reciprocal best match graphs. Theoretical Computer Science, 809:384–393, 2020. doi:10.1016/j.tcs.2019.12.033.
  • [15] Hung P. Hoang, Stefan Lendl, and Lasse Wulf. Assistance and interdiction problems on interval graphs. Discrete Applied Mathematics, 340:153–170, 2023. doi:10.1016/j.dam.2023.06.046.
  • [16] Nicolás Honorato-Droguett, Kazuhiro Kurita, Tesshu Hanaka, and Hirotaka Ono. Algorithms for optimally shifting intervals under intersection graph models. In IJTCS-FAW 2024, volume 14752, pages 66–78. Springer, 2024. doi:10.1007/978-981-97-7752-5_5.
  • [17] Nicolás Honorato-Droguett, Kazuhiro Kurita, Tesshu Hanaka, and Hirotaka Ono. On the complexity of minimising the moving distance for dispersing objects, 2025. doi:10.48550/arXiv.2502.12903.
  • [18] Donald E. Knuth and Arvind Raghunathan. The problem of compatible representatives. SIAM Journal on Discrete Mathematics, 5(3):422–427, 1992. doi:10.1137/0405033.
  • [19] John M. Lewis and Mihalis Yannakakis. The node-deletion problem for hereditary properties is NP-Complete. Journal of Computer and System Sciences, 20(2):219–230, 1980. doi:10.1016/0022-0000(80)90060-4.
  • [20] Heng Li and Richard Durbin. Fast and accurate short read alignment with burrows–wheeler transform. Bioinformatics, 25(14):1754–1760, 2009. doi:10.1093/bioinformatics/btp324.
  • [21] David Lichtenstein. Planar formulae and their uses. SIAM Journal on Computing, 11(2):329–343, 1982. doi:10.1137/0211025.
  • [22] Terry A. McKee and F. R. McMorris. Topics in Intersection Graph Theory. Discrete Mathematics and Applications. Society for Industrial and Applied Mathematics, 1999. doi:10.1137/1.9780898719802.
  • [23] Fahad Panolan, Saket Saurabh, and Meirav Zehavi. Contraction decomposition in unit disk graphs and algorithmic applications in parameterized complexity. ACM Trans. Algorithms, 20(2):15, 2024. doi:10.1145/3648594.
  • [24] Franco P. Preparata and Michael Ian Shamos. Computational Geometry - An Introduction. Texts and Monographs in Computer Science. Springer, 1985. doi:10.1007/978-1-4612-1098-6.
  • [25] R. Sritharan. Graph modification problem for some classes of graphs. Journal of Discrete Algorithms, 38-41:32–37, 2016. doi:10.1016/j.jda.2016.06.003.
  • [26] Craig A. Tovey. A simplified NP-complete satisfiability problem. Discrete Applied Mathematics, 8(1):85–89, 1984. doi:10.1016/0166-218x(84)90081-7.