When Distances Lie: Euclidean Embeddings in the Presence of Outliers and Distance Violations

Bentert, Matthias; Fomin, Fedor V.; Golovach, Petr A.; Ramanujan, M. S.; Saurabh, Saket

doi:10.4230/LIPIcs.SoCG.2025.15

When Distances Lie: Euclidean Embeddings in the Presence of Outliers and Distance Violations

Matthias Bentert University of Bergen, Norway Fedor V. Fomin

University of Bergen, Norway Petr A. Golovach

University of Bergen, Norway M. S. Ramanujan

University of Warwick, UK Saket Saurabh

Institute of Mathematical Sciences, Chennai, India
University of Bergen, Norway

Abstract

Distance geometry explores the properties of distance spaces that can be exactly represented as the pairwise Euclidean distances between points in $\mathbb{R}^{d}$ ( $d\geq 1$ ), or equivalently, distance spaces that can be isometrically embedded in $\mathbb{R}^{d}$ . In this work, we investigate whether a distance space can be isometrically embedded in $\mathbb{R}^{d}$ after applying a limited number of modifications. Specifically, we focus on two types of modifications: outlier deletion (removing points) and distance modification (adjusting distances between points). The central problem, Euclidean Embedding Editing, asks whether an input distance space on $n$ points can be transformed, using at most $k$ modifications, into a space that is isometrically embeddable in $\mathbb{R}^{d}$ .

We present several fixed-parameter tractable (FPT) and approximation algorithms for this problem. Our first result is an algorithm that solves Euclidean Embedding Editing in time $(dk)^{\mathcal{O}(d+k)}+n^{\mathcal{O}(1)}$ . The core subroutine of this algorithm, which is of independent interest, is a polynomial-time method for compressing the input distance space into an equivalent instance of Euclidean Embedding Editing with $\mathcal{O}((dk)^{2})$ points.

For the special but important case of Euclidean Embedding Editing where only outlier deletions are allowed, we improve the parameter dependence of the FPT algorithm and obtain a running time of $\min\{(d+3)^{k},2^{d+k}\}\cdot n^{\mathcal{O}(1)}$ . Additionally, we provide an FPT-approximation algorithm for this problem, which outputs a set of at most $2\cdot\mathsf{Opt}$ outliers in time $2^{d}\cdot n^{\mathcal{O}(1)}$ . This 2-approximation algorithm improves upon the previous $(3+\varepsilon)$ -approximation algorithm by Sidiropoulos, Wang, and Wang [SODA ’17]. Furthermore, we complement our algorithms with hardness results motivating our choice of parameterizations.

Keywords and phrases:

Parameterized Complexity, Euclidean Embedding, FPT-approximation

Funding:

Matthias Bentert: Supported by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 819416).

Fedor V. Fomin: Supported by the Research Council of Norway under BWCA project (grant no. 314528).

Petr A. Golovach: Supported by the Research Council of Norway under BWCA project (grant no. 314528).

M. S. Ramanujan: Supported by Engineering and Physical Sciences Research Council (EPSRC) grant EP/V044621/1.

Saket Saurabh: Supported by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 819416); and Swarnajayanti Fellowship grant DST/SJF/MSA-01/2017-18.

Copyright and License:

© Matthias Bentert, Fedor V. Fomin, Petr A. Golovach, M. S. Ramanujan, and
Saket Saurabh; licensed under Creative Commons License CC-BY 4.0

2012 ACM Subject Classification:

Mathematics of computing

\rightarrow

Combinatorial algorithms ; Theory of computation

\rightarrow

Fixed parameter tractability

Editors:

Oswin Aichholzer and Haitao Wang

Series and Publisher:

Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik

1 Introduction

The Euclidean Distance Matrix (EDM) is a matrix containing the squared Euclidean distances between points in a set. A central problem in Distance Geometry is determining whether a given matrix is an EDM [5, 25, 11, 13]. That is, the task is to identify whether a given distance space can be isometrically embedded into $\ell_{2}$ -spaces, or equivalently, the pairwise Euclidean distances among points in $\mathbb{R}^{d}$ ( $d\geq 1$ ). This problem has a rich history, originating with Cayley [7], whose observations were formalized by Menger [28], leading to Cayley-Menger determinants. Schoenberg [30] further advanced the field by characterizing $\ell_{2}$ -embeddable distances using negative type inequalities. Blumenthal’s monograph [5] remains a foundational text, documenting the theoretical underpinnings of this area.

This problem has numerous applications, including sensor network localization [29, 14], molecular conformation [21], data visualization [6], statistics [17], psychology [34, 31], learning image manifolds [36], handwriting recognition [24], studying musical rhythms [12], and signal processing [15]. The complexity of determining whether a matrix is an EDM is well understood, and it can be efficiently addressed using Singular Value Decomposition (SVD) [6, 35].

In many practical applications involving EDMs, errors due to noise, missing values, or approximation inaccuracies are common. To address these challenges, several heuristic approaches based on Multidimensional Scaling, Low-Rank Matrix Approximation, and Semidefinite Programming have been developed to reconstruct the matrix and create an embedding that minimizes the impact of these errors. We refer readers to [6, 15] for a comprehensive overview of the extensive literature on this topic.

Despite the importance and numerous practical applications of the EDM recognition problem with noise and errors, little was known about its computational complexity until recently. A notable exception is the work of Sidiropoulos, Wang, and Wang [32], which initiates the study of EDM in the presence of outliers. The work of Sidiropoulos et al. serves as the initial foundation for our studies.

In this paper, we address the algorithmic question of minimizing the number of edits required to transform a given distance matrix into EDM. Specifically, we aim to apply the minimum number of modifications to a given distance space $(X,\operatorname{\rho})$ with distance function $\operatorname{\rho}$ , such that the resulting distance space can be isometrically embedded into a $d$ -dimensional Euclidean space $\mathbb{R}^{d}$ . We consider two types of editing operations.

The first operation is element deletion. In matrix terms, this operation corresponds to deleting the row and column associated with the element we choose to remove. Following Sidiropoulos, Wang, and Wang [32], we refer to the elements removed from the distance space as outliers.

The second operation is distance modification. Let $X^{(2)}$ denote the set of unordered pairs of distinct elements in $X=\{x_{1},x_{2},\ldots\}$ . For a pair of elements $x_{i},x_{j}\in X$ , the modification operation alters the distance $\rho(x_{i},x_{j})$ . In terms of the distance matrix, this operation changes the ${ij}$ -th and the ${ji}$ -th entries. The problem of minimizing the number of modification operations to embed the resulting distance space in general metric spaces was studied in [18, 9], and embedding into ultrametric spaces was investigated in [9, 8].

Our primary algorithmic question is as follows: given integers $k_{O}$ and $k_{M}$ , can a distance space $(X,\operatorname{\rho})$ be transformed into a distance space that is embeddable in $\mathbb{R}^{d}$ by removing at most $k_{O}$ outliers and performing at most $k_{M}$ modifications? In fact, we address an even more general weighted version of this problem, where each operation (outlier deletion or distance modification) is assigned a specific weight. More precisely, we study:

Two special cases of Euclidean Embedding Editing are of particular importance. The variant of the problem with $k_{M}=0$ , that is, of placing all but $k_{O}$ outlier points of a distance space $(X,\operatorname{\rho})$ into a Euclidean space $\mathbb{R}^{d}$ of a given dimension $d$ such that the Euclidean distance between any pair of points $x,$ is equal to $\operatorname{\rho}(x,y)$ , is called Euclidean Embedding with Outliers [32]. For the variant with $k_{O}=0$ , that is, without deletion of outliers, following [18], we use the name Euclidean Metric Violation Distance.

EXAMPLE: The distance space ${{\cal X}}=(X,\operatorname{\rho})$ with $X=\{1,\dots,9\}$ and $\operatorname{\rho}$ defined by distance matrix $D$ , where the $i j$ -th entry of $D$ is $\operatorname{\rho}^{2}(i,j)$

D=\begin{bmatrix}0&{\color[rgb]{1,0,0}\definecolor[named]{pgfstrokecolor}{rgb}% {1,0,0}7}&1&2&4&5&{\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{% 0,0,1}1}&4&5\\ {\color[rgb]{1,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{1,0,0}7}&0&2&1&1&2% &{\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}10}&5&4\\ 1&2&0&1&{\color[rgb]{1,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{1,0,0}11}&% 4&{\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}4}&1&2\\ 2&1&1&0&2&1&{\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}% 8}&2&1\\ 4&1&{\color[rgb]{1,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{1,0,0}11}&2&0&% 1&{\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}12}&8&5\\ 5&2&4&1&1&0&{\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}% 7}&5&2\\ {\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}1}&{\color[% rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}10}&{\color[rgb]{% 0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}4}&{\color[rgb]{0,0,1}% \definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}8}&{\color[rgb]{0,0,1}% \definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}12}&{\color[rgb]{0,0,1}% \definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}7}&{\color[rgb]{0,0,1}% \definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}0}&{\color[rgb]{0,0,1}% \definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}8}&{\color[rgb]{0,0,1}% \definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}9}\\ 4&5&1&2&8&5&{\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}% 8}&0&1\\ 5&4&2&1&5&2&{\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}% 9}&1&0\\ \end{bmatrix},

and $k_{O}=1$ , $k_{M}=2$ , $d=2$ , all weights equal to one, and $W=3$ , is a yes-instance of Euclidean Embedding Editing. Indeed, the distance space ${{\cal X}}^{\prime}=(X^{\prime},\operatorname{\rho}^{\prime})$ defined by distance matrix $D^{\prime}$ in Fig. 1 that can be obtained from ${{\cal X}}$ by modifying distances between ${\color[rgb]{1,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{1,0,0}\{1,2\}}$ and ${\color[rgb]{1,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{1,0,0}\{3,5\}}$ and deleting outlier ${\color[rgb]{0,0,1}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,1}7}$ , is isometrically embedable in $\mathbb{R}^{2}$ .

Figure 1: Matrix

D^{\prime}

and an embedding of its distance space in

\mathbb{R}^{2}

.

Our results.

First, we present a compression for Euclidean Embedding Editing– a polynomial-time algorithm that reduces an instance of the problem to an equivalent instance with the number of points bounded by $\mathcal{O}(k^{2}d^{2})$ (Theorem 8), where $k=k_{O}+k_{M}$ . As part of this compression, we also propose a $(d+3)$ -approximation algorithm for Euclidean Embedding with Outliers, which runs in polynomial time (Lemma 10). Using this compression algorithm, we design an FPT algorithm for Euclidean Embedding Editing with a running time of $(dk)^{\mathcal{O}(d+k)}+n^{\mathcal{O}(1)}$ (Theorem 20).

For Euclidean Embedding with Outliers, a particular but important special case of Euclidean Embedding Editing – where the goal is to determine whether the distance space, excluding up to $k_{O}$ outliers, can be embedded in $\mathbb{R}^{d}$ – we obtain better dependence on the parameters with a running time of $\min\{(d+3)^{k_{O}},2^{d+k_{O}}\}\cdot n^{\mathcal{O}(1)}$ (Theorem 21). Furthermore, for this problem, we propose a 2-approximation algorithm (Theorem 22). This randomized algorithm, with a running time of $2^{d}\cdot n^{\mathcal{O}(1)}$ , guarantees a solution with at most $2\cdot\mathsf{Opt}$ outliers. As is common in Computational Geometry, all our algorithms operate under the real RAM computational model, assuming that basic operations over real numbers can be executed in unit time.

We also complement our algorithmic results with lower-bound proofs, establishing that both Euclidean Embedding with Outliers and Euclidean Metric Violation Distance are $\operatorname{NP}$ -hard even when $d=1$ (Theorems 26 and 27). So, FPT algorithms for these problems parameterized by $d$ alone appears to be out of reach, motivating our FPT-approximation algorithm. Additionally, we prove that Euclidean Embedding with Outliers is W[1]-hard when parameterized by $k_{O}$ alone (Theorem 28). These lower bounds indicate that to get an FPT algorithm for Euclidean Embedding Editing, it is important that we parameterize by both $d$ and the solution size $k_{O}+k_{M}$ . Importantly, our lower-bound results remain valid even in the unweighted case.

Related work.

The computational complexity study of Euclidean Embedding with Outliers was initiated by Sidiropoulos, Wang, and Wang [32]. They demonstrated that, assuming the Unique Games Conjecture, the problem of computing a minimum outlier embedding into $d$ -dimensional Euclidean space for any $d\geq 2$ is NP-hard to approximate within a factor of $2-\varepsilon$ for any $\varepsilon>0$ . On the algorithmic side, they showed that a $2$ -approximation can be achieved in $\mathcal{O}(n^{d+3})$ time. Additionally, they established that a $(3+\varepsilon)$ -approximation is achievable in $(2/\varepsilon)^{d}d^{\mathcal{O}(1)}n^{2}\log{n}$ time. For exact solutions, they noted an algorithm with a runtime of $\mathcal{O}(n^{d+3})+2^{k_{O}}n^{2}$ .

They also obtained results related to bicriteria approximation involving nonisomorphic embeddings, approximation of the number of outliers, and distortion. The first result, is $(\mathcal{O}(\sqrt{\delta}),(2d+2)k_{O})$ -relative outlier embedding in $\mathbb{R}^{d}$ in time $\frac{1}{\delta^{\mathcal{O}(1)}}2^{\mathcal{O}(d)}n^{2}\log{n}$ . The second result, $(\mathcal{O}(\sqrt{\delta}),2k_{O})$ -relative outlier embedding in time $\frac{1}{\delta^{\mathcal{O}(1)}}{k_{O}}^{\mathcal{O}(d)}n^{2}$ . (They define an algorithm as an $\mathcal{O}(f(\delta),g(k_{O}))$ -relative outlier embedding of ${{\cal X}}=(X,\operatorname{\rho})$ to $\mathbb{R}^{d}$ , for some functions $f$ and $g$ , if it either correctly decides that no embedding with distortion at most $\delta$ exists after removing $k_{O}$ outliers, or outputs a set $Y$ of size $g(k_{O})$ such that there is an embedding $X\setminus Y$ into $\mathbb{R}^{d}$ of distortion $\mathcal{O}(f(\delta))$ .) Since allowing distortion could significantly decrease the number of outliers, our 2-approximation algorithm for Euclidean Embedding with Outliers, Theorem 22, is incomparable with these results.

We are not aware of any algorithms with guaranteed performance for Euclidean Metric Violation Distance. Parameterized and approximation algorithms for metric violation distance problems for general metric, tree distances and ultrametric could be found in [18, 8, 9, 20]. More generally, embeddings of various metric spaces are a fundamental primitive in the design of algorithms [22, 23, 27, 26, 2, 3], though prior work often focuses on minimizing embedding distortion.

2 Preliminaries

Let $X$ be a set. A function $\operatorname{\rho}\colon X\times X\to\mathbb{R}_{\geq 0}$ is a distance on $X$ if: (i) $\operatorname{\rho}$ is symmetric, that is, for any $x,y\in X$ , $\operatorname{\rho}(x,y)=\operatorname{\rho}(y,x)$ , and (ii) $\operatorname{\rho}(x,x)=0$ for all $x\in X$ . Then, $(X,\operatorname{\rho})$ is called a distance space. If, in addition $\operatorname{\rho}$ satisfies a triange inequality: $\operatorname{\rho}(x,z)\leq\operatorname{\rho}(x,y)+\operatorname{\rho}(y,z)$ , for any $x,y,z\in X$ , then $\operatorname{\rho}$ is called a semimetric on $X$ . And if $\operatorname{\rho}(x,y)=0$ only for $x=y$ , then $\operatorname{\rho}$ is called a metric, and $(X,\operatorname{\rho})$ a metric space.

More precisely, recall that for two points $p,q\in\mathbb{R}^{d}$ , the Euclidean distance between $p$ and $q$ is $\|p-q\|_{2}=\sqrt{\langle p,p\rangle+\langle q,q\rangle-2\langle p,q\rangle}$ . We say that distance space $(X,\operatorname{\rho})$ is isometrically embeddable into $\mathbb{R}^{d}$ if there is a map, called isometric embedding, $\varphi\colon X\to\mathbb{R}^{d}$ such that $\operatorname{\rho}(x,y)=\|\varphi(x)-\varphi(y)\|_{2}$ for all $x,y\in X$ . Notice that we do not require $\varphi$ to be injective, that is, several points of $(X,\operatorname{\rho})$ may be mapped to the same point of $\mathbb{R}^{d}$ . Throughout the paper, whenever we mention an embedding, we mean an isometric embedding. Moreover, when we use the term $d$ -embedding, we are referring to embedding into $\mathbb{R}^{d}$ . A $d$ -embeddable distance space is strongly $d$ -embeddable if it is not $(d-1)$ -embeddable. We use $X^{(2)}$ to denote the set of unordered pairs of two distinct elements of $X$ . As convention, we assume that the empty set of points is $d$ -embeddable for every $d$ .

Let ${{\cal X}}=(X,\operatorname{\rho})$ be a distance space where $X=\{x_{1},\ldots,x_{n}\}$ , and let $\operatorname{\rho}_{i,j}=\operatorname{\rho}(x_{i},x_{j})$ for all $i,j\in\{1,\ldots,n\}$ . Then ${{\cal X}}$ is equivalently defined by the distance matrix $D(\rho)$ , where $D(\rho)_{i,j}=\operatorname{\rho}_{i,j}^{2}$ .

We drop the explicit reference to $\rho$ when it is clear from the context and instead of $D(\rho)$ , simply write $D$ . Suppose that ${{\cal X}}$ is embeddable into $\mathbb{R}^{d}$ . The ordered set $P=(p_{1},\ldots,p_{n})$ of points in $\mathbb{R}^{d}$ is said to be a realization of ${{\cal X}}$ if there is an embedding $\varphi\colon X\to\mathbb{R}^{d}$ such that $\varphi(x_{i})=p_{i}$ for all $i\in\{1,\ldots,n\}$ . We use the well-known property (see e.g. [15]) that a realization is unique up to rigid transformations of $\mathbb{R}^{d}$ , that is, distance-preserving transformations of Euclidean space (rotations, reflections, translations).

Proposition 1 ([15]).

Let $(p_{1},\ldots,p_{n})$ and $(q_{1},\ldots,q_{n})$ be ordered set of points in $\mathbb{R}^{d}$ . Then $\|p_{i}-p_{j}\|_{2}=\|q_{i}-q_{j}\|_{2}$ for all $i,j\in\{1,\ldots,n\}$ if and only if there is a rigid transformation of $\mathbb{R}^{d}$ mapping $p_{i}$ to $q_{i}$ for all $i\in\{1,\ldots,n\}$ .

A realization can be constructed (if it exists) in polynomial time.

Proposition 2 ([1, 33]).

Given a distance space ${{\cal X}}=(X,\operatorname{\rho})$ with $n$ points and a positive integer $d$ , in $\mathcal{O}(n^{3})$ time, it can be decided whether ${{\cal X}}$ can be embedded into $\mathbb{R}^{d}$ and, if such an embedding exists, then a realization can be constructed in this running time.

Definition 3 (Metric basis).

Let $(X,\operatorname{\rho})$ be a $d$ -embeddable distance space. A set $Y\subseteq X$ is a metric basis if, given an isometric embedding $\varphi$ of $(Y,\operatorname{\rho})$ into $\mathbb{R}^{d}$ , there is a unique way to extend $\varphi$ to an isometric embedding of $(X,\operatorname{\rho})$ . Equivalently, if a realization of $(Y,\operatorname{\rho})$ is fixed then the embedding of any point of $X\setminus Y$ in a $d$ -embedding of $(X,\operatorname{\rho})$ is unique.

We will use the well-known characteristic of strong embeddability of a distance space into a Euclidean space. For $r+1$ points $x_{0},x_{1},\dots,x_{r}$ of distance space $(X,\operatorname{\rho})$ the Cayley-Menger determinant is the determinant of the matrix obtained from the distance matrix by prepending a row and a column whose first element is zero and the other elements are one. Formally, let $\operatorname{\rho}_{i,j}=\operatorname{\rho}(x_{i},x_{j})$ , $i,j\in\{0,\dots,r\}$ . Then the Cayley-Menger determinant is

CM(x_{0},x_{1},\dots,x_{r})=\det\left(\begin{matrix}0&1&1&1&\dots&1\\ 1&0&\operatorname{\rho}_{0,1}^{2}&\operatorname{\rho}_{0,2}^{2}&\dots&% \operatorname{\rho}_{0,r}^{2}\\ 1&\operatorname{\rho}_{0,1}^{2}&0&\operatorname{\rho}_{1,2}^{2}&\dots&% \operatorname{\rho}_{1,r}^{2}\\ 1&\operatorname{\rho}_{0,2}^{2}&\operatorname{\rho}_{1,2}^{2}&0&\dots&% \operatorname{\rho}_{2,r}^{2}\\ \vdots&\vdots&\vdots&\vdots&\ddots&\vdots\\ 1&\operatorname{\rho}_{0,r}^{2}&\operatorname{\rho}_{1,r}^{2}&\operatorname{% \rho}_{2,r}^{2}&\dots&0\\ \end{matrix}\right)

Proposition 4 ([5, Chapter IV]).

A distance space ${{\cal X}}=(X,\rho)$ with $n$ points is strongly $d$ -embeddable if and only if there exist $d+1$ points, say $X_{d}=\{x_{0},\ldots,x_{d}\}$ , such that:

1.

$(-1)^{j+1}CM(x_{0},x_{1},\dots,x_{j})>0$ for $1\leq j\leq d$ , and
2.

for any $x,y\in X\setminus X_{d}$ ,

$CM(x_{0},x_{1},\dots,x_{d},x)=CM(x_{0},x_{1},\dots,x_{d},y)=CM(x_{0},x_{1},% \dots,x_{d},x,y)=0.$

Equivalently (see, for example, [32]), ${{\cal X}}$ is strongly $d$ -embeddable if and only if there is a set of $d+1$ points $X_{d}=\{x_{0},\ldots,x_{d}\}$ such that $(\{x_{0},\ldots,x_{j}\},\operatorname{\rho})$ is strongly $j$ -embeddable for all $j\in\{1,\ldots,d\}$ , and for every $x,y\in X\setminus X_{d}$ , $(X_{d}\cup\{x\}\cup\{y\})$ is $d$ -embeddable.

Thus, the embedding of the distance space ${{\cal X}}=(X,\rho)$ into $\mathbb{R}^{d}$ is essentially characterized by $d+1$ “anchor” points of $X$ . Note that in a realization of ${{\cal X}}$ that is strongly $d$ -embeddable, the anchor points correspond to a set of $d+1$ points in general position in $\mathbb{R}^{d}$ .

Following [5], we define sets of independent points.

Definition 5.

Let $(X,\operatorname{\rho})$ be a distance space. For a nonnegative integer $r$ , we say that $Y\subseteq X$ of size $r+1$ is independent if $(Y,\operatorname{\rho})$ is strongly $r$ -embeddable.

Then our “anchors” are independent sets of size $d+1$ . We use the following fact that the family of independent sets has matroid-like properties (implicit in [5, Chapter IV]).

Proposition 6 ([5, Chapter IV]).

Let $(X,\operatorname{\rho})$ be a strongly $d$ -embeddable distance space. Then, (i) any single-element set is independent, (ii) if $Y\subseteq X$ is independent, then any $\emptyset\subset Z\subseteq Y$ is independent, (iii) if $Y,Z\subseteq X$ are independent and $|Y|>|Z|$ then there is a $y\in Y\setminus Z$ such that $Z\cup\{y\}$ is independent. Furthermore, the maximum size of an independent set is $d+1$ and any independent set $Y$ of size $d+1$ is a metric basis.

Lemma 7 ( $\star$ ).

Suppose $(X,\rho)$ is a $d$ -embeddable distance space, $Y\subset X$ is independent and $x\in X\setminus Y$ . Then, $(Y\cup\{x\},\rho)$ is $|Y|$ -embeddable.

For statements marked with ( $\star$ ), the proofs are omitted in this extended abstract and is deferred to the full version.

3 Compression for Euclidean Embedding Editing

In this section, we show that, given an instance of Euclidean Embedding Editing, one can in polynomial time construct an equivalent instance where the number of points is upper-bounded by a polynomial of $k=k_{O}+k_{M}$ and $d$ and, moreover, the encoding of the weights is also polynomial in these parameters.

Theorem 8.

There is a polynomial-time algorithm that, given an instance $({{\cal X}}=(X,\operatorname{\rho}),w_{O},w_{M},W,k_{O},k_{M},d)$ of Euclidean Embedding Editing, either solves the problem or constructs an equivalent instance $({{\cal X}}^{\prime}=(X^{\prime},\operatorname{\rho}),w_{O}^{\prime},w_{M}^{% \prime},W^{\prime},k_{O}^{\prime},k_{M}^{\prime},d)$ such that $X^{\prime}\subseteq X$ , and for $k=k_{O}+k_{M}$ , it holds that $|X^{\prime}|=\mathcal{O}((kd)^{2})$ , $W^{\prime}=2^{\mathcal{O}((kd)^{12})}$ , $w_{O}^{\prime}(x)=2^{\mathcal{O}((kd)^{12})}$ for $x\in X^{\prime}$ , and $w_{M}^{\prime}(x,y)=2^{\mathcal{O}((kd)^{12})}$ for all $\{x,y\}\in X^{\prime(2)}$ .

We remark that Theorem 8 does not provide a kernelization algorithm according to the standard definition [10] as the size of the output instance is not upper-bounded by a function of the parameter – the distances between the points remain the same as in the input instance. Moreover, since Euclidean Embedding Editing is a decision problem, the word “solve” in the statement of Theorem 8 technically only refers to deciding correctly whether the instance is a yes-instance. However, if the instance is determined to be a yes-instance, then our algorithm can also output a solution that witnesses this, i.e., the outlier set and modified new distance values (if any). This additional feature of our compression algorithm will be used later in our FPT algorithm for Euclidean Embedding Editing.

In the proof of Theorem 8, we first design a polynomial-time $(d+3)$ -approximation algorithm (Section 3.1). This algorithm allows us, given an instance of the problem, either find a set of at most $(d+3)k$ points $A$ such that $(X\setminus A,\operatorname{\rho})$ is $d$ -embeddable or conclude that we have a no-instance.

In the first case, we iteratively construct a sequence of metric bases $Y_{1},\ldots,Y_{\ell}$ for $(Z,\operatorname{\rho})$ using Proposition 6, where initially $Z:=X\setminus A$ and in each iteration, we delete the points of the constructed basis from $Z$ . If $X\setminus A$ is sufficiently large, then there is a subsequence of bases of the same size with at least $2(k_{O}+2k_{M})+1$ elements. We select the first subsequence with this property. The crucial property exploited by our algorithm is that this subsequence uniquely defines embeddings of the majority of the points assuming that we have a yes-instance of the problem. We use this to give a series of reduction rules that identify points in $A$ that should be outliers in any solution and irrelevant points of $X\setminus A$ that could be deleted.

3.1 Bootstrapping the Compression through an Approximation

For a distance space ${{\cal X}}=(X,\operatorname{\rho})$ , a $d$ -outlier set is a set $A$ of points such that $(X\setminus A,\operatorname{\rho})$ is $d$ -embeddable. In unweighted Euclidean Embedding with Outliers (which we call UEEO), the function $w_{O}$ assigns unit weight to every point. Sidiropoulos et al. [32] observed that this problem admits a simple greedy $(d+3)$ -approximation running in $n^{\mathcal{O}(d)}$ . We show that this algorithm can be sped up to run in $n^{\mathcal{O}(1)}$ time, i.e., independent of $d$ in the exponent. To do so, we give a polynomial-time subroutine that produces a small hitting set for all minimal solutions. This subroutine will later be used as the first step of our compression algorithm for Euclidean Embedding Editing.

Lemma 9.

There is a polynomial-time algorithm that, given a distance space ${{\cal X}}=(X,\operatorname{\rho})$ and an integer $d\geq 1$ , outputs a set of points $\hat{A}$ of size at most $(d+3)$ such that if ${{\cal X}}$ is not $d$ -embeddable, then $\hat{A}$ intersects every inclusionwise minimal $d$ -outlier set of ${{\cal X}}$ .

Proof.

Assume that the distance space ${{\cal X}}=(X,\operatorname{\rho})$ does not admit a $d$ -embedding. Note that, in particular, this means that $|X|\geq 3$ . We then do the following.

1.

Set $A:=\{p\}$ for an arbitrary point $p\in X$ .
2.

While
$|A|\leq d$ , do the following:
1. (a)
  
  If there is a point $x\in X\setminus A$ such that $(A\cup\{x\},\operatorname{\rho})$ is not $|A|$ -embeddable, then set $\hat{A}:=A\cup\{x\}$ and return $\hat{A}$ .
2. (b)
  
  If there is a point $x\in X\setminus A$ such that $A\cup\{x\}$ is independent, then set $A:=A\cup\{x\}$ and continue the loop, else exit the loop.
3.

If there are two distinct points $x,y\in X\setminus A$ such that $(A\cup\{x,y\},\operatorname{\rho})$ is not $(|A|-1)$ -embeddable, then $\hat{A}:=A\cup\{x,y\}$ and return $\hat{A}$ .

Clearly, the algorithm runs in polynomial time. Since the while loop has at most $d$ iterations, the set $A$ when the loop is exited has size at most $d+1$ and so, the returned set $\hat{A}$ has size at most $d+3$ . $\hfill\blacktriangleleft$

We can now use Lemma 9 as a subroutine in the greedy approximation for the minimization variant of unweighted Euclidean Embedding with Outliers.

Lemma 10 ( $\star$ ).

There is a polynomial-time algorithm that, given a distance space ${{\cal X}}=(X,\operatorname{\rho})$ and an integer $d\geq 1$ , outputs a set of points $A$ of size at most $(d+3)\cdot\mathsf{Opt}$ such that $(X\setminus A,\operatorname{\rho})$ is embeddable into $\mathbb{R}^{d}$ where $\mathsf{Opt}$ is the size of a smallest $d$ -outlier set.

3.2 Compression

We are now ready to give the compression algorithm for Euclidean Embedding Editing.

Proof of Theorem 8.

Let $({{\cal X}}=(X,\operatorname{\rho}),w_{O},w_{M},W,k_{O},k_{M},d)$ be an instance of Euclidean Embedding Editing. First, we preprocess the instance to reduce the number of points and then we explain how to reassign the weights.

Notice that if $({{\cal X}},w_{O},w_{M},W,k_{O},k_{M},d)$ is a yes-instance, then it is possible to delete ${k=k_{O}+k_{M}}$ points to obtain a distance space embeddable in $\mathbb{R}^{d}$ . In the first step of our algorithm, we call the algorithm from Lemma 10 for the instance $({{\cal X}},k,d)$ of Euclidean Embedding with Outliers without weights. This algorithm outputs a set of points $A$ of size at most $(d+3)\cdot\mathsf{Opt}$ such that $(X\setminus A,\operatorname{\rho})$ is embeddable into $\mathbb{R}^{d}$ where $\mathsf{Opt}$ is the minimum number of outliers. If $|A|>(d+3)k$ then we conclude that $({{\cal X}},w_{O},w_{M},W,k_{O},k_{M},d)$ is a no-instance of Euclidean Embedding Editing and stop. From now on, we assume that $|A|\leq(d+3)(k_{O}+k_{M})$ .

Let $Y=X\setminus A$ . We first show that if $Y$ has bounded size, then we can return the original instance of the problem and stop.

Reduction Rule 1.

If $|Y|\leq 2(k_{O}+2k_{M})(d+1)^{2}$ then return $({{\cal X}},w_{O},w_{M},W,k_{O},k_{M},d)$ .

We can apply this rule because if $|Y|\leq 2(k_{O}+2k_{M})(d+1)^{2}$ then $|X|=|Y|+|A|\leq 2(k_{O}+2k_{M})(d+1)^{2}+(k_{O}+k_{M})(d+3)\leq 5(k_{O}+k_{M})% (d+3)^{2}=\mathcal{O}((kd)^{2})$ . From now on, we assume that $|Y|>2(k_{O}+2k_{M})(d+1)^{2}$ .

Because $A$ is a feasible solution to Euclidean Embedding with Outliers, $(Y,\operatorname{\rho})$ can be embedded into $\mathbb{R}^{d}$ . In the subsequent steps, we may delete some points in $A$ and reduce the parameter $k_{O}$ . This may result in a trivial instance which we solve by applying the following rule whenever possible.

Reduction Rule 2.

If $k_{O}<0$ then return a no-answer and stop. If $k_{O}\geq 0$ and $A=\emptyset$ then return a yes-answer and stop.

We greedily partition $Y$ into sets $Y_{1},\ldots,Y_{\ell}$ of size at most $d+1$ such that each $Y_{i}$ is an inclusion-maximal independent set of points of $Y\setminus\bigcup_{j=1}^{i-1}Y_{j}$ , that is, $Y_{i}$ is an inclusion-maximal subset such that $(Y_{i},\operatorname{\rho})$ is strongly $(|Y_{i}|-1)$ -embeddable.

Assume that $i\geq 1$ and the sets $Y_{1},\ldots,Y_{i-1}$ are already constructed. We set ${Z:=Y\setminus\bigcup_{j=1}^{i-1}Y_{j}}$ . If $Z=\emptyset$ , then the partition is constructed. If $Z\neq\emptyset$ , then we do the following:

1.

To initiate the construction of $Y_{i}$ , we set $Y_{i}=\{x\}$ for an arbitrary point $x\in Z$ and set $h=1$ .
2.

While there is a point $y\in Z\setminus Y_{i}$ such that $Y_{i}\cup\{y\}$ is independent, set $Y_{i}:=Y_{i}\cup\{y\}$ and $h:=h+1$ .

Because $Y$ is $d$ -embeddable, $Z$ is strongly $t$ -embeddable for some $t\leq d$ and contains an independent set of size $t$ by Proposition 4. By Proposition 6, the described procedure will construct a set $Y_{i}$ of size $t+1\leq d+1$ such that $(Y_{i},\operatorname{\rho})$ is strongly $t$ -embeddable. Notice that by Proposition 6, $Y_{i}$ is a metric basis for $(Z,\operatorname{\rho})$ .

Figure 2: A pair

(Y_{i},Y_{j})

of

x

-compatible sets. Notice that if

j\geq i

and

|Y_{j}|\leq|Y_{i}|=t+1

then

(Y_{i}\cup Y_{j},\operatorname{\rho})

admits a strong embedding into

\mathbb{R}^{t}

by Proposition 6 because

Y_{i}

is a metric basis for

(Y_{i}\cup Y_{j},\operatorname{\rho})

. Furthermore, given a realization of

(Y_{i},\operatorname{\rho})

such that the points are mapped into an affine subspace

L

of dimension

t

, the points of

Y_{j}

are mapped into

L

and the mapping is unique. If

(Y_{i}\cup\{x\},\operatorname{\rho})

is strongly

t

-embeddable then

x

is embedded in

L

and the embedding is unique. If

(Y_{i}\cup\{x\},\operatorname{\rho})

is strongly

(t+1)

-embeddable, that is,

Y_{i}\cup\{x\}

is independent then by Proposition 1, the distance space is embedded into a subspace

L^{\prime}

of dimension

t+1

containing

L

and the embedding of

x

is unique up to the reflection with respect to

L

.

Let $x\in A$ . We say that $Y_{i}$ for $i\in\{1,\ldots,\ell\}$ is $x$ -compatible if $(Y_{i}\cup\{x\},\operatorname{\rho})$ is embeddable in $\mathbb{R}^{d}$ . Otherwise, $Y_{i}$ is $x$ -incompatible. Similarly, for two distinct sets $Y_{i},Y_{j}$ for $i,j\in\{1,\dots,\ell\}$ , the pair $(Y_{i},Y_{j})$ is $x$ -compatible if $(Y_{i}\cup Y_{j}\cup\{x\},\operatorname{\rho})$ is embeddable in $\mathbb{R}^{d}$ , and $(Y_{i},Y_{j})$ is $x$ -incompatible otherwise (see Figure 2).

Notice that if $Y_{i}$ is $x$ -incompatible then for each solution to the instance either $x$ is an outlier, or $Y_{i}$ contains an outlier, or at least one distance between the points $Y_{i}\cup\{x\}$ should be modified.

For each $i\in\{1,\ldots,l\}$ , $1\leq|Y_{i}|\leq d+1$ . We partition the family $\{Y_{1},\ldots,Y_{\ell}\}$ of sets into parts $C_{1},\ldots,C_{d+1}$ according to their size (some parts may be empty). Formally, for $h\in\{1,\ldots,d+1\}$ , $C_{h}=\{Y_{i}\mid 1\leq i\leq\ell\text{ and }|Y_{i}|=h\}$ .

Consider the part $C_{h}$ for some $h\in\{1,\ldots,d+1\}$ . For a point $x\in A$ , we say that two $x$ -compatible sets $Y_{i},Y_{j}\in C_{h}$ are $x$ -equivalent if the pair $(Y_{i},Y_{j})$ is $x$ -compatible, that is, $(Y_{i}\cup Y_{j}\cup\{x\},\operatorname{\rho})$ is embeddable into $\mathbb{R}^{d}$ . We show the following claim.

Claim 11 ( $\star$ ).

$x$ -equivalency is an equivalence relation on the family of $x$ -compatible sets in $C_{h}$ . Furthermore, if $Y_{i},Y_{j}\in C_{h}$ are $x$ -equivalent and the pair $(Y_{j},Y_{s})$ for $Y_{s}\in C_{h^{\prime}}$ with $h^{\prime}\leq h$ is $x$ -compatible then $(Y_{i},Y_{s})$ is $x$ -compatible.

The definition of $x$ -equivalency implies the following property.

Claim 12.

Let $R_{1}$ and $R_{2}$ be two distinct classes of $x$ -equivalent $x$ -compatible sets. Then for any solution $(O,D)$ to the considered instance, either (i) $x\in O$ , or (ii) for each set $Y_{i}$ in $R_{1}$ there is a $y\in Y_{i}$ such that $y\in O$ or there exists a $z$ such that $\{y,z\}\in D$ , or (iii) for each set $Y_{j}$ in $R_{2}$ there is a $y\in Y_{j}$ such that $y\in O$ or there exists a $z$ such that $\{y,z\}\in D$ .

We use this property in the following rule. Because $|Y|>2(k_{O}+2k_{M})(d+1)^{2}$ and each $Y_{i}$ is of size at most $d+1$ , there is $h\in\{1,\ldots,d+1\}$ with $C_{h}$ being of size at least $2(k_{O}+2k_{M})+1$ . Let $h^{*}$ be the maximum value of $h\in\{1,\ldots,d+1\}$ with this property.

Reduction Rule 3.

If there is $x\in A$ such that each $x$ -equivalence class $R$ of $x$ -compatible sets of $C_{h^{*}}$ has size at most $|C_{h^{*}}|-(k_{O}+2k_{M})-1$ , then set $X:=X\setminus\{x\}$ , $A:=A\setminus\{x\}$ , and $k_{O}:=k_{O}-1$ .

After the exhaustive application of Reduction 3, we have that for each $x\in A$ , there is an $x$ -equivalence class $R_{x}$ of $x$ -compatible sets in $C_{h^{*}}$ that contains at least $|C_{h^{*}}|-k_{O}-2k_{M}\geq(k_{O}+2k_{M})+1$ sets and all other classes in $C_{h^{*}}$ contain at most $k_{O}+2k_{M}$ elements combined. We say that $R_{x}$ is large. We exhaustively apply the following rule using the fact that $R_{x}$ contains at least $(k_{O}+2k_{M})+1$ sets.

Reduction Rule 4.

If there is $x\in A$ such that there are at least $k_{O}+k_{M}+1$ sets $Y_{j}\in C_{h}$ for any $h\leq h^{*}$ such that $(Y_{i},Y_{j})$ is $x$ -incompatible for some $Y_{i}\in R_{x}$ , then set $X:=X\setminus\{x\}$ , $A:=A\setminus\{x\}$ , and $k_{O}:=k_{O}-1$ .

In the next crucial step of our algorithm, we identify important sets $Y_{i}$ for $\in\{1,\ldots,\ell\}$ and delete all other sets that are irrelevant.

We apply the following marking procedure that labels important sets $Y_{i}$ for $i\in\{1,\ldots,\ell\}$ . For every $x\in A$ , we do the following:

$\blacksquare$

mark every $Y_{i}$ in $C_{h}$ for $h>h^{*}$ ,
$\blacksquare$

for each $x\in A$ , mark arbitrary $(k_{O}+2k_{M})+1$ sets of $R_{x}$ ,
$\blacksquare$

for each $x\in A$ and each $h\in\{1,\ldots,h^{*}\}$ , mark each set $Y_{j}\in C_{h}$ such that there is a set $Y_{i}\in R_{x}$ such that the pair $(Y_{i},Y_{j})$ is $x$ -incompatible.

We set $I=\{i\mid 1\leq i\leq\ell\text{ and }Y_{i}\text{ is marked}\}$ and $Y^{\prime}=\bigcup_{i\in I}Y_{i}$ , and define $X^{\prime}=A\cup Y^{\prime}$ .

Claim 13 ( $\star$ ).

The instance $({{\cal X}}=(X,\operatorname{\rho}),w_{O},w_{M},W,k_{O},k_{M},d)$ of Euclidean Embedding Editing is equivalent to the instance $({{\cal X}}^{\prime}=(X^{\prime},\operatorname{\rho}),w_{O},w_{M},W,k_{O},k_{M% },d)$ .

We have that the original instance $({{\cal X}}=(X,\operatorname{\rho}),w_{O},w_{M},W,k_{O},k_{M},d)$ and the obtained instance $({{\cal X}}^{\prime}=(X^{\prime},\operatorname{\rho}),w_{O},w_{M},W,k_{O},k_{M% },d)$ of Euclidean Embedding Editing are equivalent. We prove that $|X^{\prime}|=\mathcal{O}(k^{2}d^{2})$ .

Claim 14 ( $\star$ ).

$|X^{\prime}|\leq 9(k_{O}+k_{M})^{2}(d+3)^{2}$ .

Claim 15 ( $\star$ ).

The overall running time of our algorithm is polynomial.

To compress the weights, we use the standard technique (see [16]) based on the algorithm of Frank and Tardos [19]. $\hfill\blacktriangleleft$

4 FPT algorithm for Euclidean Embedding Editing parameterized by solution size and dimension

In this section, we give our FPT algorithm for Euclidean Embedding Editing.

We prepare for it by recalling the relevant definitions and facts from [4]. In what follows, let $R$ be a real closed field and $\ell\in{\mathbb{N}}$ . Let ${{\cal P}}_{\ell}\subset R[X_{1},\dots,X_{t}]$ be a finite set of $s$ polynomials each of degree at most $\ell$ .

Definition 16 ([4]).

A ${{\cal P}}_{\ell}$ -atom is one of $P=0$ , $P\neq 0$ , $P\geq 0$ , $P\leq 0$ , where $P$ is a polynomial in ${{\cal P}}_{\ell}$ and a quantifier-free ${{\cal P}}_{\ell}$ -formula is a formula constructed only from ${{\cal P}}_{\ell}$ -atoms together with the logical connectives $\wedge$ , $\vee$ and $\neg$ .

Proposition 17 (Theorem 13.13, [4]).

Let $(\exists X_{1})\dots(\exists X_{t})F(X_{1},\dots,X_{t}),$ be a sentence, where $F(X_{1},\dots,X_{t})$ is a quantifier free ${{\cal P}}_{\ell}$ -formula. There exists an algorithm to decide the truth of the sentence with complexity¹¹1The measure of complexity here is the number of field operations. $s^{t+1}\cdot\ell^{\mathcal{O}(t)}$ in $D$ where $D$ is the ring generated by the coefficients of the polynomials in ${{\cal P}}_{\ell}$ .

Definition 18.

Consider points $x_{0},x_{1},\dots,x_{r}$ of distance space $(X,\operatorname{\rho})$ and a set $Z$ of pairs in $X$ . For every $i<j\in\{0,\dots,r\}$ , if $\{x_{i},x_{j}\}\notin Z$ , then $\hat{\operatorname{\rho}}_{i,j}=\operatorname{\rho}(x_{i},x_{j})$ , otherwise $\hat{\operatorname{\rho}}_{i,j}=z_{i,j}$ where $z_{i,j}$ is an indeterminate. Then the $Z$ -Augmented Cayley-Menger determinant is obtained from the Cayley-Menger determinant by replacing each $\operatorname{\rho}_{i,j}$ with $\hat{\operatorname{\rho}}_{i,j}$ .

Observation 19.

Consider $r+1$ points $x_{0},x_{1},\dots,x_{r}$ of distance space $(X,\operatorname{\rho})$ and a set $Z$ of pairs in $X$ . The $Z$ -Augmented Cayley-Menger determinant is a multi-variate polynomial with real coefficients, over the set $\{z_{i,j}\mid i<j,\leavevmode\nobreak\ \{x_{i},x_{j}\}\in Z\}$ of indeterminates and where each monomial has degree at most $2\cdot(r+1)$ .

Theorem 20.

Euclidean Embedding Editing is solvable in $(d(k_{O}+k_{M}))^{\mathcal{O}(d+k_{M}+k_{O})}+n^{\mathcal{O}(1)}$ time.

Proof.

Our FPT algorithm for Euclidean Embedding Editing works as follows. We begin by running the compression algorithm (Theorem 8). Following this, based on insights from Proposition 4, we reduce the task of solving the resulting instance to the task of testing a bounded number (in terms of $k, d$ ) of sentences with purely existential quantifications as described in Proposition 17, where $s,\ell,t$ are all bounded by functions of $k$ and $d$ .

We now formalize this idea. Let $({{\cal X}}=(X,\operatorname{\rho}),w_{O},w_{M},W,k_{O},k_{M},d)$ be the given instance of Euclidean Embedding Editing. We run the algorithm of Theorem 8 on this instance. Recall that this algorithm either solves the instance or produces an equivalent instance $({{\cal X}}^{\prime}=(X^{\prime},\operatorname{\rho}),w_{O}^{\prime},w_{M}^{% \prime},W^{\prime},k_{O}^{\prime},k_{M}^{\prime},d)$ such that $X^{\prime}\subseteq X$ , and for $k=k_{O}+k_{M}$ and a bound $\tau(k,d)=2^{\mathcal{O}((kd)^{12})}$ , we have that $|X^{\prime}|=\mathcal{O}((kd)^{2})$ . Note that since $X^{\prime}$ has size bounded by $\mathcal{O}((kd)^{2})$ , we may assume that $k_{O}^{\prime}$ is bounded by ${\mathcal{O}}((kd)^{2})$ and $k_{M}^{\prime}$ is bounded by ${\mathcal{O}}((kd)^{4})$ .

Since we now have a bounded number of points in $X^{\prime}$ , it is straightforward to guess a set $Z_{O}\subseteq X^{\prime}$ of at most $k_{O}^{\prime}$ outliers and a subset $Z_{M}$ of pairs from $X^{\prime\prime}=X^{\prime}\setminus Z_{O}$ of size at most $k_{M}^{\prime}$ such that (i) $w_{O}^{\prime}(Z_{O})+w_{M}^{\prime}(Z_{M})\leq W^{\prime}$ and (ii) if there is a solution to the given instance, then there is a modification of the pairs in $Z_{M}$ such that $(X^{\prime\prime},\rho^{\prime\prime})$ is embeddable into $\mathbb{R}^{d}$ , where $\rho^{\prime\prime}$ differs from $\rho$ (restricted to $X^{\prime\prime}$ ) only among the pairs in $Z_{M}$ . We next guess the embedding dimension of $(X^{\prime\prime},\rho^{\prime\prime})$ , denoted by $r$ . Note that $r\leq d$ .

By Proposition 4, we know that $(X^{\prime\prime},\rho^{\prime\prime})$ is $r$ -embeddable if and only if there is a subset $\hat{X}=\{x_{0},\dots,x_{r}\}$ of $X^{\prime\prime}$ such that (i) $(-1)^{j+1}CM(x_{0},x_{1},\dots,x_{j})>0$ for $1\leq j\leq r$ , and (ii) for any $x,y\in X^{\prime\prime}\setminus\hat{X}$ , $CM(x_{0},x_{1},\dots,x_{r},x)=CM(x_{0},x_{1},\dots,x_{r},y)=CM(x_{0},x_{1},% \dots,x_{r},x,y)=0$ , where the matrices are derived from the distance matrix $D(\rho^{\prime\prime})$ .

We next guess $\hat{X}$ . It remains to verify that there is a modification of the distances between each pair in $Z$ such that the resulting space $(X^{\prime\prime},\rho^{\prime\prime})$ is $r$ -embeddable, i.e., satisfies the above two properties. To test this, we construct the formula $F$ obtained by taking the conjunction of the atoms below and invoke Proposition 17 on the sentence $\phi$ obtained by prepending $F$ with existential quantifications for every indeterminate $\{z_{i,j}\mid i<j,\leavevmode\nobreak\ \{x_{i},x_{j}\}\in Z\}$ :

1.

$(-1)^{j+1}CM_{Z_{M}}(x_{0},x_{1},\dots,x_{j})>0$ , where $1\leq j\leq r$ .
2.

$CM_{Z_{M}}(x_{0},x_{1},\dots,x_{r},x)=0$ for every $x\in X^{\prime\prime}\setminus\hat{X}$ .
3.

$CM_{Z_{M}}(x_{0},x_{1},\dots,x_{r},x,y)=0$ for every $x,y\in X^{\prime\prime}\setminus\hat{X}$ .
4.

$z_{i,j}\geq 0$ for each indeterminate defined by $Z_{M}$ .

$\hfill\blacktriangleleft$

In the conclusion of the section, we remark that the parameter dependence can be improved for Euclidean Embedding with Outliers.

Theorem 21 ( $\star$ ).

Euclidean Embedding with Outliers can be solved in time $n^{O(1)}\cdot min\{(d+3)^{k},2^{d+k}\}$ .

5 FPT-Approximation for Unweighted Euclidean Embedding with Outliers

In this section, we give an FPT-time 2-approximation for Unweighted Euclidean Embedding with Outliers (UEEO) parameterized by the dimension $d$ . Recall that in the optmization version of this problem, the input is $({{\cal X}}=(X,\operatorname{\rho}),d)$ where $(X,\operatorname{\rho})$ is a distance space and the goal is to output a $d$ -outlier set of minimum size.

Our main idea is to give a randomized algorithm generating sequences $U_{0},\dots,U_{d+1}$ and $A_{1},\dots,A_{d+2}$ , where each $A_{i}$ is a $d$ -outlier set that is associated with the independent set $U_{i-1}$ of size $i-1$ . The construction of these sequences is roughly as follows, starting from $U_{0}=\emptyset$ and proceeding iteratively. For each $U_{i-1}$ :

1.

Any element that cannot be $d$ -embedded along with $U_{i-1}$ gets added to $A_{i}$ .
2.

We compute those elements that can be added to $U_{i-1}$ without increasing its embedding dimension, and associate a graph with these such that every $d$ -outlier set disjoint from $U_{i-1}$ includes a vertex cover of this graph. We then 2-approximate the minimum vertex cover of this graph and add it to $A_{i}$ .
3.
For the remaining elements (call this set $W$ ):
1. (a)
  
  Take all of $W$ in $A_{i}$ (which makes $A_{i}$ a 2-approximation if at least half of $W$ is in the solution),
2. (b)
  
  Uniformly at random add an element of $W$ to $U_{i-1}$ , resulting in $U_{i}$ (which gives an independent set with sufficiently high probability if at least half of $W$ is not in some fixed minimum $d$ -outlier set).

Theorem 22.

Unweighted Euclidean Embedding with Outliers can be $2$ -approximated in $2^{d}\cdot n^{\mathcal{O}(1)}$ time by a randomized algorithm with one-sided constant probability of error.

Proof.

Let $S_{\sf opt}$ be a hypothetical minimum sized $d$ -outlier set. Let $X^{\star}=X\setminus S_{\sf opt}$ . Let us assume that $(X^{\star},\operatorname{\rho})$ is strongly $d$ -embeddable. Else, it is strongly $d^{\prime}$ -embeddable for some $d^{\prime}\leq d$ , in which case we guess $d^{\prime}$ and set $d:=d^{\prime}$ . In what follows, we give a randomized algorithm that aims to obtain a metric basis of $(X^{\star},\operatorname{\rho})$ if certain pre-conditions are satisfied.

Alg-3 $(X,\rho,d)$ :

1.

Set $U_{0}:=\emptyset$ .
2.
For $i=1$ to $d+2$ do as follows.
1. (a)
  
  Construct ${\cal C}_{\sf comp}^{i}$ , the set of elements $y\in X\setminus U_{i-1}$ such that $(U_{(i-1)}\cup\{y\},\operatorname{\rho})$ is $(i-2)$ -embeddable.
2. (b)
  
  Construct ${\cal C}_{\sf def}^{i}$ , the set of elements $y\in X\setminus U_{i-1}$ such that $(U_{(i-1)}\cup\{y\},\operatorname{\rho})$ is not $d$ -embeddable.
3. (c)
  
  Construct ${\cal C}_{\sf incomp}^{i}$ , the set of elements $y\in X\setminus(U_{i-1}\cup{\cal C}_{\sf comp}^{i}\cup{\cal C}_{\sf def}^{i})$ such that $(U_{(i-1)}\cup\{y\},\operatorname{\rho})$ is not $(i-2)$ -embeddable.
4. (d)
  
  Select an element $u_{i}$ uniformly at random from ${\cal C}_{\sf incomp}^{i}$ .
5. (e)
  
  Set $U_{i}:=U_{(i-1)}\cup\{u_{i}\}$ .

Claim 23 ( $\star$ ).

For every $i\in\{0,\dots,d+1\}$ , the following holds:

1.

${\cal C}_{\sf def}^{i}$ and ${\cal C}_{\sf comp}^{i}$ are disjoint.
2.

Every $d$ -outlier set of $(X,\rho)$ disjoint from $U_{i}$ contains ${\cal C}_{\sf def}^{i+1}$ .
3.

If for all $j\leq i$ , ${\cal C}_{\sf incomp}^{j}\neq\emptyset$ , then $U_{i}$ is an independent set of size $i$ .

We next argue that for each $i\in[d+1]$ , the independent set $U_{i}$ is preserved in $X^{\star}=X\setminus S_{\sf opt}$ , with sufficiently high probability if certain conditions are met.

Claim 24 ( $\star$ ).

Let $i\in[d+1]$ such that for all $j\leq i$ , $|{\cal C}_{\sf incomp}^{j}\cap S_{\sf opt}|<\frac{|{\cal C}_{\sf incomp}^{j}|}% {2}$ . Then, $\Pr[U_{i}\subseteq X^{\star}]\geq\frac{1}{2^{i}}$ and $U_{i}$ is an independent set of size $i$ .

Next, we build a series of $d$ -outlier sets $A_{1},\dots,A_{d+2}$ as follows. For each $i\in[d+2]$ , create an instance of the Vertex Cover problem on the graph $G_{\sf comp}^{i}$ , where $V(G_{\sf comp}^{i})={\cal C}_{\sf comp}^{i}$ , and there is an edge between two elements $x,y\in{\cal C}_{\sf comp}^{i}$ if they $U_{i-1}\cup{x,y}$ is not $(i-2)$ -embeddable. Using a factor- $2$ approximation algorithm for Vertex Cover, obtain a set $Q_{i}$ such that $|Q_{i}|\leq 2|O_{i}|$ , where $O_{i}$ is a minimum vertex cover of $G_{\sf comp}^{i}$ . Finally, set $A_{i}:=Q_{i}\cup{\cal C}^{i}_{\sf incomp}\cup{\cal C}^{i}_{\sf def}$ . Once every $A_{i}$ is computed, we compute $i^{\star}=\arg\min_{i\in[d+2]}|A_{i}|$ and return $A_{i^{\star}}$ .

Claim 25 ( $\star$ ).

Let $\ell\in[d+1]$ be the least integer such that for all $j<\ell$ , it holds that

|{\cal C}_{\sf incomp}^{j}\cap S_{\sf opt}|<\frac{|{\cal C}_{\sf incomp}^{j}|}% {2}.

Then, $A_{\ell}$ is a factor-2 approximation with probability at least $\frac{1}{2^{\ell}}$ .

Since we compute $i^{\star}=\arg\min_{i\in[d+2]}|A_{i}|$ and return $A_{i^{\star}}$ , by Claim 25 we are guaranteed that $|A_{i^{\star}}|\leq 2|S_{\sf opt}|$ with probability at least $\frac{1}{2^{d+1}}$ .

Running Time Analysis.

We have already argued that we succeed with probability $\frac{1}{2^{d+1}}$ . Thus, we can boost the success probability by independently running our polynomial-time algorithm $2^{d+1}$ times and returning the minimum size solution among these runs. Thus, the probability that the algorithm fails in all of the independent runs is upper bounded by $(1-\frac{1}{2^{d+1}})^{2^{d+1}}\leq 1-\frac{1}{e}$ . Moreover, the total running time is upper bounded by $2^{d}n^{\mathcal{O}(1)}$ . $\hfill\blacktriangleleft$

6 Lower bounds

In this section, we complement our algorithmic results by proving computational lower bounds. These lower bounds also motivate our choice of parameterizations. First, we show that Euclidean Embedding Editing is $\operatorname{Para-NP}$ -hard when parameterized by $d$ . More precisely, we show that even the unweighted versions of both Euclidean Embedding with Outliers and Euclidean Metric Violation Distance are $\operatorname{NP}$ -hard even for $d=1$ .

Theorem 26 ( $\star$ ).

Euclidean Embedding with Outliers is strongly $\operatorname{NP}$ -hard even for instances with unit weights, integer distances, and $d=1$ .

Theorem 27 ( $\star$ ).

Euclidean Metric Violation Distance is strongly $\operatorname{NP}$ -hard for the instances with unit weights, integer distances, and $d=1$ .

Finally, we prove that Euclidean Embedding Editing is $\operatorname{W}[1]$ -hard when parameterized by $k_{O}+k_{M}$ only. Moreover, the hardness holds even for the unweighted variant of Euclidean Embedding with Outliers.

Theorem 28 ( $\star$ ).

Euclidean Embedding with Outliers parameterized by $k$ is $\operatorname{W}[1]$ -hard for $n$ -point instances with unit weights and integer distance matrices with entries in $\mathcal{O}(n)$ .

References

[1] Jorge Alencar, Tibérius O. Bonates, Carlile Lavor, and Leo Liberti. An algorithm for realizing euclidean distance matrices. Electron. Notes Discret. Math., 50:397–402, 2015. doi:10.1016/J.ENDM.2015.07.066.
[2] Sanjeev Arora, James Lee, and Assaf Naor. Euclidean distortion and the sparsest cut. Journal of the American Mathematical Society, 21(1):1–21, 2008.
[3] Sanjeev Arora, Satish Rao, and Umesh Vazirani. Expander flows, geometric embeddings and graph partitioning. J. ACM, 56(2):5, 2009.
[4] Saugata Basu, Richard Pollack, and Marie-Françoise Roy. Algorithms in Real Algebraic Geometry (Algorithms and Computation in Mathematics). Springer-Verlag, Berlin, Heidelberg, 2006.
[5] Leonard Mascot Blumenthal. Theory and applications of distance geometry. Clarendon Press, Oxford, 1953.
[6] Ingwer Borg and Patrick J.F. Groenen. Modern Multidimensional Scaling: Theory and Applications. Springer, New York, 2005.
[7] Arthur Cayley. On the theory of determinants. Philosophical Magazine, 19:1–16, 1841.
[8] Moses Charikar and Ruiquan Gao. Improved approximations for ultrametric violation distance. In Proceedings of the 2024 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 1704–1737. SIAM, 2024. doi:10.1137/1.9781611977912.68.
[9] Vincent Cohen-Addad, Chenglin Fan, Euiwoong Lee, and Arnaud De Mesmay. Fitting metrics and ultrametrics with minimum disagreements. In Proceedings of the 63rd Annual Symposium on Foundations of Computer Science (FOCS), pages 301–311. IEEE, 2022. doi:10.1109/FOCS54457.2022.00035.
[10] Marek Cygan, Fedor V. Fomin, Lukasz Kowalik, Daniel Lokshtanov, Dániel Marx, Marcin Pilipczuk, Michał Pilipczuk, and Saket Saurabh. Parameterized Algorithms. Springer, 2015. doi:10.1007/978-3-319-21275-3.
[11] J. Dattorro. Convex Optimization & Euclidean Distance Geometry. Meboo Publishing USA, 2008.
[12] E. D. Demaine, F. Gomez-Martin, H. Meijer, D. Rappaport, P. Taslakian, G. T. Toussaint, T. Winograd, and D. R. Wood. The distance geometry of music. Computational Geometry, 42(5):429–454, July 2009. doi:10.1016/J.COMGEO.2008.04.005.
[13] Michel Deza, Monique Laurent, and Robert Weismantel. Geometry of cuts and metrics, volume 2. Springer, 1997.
[14] L. Doherty, K. Pister, and L. El Ghaoui. Convex position estimation in wireless sensor networks. In Proceedings of the IEEE Conference on Computer Communications (INFOCOM), volume 3, pages 1655–1663. IEEE, 2001.
[15] Ivan Dokmanic, Reza Parhizkar, Juri Ranieri, and Martin Vetterli. Euclidean distance matrices: essential theory, algorithms, and applications. IEEE Signal Processing Magazine, 32(6):12–30, 2015. doi:10.1109/MSP.2015.2398954.
[16] Michael Etscheid, Stefan Kratsch, Matthias Mnich, and Heiko Röglin. Polynomial kernels for weighted problems. Journal of Computer System Sciences, 84:1–10, 2017. doi:10.1016/J.JCSS.2016.06.004.
[17] B. Everitt and S. Rabe-Hesketh. The Analysis of Proximity Data. Arnold, London, 1997.
[18] Chenglin Fan, Benjamin Raichel, and Gregory Van Buskirk. Metric violation distance: Hardness and approximation. In Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 196–209. SIAM, 2018. doi:10.1137/1.9781611975031.14.
[19] András Frank and Éva Tardos. An application of simultaneous diophantine approximation in combinatorial optimization. Combinatorica, 7(1):49–65, 1987. doi:10.1007/BF02579200.
[20] Anna C. Gilbert and Lalit Jain. If it ain’t broke, don’t fix it: Sparse metric repair. In 2017 55th Annual Allerton Conference on Communication, Control, and Computing (Allerton), pages 612–619. IEEE, 2017. doi:10.1109/ALLERTON.2017.8262793.
[21] T. F. Havel and K. Wüthrich. An evaluation of the combined use of nuclear magnetic resonance and distance geometry for the determination of protein conformations in solution. Journal of Molecular Biology, 182(2):281–294, 1985.
[22] Piotr Indyk. Algorithmic applications of low-distortion geometric embeddings. In Proceedings of the 42nd IEEE Symposium on Foundations of Computer Science (FOCS), pages 10–33. IEEE, 2001. doi:10.1109/SFCS.2001.959878.
[23] Piotr Indyk and Jiri Matousek. Low-distortion embeddings of finite metric spaces. In in Handbook of Discrete and Computational Geometry, pages 177–196. CRC Press, 2004.
[24] Viren Jain and Lawrence K. Saul. Exploratory analysis and visualization of speech and music by locally linear embedding. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), volume 3, pages 984–987. IEEE, 2004. doi:10.1109/ICASSP.2004.1326712.
[25] Leo Liberti and Carlile Lavor. Euclidean distance geometry, volume 3. Springer, 2017.
[26] Nathan Linial. Finite metric-spaces – combinatorics, geometry and algorithms. In Proceedings of the International Congress of Mathematicians, Vol. III, pages 573–586, Beijing, 2002. Higher Ed. Press.
[27] Nathan Linial, Eran London, and Yuri Rabinovich. The geometry of graphs and some of its algorithmic applications. Combinatorica, 15(2):215–245, 1995. doi:10.1007/BF01200757.
[28] Karl Menger. Untersuchungen Ã¼ber allgemeine metrik. Mathematische Annalen, 100:75–163, 1928.
[29] N. Patwari, J. N. Ash, S. Kyperountas, A. O. Hero, R. L. Moses, and N. S. Correal. Locating the nodes: Cooperative localization in wireless sensor networks. IEEE Signal Processing Magazine, 22(4):54–69, July 2005. doi:10.1109/MSP.2005.1458287.
[30] Isaac J. Schoenberg. Remarks to maurice frechet’s article “sur la definition axiomatique d’une classe d’espace distances vectoriellement applicable sur l’espace de hilbert”. Annals of Mathematics, 36:724–732, 1935.
[31] R. Shepard. The analysis of proximities: multidimensional scaling with an unknown distance function, part i. Psychometrika, 27:125–140, 1962.
[32] Anastasios Sidiropoulos, Dingkang Wang, and Yusu Wang. Metric embeddings with outliers. In Proceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 670–689. SIAM, 2017. doi:10.1137/1.9781611974782.43.
[33] Manfred J. Sippl and Harold A. Scheraga. Solution of the embedding problem and decomposition of symmetric matrices. Proc. Nat. Acad. Sci. U.S.A., 82(8):2197–2201, 1985. doi:10.1073/pnas.82.8.2197.
[34] W. Torgerson. Theory and Methods of Scaling. Wiley, New York, 1958.
[35] Warren S. Torgerson. Multidimensional scaling: I. theory and method. Psychometrika, 17(4):401–419, 1952.
[36] Kilian Q. Weinberger and Lawrence K. Saul. Unsupervised learning of image manifolds by semidefinite programming. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), volume 2, pages 988–995. IEEE, 2004. doi:10.1109/CVPR.2004.256.

[bib.bib1] [1] Jorge Alencar, Tibérius O. Bonates, Carlile Lavor, and Leo Liberti. An algorithm for realizing euclidean distance matrices. Electron. Notes Discret. Math., 50:397–402, 2015. doi:10.1016/J.ENDM.2015.07.066.

[bib.bib2] [2] Sanjeev Arora, James Lee, and Assaf Naor. Euclidean distortion and the sparsest cut. Journal of the American Mathematical Society, 21(1):1–21, 2008.

[bib.bib3] [3] Sanjeev Arora, Satish Rao, and Umesh Vazirani. Expander flows, geometric embeddings and graph partitioning. J. ACM, 56(2):5, 2009.

[bib.bib4] [4] Saugata Basu, Richard Pollack, and Marie-Françoise Roy. Algorithms in Real Algebraic Geometry (Algorithms and Computation in Mathematics). Springer-Verlag, Berlin, Heidelberg, 2006.

[bib.bib5] [5] Leonard Mascot Blumenthal. Theory and applications of distance geometry. Clarendon Press, Oxford, 1953.

[bib.bib6] [6] Ingwer Borg and Patrick J.F. Groenen. Modern Multidimensional Scaling: Theory and Applications. Springer, New York, 2005.

[bib.bib7] [7] Arthur Cayley. On the theory of determinants. Philosophical Magazine, 19:1–16, 1841.

[bib.bib8] [8] Moses Charikar and Ruiquan Gao. Improved approximations for ultrametric violation distance. In Proceedings of the 2024 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 1704–1737. SIAM, 2024. doi:10.1137/1.9781611977912.68.

[bib.bib9] [9] Vincent Cohen-Addad, Chenglin Fan, Euiwoong Lee, and Arnaud De Mesmay. Fitting metrics and ultrametrics with minimum disagreements. In Proceedings of the 63rd Annual Symposium on Foundations of Computer Science (FOCS), pages 301–311. IEEE, 2022. doi:10.1109/FOCS54457.2022.00035.

[bib.bib10] [10] Marek Cygan, Fedor V. Fomin, Lukasz Kowalik, Daniel Lokshtanov, Dániel Marx, Marcin Pilipczuk, Michał Pilipczuk, and Saket Saurabh. Parameterized Algorithms. Springer, 2015. doi:10.1007/978-3-319-21275-3.

[bib.bib11] [11] J. Dattorro. Convex Optimization & Euclidean Distance Geometry. Meboo Publishing USA, 2008.

[bib.bib12] [12] E. D. Demaine, F. Gomez-Martin, H. Meijer, D. Rappaport, P. Taslakian, G. T. Toussaint, T. Winograd, and D. R. Wood. The distance geometry of music. Computational Geometry, 42(5):429–454, July 2009. doi:10.1016/J.COMGEO.2008.04.005.

[bib.bib13] [13] Michel Deza, Monique Laurent, and Robert Weismantel. Geometry of cuts and metrics, volume 2. Springer, 1997.

[bib.bib14] [14] L. Doherty, K. Pister, and L. El Ghaoui. Convex position estimation in wireless sensor networks. In Proceedings of the IEEE Conference on Computer Communications (INFOCOM), volume 3, pages 1655–1663. IEEE, 2001.

[bib.bib15] [15] Ivan Dokmanic, Reza Parhizkar, Juri Ranieri, and Martin Vetterli. Euclidean distance matrices: essential theory, algorithms, and applications. IEEE Signal Processing Magazine, 32(6):12–30, 2015. doi:10.1109/MSP.2015.2398954.

[bib.bib16] [16] Michael Etscheid, Stefan Kratsch, Matthias Mnich, and Heiko Röglin. Polynomial kernels for weighted problems. Journal of Computer System Sciences, 84:1–10, 2017. doi:10.1016/J.JCSS.2016.06.004.

[bib.bib17] [17] B. Everitt and S. Rabe-Hesketh. The Analysis of Proximity Data. Arnold, London, 1997.

[bib.bib18] [18] Chenglin Fan, Benjamin Raichel, and Gregory Van Buskirk. Metric violation distance: Hardness and approximation. In Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 196–209. SIAM, 2018. doi:10.1137/1.9781611975031.14.

[bib.bib19] [19] András Frank and Éva Tardos. An application of simultaneous diophantine approximation in combinatorial optimization. Combinatorica, 7(1):49–65, 1987. doi:10.1007/BF02579200.

[bib.bib20] [20] Anna C. Gilbert and Lalit Jain. If it ain’t broke, don’t fix it: Sparse metric repair. In 2017 55th Annual Allerton Conference on Communication, Control, and Computing (Allerton), pages 612–619. IEEE, 2017. doi:10.1109/ALLERTON.2017.8262793.

[bib.bib21] [21] T. F. Havel and K. Wüthrich. An evaluation of the combined use of nuclear magnetic resonance and distance geometry for the determination of protein conformations in solution. Journal of Molecular Biology, 182(2):281–294, 1985.

[bib.bib22] [22] Piotr Indyk. Algorithmic applications of low-distortion geometric embeddings. In Proceedings of the 42nd IEEE Symposium on Foundations of Computer Science (FOCS), pages 10–33. IEEE, 2001. doi:10.1109/SFCS.2001.959878.

[bib.bib23] [23] Piotr Indyk and Jiri Matousek. Low-distortion embeddings of finite metric spaces. In in Handbook of Discrete and Computational Geometry, pages 177–196. CRC Press, 2004.

[bib.bib24] [24] Viren Jain and Lawrence K. Saul. Exploratory analysis and visualization of speech and music by locally linear embedding. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), volume 3, pages 984–987. IEEE, 2004. doi:10.1109/ICASSP.2004.1326712.

[bib.bib25] [25] Leo Liberti and Carlile Lavor. Euclidean distance geometry, volume 3. Springer, 2017.

[bib.bib26] [26] Nathan Linial. Finite metric-spaces – combinatorics, geometry and algorithms. In Proceedings of the International Congress of Mathematicians, Vol. III, pages 573–586, Beijing, 2002. Higher Ed. Press.

[bib.bib27] [27] Nathan Linial, Eran London, and Yuri Rabinovich. The geometry of graphs and some of its algorithmic applications. Combinatorica, 15(2):215–245, 1995. doi:10.1007/BF01200757.

[bib.bib28] [28] Karl Menger. Untersuchungen Ã¼ber allgemeine metrik. Mathematische Annalen, 100:75–163, 1928.

[bib.bib29] [29] N. Patwari, J. N. Ash, S. Kyperountas, A. O. Hero, R. L. Moses, and N. S. Correal. Locating the nodes: Cooperative localization in wireless sensor networks. IEEE Signal Processing Magazine, 22(4):54–69, July 2005. doi:10.1109/MSP.2005.1458287.

[bib.bib30] [30] Isaac J. Schoenberg. Remarks to maurice frechet’s article “sur la definition axiomatique d’une classe d’espace distances vectoriellement applicable sur l’espace de hilbert”. Annals of Mathematics, 36:724–732, 1935.

[bib.bib31] [31] R. Shepard. The analysis of proximities: multidimensional scaling with an unknown distance function, part i. Psychometrika, 27:125–140, 1962.

[bib.bib32] [32] Anastasios Sidiropoulos, Dingkang Wang, and Yusu Wang. Metric embeddings with outliers. In Proceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 670–689. SIAM, 2017. doi:10.1137/1.9781611974782.43.

[bib.bib33] [33] Manfred J. Sippl and Harold A. Scheraga. Solution of the embedding problem and decomposition of symmetric matrices. Proc. Nat. Acad. Sci. U.S.A., 82(8):2197–2201, 1985. doi:10.1073/pnas.82.8.2197.

[bib.bib34] [34] W. Torgerson. Theory and Methods of Scaling. Wiley, New York, 1958.

[bib.bib35] [35] Warren S. Torgerson. Multidimensional scaling: I. theory and method. Psychometrika, 17(4):401–419, 1952.

[bib.bib36] [36] Kilian Q. Weinberger and Lawrence K. Saul. Unsupervised learning of image manifolds by semidefinite programming. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), volume 2, pages 988–995. IEEE, 2004. doi:10.1109/CVPR.2004.256.

When Distances Lie: Euclidean Embeddings in the Presence of Outliers and Distance Violations

Abstract

Keywords and phrases:

Funding:

Copyright and License:

2012 ACM Subject Classification:

Related Version:

DOI:

Event:

Editors:

Series and Publisher:

1 Introduction

Our results.

Related work.

2 Preliminaries

Proposition 1 ([15]).

Proposition 2 ([1, 33]).

Definition 3 (Metric basis).

Proposition 4 ([5, Chapter IV]).

Definition 5.

Proposition 6 ([5, Chapter IV]).

Lemma 7 (⋆).

3 Compression for Euclidean Embedding Editing

Theorem 8.

3.1 Bootstrapping the Compression through an Approximation

Lemma 9.

Proof.

Lemma 10 (⋆).

3.2 Compression

Proof of Theorem 8.

Reduction Rule 1.

Reduction Rule 2.

Claim 11 (⋆).

Claim 12.

Reduction Rule 3.

Reduction Rule 4.

Claim 13 (⋆).

Claim 14 (⋆).

Claim 15 (⋆).

4 FPT algorithm for Euclidean Embedding Editing parameterized by solution size and dimension

Definition 16 ([4]).

Proposition 17 (Theorem 13.13, [4]).

Definition 18.

Observation 19.

Theorem 20.

Proof.

Theorem 21 (⋆).

5 FPT-Approximation for Unweighted Euclidean Embedding with Outliers

Theorem 22.

Proof.

Claim 23 (⋆).

Claim 24 (⋆).

Claim 25 (⋆).

Running Time Analysis.

6 Lower bounds

Theorem 26 (⋆).

Theorem 27 (⋆).

Theorem 28 (⋆).

References

Lemma 7 ( $\star$ ).

Lemma 10 ( $\star$ ).

Claim 11 ( $\star$ ).

Claim 13 ( $\star$ ).

Claim 14 ( $\star$ ).

Claim 15 ( $\star$ ).

Theorem 21 ( $\star$ ).

Claim 23 ( $\star$ ).

Claim 24 ( $\star$ ).

Claim 25 ( $\star$ ).

Theorem 26 ( $\star$ ).

Theorem 27 ( $\star$ ).

Theorem 28 ( $\star$ ).