Preclustering Algorithms for Imprecise Points

Abam, Mohammad Ali; de Berg, Mark; Farahzad, Sina; Mirsadeghi, Mir Omid Haji; Saghafian, Morteza

doi:10.4230/LIPIcs.SWAT.2020.3

Abstract

We study the problem of preclustering a set B of imprecise points in ℝ^d: we wish to cluster the regions specifying the potential locations of the points such that, no matter where the points are located within their regions, the resulting clustering approximates the optimal clustering for those locations. We consider k-center, k-median, and k-means clustering, and obtain the following results.
Let B:={b₁,…,b_n} be a collection of disjoint balls in ℝ^d, where each ball b_i specifies the possible locations of an input point p_i. A partition 𝒞 of B into subsets is called an (f(k),α)-preclustering (with respect to the specific k-clustering variant under consideration) if (i) 𝒞 consists of f(k) preclusters, and (ii) for any realization P of the points p_i inside their respective balls, the cost of the clustering on P induced by 𝒞 is at most α times the cost of an optimal k-clustering on P. We call f(k) the size of the preclustering and we call α its approximation ratio. We prove that, even in ℝ^1, one may need at least 3k-3 preclusters to obtain a bounded approximation ratio - this holds for the k-center, the k-median, and the k-means problem - and we present a (3k,1) preclustering for the k-center problem in ℝ^1. We also present various preclusterings for balls in ℝ^d with d⩾2, including a (3k,α)-preclustering with α≈13.9 for the k-center and the k-median problem, and α≈254.7 for the k-means problem.

Cite As Get BibTex

Mohammad Ali Abam, Mark de Berg, Sina Farahzad, Mir Omid Haji Mirsadeghi, and Morteza Saghafian. Preclustering Algorithms for Imprecise Points. In 17th Scandinavian Symposium and Workshops on Algorithm Theory (SWAT 2020). Leibniz International Proceedings in Informatics (LIPIcs), Volume 162, pp. 3:1-3:12, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2020) https://doi.org/10.4230/LIPIcs.SWAT.2020.3

Author Details

Mohammad Ali Abam

Department of Computer Engineering, Sharif University of Technology, Iran

Mark de Berg

Department of Mathematics and Computer Science, TU Eindhoven, the Netherlands

Sina Farahzad

Department of Computer Engineering, Sharif University of Technology, Iran

Mir Omid Haji Mirsadeghi

Department of Mathematical Sciences, Sharif University of Technology, Iran

Morteza Saghafian

Department of Mathematical Sciences, Sharif University of Technology, Iran

Funding

The work in this paper is supported by the Netherlands Organisation for Scientific Research (NWO) through Gravitation-grant NETWORKS-024.002.003.

References

Pankaj K. Agarwal and Cecilia Magdalena Procopiuc. Exact and approximation algorithms for clustering. Algorithmica, 33(2):201-226, 2002. URL: https://doi.org/10.1007/s00453-001-0110-y.
Sanjeev Arora, Prabhakar Raghavan, and Satish Rao. Approximation schemes for euclidean k-medians and related problems. In Proceedings of the ACM Symposium on the Theory of Computing, pages 106-113, 1998. URL: https://doi.org/10.1145/276698.276718.
Kevin Buchin, Maarten Löffler, Pat Morin, and Wolfgang Mulzer. Preprocessing imprecise points for delaunay triangulation: Simplified and extended. Algorithmica, 61(3):674-693, 2011. URL: https://doi.org/10.1007/s00453-010-9430-0.
Dan Feldman, Morteza Monemizadeh, and Christian Sohler. A PTAS for k-means clustering based on weak coresets. In Proceedings of ACM Symposium on Computational Geometry, pages 11-18, 2007. URL: https://doi.org/10.1145/1247069.1247072.
Dorit S. Hochbaum and David B. Shmoys. A best possible heuristic for the k-center problem. Math. Oper. Res., 10(2):180-184, 1985. URL: https://doi.org/10.1287/moor.10.2.180.
Ragesh Jaiswal, Amit Kumar, and Sandeep Sen. A simple D 2-sampling based PTAS for k-means and other clustering problems. In Computing and Combinatorics COCOON 2012, volume 7434 of Lecture Notes in Computer Science, pages 13-24. Springer, 2012. URL: https://doi.org/10.1007/978-3-642-32241-9_2.
Wenqi Ju, Jun Luo, Binhai Zhu, and Ovidiu Daescu. Largest area convex hull of imprecise data based on axis-aligned squares. J. Comb. Optim., 26(4):832-859, 2013. URL: https://doi.org/10.1007/s10878-012-9488-5.
Stavros G. Kolliopoulos and Satish Rao. A nearly linear-time approximation scheme for the euclidean kappa-median problem. In Algorithms - ESA Proceedings, volume 1643 of Lecture Notes in Computer Science, pages 378-389. Springer, 1999. URL: https://doi.org/10.1007/3-540-48481-7_33.
Chih-Hung Liu and Sandro Montanari. Minimizing the diameter of a spanning tree for imprecise points. Algorithmica, 80(2):801-826, 2018. URL: https://doi.org/10.1007/s00453-017-0292-6.
Maarten Löffler. Data Imprecision in Computational Geometry. PhD thesis, Utrecht University, Netherlands, 2009.
Maarten Löffler and Marc J. van Kreveld. Largest and smallest convex hulls for imprecise points. Algorithmica, 56(2):235-269, 2010. URL: https://doi.org/10.1007/s00453-008-9174-2.
Takayuki Nagai and Nobuki Tokura. Tight error bounds of geometric problems on convex objects with imprecise coordinates. In JCDCG, volume 2098 of Lecture Notes in Computer Science, pages 252-263. Springer, 2000. URL: https://doi.org/10.1007/3-540-47738-1_24.
Farnaz Sheikhi, Ali Mohades, Mark de Berg, and Ali D. Mehrabi. Separability of imprecise points. Comput. Geom., 61:24-37, 2017. URL: https://doi.org/10.1016/j.comgeo.2016.10.001.
David B. Shmoys and Éva Tardos. An approximation algorithm for the generalized assignment problem. Math. Program., 62:461-474, 1993. URL: https://doi.org/10.1007/BF01585178.
István Talata. Exponential lower bound for the translative kissing numbers of d -dimensional convex bodies. Discrete and Computational Geometry, 19(3):447-455, 1998. URL: https://doi.org/10.1007/PL00009362.

Preclustering Algorithms for Imprecise Points

Authors Mohammad Ali Abam, Mark de Berg , Sina Farahzad, Mir Omid Haji Mirsadeghi, Morteza Saghafian

File

Document Identifiers

Subject Classification

ACM Subject Classification

Keywords

Metrics

Abstract

Cite As Get BibTex

Author Details

References

Thanks for your feedback!

Could not send message