Online k-Median with Consistent Clusters

Authors Benjamin Moseley , Heather Newman , Kirk Pruhs



PDF
Thumbnail PDF

File

LIPIcs.APPROX-RANDOM.2024.20.pdf
  • Filesize: 1 MB
  • 22 pages

Document Identifiers

Author Details

Benjamin Moseley
  • Tepper School of Business, Carnegie Mellon University, Pittsburgh, PA, USA
Heather Newman
  • Department of Mathematical Sciences, Carnegie Mellon University, Pittsburgh, PA, USA
Kirk Pruhs
  • Department of Computer Science, University of Pittsburgh, Pittsburgh, PA, USA

Cite AsGet BibTex

Benjamin Moseley, Heather Newman, and Kirk Pruhs. Online k-Median with Consistent Clusters. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2024). Leibniz International Proceedings in Informatics (LIPIcs), Volume 317, pp. 20:1-20:22, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2024)
https://doi.org/10.4230/LIPIcs.APPROX/RANDOM.2024.20

Abstract

We consider the problem in which n points arrive online over time, and upon arrival must be irrevocably assigned to one of k clusters where the objective is the standard k-median objective. Lower-bound instances show that for this problem no online algorithm can achieve a competitive ratio bounded by any function of n. Thus we turn to a beyond worst-case analysis approach, namely we assume that the online algorithm is a priori provided with a predicted budget B that is an upper bound to the optimal objective value (e.g., obtained from past instances). Our main result is an online algorithm whose competitive ratio (measured against B) is solely a function of k. We also give a lower bound showing that the competitive ratio of every algorithm must depend on k.

Subject Classification

ACM Subject Classification
  • Theory of computation → Online algorithms
Keywords
  • k-median
  • online algorithms
  • learning-augmented algorithms
  • beyond worst-case analysis

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads

References

  1. Vijay Arya, Naveen Garg, Rohit Khandekar, Adam Meyerson, Kamesh Munagala, and Vinayaka Pandit. Local search heuristics for k-median and facility location problems. SIAM Journal of Computing, 33(3):544-562, 2004. Google Scholar
  2. Bahman Bahmani, Benjamin Moseley, Andrea Vattani, Ravi Kumar, and Sergei Vassilvitskii. Scalable k-means++. Proc. VLDB Endow., 5(7):622-633, 2012. Google Scholar
  3. Robi Bhattacharjee, Jacob Imola, Michal Moshkovitz, and Sanjoy Dasgupta. Online k-means clustering on arbitrary data streams. In International Conference on Algorithmic Learning Theory, pages 204-236. PMLR, 2023. Google Scholar
  4. Robi Bhattacharjee and Michal Moshkovitz. No-substitution k-means clustering with adversarial order. In Algorithmic Learning Theory, pages 345-366. PMLR, 2021. Google Scholar
  5. Christopher M. Bishop. Pattern Recognition and Machine Learning. Springer-Verlag, Berlin, Heidelberg, 2006. Google Scholar
  6. Jaroslaw Byrka, Thomas W. Pensyl, Bartosz Rybicki, Aravind Srinivasan, and Khoa Trinh. An improved approximation for k-median and positive correlation in budgeted optimization. ACM Transactions on Algorithms, 13(2):23:1-23:31, 2017. Google Scholar
  7. T-H. Hubert Chan, Arnaud Guerqin, and Mauro Sozio. Fully dynamic k-center clustering. In World Wide Web Conference, pages 579-587, 2018. Google Scholar
  8. Moses Charikar, Chandra Chekuri, Tomás Feder, and Rajeev Motwani. Incremental clustering and dynamic information retrieval. SIAM Journal of Computing, 33(6):1417-1440, 2004. Google Scholar
  9. Moses Charikar, Sudipto Guha, Éva Tardos, and David B. Shmoys. A constant-factor approximation algorithm for the k-median problem. Journal of Computer and Systems Sciences, 65(1):129-149, 2002. Google Scholar
  10. Vincent Cohen-Addad, Niklas Hjuler, Nikos Parotsidis, David Saulpic, and Chris Schwiegelshohn. Fully dynamic consistent facility location. In Conference on Neural Information Processing Systems, pages 3250-3260, 2019. Google Scholar
  11. Hendrik Fichtenberger, Silvio Lattanzi, Ashkan Norouzi-Fard, and Ola Svensson. Consistent k-clustering for general metrics. In ACM-SIAM Symposium on Discrete Algorithms, 2021. Google Scholar
  12. Xiangyu Guo, Janardhan Kulkarni, Shi Li, and Jiayi Xian. Consistent k-median: Simpler, better and robust. In International Conference on Artificial Intelligence and Statistics, volume 130, pages 1135-1143, 2021. Google Scholar
  13. K. Jain and V. V. Vazirani. Approximation algorithms for metric facility location and k-median problems using the primal-dual schema and lagrangian relaxation. Journal of the ACM, 48(2):274-296, 2001. Google Scholar
  14. Silvio Lattanzi and Sergei Vassilvitskii. Consistent k-clustering. In Doina Precup and Yee Whye Teh, editors, International Conference on Machine Learning, pages 1975-1984, 2017. Google Scholar
  15. Shi Li and Ola Svensson. Approximating k-median via pseudo-approximation. SIAM Journal of Computing, 45(2):530-547, 2016. Google Scholar
  16. Edo Liberty, Ram Sriharsha, and Maxim Sviridenko. k-means clustering. talk slides. Google Scholar
  17. Edo Liberty, Ram Sriharsha, and Maxim Sviridenko. An algorithm for online k-means clustering. In Workshop on Algorithm Engineering and Experiments, pages 81-89, 2016. Google Scholar
  18. A. Meyerson. Online facility location. In IEEE Symposium on Foundations of Computer Science, pages 426-431, 2001. Google Scholar
  19. Adam Meyerson, Liadan O'Callaghan, and Serge Plotkin. A k-median algorithm with running time independent of data size. Machine Learning, 56(1):61-87, 2004. Google Scholar
  20. Benjamin Moseley, Heather Newman, and Kirk Pruhs. Online k-median with consistent clusters. arXiv preprint, 2023. URL: https://arxiv.org/abs/2303.15379.
  21. Tim Roughgarden. Beyond the Worst-Case Analysis of Algorithms. Cambridge University Press, 2021. Google Scholar
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail