,
Xiao Hu
,
Bas Ketsman
Creative Commons Attribution 4.0 International license
We study the worst-case communication complexity of the join query evaluation problem over large-scale data in distributed shared-nothing systems under the MPC model. We focus on multi-round MPC algorithms that run in constant number of rounds. The problem is well-understood for a few classes of queries, mainly the class of acyclic queries and the class of graph-like queries. For queries not belonging to either class, the complexity picture is much less clear. We study the class of degree-two queries and fragments thereof. In this paper, we tighten the gap between the upper and lower bounds for the studied classes and establish worst-case optimality for some fragments of the considered classes. We also debunk a well-believed conjecture about which query-related quantity, in the worst-case, optimally captures the communication complexity of the studied problem.
@InProceedings{aamer_et_al:LIPIcs.ICDT.2026.8,
author = {Aamer, Heba and Hu, Xiao and Ketsman, Bas},
title = {{Neither Cover nor Pack: Distributed Worst-Case Optimality of Degree-2 Joins}},
booktitle = {29th International Conference on Database Theory (ICDT 2026)},
pages = {8:1--8:20},
series = {Leibniz International Proceedings in Informatics (LIPIcs)},
ISBN = {978-3-95977-413-0},
ISSN = {1868-8969},
year = {2026},
volume = {365},
editor = {ten Cate, Balder and Funk, Maurice},
publisher = {Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
address = {Dagstuhl, Germany},
URL = {https://drops.dagstuhl.de/entities/document/10.4230/LIPIcs.ICDT.2026.8},
URN = {urn:nbn:de:0030-drops-256226},
doi = {10.4230/LIPIcs.ICDT.2026.8},
annote = {Keywords: degree-two joins, worst-case optimality, distributed algorithms}
}