An increased and growing interest in large-scale data processing has triggered a demand for specialized algorithms that thrive in massively parallel shared-nothing systems. To answer the question of how to efficiently compute join queries in this setting, a rich line of research has emerged specifically for the Massively Parallel Communication (MPC) model. In the MPC model, algorithms are executed in rounds, with each round consisting of a synchronized communication phase and a separate local computation phase. The main cost measure is the load of the algorithm, defined as the maximum number of messages received by any server in any round. We study worst-case optimal algorithms for the join query evaluation problem in the constant-round MPC model. In the single-round variant of MPC, the worst-case optimal load for this problem is well understood and algorithms exist that guarantee this load for any join query. In the constant-round variant of MPC, queries can often be computed with a lower load compared to the single-round variant, but the worst-case optimal load is only known for specific classes of join queries, including graph-like and acyclic join queries, and the associated algorithms use very different techniques. In this paper, we propose a new constant-round MPC algorithm for computing join queries. Our algorithm is correct for every join query and its load matches (up to a polylog factor) the worst-case optimal load for at least all join queries that are acyclic or graph-like.
@InProceedings{aamer_et_al:LIPIcs.ICDT.2025.6, author = {Aamer, Heba and Ketsman, Bas}, title = {{PAC: Computing Join Queries with Semi-Covers}}, booktitle = {28th International Conference on Database Theory (ICDT 2025)}, pages = {6:1--6:20}, series = {Leibniz International Proceedings in Informatics (LIPIcs)}, ISBN = {978-3-95977-364-5}, ISSN = {1868-8969}, year = {2025}, volume = {328}, editor = {Roy, Sudeepa and Kara, Ahmet}, publisher = {Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik}, address = {Dagstuhl, Germany}, URL = {https://drops.dagstuhl.de/entities/document/10.4230/LIPIcs.ICDT.2025.6}, URN = {urn:nbn:de:0030-drops-229474}, doi = {10.4230/LIPIcs.ICDT.2025.6}, annote = {Keywords: Worst-case optimal load, MPC model, join queries} }
Feedback for Dagstuhl Publishing