,
A. R. Sricharan
,
Teresa Anna Steiner
Creative Commons Attribution 4.0 International license
Privately counting distinct elements in a stream is a fundamental data analysis problem with many applications in machine learning. In the turnstile model, Jain et al. [NeurIPS2023] initiated the study of this problem parameterized by the maximum flippancy of any element, i.e., the number of times that the count of an element changes from 0 to above 0 or vice versa. They give an item-level (ε,δ)-differentially private algorithm whose additive error is tight with respect to that parameterization. In this work, we show that a very simple algorithm based on the sparse vector technique achieves a tight additive error for item-level (ε,δ)-differential privacy and item-level ε-differential privacy with regards to a different parameterization, namely the sum of all flippancies. Our second result is a bound which shows that for a large class of algorithms, including all existing differentially private algorithms for this problem, the lower bound from item-level differential privacy extends to event-level differential privacy. This partially answers an open question by Jain et al. [NeurIPS2023].
@InProceedings{henzinger_et_al:LIPIcs.APPROX/RANDOM.2024.40,
author = {Henzinger, Monika and Sricharan, A. R. and Steiner, Teresa Anna},
title = {{Private Counting of Distinct Elements in the Turnstile Model and Extensions}},
booktitle = {Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2024)},
pages = {40:1--40:21},
series = {Leibniz International Proceedings in Informatics (LIPIcs)},
ISBN = {978-3-95977-348-5},
ISSN = {1868-8969},
year = {2024},
volume = {317},
editor = {Kumar, Amit and Ron-Zewi, Noga},
publisher = {Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
address = {Dagstuhl, Germany},
URL = {https://drops.dagstuhl.de/entities/document/10.4230/LIPIcs.APPROX/RANDOM.2024.40},
URN = {urn:nbn:de:0030-drops-210335},
doi = {10.4230/LIPIcs.APPROX/RANDOM.2024.40},
annote = {Keywords: differential privacy, turnstile model, counting distinct elements}
}