ElZein, Hicham ;
He, Meng ;
Munro, J. Ian ;
Nekrich, Yakov ;
Sandlund, Bryce
On Approximate Range Mode and Range Selection
Abstract
For any epsilon in (0,1), a (1+epsilon)approximate range mode query asks for the position of an element whose frequency in the query range is at most a factor (1+epsilon) smaller than the true mode. For this problem, we design a data structure occupying O(n/epsilon) bits of space to answer queries in O(lg(1/epsilon)) time. This is an encoding data structure which does not require access to the input sequence; the space cost of this structure is asymptotically optimal for constant epsilon as we also prove a matching lower bound. Furthermore, our solution improves the previous best result of Greve et al. (Cell Probe Lower Bounds and Approximations for Range Mode, ICALP'10) by saving the space cost by a factor of lg n while achieving the same query time. In dynamic settings, we design an O(n)word data structure that answers queries in O(lg n /lg lg n) time and supports insertions and deletions in O(lg n) time, for any constant epsilon in (0,1); the bounds for nonconstant epsilon = o(1) are also given in the paper. This is the first result on dynamic approximate range mode; it can also be used to obtain the first static data structure for approximate 3sided range mode queries in two dimensions.
Another problem we consider is approximate range selection. For any alpha in (0,1/2), an alphaapproximate range selection query asks for the position of an element whose rank in the query range is in [k  alpha s, k + alpha s], where k is a rank given by the query and s is the size of the query range. When alpha is a constant, we design an O(n)bit encoding data structure that can answer queries in constant time and prove this space cost is asymptotically optimal. The previous best result by Krizanc et al. (Range Mode and Range Median Queries on Lists and Trees, Nordic Journal of Computing, 2005) uses O(n lg n) bits, or O(n) words, to achieve constant approximation for range median only. Thus we not only improve the space cost, but also provide support for any arbitrary k given at query time. We also analyse our solutions for nonconstant alpha.
BibTeX  Entry
@InProceedings{elzein_et_al:LIPIcs:2019:11553,
author = {Hicham ElZein and Meng He and J. Ian Munro and Yakov Nekrich and Bryce Sandlund},
title = {{On Approximate Range Mode and Range Selection}},
booktitle = {30th International Symposium on Algorithms and Computation (ISAAC 2019)},
pages = {57:157:14},
series = {Leibniz International Proceedings in Informatics (LIPIcs)},
ISBN = {9783959771306},
ISSN = {18688969},
year = {2019},
volume = {149},
editor = {Pinyan Lu and Guochuan Zhang},
publisher = {Schloss DagstuhlLeibnizZentrum fuer Informatik},
address = {Dagstuhl, Germany},
URL = {https://drops.dagstuhl.de/opus/volltexte/2019/11553},
URN = {urn:nbn:de:0030drops115531},
doi = {10.4230/LIPIcs.ISAAC.2019.57},
annote = {Keywords: data structures, approximate range query, range mode, range median}
}
28.11.2019
Keywords: 

data structures, approximate range query, range mode, range median 
Seminar: 

30th International Symposium on Algorithms and Computation (ISAAC 2019)

Issue date: 

2019 
Date of publication: 

28.11.2019 