Creative Commons Attribution 3.0 Unported license
In this paper, we report progress on answering the open problem presented by Pagh [11], who considered the near neighbor search without false negatives for the Hamming distance. We show new data structures for solving the c-approximate near neighbors problem without false negatives for Euclidean high dimensional space
\mathcal{R}^d. These data structures work for any c = \omega(\sqrt{\log{\log{n}}}), where n is the number of points in the input set, with poly-logarithmic query time and polynomial
pre-processing time. This improves over the known algorithms, which require c to be \Omega(\sqrt{d}).
This improvement is obtained by applying a sequence of reductions, which are interesting on their own. First, we reduce the problem to d instances of dimension logarithmic in n. Next, these instances are reduced to a number of c-approximate near neighbor search without false negatives instances in \big(\Rspace^k\big)^L space equipped with metric m(x,y) = \max_{1 \le i \leL}(\dist{x_i - y_i}_2).
@InProceedings{sankowski_et_al:LIPIcs.ISAAC.2017.63,
author = {Sankowski, Piotr and Wygocki, Piotr},
title = {{Approximate Nearest Neighbors Search Without False Negatives For l\underline2 For c\ranglesqrt\{loglog\{n\}\}}},
booktitle = {28th International Symposium on Algorithms and Computation (ISAAC 2017)},
pages = {63:1--63:12},
series = {Leibniz International Proceedings in Informatics (LIPIcs)},
ISBN = {978-3-95977-054-5},
ISSN = {1868-8969},
year = {2017},
volume = {92},
editor = {Okamoto, Yoshio and Tokuyama, Takeshi},
publisher = {Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
address = {Dagstuhl, Germany},
URL = {https://drops.dagstuhl.de/entities/document/10.4230/LIPIcs.ISAAC.2017.63},
URN = {urn:nbn:de:0030-drops-82189},
doi = {10.4230/LIPIcs.ISAAC.2017.63},
annote = {Keywords: locality sensitive hashing, approximate near neighbor search, high- dimensional, similarity search}
}