The Wavelet Tree data structure introduced in Grossi, Gupta, and Vitter [Grossi et al., 2003] is a space-efficient technique for rank and select queries that generalizes from binary symbols to an arbitrary multisymbol alphabet. Over the last two decades, it has become a pivotal tool in modern full-text indexing and data compression because of its properties and capabilities in compressing and indexing data, with many applications to information retrieval, genome analysis, data mining, and web search. In this paper, we survey the fascinating history and impact of Wavelet Trees; no doubt many more developments are yet to come. Our survey borrows some content from the authors' earlier works. This paper is divided into two parts: The first part gives a brief history of Wavelet Trees, including its varieties and practical implementations, which appears in the Festschrift dedicated to Roberto Grossi; the second part (this one) deals with Wavelet Tree-based text indexing and is included in the Festschrift dedicated to Giovanni Manzini.
@InProceedings{ferragina_et_al:OASIcs.Manzini.4, author = {Ferragina, Paolo and Giancarlo, Raffaele and Grossi, Roberto and Rosone, Giovanna and Venturini, Rossano and Vitter, Jeffrey Scott}, title = {{Wavelet Tree, Part II: Text Indexing}}, booktitle = {The Expanding World of Compressed Data: A Festschrift for Giovanni Manzini's 60th Birthday}, pages = {4:1--4:10}, series = {Open Access Series in Informatics (OASIcs)}, ISBN = {978-3-95977-390-4}, ISSN = {2190-6807}, year = {2025}, volume = {131}, editor = {Ferragina, Paolo and Gagie, Travis and Navarro, Gonzalo}, publisher = {Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik}, address = {Dagstuhl, Germany}, URL = {https://drops.dagstuhl.de/entities/document/10.4230/OASIcs.Manzini.4}, URN = {urn:nbn:de:0030-drops-239127}, doi = {10.4230/OASIcs.Manzini.4}, annote = {Keywords: Wavelet tree, data compression, text indexing, compressed suffix array, Burrows-Wheeler transform, rank and select} }
Feedback for Dagstuhl Publishing