QuadRank: Engineering a High Throughput Rank

Groot Koerkamp, Ragnar

doi:10.4230/LIPIcs.SEA.2026.20

Abstract

Motivation. Given a text, a query rank(q, c) counts the number of occurrences of character c among the first q characters of the text. Space-efficient methods to answer these rank queries form an important building block in many succinct data structures. For example, the FM-index [Ferragina and Manzini, 2000] is a widely used data structure that uses rank queries to locate all occurrences of a pattern in a text.
In bioinformatics applications, the goal is usually to process large inputs as fast as possible. Thus, data structures should have high throughput when used with many threads.
Contributions. We first survey existing results on rank data structures. For the σ = 2 binary alphabet, we then develop BiRank, which has 3.28% space overhead. BiRank merges the central ideas of two recent papers: (1) we interleave (inline) offsets in each cache line of the underlying bit vector [Laws et al., 2024], reducing cache misses, and (2) these offsets are to the middle of each block so that only half of each needs popcounting [Gottlieb and Reinert, 2025]. In QuadRank (14.4% overhead), we extend these techniques to the σ = 4 (DNA) alphabet.
Both data structures typically require only a single cache miss per query, making them highly suitable for high-throughput and memory-bound settings. To enable efficient batch-processing, we support prefetching the cache lines required to answer upcoming queries.
Results. BiRank and QuadRank are around 1.5× and 2× faster than similar-overhead methods that do not use interleaving. Prefetching gives an additional 2× speedup, at which point the dual-channel DDR4 RAM bandwidth becomes a hard limit on the total throughput. With prefetching, both methods outperform all other methods apart from SPIDER [Laws et al., 2024] by 2×.
When using QuadRank with prefetching in a toy count-only FM-index, QuadFm, this results in a smaller size and up to 4× speedup over Genedex, a state-of-the-art batching FM-index implementation.
Conclusion. Optimizing data structures for high throughput, by minimizing cache misses and branch-misses and adding support for prefetching, can result in significant speedups when benchmarks are adjusted accordingly.

Tim Anderson and Travis J. Wheeler. An optimized fm-index library for nucleotide and amino acid search. Algorithms for Molecular Biology, 16(1), December 2021. URL: https://doi.org/10.1186/s13015-021-00204-6.
Piotr Beling. Bsuccinct: Rust libraries and programs focused on succinct data structures. SoftwareX, 26:101681, May 2024. URL: https://doi.org/10.1016/j.softx.2024.101681.
Alex Bowe. Multiary Wavelet Trees in Practice. Master’s thesis, School of Computer Science and Information Technology RMIT University, Melbourne, Australia, 2010. URL: https://raw.githubusercontent.com/alexbowe/wavelet-paper/thesis/thesis.pdf.
Michael Burrows. A block-sorting lossless data compression algorithm. SRS Research Report, 124, 1994.
Matteo Ceregini, Florian Kurpicz, and Rossano Venturini. Faster wavelet trees with quad vectors. arXiv, 2023. URL: https://doi.org/10.48550/arXiv.2302.09239.
Matteo Ceregini, Florian Kurpicz, and Rossano Venturini. Faster wavelet tree queries. In 2024 Data Compression Conference (DCC). IEEE, March 2024. URL: https://doi.org/10.1109/dcc58796.2024.00030.
Alejandro Chacon, Santiago Marco-Sola, Antonio Espinosa, Paolo Ribeca, and Juan Carlos Moure. Boosting the fm-index on the gpu: Effective techniques to mitigate random memory access. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 12(5):1048-1059, September 2015. URL: https://doi.org/10.1109/tcbb.2014.2377716.
Francisco Claude, Gonzalo Navarro, and Alberto Ordóñez. The wavelet matrix: An efficient wavelet tree for large alphabets. Information Systems, 47:15-32, January 2015. URL: https://doi.org/10.1016/j.is.2014.06.002.
Cydhra. Vers - very efficient rank and select (vers_vecs). github.com/Cydhra/vers, 2023.
Edsger W. Dijkstra. Why numbering should start at zero. https://www.cs.utexas.edu/~EWD/transcriptions/EWD08xx/EWD831.html, 1982.
Jack J. Dongarra. The evolution of mathematical software. Communications of the ACM, 65(12):66-72, November 2022. URL: https://doi.org/10.1145/3554977.
Felix Leander Droop. Genedex: a small and fast FM-index for Rust. https://github.com/feldroop/genedex, 2025.
P. Ferragina and G. Manzini. Opportunistic data structures with applications. In Proceedings 41st Annual Symposium on Foundations of Computer Science, SFCS-00, pages 390-398. IEEE Comput. Soc, 2000. URL: https://doi.org/10.1109/sfcs.2000.892127.
Paolo Ferragina, Giovanni Manzini, Veli Mäkinen, and Gonzalo Navarro. Compressed representations of sequences and full-text indexes. ACM Transactions on Algorithms, 3(2):20, May 2007. URL: https://doi.org/10.1145/1240233.1240243.
Simon Gog, Timo Beller, Alistair Moffat, and Matthias Petri. From Theory to Practice: Plug and Play with Succinct Data Structures, pages 326-337. Springer International Publishing, 2014. URL: https://doi.org/10.1007/978-3-319-07959-2_28.
Alexander Golynski. Optimal lower bounds for rank and select indexes. In Automata, Languages and Programming, pages 370-381. Springer Berlin Heidelberg, 2006. URL: https://doi.org/10.1007/11786986_33.
Rodrigo González, Szymon Grabowski, Veli Mäkinen, and Gonzalo Navarro. Practical implementation of rank and select queries. In Poster Proceedings of WEA 2005, 2005. WEA.
Simon Gene Gottlieb and Knut Reinert. Engineering rank queries on bit vectors and strings. Algorithms for Molecular Biology, 20(1), December 2025. URL: https://doi.org/10.1186/s13015-025-00291-9.
Simon Gene Gottlieb and Knut Reinert. Search schemes for approximate string matching. NAR Genomics and Bioinformatics, 7(1), January 2025. URL: https://doi.org/10.1093/nargab/lqaf025.
Ragnar Groot Koerkamp. RagnarGrootKoerkamp/quadrank. Software, (visited on 2026-04-28). URL: https://github.com/RagnarGrootKoerkamp/quadrank
archived version
full metadata available at: https://doi.org/10.4230/artifacts.25696
Roberto Grossi, Ankur Gupta, and Jeffrey Scott Vitter. High-order entropy-compressed text indexes. In Proceedings of the Fourteenth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA '03, pages 841-850, USA, 2003. Society for Industrial and Applied Mathematics. URL: http://dl.acm.org/citation.cfm?id=644108.644250.
Guy Joseph Jacobson. Succinct static data structures. PhD thesis, Carnegie Mellon University, 1988.
Sujay Jayaker. RsDict: Fast rank/select over bitmaps. https://github.com/sujayakar/rsdict, 2020.
Shunsuke Kanda. Succinct data structures in rust (sucds). https://github.com/kampersanda/sucds, 2021.
Florian Kurpicz. Engineering compact data structures for rank and select queries on bit vectors. arXiv, 2022. URL: https://doi.org/10.48550/arXiv.2206.01149.
Florian Kurpicz. Engineering compact data structures for rank and select queries on bit vectors. In SPIRE 2022, pages 257-272. Springer International Publishing, 2022. URL: https://doi.org/10.1007/978-3-031-20643-6_19.
Florian Kurpicz, Niccolò Rigi-Luperti, and Peter Sanders. Theory meets practice for bit vectors supporting rank and select. arXiv, 2025. URL: https://doi.org/10.48550/arXiv.2509.17819.
Johannes Köster. Rust-bio: a fast and safe bioinformatics library. Bioinformatics, 32(3):444-446, October 2015. URL: https://doi.org/10.1093/bioinformatics/btv573.
Ben Langmead and Steven L Salzberg. Fast gapped-read alignment with Bowtie 2. Nature Methods, 9(4):357-359, March 2012. URL: https://doi.org/10.1038/nmeth.1923.
Ben Langmead, Cole Trapnell, Mihai Pop, and Steven L Salzberg. Ultrafast and memory-efficient alignment of short dna sequences to the human genome. Genome Biology, 10(3), March 2009. URL: https://doi.org/10.1186/gb-2009-10-3-r25.
Matthew D. Laws, Jocelyn Bliven, Kit Conklin, Elyes Laalai, Samuel McCauley, and Zach S. Sturdevant. Spider: Improved succinct rank and select performance. In SEA 2024, volume 301, pages 21:1-21:18. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2024. URL: https://doi.org/10.4230/LIPIcs.SEA.2024.21.
Heng Li. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv, 2013. URL: https://doi.org/10.48550/arXiv.1303.3997.
Peter Bro Miltersen. Lower bounds on the size of selection and rank indexes. In SODA '05, pages 11-12. Society for Industrial and Applied Mathematics, 2005. URL: http://dl.acm.org/citation.cfm?id=1070432.1070435.
Wojciech Muła. SSSE3: fast popcount. http://0x80.pl/notesen/2008-05-24-sse-popcount.html, May 2008.
Wojciech Muła, Nathan Kurz, and Daniel Lemire. Faster population counts using avx2 instructions. The Computer Journal, 61(1):111-120, May 2017. URL: https://doi.org/10.1093/comjnl/bxx046.
Gonzalo Navarro and Eliana Providel. Fast, small, simple rank/select on bitmaps. In SEA 2012, pages 295-306. Springer Berlin Heidelberg, 2012. URL: https://doi.org/10.1007/978-3-642-30850-5_26.
Giulio Ermanno Pibiri and Shunsuke Kanda. Rank/select queries over mutable bitmaps. Information Systems, 99:101756, July 2021. URL: https://doi.org/10.1016/j.is.2021.101756.
Christopher Pockrandt, Marcel Ehrhardt, and Knut Reinert. EPR-Dictionaries: A practical and fast data structure for constant time searches in unidirectional and bidirectional FM indices. In RECOMB 2017, pages 190-206. Springer International Publishing, 2017. URL: https://doi.org/10.1007/978-3-319-56970-3_12.
Luca Renders, Lore Depuydt, and Jan Fostier. Approximate pattern matching using search schemes and in-text verification. In Bioinformatics and Biomedical Engineering, pages 419-435. Springer International Publishing, 2022. URL: https://doi.org/10.1007/978-3-031-07802-6_36.
Luca Renders, Lore Depuydt, Travis Gagie, and Jan Fostier. Columba: fast approximate pattern matching with optimized search schemes. Bioinformatics, 41(12), December 2025. URL: https://doi.org/10.1093/bioinformatics/btaf652.
Arang Rhie, Sergey Nurk, Monika Cechova, Savannah J. Hoyt, Dylan J. Taylor, Nicolas Altemose, Paul W. Hook, Sergey Koren, Mikko Rautiainen, Ivan A. Alexandrov, Jamie Allen, Mobin Asri, Andrey V. Bzikadze, Nae-Chyun Chen, Chen-Shan Chin, Mark Diekhans, Paul Flicek, Giulio Formenti, Arkarachai Fungtammasan, Carlos Garcia Giron, Erik Garrison, Ariel Gershman, Jennifer L. Gerton, Patrick G. S. Grady, Andrea Guarracino, Leanne Haggerty, Reza Halabian, Nancy F. Hansen, Robert Harris, Gabrielle A. Hartley, William T. Harvey, Marina Haukness, Jakob Heinz, Thibaut Hourlier, Robert M. Hubley, Sarah E. Hunt, Stephen Hwang, Miten Jain, Rupesh K. Kesharwani, Alexandra P. Lewis, Heng Li, Glennis A. Logsdon, Julian K. Lucas, Wojciech Makalowski, Christopher Markovic, Fergal J. Martin, Ann M. Mc Cartney, Rajiv C. McCoy, Jennifer McDaniel, Brandy M. McNulty, Paul Medvedev, Alla Mikheenko, Katherine M. Munson, Terence D. Murphy, Hugh E. Olsen, Nathan D. Olson, Luis F. Paulin, David Porubsky, Tamara Potapova, Fedor Ryabov, Steven L. Salzberg, Michael E. G. Sauria, Fritz J. Sedlazeck, Kishwar Shafin, Valery A. Shepelev, Alaina Shumate, Jessica M. Storer, Likhitha Surapaneni, Angela M. Taravella Oill, Françoise Thibaud-Nissen, Winston Timp, Marta Tomaszkiewicz, Mitchell R. Vollger, Brian P. Walenz, Allison C. Watwood, Matthias H. Weissensteiner, Aaron M. Wenger, Melissa A. Wilson, Samantha Zarate, Yiming Zhu, Justin M. Zook, Evan E. Eichler, Rachel J. O’Neill, Michael C. Schatz, Karen H. Miga, Kateryna D. Makova, and Adam M. Phillippy. The complete sequence of a human y chromosome. Nature, 621(7978):344-354, August 2023. URL: https://doi.org/10.1038/s41586-023-06457-y.
Jesse A. Tov. Succinct data structures for rust (succinct). https://github.com/tov/succinct-rs, 2016.
Sebastiano Vigna. Broadword implementation of rank/select queries. In Catherine C. McGeoch, editor, WEA 2008, pages 154-168, Berlin, Heidelberg, 2008. Springer Berlin Heidelberg. URL: https://doi.org/10.1007/978-3-540-68552-4_12.
Sebastiano Vigna and Tommaso Fontana. Sux. https://github.com/vigna/sux-rs, 2024.
Mohsen Zakeri, Nathaniel K. Brown, Omar Y. Ahmed, Travis Gagie, and Ben Langmead. Movi: A fast and cache-efficient full-text pangenome index. iScience, 27(12):111464, December 2024. URL: https://doi.org/10.1016/j.isci.2024.111464.
Mohsen Zakeri, Nathaniel K. Brown, Travis Gagie, and Ben Langmead. Movi 2: Fast and space-efficient queries on pangenomes. bioRxiv, October 2025. URL: https://doi.org/10.1101/2025.10.16.682873.
Dong Zhou, David G. Andersen, and Michael Kaminsky. Space-efficient, high-performance rank and select structures on uncompressed bit sequences. In SEA 2013, pages 151-163. Springer Berlin Heidelberg, 2013. URL: https://doi.org/10.1007/978-3-642-38527-8_15.

QuadRank: Engineering a High Throughput Rank

Author Ragnar Groot Koerkamp

File

Document Identifiers

Subject Classification

ACM Subject Classification

Keywords

Metrics

Abstract

Cite As Get BibTex

Author Details

Acknowledgements

References

Thanks for your feedback!

Could not send message

QuadRank: Engineering a High Throughput Rank

Author Ragnar Groot Koerkamp

File

Document Identifiers

Subject Classification

ACM Subject Classification

Keywords

Metrics

Abstract

Cite As Get BibTex

Author Details

Acknowledgements

Supplementary Materials

References

Thanks for your feedback!

Could not send message