Fast Approximation of Search Trees on Trees with Centroid Trees

Berendsohn, Benjamin Aram; Golinsky, Ishay; Kaplan, Haim; Kozma, László

doi:10.4230/LIPIcs.ICALP.2023.19

Abstract

Search trees on trees (STTs) generalize the fundamental binary search tree (BST) data structure: in STTs the underlying search space is an arbitrary tree, whereas in BSTs it is a path. An optimal BST of size n can be computed for a given distribution of queries in 𝒪(n²) time [Knuth, Acta Inf. 1971] and centroid BSTs provide a nearly-optimal alternative, computable in 𝒪(n) time [Mehlhorn, SICOMP 1977]. By contrast, optimal STTs are not known to be computable in polynomial time, and the fastest constant-approximation algorithm runs in 𝒪(n³) time [Berendsohn, Kozma, SODA 2022]. Centroid trees can be defined for STTs analogously to BSTs, and they have been used in a wide range of algorithmic applications. In the unweighted case (i.e., for a uniform distribution of queries), the centroid tree can be computed in 𝒪(n) time [Brodal, Fagerberg, Pedersen, Östlin, ICALP 2001; Della Giustina, Prezza, Venturini, SPIRE 2019]. These algorithms, however, do not readily extend to the weighted case. Moreover, no approximation guarantees were previously known for centroid trees in either the unweighted or weighted cases. In this paper we revisit centroid trees in a general, weighted setting, and we settle both the algorithmic complexity of constructing them, and the quality of their approximation. For constructing a weighted centroid tree, we give an output-sensitive 𝒪(n log h) ⊆ 𝒪(n log n) time algorithm, where h is the height of the resulting centroid tree. If the weights are of polynomial complexity, the running time is 𝒪(n log log n). We show these bounds to be optimal, in a general decision tree model of computation. For approximation, we prove that the cost of a centroid tree is at most twice the optimum, and this guarantee is best possible, both in the weighted and unweighted cases. We also give tight, fine-grained bounds on the approximation-ratio for bounded-degree trees and on the approximation-ratio of more general α-centroid trees.

Stephen Alstrup, Jacob Holm, Kristian De Lichtenberg, and Mikkel Thorup. Maintaining information in fully dynamic trees with top trees. ACM Trans. Algorithms, 1(2):243-264, October 2005. URL: https://doi.org/10.1145/1103963.1103966.
Bengt Aspvall and Pinar Heggernes. Finding minimum height elimination trees for interval graphs in polynomial time. BIT, 34:484-509, 1994.
Yosi Ben-Asher, Eitan Farchi, and Ilan Newman. Optimal search in trees. SIAM J. Comput., 28(6):2090-2102, 1999. URL: https://doi.org/10.1137/S009753979731858X.
Michael A. Bender, Martin Farach-Colton, and Bradley C. Kuszmaul. Cache-oblivious string b-trees. In ACM SIGMOD-SIGACT-SIGART, pages 233-242, 2006.
Benjamin Aram Berendsohn. The diameter of caterpillar associahedra. In Artur Czumaj and Qin Xin, editors, 18th Scandinavian Symposium and Workshops on Algorithm Theory, SWAT 2022, June 27-29, 2022, Tórshavn, Faroe Islands, volume 227 of LIPIcs, pages 14:1-14:12. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2022. URL: https://doi.org/10.4230/LIPIcs.SWAT.2022.14.
Benjamin Aram Berendsohn, Ishay Golinsky, Haim Kaplan, and László Kozma. Fast approximation of search trees on trees with centroid trees, 2022. URL: https://arxiv.org/abs/2209.08024.
Benjamin Aram Berendsohn and László Kozma. Splay trees on trees. In SODA, pages 1875-1900, 2022.
Hans L. Bodlaender, Jitender S. Deogun, Klaus Jansen, Ton Kloks, Dieter Kratsch, Haiko Müller, and Zsolt Tuza. Rankings of graphs. SIAM Journal on Discrete Mathematics, 11(1):168-181, 1998.
H.L. Bodlaender, J.R. Gilbert, H. Hafsteinsson, and T. Kloks. Approximating treewidth, pathwidth, frontsize, and shortest elimination tree. Journal of Algorithms, 18(2):238-255, 1995. URL: https://doi.org/10.1006/jagm.1995.1009.
Prosenjit Bose, Jean Cardinal, John Iacono, Grigorios Koumoutsos, and Stefan Langerman. Competitive online search trees on trees. In SODA, pages 1878-1891, 2020.
Gerth Stølting Brodal, Rolf Fagerberg, Christian N. S. Pedersen, and Anna Östlin. The complexity of constructing evolutionary trees using experiments. In ICALP, pages 140-151. Springer, 2001.
Jean Cardinal, Stefan Langerman, and Pablo Pérez-Lantero. On the diameter of tree associahedra. Electron. J. Comb., 25(4):P4.18, 2018. URL: http://www.combinatorics.org/ojs/index.php/eljc/article/view/v25i4p18, URL: https://doi.org/10.37236/7762.
Jean Cardinal, Lionel Pournin, and Mario Valencia-Pabon. Bounds on the diameter of graph associahedra. In Proceedings of the XI Latin and American Algorithms, Graphs and Optimization Symposium (LAGOS), volume 195 of Procedia Computer Science, pages 239-247. Elsevier, 2021.
Michael Carr and Satyan L. Devadoss. Coxeter complexes and graph-associahedra. Topology and its Applications, 153(12):2155-2168, 2006.
Cesar Ceballos, Thibault Manneville, Vincent Pilaud, and Lionel Pournin. Diameters and geodesic properties of generalizations of the associahedron. In Proceedings of the 27th International Conference on Formal Power Series and Algebraic Combinatorics (FPSAC), pages 345-356, 2015.
Panagiotis Charalampopoulos, Pawel Gawrychowski, Shay Mozes, and Oren Weimann. An almost optimal edit distance oracle. In Nikhil Bansal, Emanuela Merelli, and James Worrell, editors, 48th International Colloquium on Automata, Languages, and Programming, ICALP 2021, July 12-16, 2021, Glasgow, Scotland (Virtual Conference), volume 198 of LIPIcs, pages 48:1-48:20. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2021. URL: https://doi.org/10.4230/LIPIcs.ICALP.2021.48.
Ferdinando Cicalese, Tobias Jacobs, Eduardo Laber, and Marco Molinaro. On the complexity of searching in trees and partially ordered structures. Theor. Comput. Sci., 412(50):6879-6896, 2011. URL: https://doi.org/10.1016/j.tcs.2011.08.042.
Ferdinando Cicalese, Tobias Jacobs, Eduardo Laber, and Marco Molinaro. Improved approximation algorithms for the average-case tree searching problem. Algorithmica, 68(4):1045-1074, 2014. URL: https://doi.org/10.1007/s00453-012-9715-6.
Erik D. Demaine, Dion Harmon, John Iacono, and Mihai Pǎtraşcu. Dynamic optimality - almost. SIAM J. Comput., 37(1):240-251, 2007. URL: https://doi.org/10.1137/S0097539705447347.
Jitender S Deogun, Ton Kloks, Dieter Kratsch, and Haiko Müller. On vertex ranking for permutation and other graphs. In STACS 1994, pages 747-758. Springer, 1994.
Satyan L. Devadoss. A realization of graph associahedra. Discrete Mathematics, 309(1):271-276, 2009.
Iain S Duff, Albert Maurice Erisman, and John Ker Reid. Direct methods for sparse matrices. Oxford University Press, 2017.
Guy Even and Shakhar Smorodinsky. Hitting sets online and unique-max coloring. Discret. Appl. Math., 178:71-82, 2014. URL: https://doi.org/10.1016/j.dam.2014.06.019.
Paolo Ferragina. On the weak prefix-search problem. Theor. Comput. Sci., 483:75-84, 2013. URL: https://doi.org/10.1016/j.tcs.2012.06.011.
Paolo Ferragina and Rossano Venturini. Compressed cache-oblivious string b-tree. ACM Trans. Algorithms, 12(4):52:1-52:17, 2016. URL: https://doi.org/10.1145/2903141.
Greg N. Frederickson and Donald B Johnson. Finding kth paths and p-centers by generating and searching good data structures. Journal of Algorithms, 4(1):61-80, 1983.
Travis Gagie, Danny Hermelin, Gad M Landau, and Oren Weimann. Binary jumbled pattern matching on trees and tree-like structures. Algorithmica, 73(3):571-588, 2015.
Davide Della Giustina, Nicola Prezza, and Rossano Venturini. A new linear-time algorithm for centroid decomposition. In Proceedings of the 26th International Symposium on String Processing and Information Retrieval (SPIRE), volume 11811 of Lecture Notes in Computer Science, pages 274-282. Springer, 2019.
Michael T. Goodrich and Roberto Tamassia. Dynamic trees and dynamic point location. SIAM J. Comput., 28(2):612-636, 1998. URL: https://doi.org/10.1137/S0097539793254376.
Leonidas J. Guibas, John Hershberger, Daniel Leven, Micha Sharir, and Robert Endre Tarjan. Linear-time algorithms for visibility and shortest path problems inside triangulated simple polygons. Algorithmica, 2:209-233, 1987. URL: https://doi.org/10.1007/BF01840360.
Brent Heeringa, Marius Catalin Iordan, and Louis Theran. Searching in dynamic tree-like partial orders. In WADS 2011, volume 6844 of Lecture Notes in Computer Science, pages 512-523. Springer, 2011. URL: https://doi.org/10.1007/978-3-642-22300-6_43.
D.S. Hirschberg, L.L. Larmore, and M. Molodowitch. Subtree weight ratios for optimal binary search trees. Technical Report TR 86-02, ICS Department, University of California, Irvine, 1986.
Ananth V. Iyer, H. Donald Ratliff, and Gopalakrishnan Vijayan. Optimal node ranking of trees. Inf. Process. Lett., 28(5):225-229, 1988. URL: https://doi.org/10.1016/0020-0190(88)90194-9.
Camille Jordan. Sur les assemblages de lignes. Journal für die reine und angewandte Mathematik, 70:185-190, 1869.
Meir Katchalski, William McCuaig, and Suzanne Seager. Ordered colourings. Discrete Mathematics, 142(1-3):141-154, 1995.
Donald E. Knuth. Optimum binary search trees. Acta Informatica, 1(1):14-25, 1971. URL: https://doi.org/10.1007/BF00264289.
Tomasz Kociumaka, Jakub Pachocki, Jakub Radoszewski, Wojciech Rytter, and Tomasz Waleń. Efficient counting of square substrings in a tree. Theoretical Computer Science, 544:60-73, 2014.
Eduardo Laber and Marco Molinaro. An approximation algorithm for binary searching in trees. Algorithmica, 59(4):601-620, 2011. URL: https://doi.org/10.1007/s00453-009-9325-0.
Eduardo Laber and Loana Nogueira. Fast searching in trees. Electronic Notes in Discrete Mathematics, 7:90-93, 2001. URL: https://doi.org/10.1016/S1571-0653(04)00232-X.
Lawrence L. Larmore. A subquadratic algorithm for constructing approximately optimal binary search trees. J. Algorithms, 8(4):579-591, 1987. URL: https://doi.org/10.1016/0196-6774(87)90052-6.
Charles E. Leiserson. Area-efficient graph layouts (for VLSI). In STOC 1980, pages 270-281. IEEE Computer Society, 1980. URL: https://doi.org/10.1109/SFCS.1980.13.
Nathan Linial and Michael E. Saks. Every poset has a central element. J. Comb. Theory, Ser. A, 40(2):195-210, 1985. URL: https://doi.org/10.1016/0097-3165(85)90087-1.
Joseph W.H. Liu. The role of elimination trees in sparse factorization. SIAM journal on matrix analysis and applications, 11(1):134-172, 1990.
Kurt Mehlhorn. Nearly optimal binary search trees. Acta Informatica, 5(4):287-295, 1975.
Kurt Mehlhorn. A best possible bound for the weighted path length of binary search trees. SIAM Journal on Computing, pages 235-239, 1977.
Shay Mozes, Krzysztof Onak, and Oren Weimann. Finding an optimal tree searching strategy in linear time. In SODA 2008, pages 1096-1105. SIAM, 2008. URL: http://dl.acm.org/citation.cfm?id=1347082.1347202.
Jaroslav Nesetril and Patrice Ossona de Mendez. Sparsity - Graphs, Structures, and Algorithms, volume 28 of Algorithms and combinatorics. Springer, 2012. URL: https://doi.org/10.1007/978-3-642-27875-4.
Krzysztof Onak and Pawel Parys. Generalization of binary search: Searching in trees and forest-like partial orders. In FOCS 2006, pages 379-388, 2006. URL: https://doi.org/10.1109/FOCS.2006.32.
Alex Pothen, Horst D. Simon, and Kang-Pu Liou. Partitioning sparse matrices with eigenvectors of graphs. SIAM journal on matrix analysis and applications, 11(3):430-452, 1990.
Alejandro A. Schäffer. Optimal node ranking of trees in linear time. Information Processing Letters, 33(2):91-96, 1989.
Daniel Dominic Sleator and Robert Endre Tarjan. Self-adjusting binary search trees. J. ACM, 32(3):652-686, July 1985. URL: https://doi.org/10.1145/3828.3835.

Fast Approximation of Search Trees on Trees with Centroid Trees

Authors Benjamin Aram Berendsohn, Ishay Golinsky, Haim Kaplan , László Kozma

File

Document Identifiers

Author Details

Cite AsGet BibTex

Abstract

Subject Classification

ACM Subject Classification

Keywords

Metrics

References

Thanks for your feedback!

Could not send message

Fast Approximation of Search Trees on Trees with Centroid Trees

Authors Benjamin Aram Berendsohn, Ishay Golinsky, Haim Kaplan , László Kozma

File

Document Identifiers

Author Details

Funding

Cite AsGet BibTex

Abstract

Subject Classification

ACM Subject Classification

Keywords

Metrics

Related Versions

References

Thanks for your feedback!

Could not send message