Largest Weight Common Subtree Embeddings with Distance Penalties

Authors Andre Droschinsky , Nils M. Kriege , Petra Mutzel



PDF
Thumbnail PDF

File

LIPIcs.MFCS.2018.54.pdf
  • Filesize: 0.49 MB
  • 15 pages

Document Identifiers

Author Details

Andre Droschinsky
  • Department of Computer Science, TU Dortmund University, Otto-Hahn-Str. 14, 44221 Dortmund, Germany
Nils M. Kriege
  • Department of Computer Science, TU Dortmund University, Otto-Hahn-Str. 14, 44221 Dortmund, Germany
Petra Mutzel
  • Department of Computer Science, TU Dortmund University, Otto-Hahn-Str. 14, 44221 Dortmund, Germany

Cite AsGet BibTex

Andre Droschinsky, Nils M. Kriege, and Petra Mutzel. Largest Weight Common Subtree Embeddings with Distance Penalties. In 43rd International Symposium on Mathematical Foundations of Computer Science (MFCS 2018). Leibniz International Proceedings in Informatics (LIPIcs), Volume 117, pp. 54:1-54:15, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2018)
https://doi.org/10.4230/LIPIcs.MFCS.2018.54

Abstract

The largest common embeddable subtree problem asks for the largest possible tree embeddable into two input trees and generalizes the classical maximum common subtree problem. Several variants of the problem in labeled and unlabeled rooted trees have been studied, e.g., for the comparison of evolutionary trees. We consider a generalization, where the sought embedding is maximal with regard to a weight function on pairs of labels. We support rooted and unrooted trees with vertex and edge labels as well as distance penalties for skipping vertices. This variant is important for many applications such as the comparison of chemical structures and evolutionary trees. Our algorithm computes the solution from a series of bipartite matching instances, which are solved efficiently by exploiting their structural relation and imbalance. Our analysis shows that our approach improves or matches the running time of the formally best algorithms for several problem variants. Specifically, we obtain a running time of O(|T| |T'|Delta) for two rooted or unrooted trees T and T', where Delta=min{Delta(T),Delta(T')} with Delta(X) the maximum degree of X. If the weights are integral and at most C, we obtain a running time of O(|T| |T'|sqrt Delta log (C min{|T|,|T'|})) for rooted trees.

Subject Classification

ACM Subject Classification
  • Theory of computation → Graph algorithms analysis
Keywords
  • maximum common subtree
  • largest embeddable subtree
  • topological embedding
  • maximum weight matching
  • subtree homeomorphism

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads

References

  1. Moon Jung Chung. O(n^2.5) time algorithms for the subgraph homeomorphism problem on trees. Journal of Algorithms, 8(1):106-112, 1987. URL: http://dx.doi.org/10.1016/0196-6774(87)90030-7.
  2. James Trimble Ciaran McCreesh, Patrick Prosser. A partitioning algorithm for maximum common subgraph problems. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI-17, pages 712-719, 2017. URL: http://dx.doi.org/10.24963/ijcai.2017/99.
  3. Andre Droschinsky, Nils Kriege, and Petra Mutzel. Finding Largest Common Substructures of Molecules in Quadratic Time, pages 309-321. Springer International Publishing, Cham, 2017. URL: http://dx.doi.org/10.1007/978-3-319-51963-0_24.
  4. Andre Droschinsky, Nils M. Kriege, and Petra Mutzel. Faster Algorithms for the Maximum Common Subtree Isomorphism Problem. In Piotr Faliszewski, Anca Muscholl, and Rolf Niedermeier, editors, 41st International Symposium on Mathematical Foundations of Computer Science (MFCS 2016), volume 58 of Leibniz International Proceedings in Informatics (LIPIcs), pages 33:1-33:14, Dagstuhl, Germany, 2016. Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik. URL: http://dx.doi.org/10.4230/LIPIcs.MFCS.2016.33.
  5. Hans-Christian Ehrlich and Matthias Rarey. Maximum common subgraph isomorphism algorithms and their applications in molecular science: a review. Wiley Interdisciplinary Reviews: Computational Molecular Science, 1(1):68-79, 2011. URL: http://dx.doi.org/10.1002/wcms.5.
  6. Michael R. Garey and David S. Johnson. Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman, 1979. Google Scholar
  7. Andrew V. Goldberg, Sagi Hed, Haim Kaplan, and Robert E. Tarjan. Minimum-cost flows in unit-capacity networks. Theory of Computing Systems, 61(4):987-1010, Nov 2017. URL: http://dx.doi.org/10.1007/s00224-017-9776-7.
  8. Arvind Gupta and Naomi Nishimura. Finding largest subtrees and smallest supertrees. Algorithmica, 21:183-210, 1998. URL: http://dx.doi.org/10.1007/PL00009212.
  9. Ming-Yang Kao, Tak-Wah Lam, Wing-Kin Sung, and Hing-Fung Ting. An even faster and more unifying algorithm for comparing trees via unbalanced bipartite matchings. Journal of Algorithms, 40(2):212-233, 2001. URL: http://dx.doi.org/10.1006/jagm.2001.1163.
  10. Antoni Lozano and Gabriel Valiente. On the maximum common embedded subtree problem for ordered trees. In In C. Iliopoulos and T Lecroq, editors, String Algorithmics, chapter 7. King’s College London Publications, 2004. Google Scholar
  11. Daniel M. Martin and Bhalchandra D. Thatte. The maximum agreement subtree problem. Discrete Applied Mathematics, 161(13-14):1805-1817, 2013. URL: http://dx.doi.org/10.1016/j.dam.2013.02.037.
  12. David W. Matula. Subtree isomorphism in O(n^5/2). In P. Hell B. Alspach and D.J. Miller, editors, Algorithmic Aspects of Combinatorics, volume 2 of Annals of Discrete Mathematics, pages 91-106. Elsevier, 1978. URL: http://dx.doi.org/10.1016/S0167-5060(08)70324-8.
  13. L. Ramshaw and R. E. Tarjan. A weight-scaling algorithm for min-cost imperfect matchings in bipartite graphs. In 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science, pages 581-590, Oct 2012. URL: http://dx.doi.org/10.1109/FOCS.2012.9.
  14. Lyle Ramshaw and Robert Tarjan. On minimum-cost assignments in unbalanced bipartite graphs, 04 2012. URL: http://www.hpl.hp.com/techreports/2012/HPL-2012-40R1.html.
  15. Matthias Rarey and J. Scott Dixon. Feature trees: A new molecular similarity measure based on tree matching. Journal of Computer-Aided Molecular Design, 12:471-490, 1998. URL: http://dx.doi.org/10.1023/A:1008068904628.
  16. John W. Raymond and Peter Willett. Maximum common subgraph isomorphism algorithms for the matching of chemical structures. Journal of Computer-Aided Molecular Design, 16(7):521-533, 2002. URL: http://ipsapp008.lwwonline.com/content/getfile/4830/45/6/abstract.htm.
  17. Leander Schietgat, Jan Ramon, and Maurice Bruynooghe. A polynomial-time maximum common subgraph algorithm for outerplanar graphs and its application to chemoinformatics. Annals of Mathematics and Artificial Intelligence, 69(4):343-376, 2013. URL: http://dx.doi.org/10.1007/s10472-013-9335-0.
  18. Gabriel Valiente. Algorithms on Trees and Graphs. Springer-Verlag, Berlin, 2002. Google Scholar
  19. Atsuko Yamaguchi, Kiyoko F. Aoki, and Hiroshi Mamitsuka. Finding the maximum common subgraph of a partial k-tree and a graph with a polynomially bounded number of spanning trees. Inf. Process. Lett., 92(2):57-63, 2004. URL: http://dx.doi.org/10.1016/j.ipl.2004.06.019.
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail