Finding Maximum Common Contractions Between Phylogenetic Networks

Authors Bertrand Marchand , Nadia Tahiri , Olivier Tremblay-Savard , Manuel Lafond



PDF
Thumbnail PDF

File

LIPIcs.WABI.2024.16.pdf
  • Filesize: 1.19 MB
  • 24 pages

Document Identifiers

Author Details

Bertrand Marchand
  • Department of Computer Science, University of Sherbrooke, Canada
Nadia Tahiri
  • Department of Computer Science, University of Sherbrooke, Canada
Olivier Tremblay-Savard
  • Department of Computer Science, University of Manitoba, Winnipeg, Canada
Manuel Lafond
  • Department of Computer Science, University of Sherbrooke, Canada

Cite AsGet BibTex

Bertrand Marchand, Nadia Tahiri, Olivier Tremblay-Savard, and Manuel Lafond. Finding Maximum Common Contractions Between Phylogenetic Networks. In 24th International Workshop on Algorithms in Bioinformatics (WABI 2024). Leibniz International Proceedings in Informatics (LIPIcs), Volume 312, pp. 16:1-16:24, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2024)
https://doi.org/10.4230/LIPIcs.WABI.2024.16

Abstract

In this paper, we lay the groundwork on the comparison of phylogenetic networks based on edge contractions and expansions as edit operations, as originally proposed by Robinson and Foulds to compare trees. We prove that these operations connect the space of all phylogenetic networks on the same set of leaves, even if we forbid contractions that create cycles. This allows to define an operational distance on this space, as the minimum number of contractions and expansions required to transform one network into another. We highlight the difference between this distance and the computation of the maximum common contraction between two networks. Given its ability to outline a common structure between them, which can provide valuable biological insights, we study the algorithmic aspects of the latter. We first prove that computing a maximum common contraction between two networks is NP-hard, even when the maximum degree, the size of the common contraction, or the number of leaves is bounded. We also provide lower bounds to the problem based on the Exponential-Time Hypothesis. Nonetheless, we do provide a polynomial-time algorithm for weakly galled trees, a generalization of galled trees.

Subject Classification

ACM Subject Classification
  • Applied computing → Bioinformatics
  • Theory of computation → Problems, reductions and completeness
Keywords
  • Phylogenetic networks
  • contractions
  • algorithms
  • weakly galled trees

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads

References

  1. Sophie S Abby, Eric Tannier, Manolo Gouy, and Vincent Daubin. Lateral gene transfer as a support for the tree of life. Proceedings of the National Academy of Sciences, 109(13):4962-4967, 2012. Google Scholar
  2. Akanksha Agrawal, Lawqueen Kanesh, Saket Saurabh, and Prafullkumar Tale. Paths to trees and cacti. Theoretical Computer Science, 860:98-116, 2021. Google Scholar
  3. Tatsuya Akutsu, Avraham A Melkman, and Takeyuki Tamura. Improved hardness of maximum common subgraph problems on labeled graphs of bounded treewidth and bounded degree. International Journal of Foundations of Computer Science, 31(02):253-273, 2020. Google Scholar
  4. Dhanyamol Antony, Yixin Cao, Sagartanu Pal, and RB Sandeep. Switching classes: Characterization and computation. arXiv preprint arXiv:2403.04263, 2024. Google Scholar
  5. Allan Bai, Péter L Erdős, Charles Semple, and Mike Steel. Defining phylogenetic networks using ancestral profiles. Mathematical Biosciences, 332:108537, 2021. Google Scholar
  6. Hans-Jurgen Bandelt, Peter Forster, and Arne Röhl. Median-joining networks for inferring intraspecific phylogenies. Molecular biology and evolution, 16(1):37-48, 1999. Google Scholar
  7. Rémy Belmonte, Petr A Golovach, Pim van’t Hof, and Daniël Paulusma. Parameterized complexity of three edge contraction problems with degree constraints. Acta Informatica, 51(7):473-497, 2014. Google Scholar
  8. Vincent Berry, Celine Scornavacca, and Mathias Weller. Scanning phylogenetic networks is np-hard. In SOFSEM 2020: Theory and Practice of Computer Science: 46th International Conference on Current Trends in Theory and Practice of Informatics, SOFSEM 2020, Limassol, Cyprus, January 20-24, 2020, Proceedings 46, pages 519-530. Springer, 2020. Google Scholar
  9. Alix Boc, Alpha Boubacar Diallo, and Vladimir Makarenkov. T-rex: a web server for inferring, validating and visualizing phylogenetic trees and networks. Nucleic acids research, 40(W1):W573-W579, 2012. Google Scholar
  10. Magnus Bordewich, Simone Linz, and Charles Semple. Lost in space? generalising subtree prune and regraft to spaces of phylogenetic networks. Journal of theoretical biology, 423:1-12, 2017. Google Scholar
  11. Magnus Bordewich and Charles Semple. On the computational complexity of the rooted subtree prune and regraft distance. Annals of combinatorics, 8:409-423, 2005. Google Scholar
  12. Andries Evert Brouwer and Henk Jan Veldman. Contractibility and np-completeness. Journal of Graph Theory, 11(1):71-79, 1987. Google Scholar
  13. Pablo G Cámara, Arnold J Levine, and Raul Rabadan. Inference of ancestral recombination graphs through topological data analysis. PLoS computational biology, 12(8):e1005071, 2016. Google Scholar
  14. Gabriel Cardona, Mercè Llabrés, Francesc Rosselló, and Gabriel Valiente. Metrics for phylogenetic networks i: Generalizations of the robinson-foulds metric. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 6(1):46-61, 2008. Google Scholar
  15. Gabriel Cardona, Mercè Llabrés, Francesc Rosselló, and Gabriel Valiente. Metrics for phylogenetic networks ii: Nodal and triplets metrics. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 6(3):454-469, 2008. Google Scholar
  16. Gabriel Cardona, Mercè Llabrés, Francesc Rosselló, Gabriel Valiente, et al. The comparison of tree-sibling time consistent phylogenetic networks is graph isomorphism-complete. The Scientific World Journal, 2014, 2014. Google Scholar
  17. Gabriel Cardona, Joan Carles Pons, Gerard Ribas, and Tomas Martınez Coronado. Comparison of orchard networks using their extended μ-representation. IEEE/ACM Transactions on Computational Biology and Bioinformatics, pages 1-8, 2024. URL: https://doi.org/10.1109/TCBB.2024.3361390.
  18. Gabriel Cardona, Francesc Rosselló, and Gabriel Valiente. Tripartitions do not always discriminate phylogenetic networks. Mathematical Biosciences, 211(2):356-370, 2008. Google Scholar
  19. Charles Choy, Jesper Jansson, Kunihiko Sadakane, and Wing-Kin Sung. Computing the maximum agreement of phylogenetic networks. Theoretical Computer Science, 335(1):93-107, 2005. Google Scholar
  20. Andreas Darmann and Janosch Döcker. On a simple hard variant of not-all-equal 3-sat. Theoretical Computer Science, 815:147-152, 2020. Google Scholar
  21. Bhaskar DasGupta, Xin He, Tao Jiang, Ming Li, John Tromp, and Louxin Zhang. On computing the nearest neighbor interchange distance. Computing, 23(22):21-26, 1998. Google Scholar
  22. Reinhard Diestel. Graph Theory. Springer, 2016. Google Scholar
  23. Norman C Ellstrand and Kristina A Schierenbeck. Hybridization as a stimulus for the evolution of invasiveness in plants? Proceedings of the National Academy of Sciences, 97(13):7043-7050, 2000. Google Scholar
  24. Peter Forster, Lucy Forster, Colin Renfrew, and Michael Forster. Phylogenetic network analysis of sars-cov-2 genomes. Proceedings of the National Academy of Sciences, 117(17):9241-9243, 2020. Google Scholar
  25. Peter Forster, Lucy Forster, Colin Renfrew, and Michael Forster. Reply to sanchez-pacheco et al., chookajorn, and mavian et al.: explaining phylogenetic network analysis of sars-cov-2 genomes. Proceedings of the National Academy of Sciences of the United States of America, 117(23):12524, 2020. Google Scholar
  26. Philippe Gambette, Leo Van Iersel, Mark Jones, Manuel Lafond, Fabio Pardi, and Celine Scornavacca. Rearrangement moves on rooted phylogenetic networks. PLoS computational biology, 13(8):e1005611, 2017. Google Scholar
  27. Michael R Garey and David S Johnson. Computers and intractability, volume 174. freeman San Francisco, 1979. Google Scholar
  28. Dan Gusfield, Satish Eddhu, and Charles Langley. Optimal, efficient reconstruction of phylogenetic networks with constrained recombination. Journal of bioinformatics and computational biology, 2(01):173-213, 2004. Google Scholar
  29. Marc Hellmuth, David Schaller, and Peter F Stadler. Clustering systems of phylogenetic networks. Theory in Biosciences, 142(4):301-358, 2023. Google Scholar
  30. Daniel H Huson, Regula Rupp, and Celine Scornavacca. Phylogenetic networks: concepts, algorithms and applications. Cambridge University Press, 2010. Google Scholar
  31. Marcin Kamiński, Daniël Paulusma, and Dimitrios M Thilikos. Contractions of planar graphs in polynomial time. In Algorithms-ESA 2010: 18th Annual European Symposium, Liverpool, UK, September 6-8, 2010. Proceedings, Part I 18, pages 122-133. Springer, 2010. Google Scholar
  32. Jonathan Klawitter. Spaces of phylogenetic networks. PhD thesis, ResearchSpace@ Auckland, 2020. Google Scholar
  33. Eugene V Koonin, Kira S Makarova, and L Aravind. Horizontal gene transfer in prokaryotes: quantification and classification. Annual Reviews in Microbiology, 55(1):709-742, 2001. Google Scholar
  34. Nils Kriege, Florian Kurpicz, and Petra Mutzel. On maximum common subgraph problems in series-parallel graphs. European Journal of Combinatorics, 68:79-95, 2018. Google Scholar
  35. Kaari Landry, Aivee Teodocio, Manuel Lafond, and Olivier Tremblay-Savard. Defining phylogenetic network distances using cherry operations. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2022. Google Scholar
  36. Kaari Landry, Olivier Tremblay-Savard, and Manuel Lafond. A fixed-parameter tractable algorithm for finding agreement cherry-reduced subnetworks in level-1 orchard networks. Journal of Computational Biology, 2023. Google Scholar
  37. Asaf Levin, Daniël Paulusma, and Gerhard J Woeginger. The computational complexity of graph contractions i: Polynomially solvable and np-complete cases. Networks: An International Journal, 51(3):178-189, 2008. Google Scholar
  38. Eugene M Luks. Isomorphism of graphs of bounded valence can be tested in polynomial time. Journal of computer and system sciences, 25(1):42-65, 1982. Google Scholar
  39. Bertrand Marchand, Yann Ponty, and Laurent Bulteau. Tree diet: reducing the treewidth to unlock fpt algorithms in rna bioinformatics. Algorithms for Molecular Biology, 17(1):8, 2022. Google Scholar
  40. Jiří Matoušek and Robin Thomas. On the complexity of finding iso-and other morphisms for partial k-trees. Discrete Mathematics, 108(1-3):343-364, 1992. Google Scholar
  41. Bernard ME Moret, Luay Nakhleh, Tandy Warnow, C Randal Linder, Anna Tholse, Anneke Padolina, Jerry Sun, and Ruth Timme. Phylogenetic networks: modeling, reconstructibility, and accuracy. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 1(1):13-23, 2004. Google Scholar
  42. David F Robinson and Leslie R Foulds. Comparison of phylogenetic trees. Mathematical biosciences, 53(1-2):131-147, 1981. Google Scholar
  43. Francesc Rosselló and Gabriel Valiente. All that glisters is not galled. Mathematical biosciences, 221(1):54-59, 2009. Google Scholar
  44. Santiago J Sánchez-Pacheco, Sungsik Kong, Paola Pulido-Santacruz, Robert W Murphy, and Laura Kubatko. Median-joining network analysis of sars-cov-2 genomes is neither phylogenetic nor evolutionary. Proceedings of the National Academy of Sciences, 117(23):12518-12519, 2020. Google Scholar
  45. Bohdan Zelinka. Contraction distance between isomorphism classes of graphs. Časopis pro pěstování matematiky, 115(2):211-216, 1990. Google Scholar
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail