Consensus Clusters in Robinson-Foulds Reticulation Networks

Authors Alexey Markin , Oliver Eulenstein



PDF
Thumbnail PDF

File

LIPIcs.WABI.2019.12.pdf
  • Filesize: 487 kB
  • 12 pages

Document Identifiers

Author Details

Alexey Markin
  • Department of Computer Science, Iowa State University, Ames, IA, USA
Oliver Eulenstein
  • Department of Computer Science, Iowa State University, Ames, IA, USA

Acknowledgements

The authors want to thank the reviewers for their valuable and constructive comments.

Cite AsGet BibTex

Alexey Markin and Oliver Eulenstein. Consensus Clusters in Robinson-Foulds Reticulation Networks. In 19th International Workshop on Algorithms in Bioinformatics (WABI 2019). Leibniz International Proceedings in Informatics (LIPIcs), Volume 143, pp. 12:1-12:12, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2019)
https://doi.org/10.4230/LIPIcs.WABI.2019.12

Abstract

Inference of phylogenetic networks - the evolutionary histories of species involving speciation as well as reticulation events - has proved to be an extremely challenging problem even for smaller datasets easily tackled by supertree inference methods. An effective way to boost the scalability of distance-based supertree methods originates from the Pareto (for clusters) property, which is a highly desirable property for phylogenetic consensus methods. In particular, one can employ strict consensus merger algorithms to boost the scalability and accuracy of supertree methods satisfying Pareto; cf. SuperFine. In this work, we establish a Pareto-like property for phylogenetic networks. Then we consider the recently introduced RF-Net method that heuristically solves the so-called RF-Network problem and which was demonstrated to be an efficient and effective tool for the inference of hybridization and reassortment networks. As our main result, we provide a constructive proof (entailing an explicit refinement algorithm) that the Pareto property applies to the RF-Network problem when the solution space is restricted to the popular class of tree-child networks. This result implies that strict consensus merger strategies, similar to SuperFine, can be directly applied to boost both accuracy and scalability of RF-Net significantly. Finally, we further investigate the optimum solutions to the RF-Network problem; in particular, we describe structural properties of all optimum (tree-child) RF-networks in relation to strict consensus clusters of the input trees.

Subject Classification

ACM Subject Classification
  • Applied computing → Computational biology
  • Mathematics of computing → Graph theory
Keywords
  • Phylogenetics
  • phylogenetic tree
  • phylogenetic network
  • reticulation network
  • Robinson-Foulds
  • Pareto
  • RF-Net

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads

References

  1. Mukul S. Bansal, J. Gordon Burleigh, Oliver Eulenstein, and David Fernández-Baca. Robinson-Foulds supertrees. Algorithms Mol Biol, 5:18, 2010. Google Scholar
  2. Mihaela Baroni, Charles Semple, and Mike Steel. A framework for representing reticulate evolution. Annals of Combinatorics, 8(4):391-408, 2005. Google Scholar
  3. Olaf R.P. Bininda-Emonds, editor. Phylogenetic Supertrees: Combining Information to Reveal the Tree of Life, volume 4 of Computational Biology. Springer Verlag, 2004. Google Scholar
  4. Olaf R.P. Bininda-Emonds, John L. Gittleman, and Mike A. Steel. The (Super)tree of life: Procedures, problems, and prospects. The (Super)tree of life: Procedures, problems, and prospects, 33:265-289, 2002. Google Scholar
  5. Magnus Bordewich, Simone Linz, and Charles Semple. Lost in space? Generalising subtree prune and regraft to spaces of phylogenetic networks. J Theor Biol, 423:1-12, 2017. Google Scholar
  6. David Bryant. A classification of consensus methods for phylogenetics. In M. Janowitz, F.-J. Lapointe, F. McMorris, B. Mirkin, and F. Roberts, editors, Discrete Mathematics and Theoretical Computer Science, volume 61, pages 163-185. Am Math Soc, Providence, RI, 2003. Google Scholar
  7. Gabriel Cardona, Francesc Rossello, and Gabriel Valiente. Comparison of tree-child phylogenetic networks. IEEE/ACM TCBB, 6(4):552-569, 2009. Google Scholar
  8. Wen-Chieh Chang, Paweł Górecki, and Oliver Eulenstein. Exact solutions for species tree inference from discordant gene trees. J Bioinform Comput Biol, 11(5):1342005, 2013. Google Scholar
  9. Cedric Chauve, Nadia El-Mabrouk, Laurent Guéguen, Magali Semeria, and Eric Tannier. Duplication, Rearrangement and Reconciliation: A Follow-Up 13 Years Later, pages 47-62. Springer, London, 2013. Google Scholar
  10. Theodosius Dobzhansky. Nothing in biology makes sense except in the light of evolution. The american biology teacher, 75(2):87-91, 2013. Google Scholar
  11. Juliane C Dohm, André E Minoche, Daniela Holtgräwe, Salvador Capella-Gutiérrez, Falk Zakrzewski, Hakim Tafer, Oliver Rupp, Thomas Rosleff Sörensen, Ralf Stracke, Richard Reinhardt, et al. The genome of the recently domesticated crop plant sugar beet (Beta vulgaris). Nature, 505(7484):546, 2014. Google Scholar
  12. Oliver Eulenstein, Snehalata Huzurbazar, and David A Liberles. Evolution after Gene Duplication, chapter Reconciling Phylogenetic Trees, pages 185-206. John Wiley, 2010. Google Scholar
  13. Daniel H Huson, Scott M Nettles, and Tandy J Warnow. Disk-covering, a fast-converging method for phylogenetic tree reconstruction. J Comput Biol, 6(3-4):369-386, 1999. Google Scholar
  14. Daniel H Huson, Regula Rupp, and Celine Scornavacca. Phylogenetic networks: concepts, algorithms and applications. Cambridge University Press, 2010. Google Scholar
  15. Harris T. Lin, J. Gordon Burleigh, and Oliver Eulenstein. Consensus properties for the deep coalescence problem and their application for scalable tree search. BMC Bioinf, 13 Suppl 10:S12, 2012. Google Scholar
  16. Bin Ma, Ming Li, and Louxin Zhang. From Gene Trees to Species Trees. SIAM J. Comput., 30(3):729-752, 2000. URL: https://doi.org/10.1137/S0097539798343362.
  17. Alexey Markin, Tavis K Anderson, Venkata SKT Vadali, and Oliver Eulenstein. Robinson-Foulds Reticulation Networks. bioRxiv, 2019. URL: https://doi.org/10.1101/642793.
  18. Andreu Mas-Colell, Michael Dennis Whinston, Jerry R Green, et al. Microeconomic theory, volume 1. Oxford university press New York, 1995. Google Scholar
  19. Siavash Mirarab, Rezwana Reaz, Md S Bayzid, Théo Zimmermann, M Shel Swenson, and Tandy Warnow. ASTRAL: genome-scale coalescent-based species tree estimation. Bioinformatics, 30(17):541-548, 2014. Google Scholar
  20. Jucheol Moon and Oliver Eulenstein. Synthesizing large-scale species trees using the strict consensus approach. J Bioinform Comput Biol, 15(03), 2017. Google Scholar
  21. Jucheol Moon, Harris T Lin, and Oliver Eulenstein. Consensus properties and their large-scale applications for the gene duplication problem. J Bioinform Comput Biol, 14(03), 2016. Google Scholar
  22. Thomas J Near, Ron I Eytan, Alex Dornburg, Kristen L Kuhn, Jon A Moore, Matthew P Davis, Peter C Wainwright, Matt Friedman, and W Leo Smith. Resolution of ray-finned fish phylogeny and timing of diversification. PNAS, 109(34):13698-13703, 2012. Google Scholar
  23. DA Neumann. Faithful consensus methods for n-trees. Math Biosci, 63(2):271-287, 1983. Google Scholar
  24. David F Robinson and Leslie R Foulds. Comparison of phylogenetic trees. Math Biosci, 53:131-147, 1981. Google Scholar
  25. Mike Steel and Allen Rodrigo. Maximum likelihood supertrees. Syst Biol, 57(2):243-250, April 2008. Google Scholar
  26. M Shel Swenson, Rahul Suri, C Randal Linder, and Tandy Warnow. SuperFine: fast and accurate supertree estimation. Syst Biol, 61(2):214, 2011. Google Scholar
  27. Pranjal Vachaspati and Tandy Warnow. FastRFS: fast and accurate Robinson-Foulds Supertrees using constrained exact optimization. Bioinformatics, 33(5):631-639, 2016. Google Scholar
  28. André Wehe, Mukul S Bansal, J Gordon Burleigh, and Oliver Eulenstein. DupTree: a program for large-scale phylogenetic analyses using gene tree parsimony. Bioinformatics, 24(13):1540-1541, 2008. Google Scholar
  29. Yun Yu, R. Matthew Barnett, and Luay Nakhleh. Parsimonious Inference of Hybridization in the Presence of Incomplete Lineage Sorting. Syst Biol, 62(5):738-751, 2013. Google Scholar
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail