Consensus Clusters in Robinson-Foulds Reticulation Networks

Markin, Alexey; Eulenstein, Oliver

doi:10.4230/LIPIcs.WABI.2019.12

File

Author Details

Alexey Markin

Department of Computer Science, Iowa State University, Ames, IA, USA

Oliver Eulenstein

Department of Computer Science, Iowa State University, Ames, IA, USA

Cite As Get BibTex

Alexey Markin and Oliver Eulenstein. Consensus Clusters in Robinson-Foulds Reticulation Networks. In 19th International Workshop on Algorithms in Bioinformatics (WABI 2019). Leibniz International Proceedings in Informatics (LIPIcs), Volume 143, pp. 12:1-12:12, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2019) https://doi.org/10.4230/LIPIcs.WABI.2019.12

Abstract

Inference of phylogenetic networks - the evolutionary histories of species involving speciation as well as reticulation events - has proved to be an extremely challenging problem even for smaller datasets easily tackled by supertree inference methods. An effective way to boost the scalability of distance-based supertree methods originates from the Pareto (for clusters) property, which is a highly desirable property for phylogenetic consensus methods. In particular, one can employ strict consensus merger algorithms to boost the scalability and accuracy of supertree methods satisfying Pareto; cf. SuperFine. In this work, we establish a Pareto-like property for phylogenetic networks. Then we consider the recently introduced RF-Net method that heuristically solves the so-called RF-Network problem and which was demonstrated to be an efficient and effective tool for the inference of hybridization and reassortment networks. As our main result, we provide a constructive proof (entailing an explicit refinement algorithm) that the Pareto property applies to the RF-Network problem when the solution space is restricted to the popular class of tree-child networks. This result implies that strict consensus merger strategies, similar to SuperFine, can be directly applied to boost both accuracy and scalability of RF-Net significantly. Finally, we further investigate the optimum solutions to the RF-Network problem; in particular, we describe structural properties of all optimum (tree-child) RF-networks in relation to strict consensus clusters of the input trees.

Subject Classification

ACM Subject Classification

Applied computing → Computational biology
Mathematics of computing → Graph theory

Keywords

Phylogenetics
phylogenetic tree
phylogenetic network
reticulation network
Robinson-Foulds
Pareto
RF-Net

Metrics

Access Statistics
Total Accesses (updated on a weekly basis)

0

PDF Downloads

0

Metadata Views

References

Mukul S. Bansal, J. Gordon Burleigh, Oliver Eulenstein, and David Fernández-Baca. Robinson-Foulds supertrees. Algorithms Mol Biol, 5:18, 2010.
Mihaela Baroni, Charles Semple, and Mike Steel. A framework for representing reticulate evolution. Annals of Combinatorics, 8(4):391-408, 2005.
Olaf R.P. Bininda-Emonds, editor. Phylogenetic Supertrees: Combining Information to Reveal the Tree of Life, volume 4 of Computational Biology. Springer Verlag, 2004.
Olaf R.P. Bininda-Emonds, John L. Gittleman, and Mike A. Steel. The (Super)tree of life: Procedures, problems, and prospects. The (Super)tree of life: Procedures, problems, and prospects, 33:265-289, 2002.
Magnus Bordewich, Simone Linz, and Charles Semple. Lost in space? Generalising subtree prune and regraft to spaces of phylogenetic networks. J Theor Biol, 423:1-12, 2017.
David Bryant. A classification of consensus methods for phylogenetics. In M. Janowitz, F.-J. Lapointe, F. McMorris, B. Mirkin, and F. Roberts, editors, Discrete Mathematics and Theoretical Computer Science, volume 61, pages 163-185. Am Math Soc, Providence, RI, 2003.
Gabriel Cardona, Francesc Rossello, and Gabriel Valiente. Comparison of tree-child phylogenetic networks. IEEE/ACM TCBB, 6(4):552-569, 2009.
Wen-Chieh Chang, Paweł Górecki, and Oliver Eulenstein. Exact solutions for species tree inference from discordant gene trees. J Bioinform Comput Biol, 11(5):1342005, 2013.
Cedric Chauve, Nadia El-Mabrouk, Laurent Guéguen, Magali Semeria, and Eric Tannier. Duplication, Rearrangement and Reconciliation: A Follow-Up 13 Years Later, pages 47-62. Springer, London, 2013.
Theodosius Dobzhansky. Nothing in biology makes sense except in the light of evolution. The american biology teacher, 75(2):87-91, 2013.
Juliane C Dohm, André E Minoche, Daniela Holtgräwe, Salvador Capella-Gutiérrez, Falk Zakrzewski, Hakim Tafer, Oliver Rupp, Thomas Rosleff Sörensen, Ralf Stracke, Richard Reinhardt, et al. The genome of the recently domesticated crop plant sugar beet (Beta vulgaris). Nature, 505(7484):546, 2014.
Oliver Eulenstein, Snehalata Huzurbazar, and David A Liberles. Evolution after Gene Duplication, chapter Reconciling Phylogenetic Trees, pages 185-206. John Wiley, 2010.
Daniel H Huson, Scott M Nettles, and Tandy J Warnow. Disk-covering, a fast-converging method for phylogenetic tree reconstruction. J Comput Biol, 6(3-4):369-386, 1999.
Daniel H Huson, Regula Rupp, and Celine Scornavacca. Phylogenetic networks: concepts, algorithms and applications. Cambridge University Press, 2010.
Harris T. Lin, J. Gordon Burleigh, and Oliver Eulenstein. Consensus properties for the deep coalescence problem and their application for scalable tree search. BMC Bioinf, 13 Suppl 10:S12, 2012.
Bin Ma, Ming Li, and Louxin Zhang. From Gene Trees to Species Trees. SIAM J. Comput., 30(3):729-752, 2000. URL: https://doi.org/10.1137/S0097539798343362.
Alexey Markin, Tavis K Anderson, Venkata SKT Vadali, and Oliver Eulenstein. Robinson-Foulds Reticulation Networks. bioRxiv, 2019. URL: https://doi.org/10.1101/642793.
Andreu Mas-Colell, Michael Dennis Whinston, Jerry R Green, et al. Microeconomic theory, volume 1. Oxford university press New York, 1995.
Siavash Mirarab, Rezwana Reaz, Md S Bayzid, Théo Zimmermann, M Shel Swenson, and Tandy Warnow. ASTRAL: genome-scale coalescent-based species tree estimation. Bioinformatics, 30(17):541-548, 2014.
Jucheol Moon and Oliver Eulenstein. Synthesizing large-scale species trees using the strict consensus approach. J Bioinform Comput Biol, 15(03), 2017.
Jucheol Moon, Harris T Lin, and Oliver Eulenstein. Consensus properties and their large-scale applications for the gene duplication problem. J Bioinform Comput Biol, 14(03), 2016.
Thomas J Near, Ron I Eytan, Alex Dornburg, Kristen L Kuhn, Jon A Moore, Matthew P Davis, Peter C Wainwright, Matt Friedman, and W Leo Smith. Resolution of ray-finned fish phylogeny and timing of diversification. PNAS, 109(34):13698-13703, 2012.
DA Neumann. Faithful consensus methods for n-trees. Math Biosci, 63(2):271-287, 1983.
David F Robinson and Leslie R Foulds. Comparison of phylogenetic trees. Math Biosci, 53:131-147, 1981.
Mike Steel and Allen Rodrigo. Maximum likelihood supertrees. Syst Biol, 57(2):243-250, April 2008.
M Shel Swenson, Rahul Suri, C Randal Linder, and Tandy Warnow. SuperFine: fast and accurate supertree estimation. Syst Biol, 61(2):214, 2011.
Pranjal Vachaspati and Tandy Warnow. FastRFS: fast and accurate Robinson-Foulds Supertrees using constrained exact optimization. Bioinformatics, 33(5):631-639, 2016.
André Wehe, Mukul S Bansal, J Gordon Burleigh, and Oliver Eulenstein. DupTree: a program for large-scale phylogenetic analyses using gene tree parsimony. Bioinformatics, 24(13):1540-1541, 2008.
Yun Yu, R. Matthew Barnett, and Luay Nakhleh. Parsimonious Inference of Hybridization in the Presence of Incomplete Lineage Sorting. Syst Biol, 62(5):738-751, 2013.

Consensus Clusters in Robinson-Foulds Reticulation Networks

Authors Alexey Markin , Oliver Eulenstein

File

Document Identifiers

Author Details

Acknowledgements

Cite As Get BibTex

Abstract

Subject Classification

ACM Subject Classification

Keywords

Metrics

References

Thanks for your feedback!

Could not send message

Consensus Clusters in Robinson-Foulds Reticulation Networks

Authors Alexey Markin , Oliver Eulenstein

File

Document Identifiers

Author Details

Funding

Acknowledgements

Cite As Get BibTex

Abstract

Subject Classification

ACM Subject Classification

Keywords

Metrics

References

Thanks for your feedback!

Could not send message