Incomplete Directed Perfect Phylogeny in linear time

Bernardini, Giulia; Bonizzoni, Paola; Gawrychowski, Paweł

doi:10.1007/978-3-030-83508-8_13

G. Bernardini (Giulia), P. Bonizzoni (Paola) and P. Gawrychowski (Paweł)

2021-07-31

Incomplete Directed Perfect Phylogeny in linear time

Presented at the 17th Algorithms and Data Structures Symposium (August 2021), Virtual, Online

Reconstructing the evolutionary history of a set of species is a central task in computational biology. In real data, it is often the case that some information is missing: the Incomplete Directed Perfect Phylogeny (IDPP) problem asks, given a collection of species described by a set of binary characters with some unknown states, to complete the missing states in such a way that the result can be explained with a directed perfect phylogeny. Pe’er et al. [SICOMP 2004] proposed a solution that takes O~ (nm) time (the O~ (· ) notation suppresses polylog factors) for n species and m characters. Their algorithm relies on pre-existing dynamic connectivity data structures: a computational study recently conducted by Fernández-Baca and Liu showed that, in this context, complex data structures perform worse than simpler ones with worse asymptotic bounds. This gives us the motivation to look into the particular properties of the dynamic connectivity problem in this setting, so as to avoid the use of sophisticated data structures as a blackbox. Not only are we successful in doing so, and give a much simpler O(nmlog n) -time algorithm for the IDPP problem; our insights into the specific structure of the problem lead to an asymptotically optimal O(nm) -time algorithm.

Additional Metadata
Persistent URL	doi.org/10.1007/978-3-030-83508-8_13
Series	Lecture Notes in Computer Science/Lecture Notes in Artificial Intelligence
Project	Optimization for and with Machine Learning , Pan-genome Graph Algorithms and Data Integration
Conference	17th Algorithms and Data Structures Symposium
Grant	This work was funded by the The Netherlands Organisation for Scientific Research (NWO); grant id nwo/OCENW.2019.015 - Optimization for and with Machine Learning, This work was funded by the European Commission 7th Framework Programme; grant id h2020/872539 - Pan-genome Graph Algorithms and Data Integration (PANGAIA)
Organisation	Centrum Wiskunde & Informatica, Amsterdam (CWI), The Netherlands
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Bernardini, G., Bonizzoni, P., & Gawrychowski, P. (2021). Incomplete Directed Perfect Phylogeny in linear time. In Workshop on Algorithms and Data Structures (pp. 172–185). doi:10.1007/978-3-030-83508-8_13

View at Publisher

Full Text ( Final Version , 535kb )

Incomplete Directed Perfect Phylogeny in linear time

Publication

Publication

Address

CWI researchers

Questions or comments?

Incomplete Directed Perfect Phylogeny in linear time

Publication

Publication

Workflow

Workflow

Add Content