Resolution and reconciliation of non-binary gene trees with transfers, duplications and losses

Edwin Jacox 1 Mathias Weller 2, 3 Eric Tannier 4, 5 Celine Scornavacca 1, 2
3 MAB - Méthodes et Algorithmes pour la Bioinformatique
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
5 BEAGLE - Artificial Evolution and Computational Biology
LBBE - Laboratoire de Biométrie et Biologie Evolutive - UMR 5558, Inria Grenoble - Rhône-Alpes, LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : Gene trees reconstructed from sequence alignments contain poorly supported branches when the phylogenetic signal in the sequences is insufficient to determine them all. When a species tree is available, the signal of gains and losses of genes can be used to correctly resolve the unsupported parts of the gene history. However finding a most parsimonious binary resolution of a non-binary tree obtained by contracting the unsupported branches is NP-hard if transfer events are considered as possible gene scale events, in addition to gene origination, duplication and loss. We propose an exact, parameterized algorithm to solve this problem in single-exponential time, where the parameter is the number of connected branches of the gene tree that show low support from the sequence alignment or, equivalently, the maximum number of children of any node of the gene tree once the low-support branches have been collapsed. This improves on the best known algorithm by an exponential factor. We propose a way to choose among optimal solutions based on the available information. We show the usability of this principle on several simulated and biological datasets. The results are comparable in quality to several other tested methods having similar goals, but our approach provides a lower running time and a guarantee that the produced solution is optimal.
Document type :
Journal articles
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01526643
Contributor : Eric Tannier <>
Submitted on : Tuesday, May 23, 2017 - 12:18:29 PM
Last modification on : Wednesday, September 4, 2019 - 1:40:44 PM

Links full text

Identifiers

Citation

Edwin Jacox, Mathias Weller, Eric Tannier, Celine Scornavacca. Resolution and reconciliation of non-binary gene trees with transfers, duplications and losses. Bioinformatics, Oxford University Press (OUP), 2017, 33 (7), pp.980-987. ⟨10.1093/bioinformatics/btw778⟩. ⟨hal-01526643⟩

Share

Metrics

Record views

616