How Pseudo-Boolean Programming can help Genome Rearrangement Distance Computation

Sébastien Angibaud 1, * Guillaume Fertin 1 Irena Rusu 1 Stéphane Vialette 2
* Corresponding author
2 Algorithmics
LRI - Laboratoire de Recherche en Informatique
Abstract : Computing genomic distances between whole genomes is a fundamental problem in comparative genomics. Recent researches have resulted in different genomic distance definitions: number of breakpoints, number of common intervals, number of conserved intervals, Maximum Adjacency Disruption number (MAD), etc. Unfortunately, it turns out that, in presence of duplications, most problems are NP-hard, and hence several heuristics have been recently proposed. However, while it is relatively easy to compare heuristics between them, until now very little is known about the absolute accuracy of these heuristics. Therefore, there is a great need for algorithmic approaches that compute exact solutions for these genomic distances. In this paper, we present a novel generic pseudo-boolean approach for computing the exact genomic distance between two whole genomes in presence of duplications, and put strong emphasis on common intervals under the maximum matching model. Of particular importance, we show very strong evidence that the simple LCS heuristic provides very good results on a well-known public benchmark dataset of gamma-Proteobacteria.
Liste complète des métadonnées

Cited literature [15 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00418258
Contributor : Guillaume Fertin <>
Submitted on : Thursday, September 17, 2009 - 4:37:35 PM
Last modification on : Wednesday, May 23, 2018 - 3:44:02 PM
Document(s) archivé(s) le : Tuesday, June 15, 2010 - 10:04:48 PM

File

RECOMB-CG06.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00418258, version 1

Collections

Citation

Sébastien Angibaud, Guillaume Fertin, Irena Rusu, Stéphane Vialette. How Pseudo-Boolean Programming can help Genome Rearrangement Distance Computation. 4th RECOMB Comparative Genomics Satellite Workshop (RECOMB-CG 2006), 2007, Montréal, Canada. pp.75-86. ⟨hal-00418258⟩

Share

Metrics

Record views

486

Files downloads

220