A General Framework for Computing Rearrangement Distances between Genomes with Duplicates

Abstract : Computing genomic distances between whole genomes is a fundamental problem in comparative genomics. Recent researches have resulted in different genomic distance definitions: number of breakpoints, number of common intervals, number of conserved intervals, Maximum Adjacency Disruption number (MAD), etc. Unfortunately, it turns out that, in presence of duplications, most problems are NP-hard, and hence several heuristics have been recently proposed. However, while it is relatively easy to compare heuristics between them, until now very little is known about the absolute accuracy of these heuristics. Therefore, there is a great need for algorithmic approaches that compute exact solutions for these genomic distances. In this paper, we present a novel generic pseudo-boolean approach for computing the exact genomic distance between two whole genomes in presence of duplications, and put strong emphasis on common intervals under the maximum matching model. Of particular importance, we show three heuristics which provide very good results on a well-known public dataset of gamma-Proteobacteria.
Type de document :
Article dans une revue
Journal of Computational Biology, Mary Ann Liebert, 2007, 14 (4), pp.379-393
Liste complète des métadonnées

Littérature citée [22 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-00417735
Contributeur : Guillaume Fertin <>
Soumis le : mercredi 16 septembre 2009 - 16:56:28
Dernière modification le : mercredi 23 mai 2018 - 15:44:02
Document(s) archivé(s) le : mardi 15 juin 2010 - 23:46:35

Fichier

PSBJCB.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00417735, version 1

Citation

Sébastien Angibaud, Guillaume Fertin, Irena Rusu, Stéphane Vialette. A General Framework for Computing Rearrangement Distances between Genomes with Duplicates. Journal of Computational Biology, Mary Ann Liebert, 2007, 14 (4), pp.379-393. 〈hal-00417735〉

Partager

Métriques

Consultations de la notice

264

Téléchargements de fichiers

159