Genomes containing Duplicates are Hard to compare

Abstract : In this paper, we are interested in the algorithmic complexity of computing (dis)similarity measures between two genomes when they contain duplicated genes. In that case, there are usually two main ways to compute a given (dis)similarity measure M between two genomes G1 and G2: the rst model, that we will call the matching model, consists in making a one-to-one correspondence between genes of G1 and genes of G2, in such a way that M is optimized. The second model, called the exemplar model, consists in keeping in G1 (resp. G2) exactly one copy of each gene, thus deleting all the other copies, in such a way that M is optimized. We present here dierent results concerning the algorithmic complexity of computing three dierent similarity measures (number of common intervals, MAD number and SAD number) in those two models, basically showing that the problem becomes NP-complete for each of them as soon as genomes contain duplicates. We show indeed that for common intervals, MAD and SAD, the problem is NP-complete when genes are duplicated in genomes, in both the exemplar and matching models. In the case of MAD and SAD, we actually prove that, under both models, both MAD and SAD problems are APX-hard
Type de document :
Communication dans un congrès
International Workshop on Bioinformatics Research and Applications (IWBRA 2006), 2006, Reading, United Kingdom. Springer-Verlag, LNCS Vol. 3992, pp.783-790, 2006, Lecture Notes in Computer Science (LNCS)
Liste complète des métadonnées

Littérature citée [9 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-00418260
Contributeur : Guillaume Fertin <>
Soumis le : jeudi 17 septembre 2009 - 16:41:28
Dernière modification le : mercredi 23 mai 2018 - 15:44:02
Document(s) archivé(s) le : mardi 15 juin 2010 - 23:51:16

Fichier

DuplicatesIWBRA06.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00418260, version 1

Collections

Citation

Cedric Chauve, Guillaume Fertin, Romeo Rizzi, Stéphane Vialette. Genomes containing Duplicates are Hard to compare. International Workshop on Bioinformatics Research and Applications (IWBRA 2006), 2006, Reading, United Kingdom. Springer-Verlag, LNCS Vol. 3992, pp.783-790, 2006, Lecture Notes in Computer Science (LNCS). 〈hal-00418260〉

Partager

Métriques

Consultations de la notice

326

Téléchargements de fichiers

281