Gapped consensus motif discovery: evaluation of a new algorithm based on local multiple alignments and a sampling strategy - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2006

Gapped consensus motif discovery: evaluation of a new algorithm based on local multiple alignments and a sampling strategy

Résumé

We check the efficiency and faisability of a novel method designed for the discovery of a priori unknown motifs described as gaps alternating with specific regions. Such motifs are searched for as consensi of non homologous biological sequences. The only specifications required concern the maximal gap length, the minimal frequency for specific characters and the minimal percentage (quorum) of sequences sharing the motif. Our method is based on a cooperation between a multiple alignment method for a quick detection of local similarities and a sampling strategy running candidate position specific scoring matrices to convergence. This rather original way implemented for converging to the solution proves efficient both on simulated data, gapped instances of the so-called challenge problem, promoter sites in Dicot plants and transcription factor binding sites in E.Coli. Our algorithm compares favorably with the MEME and STARS approaches in terms of accuracy.
Fichier principal
Vignette du fichier
rapport_rr_sinoquet_20_04_05.pdf (1.25 Mo) Télécharger le fichier

Dates et versions

hal-00023162 , version 1 (20-04-2006)

Identifiants

  • HAL Id : hal-00023162 , version 1

Citer

Christine Sinoquet. Gapped consensus motif discovery: evaluation of a new algorithm based on local multiple alignments and a sampling strategy. 2006. ⟨hal-00023162⟩
47 Consultations
75 Téléchargements

Partager

Gmail Facebook X LinkedIn More