Learning Constrained Edit State Machines

Laurent Boyer; Olivier Gandrillon; Amaury Habrard; Mathilde Pellerin; Marc Sebban

Communication Dans Un Congrès Année : 2009

Learning Constrained Edit State Machines

(1) , (2) , (3) , (2) , (4)

1
2
3
4

Laurent Boyer

Fonction : Auteur
PersonId : 841787

Laboratoire de Mathématiques

Olivier Gandrillon

Fonction : Auteur
PersonId : 857147

Centre de génétique et de physiologie moléculaire et cellulaire

Amaury Habrard

Fonction : Auteur
PersonId : 439
IdHAL : amaury-habrard
ORCID : 0000-0003-3038-9347
IdRef : 084103655

Laboratoire d'informatique Fondamentale de Marseille - UMR 6166

Mathilde Pellerin

Fonction : Auteur

Centre de génétique et de physiologie moléculaire et cellulaire

Marc Sebban

Fonction : Auteur
PersonId : 5203
IdHAL : marc-sebban
ORCID : 0000-0001-6851-169X
IdRef : 050802623

Laboratoire Hubert Curien

Résumé

Learning the parameters of the edit distance has been increasingly studied during the past few years to improve the assessment of similarities between structured data, such as strings, trees or graphs. Often based on the optimization of the likelihood of pairs of data, the learned models usually take the form of probabilistic state machines, such as pair-Hidden Markov Models (pair-HMM), stochastic transducers, or probabilistic deterministic automata. Although the use of such models has lead to significant improvements of edit distance-based classification tasks, a new challenge has appeared on the horizon: How integrating background knowledge during the learning process? This is the subject matter of this paper in the case of (input,output) pairs of strings. We present a generalization of the pair-HMM in the form of a constrained state machine, where a transition between two states is driven by constraints fulfilled on the input string. Experimental results are provided on a task in molecular biology, aiming to detect transcription factor binding sites.

Domaines

Apprentissage [cs.LG]

Fichier principal

ICTAI09.pdf (212.35 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Marc Sebban : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00485560

Soumis le : vendredi 21 mai 2010-09:25:12

Dernière modification le : jeudi 4 avril 2024-18:21:36

Archivage à long terme le : jeudi 16 septembre 2010-15:06:01

Dates et versions

hal-00485560 , version 1 (21-05-2010)

Identifiants

HAL Id : hal-00485560 , version 1

Citer

Laurent Boyer, Olivier Gandrillon, Amaury Habrard, Mathilde Pellerin, Marc Sebban. Learning Constrained Edit State Machines. 21st IEEE International Conference on Tools with Artificial Intelligence, Nov 2009, United States. pp.734-741. ⟨hal-00485560⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-ST-ETIENNE UNIV-SAVOIE IOGS UGA LIF CNRS UNIV-AMU UNIV-LYON1 LAHC LAMA PARISTECH CGPHIMC LIS-LAB UDL

283 Consultations

140 Téléchargements

Learning Constrained Edit State Machines

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager