Learning Good Edit Similarities with Generalization Guarantees

Aurélien Bellet; Amaury Habrard; Marc Sebban

Communication Dans Un Congrès Année : 2011

Learning Good Edit Similarities with Generalization Guarantees

(1) , (2) , (1)

1
2

Aurélien Bellet

Fonction : Auteur
PersonId : 9877
IdHAL : aurelien-bellet
ORCID : 0000-0003-3440-1251
IdRef : 17653136X

Laboratoire Hubert Curien

Amaury Habrard

Fonction : Auteur
PersonId : 439
IdHAL : amaury-habrard
ORCID : 0000-0003-3038-9347
IdRef : 084103655

Laboratoire d'informatique Fondamentale de Marseille - UMR 6166

Marc Sebban

Fonction : Auteur
PersonId : 5203
IdHAL : marc-sebban
ORCID : 0000-0001-6851-169X
IdRef : 050802623

Laboratoire Hubert Curien

Résumé

Similarity and distance functions are essential to many learning algorithms, thus training them has attracted a lot of interest. When it comes to dealing with structured data (e.g., strings or trees), edit similarities are widely used, and there exists a few methods for learning them. However, these methods offer no theoretical guarantee as to the generalization performance and discriminative power of the resulting similarities. Recently, a theory of learning with good similarity functions was proposed. This new theory bridges the gap between the properties of a similarity function and its performance in classification. In this paper, we propose a novel edit similarity learning approach (GESL) driven by the idea of goodness, which allows us to derive generalization guarantees using the notion of uniform stability. We experimentally show that edit similarities learned with our method induce classification models that are both more accurate and sparser than those induced by the edit distance or edit similarities learned with a state-of-the-art method.

Domaines

Apprentissage [cs.LG]

Fichier principal

ecml11.pdf (223.34 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Marc Sebban : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00608631

Soumis le : mercredi 7 janvier 2015-15:26:40

Dernière modification le : vendredi 24 mars 2023-14:53:00

Archivage à long terme le : vendredi 11 septembre 2015-01:30:43

Dates et versions

hal-00608631 , version 1 (07-01-2015)

Identifiants

HAL Id : hal-00608631 , version 1

Citer

Aurélien Bellet, Amaury Habrard, Marc Sebban. Learning Good Edit Similarities with Generalization Guarantees. European Conference on Machine Learning, Sep 2011, Athens, Greece. pp.188-203. ⟨hal-00608631⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-ST-ETIENNE IOGS LIF CNRS UNIV-AMU LAHC PARISTECH LIS-LAB UDL

115 Consultations

617 Téléchargements

Learning Good Edit Similarities with Generalization Guarantees

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager