Similarity Learning for Provably Accurate Sparse Linear Classification

Aurélien Bellet; Amaury Habrard; Marc Sebban

Communication Dans Un Congrès Année : 2012

Similarity Learning for Provably Accurate Sparse Linear Classification

(1) , (1) , (1)

Aurélien Bellet

Fonction : Auteur
PersonId : 9877
IdHAL : aurelien-bellet
ORCID : 0000-0003-3440-1251
IdRef : 17653136X

Laboratoire Hubert Curien

Amaury Habrard

Fonction : Auteur
PersonId : 439
IdHAL : amaury-habrard
ORCID : 0000-0003-3038-9347
IdRef : 084103655

Laboratoire Hubert Curien

Marc Sebban

Fonction : Auteur
PersonId : 5203
IdHAL : marc-sebban
ORCID : 0000-0001-6851-169X
IdRef : 050802623

Laboratoire Hubert Curien

Résumé

In recent years, the crucial importance of metrics in machine learning algorithms has led to an increasing interest for optimizing distance and similarity functions. Most of the state of the art focus on learning Mahalanobis distances (requiring to fulfill a constraint of positive semi-definiteness) for use in a local k-NN algorithm. However, no theoretical link is established between the learned metrics and their performance in classification. In this paper, we make use of the formal framework of good similarities introduced by Balcan et al. to design an algorithm for learning a non PSD linear similarity optimized in a nonlinear feature space, which is then used to build a global linear classifier. We show that our approach has uniform stability and derive a generalization bound on the classification error. Experiments performed on various datasets confirm the effectiveness of our approach compared to stateof-the-art methods and provide evidence that (i) it is fast, (ii) robust to overfitting and (iii) produces very sparse classifiers.

Mots clés

Metric Learning

Domaines

Apprentissage [cs.LG]

Fichier principal

paper.pdf (232.05 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Marc Sebban : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00708401

Soumis le : jeudi 21 juin 2012-12:01:30

Dernière modification le : vendredi 24 mars 2023-14:52:55

Archivage à long terme le : samedi 22 septembre 2012-02:21:30

Dates et versions

hal-00708401 , version 1 (21-06-2012)

Identifiants

HAL Id : hal-00708401 , version 1

Citer

Aurélien Bellet, Amaury Habrard, Marc Sebban. Similarity Learning for Provably Accurate Sparse Linear Classification. International Conference on Machine Learning, Jun 2012, United Kingdom. ⟨hal-00708401⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-ST-ETIENNE IOGS CNRS LAHC PARISTECH UDL

277 Consultations

62 Téléchargements

Similarity Learning for Provably Accurate Sparse Linear Classification

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager