Modeling Inter and Intra-Class Relations in the Triplet Loss for Zero-Shot Learning

Recognizing visual unseen classes, i.e. for which no training data is available, is known as Zero Shot Learning (ZSL). Some of the best performing methods apply the triplet loss to seen classes to learn a mapping between visual representations of images and attribute vectors that constitute class prototypes. They nevertheless make several implicit assumptions that limit their performance on real use cases, particularly with fine-grained datasets comprising a large number of classes. We identify three of these assumptions and put forward corresponding novel contributions to address them. Our approach consists in taking into account both inter-class and intra-class relations, respectively by being more permissive with confusions between similar classes, and by penalizing visual samples which are atypical to their class. The approach is tested on four datasets, including the large-scale ImageNet, and exhibits performances significantly above recent methods, even gen-erative methods based on more restrictive hypotheses.

Domaines

Intelligence artificielle [cs.AI] Vision par ordinateur et reconnaissance de formes [cs.CV] Recherche d'information [cs.IR] Multimédia [cs.MM]

Fichier principal

lecacheux19iccv.pdf (597.68 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

HERVE LE BORGNE : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02440364

Soumis le : lundi 20 janvier 2020-15:13:12

Dernière modification le : mercredi 3 avril 2024-10:20:13

Archivage à long terme le : mardi 21 avril 2020-12:51:56

Dates et versions

hal-02440364 , version 1 (20-01-2020)

Identifiants

HAL Id : hal-02440364 , version 1
DOI : 10.1109/ICCV.2019.01043

Citer

Yannick Le Cacheux, Hervé Le Borgne, Michel Crucianu. Modeling Inter and Intra-Class Relations in the Triplet Loss for Zero-Shot Learning. IEEE International Conference on Computer Vision, Oct 2019, Séoul, South Korea. ⟨10.1109/ICCV.2019.01043⟩. ⟨hal-02440364⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CEA CNAM CEA-DRF CEDRIC-CNAM HESAM

108 Consultations

154 Téléchargements