Using Sentences as Semantic Representations in Large Scale Zero-Shot Learning

Yannick Le Cacheux; Hervé Le Borgne; Michel Crucianu

Communication Dans Un Congrès Année : 2020

Using Sentences as Semantic Representations in Large Scale Zero-Shot Learning

(1, 2) , (1) , (2)

1
2

Yannick Le Cacheux

Fonction : Auteur
PersonId : 177264
IdHAL : yannick-le-cacheux
ORCID : 0000-0002-9942-5927

Laboratoire d'Intégration des Systèmes et des Technologies

CEDRIC. Données complexes, apprentissage et représentations

Hervé Le Borgne

Fonction : Auteur
PersonId : 181478
IdHAL : herve-le-borgne
ORCID : 0000-0003-0520-8436
IdRef : 079208452

Laboratoire d'Intégration des Systèmes et des Technologies

Michel Crucianu

Fonction : Auteur
PersonId : 180351
IdHAL : michel-crucianu
ORCID : 0000-0001-8204-6843

CEDRIC. Données complexes, apprentissage et représentations

Résumé

Zero-shot learning aims to recognize instances of unseen classes, for which no visual instance is available during training, by learning mul-timodal relations between samples from seen classes and corresponding class semantic representations. These class representations usually consist of either attributes, which do not scale well to large datasets, or word embeddings, which lead to poorer performance. A good trade-off could be to employ short sentences in natural language as class descriptions. We explore different solutions to use such short descriptions in a ZSL setting and show that while simple methods cannot achieve very good results with sentences alone, a combination of usual word embeddings and sentences can significantly outperform current state-of-the-art 3 .

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

2010.02959.pdf (818.83 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Michel CRUCIANU : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03003689

Soumis le : vendredi 13 novembre 2020-12:43:50

Dernière modification le : mercredi 3 avril 2024-11:14:12

Dates et versions

hal-03003689 , version 1 (13-11-2020)

Identifiants

HAL Id : hal-03003689 , version 1

Citer

Yannick Le Cacheux, Hervé Le Borgne, Michel Crucianu. Using Sentences as Semantic Representations in Large Scale Zero-Shot Learning. ECCV 2020 workshop Transferring and adapting source knowledge in computer vision (TASK-CV), Aug 2020, Glasgow, United Kingdom. ⟨hal-03003689⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CEA CNAM DRT CEA-DRF LIST CEDRIC-CNAM GS-ENGINEERING GS-COMPUTER-SCIENCE GS-SPORT-HUMAN-MOVEMENT HESAM

66 Consultations

55 Téléchargements

Using Sentences as Semantic Representations in Large Scale Zero-Shot Learning

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager