GAZE LATENT SUPPORT VECTOR MACHINE FOR IMAGE CLASSIFICATION

Xin Wang 1 Nicolas Thome 1 Matthieu Cord 1
1 MLIA - Machine Learning and Information Access
LIP6 - Laboratoire d'Informatique de Paris 6
Abstract : This paper deals with image categorization from weak supervision, e.g. global image labels. We propose to improve the region selection performed in latent variable models such as Latent Support Vector Machine (LSVM) by leveraging human eye movement features collected from an eye-tracker device. We introduce a new model, Gaze Latent Support Vector Machine (G-LSVM), whose region selection during training is biased toward regions with a large gaze density ratio. On this purpose, the training objective is enriched with a gaze loss, from which we derive a convex upper bound, leading to a Concave-Convex Procedure (CCCP) optimization scheme. Experiments show that G-LSVM significantly outperforms LSVM in both object detection and action recognition problems on PASCAL VOC 2012. We also show that our G-LSVM is even slightly better than a model trained from bounding box annotations, while gaze labels are much cheaper to collect.
Type de document :
Communication dans un congrès
IEEE International Conference on Image Processing (ICIP), Sep 2016, Phoenix, AZ, United States. 〈10.1109/ICIP.2016.7532354〉
Liste complète des métadonnées

Littérature citée [31 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01342580
Contributeur : Xin Wang <>
Soumis le : mercredi 6 juillet 2016 - 13:26:15
Dernière modification le : samedi 24 novembre 2018 - 01:35:07

Fichier

GAZE_LATENT_SUPPORT_VECTOR_MAC...
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Xin Wang, Nicolas Thome, Matthieu Cord. GAZE LATENT SUPPORT VECTOR MACHINE FOR IMAGE CLASSIFICATION. IEEE International Conference on Image Processing (ICIP), Sep 2016, Phoenix, AZ, United States. 〈10.1109/ICIP.2016.7532354〉. 〈hal-01342580〉

Partager

Métriques

Consultations de la notice

123

Téléchargements de fichiers

243