Compressed Fisher Vectors for Large-Scale Image Classification

Jorge Sanchez; Florent Perronnin; Thomas Mensink; Jakob Verbeek

Rapport (Rapport De Recherche) Année : 2013

Compressed Fisher Vectors for Large-Scale Image Classification

(1) , (2) , (2, 3) , (3)

1
2
3

Jorge Sanchez

Fonction : Auteur correspondant
PersonId : 935847

Connectez-vous pour contacter l'auteur

Facultad de Matemática, Astronomía y Física [Cordoba]

Florent Perronnin

Fonction : Auteur correspondant
PersonId : 928545

Connectez-vous pour contacter l'auteur

Xerox Research Centre Europe [Meylan]

Thomas Mensink

Fonction : Auteur correspondant

Xerox Research Centre Europe [Meylan]

Learning and recognition in vision

Jakob Verbeek

Fonction : Auteur correspondant
PersonId : 10676
IdHAL : verbeek
ORCID : 0000-0003-1419-1816
IdRef : 180998463

Connectez-vous pour contacter l'auteur

Learning and recognition in vision

Résumé

A standard approach to describe an image for image classification and image retrieval, is to extract a set of local patch descriptors, encode them into a high dimensional vector and pool them into an image-level signature. The most common patch encoding strategy consists in quantizing the local descriptors into a finite set of prototypical elements. This leads to the popular Bag-of-Visual words (BOV) representation. In this work, we propose to use the Fisher Kernel framework as an alternative patch encoding strategy: we describe patches by their deviation from an ''universal'' generative model. This representation, which we call Fisher Vector (FV) has many advantages: it is efficient to compute, it leads to excellent results even with costless linear classifiers, and it can be compressed with a minimal loss of accuracy using product quantization. We report experimental results on five standard datasets -- PASCAL VOC 2007, Caltech 256, SUN 397, ILSVRC 2010 and ImageNet10K -- with up to 9M images and 10K classes, showing state-of-the-art results with the FV framework.

Domaines

Apprentissage [cs.LG]

Fichier principal

RR-8209.pdf (639.17 Ko)

Screenshot.png (61.93 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Format : Figure, Image

Jakob Verbeek : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00779493

Soumis le : mardi 22 janvier 2013-12:01:29

Dernière modification le : mercredi 12 avril 2023-13:42:10

Archivage à long terme le : mardi 23 avril 2013-03:52:48

Dates et versions

hal-00779493 , version 1 (22-01-2013)

hal-00779493 , version 2 (27-05-2013)

hal-00779493 , version 3 (12-06-2013)

Identifiants

HAL Id : hal-00779493 , version 1

Citer

Jorge Sanchez, Florent Perronnin, Thomas Mensink, Jakob Verbeek. Compressed Fisher Vectors for Large-Scale Image Classification. [Research Report] RR-8209, 2013. ⟨hal-00779493v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA-RRRT

2443 Consultations

15093 Téléchargements

Compressed Fisher Vectors for Large-Scale Image Classification

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager