Rotation and translation covariant match kernels for image retrieval

Giorgos Tolias 1 Andrei Bursuc 1 Teddy Furon 1 Hervé Jégou 1
1 LinkMedia - Creating and exploiting explicit links between multimedia fragments
Inria Rennes – Bretagne Atlantique , IRISA-D6 - MEDIA ET INTERACTIONS
Abstract : Most image encodings achieve orientation invariance by aligning the patches to their dominant orientations and translation invariance by completely ignoring patch position or by max-pooling. Albeit successful, such choices introduce too much invariance because they do not guarantee that the patches are rotated or translated consistently. In this paper, we propose a geometric-aware aggregation strategy, which jointly encodes the local descriptors together with their patch dominant angle or location. The geometric attributes are encoded in a continuous manner by leveraging explicit feature maps. Our technique is compatible with generic match kernel formulation and can be employed along with several popular encoding methods, in particular Bag-of-Words, VLAD and the Fisher vector. The method is further combined with an efficient monomial embedding to provide a codebook-free method aggregating local descriptors into a single vector representation. Invariance is achieved by efficient similarity estimation of multiple rotations or translations, offered by a simple trigonometric polynomial. This strategy is effective for image search, as shown by experiments performed on standard benchmarks for image and particular object retrieval, namely Holidays and Oxford buildings.
Type de document :
Article dans une revue
Computer Vision and Image Understanding, Elsevier, 2015, pp.15. <10.1016/j.cviu.2015.06.007>
Liste complète des métadonnées


https://hal.archives-ouvertes.fr/hal-01168525
Contributeur : Teddy Furon <>
Soumis le : mardi 25 août 2015 - 16:24:55
Dernière modification le : mercredi 2 août 2017 - 10:11:49
Document(s) archivé(s) le : jeudi 26 novembre 2015 - 14:04:06

Fichier

ToliasBursucFuronJegou_CVIU201...
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Giorgos Tolias, Andrei Bursuc, Teddy Furon, Hervé Jégou. Rotation and translation covariant match kernels for image retrieval. Computer Vision and Image Understanding, Elsevier, 2015, pp.15. <10.1016/j.cviu.2015.06.007>. <hal-01168525>

Partager

Métriques

Consultations de
la notice

471

Téléchargements du document

441