D. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94

H. Bay, A. Ess, T. Tuytelaars, and L. V. , Speeded-Up Robust Features (SURF), Computer Vision and Image Understanding, vol.110, issue.3, pp.346-359, 2008.
DOI : 10.1016/j.cviu.2007.09.014

T. Ojala, M. Pietikainen, and T. Maenpaa, Multiresolution gray-scale and rotation invariant texture classification with local binary patterns, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, issue.7, pp.971-987, 2002.
DOI : 10.1109/TPAMI.2002.1017623

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, 2003.
DOI : 10.1109/ICCV.2003.1238663

J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman, Object retrieval with large vocabularies and fast spatial matching, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383172

G. Tolias and Y. , Avrithis, Speeded-up, relaxed spatial matching, 2011.

O. Chum, J. Philbin, J. Sivic, M. Isard, and A. Zisserman, Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval, 2007 IEEE 11th International Conference on Computer Vision, 2007.
DOI : 10.1109/ICCV.2007.4408891

G. Tolias and H. Jégou, Visual query expansion with or without geometry: Refining local descriptors by feature aggregation, Pattern Recognition, vol.47, issue.10
DOI : 10.1016/j.patcog.2014.04.007

URL : https://hal.archives-ouvertes.fr/hal-00971267

H. Jégou, M. Douze, and C. Schmid, Improving Bag-of-Features for Large Scale Image Search, International Journal of Computer Vision, vol.42, issue.3, pp.316-336, 2010.
DOI : 10.1007/s11263-009-0285-2

A. Mikulík, M. Perdoch, O. Chum, and J. Matas, Learning a Fine Vocabulary, 2010.
DOI : 10.1007/978-3-642-15558-1_1

H. Jégou, M. Douze, and C. Schmid, On the burstiness of visual elements, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206609

G. Tolias, Y. Avrithis, and H. Jégou, To Aggregate or Not to aggregate: Selective Match Kernels for Image Search, 2013 IEEE International Conference on Computer Vision, 2013.
DOI : 10.1109/ICCV.2013.177

URL : https://hal.archives-ouvertes.fr/hal-00864684

J. Wang, J. Yang, F. L. Yu, T. Huang, and Y. , Gong, Localityconstrained linear coding for image classification, 2010.

F. Perronnin and C. R. Dance, Fisher Kernels on Visual Vocabularies for Image Categorization, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383266

H. Jégou, F. Perronnin, M. Douze, J. Sánchez, P. Pérez et al., Aggregating Local Image Descriptors into Compact Codes, Trans. PAMI, 2012.
DOI : 10.1109/TPAMI.2011.235

F. Perronnin, Y. Liu, J. Sánchez, and H. Poirier, Large-scale image retrieval with compressed Fisher vectors, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5540009

M. Charikar, Similarity estimation techniques from rounding algorithms, Proceedings of the thiry-fourth annual ACM symposium on Theory of computing , STOC '02, 2002.
DOI : 10.1145/509907.509965

H. Jégou, M. Douze, and C. Schmid, Product Quantization for Nearest Neighbor Search, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.33, issue.1, pp.117-128, 2011.
DOI : 10.1109/TPAMI.2010.57

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.68

URL : https://hal.archives-ouvertes.fr/inria-00548585

M. Douze, H. Jégou, H. Singh, L. Amsaleg, and C. Schmid, Evaluation of GIST descriptors for web-scale image search, Proceeding of the ACM International Conference on Image and Video Retrieval, CIVR '09, 2009.
DOI : 10.1145/1646396.1646421

URL : https://hal.archives-ouvertes.fr/inria-00394212

P. Koniusz, F. Yan, and K. Mikolajczyk, Comparison of mid-level feature coding approaches and pooling strategies in visual concept detection, Computer Vision and Image Understanding, vol.117, issue.5, pp.479-492, 2013.
DOI : 10.1016/j.cviu.2012.10.010

W. Zhao, H. Jégou, and G. Gravier, Oriented pooling for dense and nondense rotation-invariant features, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00841590

L. Bo and C. Sminchisescu, Efficient match kernel between sets of features for visual recognition, 2009.

G. Tolias, T. Furon, and H. Jégou, Orientation Covariant Aggregation of Local Descriptors with Embeddings, pp.382-397, 2014.
DOI : 10.1007/978-3-319-10599-4_25

URL : https://hal.archives-ouvertes.fr/hal-01020823

H. Jégou, M. Douze, and C. Schmid, Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search, 2008.
DOI : 10.1007/978-3-540-88682-2_24

L. Bo, X. Ren, and D. Fox, Kernel descriptors for visual recognition, 2010.

A. Vedaldi and A. Zisserman, Efficient Additive Kernels via Explicit Feature Maps, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.3, pp.480-492, 2012.
DOI : 10.1109/TPAMI.2011.153

R. Arandjelovic and A. Zisserman, All About VLAD, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013.
DOI : 10.1109/CVPR.2013.207

J. Krapac, J. Verbeek, and F. Jurie, Modeling spatial layout with fisher vectors for image categorization, 2011 International Conference on Computer Vision, pp.1487-1494, 2011.
DOI : 10.1109/ICCV.2011.6126406

URL : https://hal.archives-ouvertes.fr/inria-00612277

P. Koniusz and K. Mikolajczyk, Spatial Coordinate Coding to reduce histogram representations, Dominant Angle and Colour Pyramid Match, 2011 18th IEEE International Conference on Image Processing, pp.661-664, 2011.
DOI : 10.1109/ICIP.2011.6116639

J. Sánchez, F. Perronnin, and T. De-campos, Modeling the spatial layout of images beyond spatial pyramids, Pattern Recognition Letters, vol.33, issue.16, pp.2216-2223, 2012.
DOI : 10.1016/j.patrec.2012.07.019

P. Gosselin, N. Murray, H. Jégou, and F. Perronnin, Revisiting the Fisher vector for fine-grained classification, Pattern Recognition Letters, vol.49, pp.92-98, 2014.
DOI : 10.1016/j.patrec.2014.06.011

URL : https://hal.archives-ouvertes.fr/hal-01056223

L. Bo, K. Lai, X. Ren, and D. Fox, Object recognition with hierarchical kernel descriptors, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995719

S. Lyu, Mercer kernels for object recognition with local features, 2005.

G. Csurka, C. R. Dance, L. Fan, J. Willamowski, and C. Bray, Visual categorization with bags of keypoints, ECCV Workshop Statistical Learning in Computer Vision, 2004.

D. Picard and P. Gosselin, Efficient image signatures and similarities using tensor products of local descriptors, Computer Vision and Image Understanding, vol.117, issue.6
DOI : 10.1016/j.cviu.2013.02.004

URL : https://hal.archives-ouvertes.fr/hal-00799074

P. Koniusz, F. Yan, P. Gosselin, and K. Mikolajczyk, Higher-order occurrence pooling on mid-and low-level features: Visual concept detection, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00922524

J. C. Caseiro, J. Batista, and C. Sminchisescu, Semantic segmentation with second-order pooling, p.2012

M. Abramowitz and I. A. Stegun, Handbook of mathematical functions with formulas, graphs, and mathematical tables, of National Bureau of Standards Applied Mathematics Series, U.S. Government Printing Office, 1964.

F. Perronnin, J. Sánchez, and T. Mensink, Improving the Fisher Kernel for Large-Scale Image Classification, 2010.
DOI : 10.1007/978-3-642-15561-1_11

URL : https://hal.archives-ouvertes.fr/inria-00548630

Y. Zhang, Z. Jia, and T. Chen, Image retrieval with geometry-preserving visual phrases, CVPR 2011, pp.809-816, 2011.
DOI : 10.1109/CVPR.2011.5995528

J. F. Henriques, R. Caseiro, P. Martins, and J. Batista, High-Speed Tracking with Kernelized Correlation Filters, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.37, issue.3
DOI : 10.1109/TPAMI.2014.2345390

M. Perdoch, O. Chum, and J. Matas, Efficient representation of local geometry for large scale object retrieval, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009.
DOI : 10.1109/CVPR.2009.5206529

T. Jaakkola and D. Haussler, Exploiting generative models in discriminative classifiers, 1998.

K. Chatfield, V. Lempitsky, A. Vedaldi, and A. Zisserman, The devil is in the details: an evaluation of recent feature encoding methods, Procedings of the British Machine Vision Conference 2011, 2011.
DOI : 10.5244/C.25.76

K. Mikolajczyk, T. Tuytelaars, C. Schmid, A. Zisserman, J. Matas et al., A Comparison of Affine Region Detectors, International Journal of Computer Vision, vol.65, issue.1-2, pp.43-72, 2005.
DOI : 10.1007/s11263-005-3848-x

URL : https://hal.archives-ouvertes.fr/inria-00548528

]. R. Arandjelovic and A. Zisserman, Three things everyone should know to improve object retrieval, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6248018

J. Delhumeau, P. Gosselin, H. Jégou, and P. Pérez, Revisiting the VLAD image representation, Proceedings of the 21st ACM international conference on Multimedia, MM '13, 2013.
DOI : 10.1145/2502081.2502171

URL : https://hal.archives-ouvertes.fr/hal-00840653

J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman, Lost in quantization: Improving particular object retrieval in large scale image databases, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587635

M. Douze and H. Jégou, The Yael Library, Proceedings of the ACM International Conference on Multimedia, MM '14, 2014.
DOI : 10.1145/2647868.2654892

URL : https://hal.archives-ouvertes.fr/hal-01020695

A. Torii, J. Sivic, T. Pajdla, and M. Okutomi, Visual place recognition with repetitive structures, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00934288

H. Jégou and O. Chum, Negative Evidences and Co-occurences in Image Retrieval: The Benefit of PCA and Whitening, p.2012
DOI : 10.1007/978-3-642-33709-3_55

B. Safadi and G. Quenot, Descriptor optimization for multimedia indexing and retrieval, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00981672

H. Jégou and A. Zisserman, Triangulation Embedding and Democratic Aggregation for Image Search, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.417

M. Muja and D. G. Lowe, Fast approximate nearest neighbors with automatic algorithm configuration, 2009.