. Agrawal, . Pulkit, R. Girshick, and J. Malik, Analyzing the Performance of Multilayer Neural Networks for Object Recognition, ECCV, 2014.
DOI : 10.1007/978-3-319-10584-0_22

. An, . Senjian, . Peursum, . Patrick, . Liu et al., Efficient algorithms for subwindow search in object detection and localization, CVPR, 2009.

R. Arandjelovic and A. Zisserman, Three things everyone should know to improve object retrieval, 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012.
DOI : 10.1109/CVPR.2012.6248018

R. Arandjelovic and A. Zisserman, All About VLAD, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013.
DOI : 10.1109/CVPR.2013.207

R. Arandjelovic, . Gronat, . Petr, . Torii, . Akihiko et al., Netvlad: Cnn architecture for weakly supervised place recognition, arXiv, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01242052

Y. Avrithis and Y. Kalantidis, Approximate Gaussian Mixtures for Large Scale Vocabularies, ECCV, 2012.
DOI : 10.1007/978-3-642-33712-3_2

Y. Avrithis and G. Tolias, Hough Pyramid Matching: Speeded-Up Geometry Re-ranking for Large Scale Image Retrieval, International Journal of Computer Vision, vol.244, issue.1309, 2014.
DOI : 10.1145/1873951.1874019

. Azizpour, . Hossein, A. Razavian, . Sharif, J. Sullivan et al., From generic to specific deep representations for visual recognition, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2014.
DOI : 10.1109/CVPRW.2015.7301270

A. Babenko, . Lempitsky, and . Victor, Aggregating deep convolutional features for image retrieval, ICCV, 2015.

. Babenko, . Artem, . Slesarev, . Anton, . Chigorin et al., Neural Codes for Image Retrieval, ECCV, 2014.
DOI : 10.1007/978-3-319-10590-1_38

J. Bentley, Programming Pearls, 1999.

. Chen, . Qiang, . Song, . Zheng, . Feris et al., Efficient Maximum Appearance Search for Large-Scale Object Detection, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013.
DOI : 10.1109/CVPR.2013.410

. Chum, . Ondrej, A. Mikulik, M. Perdoch, and J. Matas, Total recall II: Query expansion revisited, CVPR 2011, 2011.
DOI : 10.1109/CVPR.2011.5995601

. Danfeng, . Qin, S. Gammeter, L. Bossard, T. Quack et al., Hello neighbor: Accurate object retrieval with k-reciprocal nearest neighbors, CVPR, 2011.

P. Dollár, . Tu, . Zhuowen, . Perona, . Pietro et al., Integral Channel Features, Procedings of the British Machine Vision Conference 2009, 2009.
DOI : 10.5244/C.23.91

J. Donahue, . Jia, . Yangqing, . Vinyals, . Oriol et al., Decaf: A deep convolutional activation feature for generic visual recognition, arXiv, 2013.

R. Girshick, Fast R-CNN, 2015 IEEE International Conference on Computer Vision (ICCV), 2015.
DOI : 10.1109/ICCV.2015.169

R. Girshick, . Donahue, . Jeff, . Darrell, . Trevor et al., Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.81

Y. Gong, . Wang, . Liwei, . Guo, . Ruiqi et al., Multi-scale Orderless Pooling of Deep Convolutional Activation Features, ECCV, 2014.
DOI : 10.1007/978-3-319-10584-0_26

F. Iandola, . Moskewicz, . Matt, . Karayev, . Sergey et al., Densenet: Implementing efficient convnet descriptor pyramids, arxiv, 2014.

H. Jégou, . Chum, and . Ondrej, Negative Evidences and Co-occurences in Image Retrieval: The Benefit of PCA and Whitening, ECCV, 2012.
DOI : 10.1007/978-3-642-33709-3_55

H. Jégou and A. Zisserman, Triangulation Embedding and Democratic Aggregation for Image Search, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.417

. Jégou, . Hervé, . Douze, . Matthijs, and C. Schmid, Improving Bag-of-Features for Large Scale Image Search, International Journal of Computer Vision, vol.42, issue.3, 2010.
DOI : 10.1007/s11263-009-0285-2

. Jégou, . Hervé, . Perronnin, . Florent, . Douze et al., Aggregating Local Image Descriptors into Compact Codes, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.34, issue.9, 2012.
DOI : 10.1109/TPAMI.2011.235

Y. Kalantidis, C. Mellina, and S. Osindero, Cross-Dimensional Weighting for Aggregated Deep Convolutional Features, 2015.
DOI : 10.1016/j.cviu.2013.12.002

A. Krizhevsky, . Sutskever, . Ilya, and G. E. Hinton, ImageNet classification with deep convolutional neural networks, Communications of the ACM, vol.60, issue.6, 2012.
DOI : 10.1162/neco.2009.10-08-881

C. H. Lampert, Detecting objects in large image collections and videos by efficient subimage retrieval, 2009 IEEE 12th International Conference on Computer Vision, 2009.
DOI : 10.1109/ICCV.2009.5459359

C. H. Lampert, . Blaschko, B. Matthew, and T. Hofmann, Efficient Subwindow Search: A Branch and Bound Framework for Object Localization, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.31, issue.12, pp.312129-2142, 2009.
DOI : 10.1109/TPAMI.2009.144

Z. Lin and J. Brandt, A Local Bag-of-Features Model for Large-Scale Object Retrieval, ECCV, 2010.
DOI : 10.1007/978-3-642-15567-3_22

D. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94

A. Mikulik, . Perdoch, . Michal, . Chum, . Ond?ej et al., Learning Vocabularies over a Fine Quantization, International Journal of Computer Vision, vol.30, issue.2, p.2013
DOI : 10.1109/TPAMI.2007.70755

. Oquab, . Maxime, . Bottou, . Leon, . Laptev et al., Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.222

URL : https://hal.archives-ouvertes.fr/hal-00911179

G. Papandreou, . Kokkinos, . Iasonas, and P. Savalle, Untangling local and global deformations in deep convolutional networks for image classification and sliding window detection, arXiv, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01109289

J. Philbin, . Chum, . Ondrej, . Isard, . Michael et al., Object retrieval with large vocabularies and fast spatial matching, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383172

J. Philbin, . Chum, . Ondrej, . Isard, . Michael et al., Lost in quantization: Improving particular object retrieval in large scale image databases, 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008.
DOI : 10.1109/CVPR.2008.4587635

F. Radenovi´cradenovi´c, . Jegou, . Herve, . Chum, and . Ondrej, Multiple measurements and joint dimensionality reduction for large scale image search with short vectors, ICMR, 2015.

A. Razavian, . Sharif, . Azizpour, . Hossein, J. Sullivan et al., CNN Features Off-the-Shelf: An Astounding Baseline for Recognition, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2014.
DOI : 10.1109/CVPRW.2014.131

A. Razavian, . Sharif, J. Sullivan, . Maki, . Atsuto et al., A baseline for visual instance retrieval with deep convolutional networks, arXiv, 2014.

. Ren, . Shaoqing, . He, . Kaiming, R. Girshick et al., Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, arXiv, 2015.
DOI : 10.1109/TPAMI.2016.2577031

X. Shen, . Lin, . Zhe, J. Brandt, and Y. Wu, Spatially-Constrained Similarity Measurefor Large-Scale Object Retrieval, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.36, issue.6, pp.1229-1241, 2014.
DOI : 10.1109/TPAMI.2013.237

R. Sicre and F. Jurie, Discriminative part model for visual recognition, Computer Vision and Image Understanding, vol.141, pp.28-37
DOI : 10.1016/j.cviu.2015.08.002

URL : https://hal.archives-ouvertes.fr/hal-01132389

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, 2014.

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, 2003.
DOI : 10.1109/ICCV.2003.1238663

. Tao, . Ran, . Gavves, . Efstratios, . Snoek et al., Locality in Generic Instance Search from One Example, 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014.
DOI : 10.1109/CVPR.2014.269

G. Tolias, . Avrithis, . Yannis, and H. Jégou, Image Search with Selective Match Kernels: Aggregation Across Single and Multiple Images, International Journal of Computer Vision, vol.103, issue.1, p.2015
DOI : 10.1007/978-3-642-33709-3_47

URL : https://hal.archives-ouvertes.fr/hal-01131898

J. Uijlings, . Van-de-sande, . Koen, . Gevers, . Theo et al., Selective Search for Object Recognition, International Journal of Computer Vision, vol.57, issue.1, pp.154-171, 2013.
DOI : 10.1023/B:VISI.0000013087.49260.fb

. Van-de-sande, E. Koen, . Snoek, G. Cees, . Smeulders et al., Fisher and VLAD with flair, CVPR, 2014.

A. Vedaldi and K. Lenc, Matconvnet-convolutional neural networks for matlab, arXiv, 2014.

P. Viola and M. Jones, Robust real-time object detection, IJCV, vol.4, pp.34-47, 2001.

L. Xie, Q. Tian, R. Hong, and B. Zhang, Image Classification and Retrieval are ONE, Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, ICMR '15, 2015.
DOI : 10.1145/2393347.2393377

. Zhong, . Zhiyuan, . Zhu, . Jianke, . Hoi et al., Fast Object Retrieval Using Direct Spatial Matching, IEEE Transactions on Multimedia, vol.17, issue.8, pp.1391-1397, 2015.
DOI : 10.1109/TMM.2015.2446201