A. Oliva and A. Torralba, Modeling the shape of the scene : A holistic representation of the spatial envelope, International Journal of Computer Vision, vol.42, issue.3, pp.145-175, 2001.
DOI : 10.1023/A:1011139631724

M. J. Swain and D. H. Ballard, Color indexing, International Journal of Computer Vision, vol.31, issue.1, pp.11-32, 1991.
DOI : 10.1007/BF00130487

B. S. Manjunath and W. Ma, Texture features for browsing and retrieval of image data Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol.18, issue.8, pp.837-842, 1996.

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), pp.2169-2178, 2006.
DOI : 10.1109/CVPR.2006.68

URL : https://hal.archives-ouvertes.fr/inria-00548585

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, pp.1470-1477, 2003.
DOI : 10.1109/ICCV.2003.1238663

D. G. Lowe, Distinctive image features from scaleinvariant keypoints, International Journal of Computer Vision, pp.91-110, 2004.
DOI : 10.1023/b:visi.0000029664.99615.94

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.14.4931

H. Bay, T. Tuytelaars, and L. J. , Surf : Speeded up robust features, European Conference on Computer Vision, pp.404-417, 2006.
DOI : 10.1007/11744023_32

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.679.3046

R. Arandjelovi´carandjelovi´c and A. Zisserman, All about vlad, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.1578-1585, 2013.

F. Perronnin, J. Sánchez, and T. Mensink, Improving the Fisher Kernel for Large-Scale Image Classification, European Conference on Computer Vision, pp.143-156, 2010.
DOI : 10.1007/978-3-642-15561-1_11

URL : https://hal.archives-ouvertes.fr/inria-00548630

P. H. Gosselin, N. Murray, H. Jégou, and F. Perronnin, Revisiting the Fisher vector for fine-grained classification, Pattern Recognition Letters, vol.49, pp.92-98, 2014.
DOI : 10.1016/j.patrec.2014.06.011

URL : https://hal.archives-ouvertes.fr/hal-01056223

Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, Gradientbased learning applied to document recognition, Proceedings of the IEEE, pp.2278-2324, 1998.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems, pp.1106-1114, 2012.

M. D. Zeiler and R. Fergus, Visualizing and Understanding Convolutional Networks, European Conference on Computer Vision, pp.818-833, 2014.
DOI : 10.1007/978-3-319-10590-1_53

URL : http://arxiv.org/abs/1311.2901

R. B. Girshick, J. Donahue, T. Darrell, and J. Malik, Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation, 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp.580-587, 2014.
DOI : 10.1109/CVPR.2014.81

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, 1409.

K. He, X. Zhang, S. Ren, and J. Sun, Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification, 2015 IEEE International Conference on Computer Vision (ICCV), 1502.
DOI : 10.1109/ICCV.2015.123

D. Picard, P. Gosselin, and M. Gaspard, Challenges in content-based image indexing of cultural heritage collections, Signal Processing Magazine, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01164409

J. Wang, J. Yang, K. Yu, F. Lv, T. Huang et al., Locality-constrained Linear Coding for image classification, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.3360-3367, 2010.
DOI : 10.1109/CVPR.2010.5540018