J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, pp.1470-1477, 2003.
DOI : 10.1109/ICCV.2003.1238663

J. Yang, K. Yu, Y. Gong, and T. Huang, Linear spatial pyramid matching using sparse coding for image classification, CVPR, 2009.

H. Goh, N. Thome, M. Cord, and J. Lim, Unsupervised and Supervised Visual Codes with Restricted Boltzmann Machines, Proceedings of the 12th European conference on Computer Vision - Volume Part V, pp.298-311
DOI : 10.1007/978-3-642-33715-4_22

URL : https://hal.archives-ouvertes.fr/hal-00816428

F. Perronnin and C. R. Dance, Fisher Kernels on Visual Vocabularies for Image Categorization, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383266

S. Eliza-fontes-de-avila, N. Thome, M. Cord, E. Valle, A. De-albuquerque et al., Bossa: Extended bow formalism for image classification, ICIP, pp.2909-2912, 2011.

S. Eliza-fontes-de-avila, N. Thome, M. Cord, E. Valle, A. De-albuquerque et al., Pooling in image representation: The visual codeword point of view, Computer Vision and Image Understanding, vol.117, issue.5, pp.453-465, 2013.
DOI : 10.1016/j.cviu.2012.09.007

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.68

URL : https://hal.archives-ouvertes.fr/inria-00548585

T. Serre, L. Wolf, S. Bileschi, M. Riesenhuber, and T. Poggio, Robust Object Recognition with Cortex-Like Mechanisms, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.29, issue.3, pp.411-426, 2007.
DOI : 10.1109/TPAMI.2007.56

C. Theriault, N. Thome, and M. Cord, Extended Coding and Pooling in the HMAX Model, IEEE Transactions on Image Processing, vol.22, issue.2, pp.764-777, 2013.
DOI : 10.1109/TIP.2012.2222900

URL : https://hal.archives-ouvertes.fr/hal-01185467

L. Li, H. Su, E. Xing, and L. Fei-fei, Object bank: A high-level image representation for scene classification & semantic feature sparsification, NIPS, 2010.

L. Torresani, M. Szummer, and A. Fitzgibbon, Efficient Object Category Recognition Using Classemes, ECCV, 2010.
DOI : 10.1007/978-3-642-15549-9_56

P. F. Felzenszwalb, R. B. Girshick, D. A. Mcallester, and D. Ramanan, Object Detection with Discriminatively Trained Part-Based Models, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.9, 2010.
DOI : 10.1109/TPAMI.2009.167

D. Hoiem, A. Efros, and M. Hebert, Automatic photo pop-up, ACM Trans. Graph, vol.24, issue.3, 2005.
DOI : 10.1145/1186822.1073232

M. Everingham, L. Van-gool, C. Williams, J. Winn, and A. Zisserman, The Pascal Visual Object Classes (VOC) Challenge, International Journal of Computer Vision, vol.73, issue.2, 2007.
DOI : 10.1007/s11263-009-0275-4

C. Chang and C. Lin, LIBSVM, ACM Transactions on Intelligent Systems and Technology, vol.2, issue.3, 2011.
DOI : 10.1145/1961189.1961199

K. Chatfield, V. Lempitsky, A. Vedaldi, and A. Zisserman, The devil is in the details: an evaluation of recent feature encoding methods, Procedings of the British Machine Vision Conference 2011, 2011.
DOI : 10.5244/C.25.76

J. Sanchez, T. Perronnin, and . Decampos, Modeling the spatial layout of images beyond spatial pyramids, Pattern Recognition Letters, vol.33, issue.16, 2012.
DOI : 10.1016/j.patrec.2012.07.019

Y. Su and F. Jurie, Improving Image Classification Using Semantic Attributes, International Journal of Computer Vision, vol.72, issue.2, 2012.
DOI : 10.1007/s11263-012-0529-4

URL : https://hal.archives-ouvertes.fr/hal-00805996

D. Picard, M. Cord, and A. , Image Retrieval Over Networks: Active Learning Using Ant Algorithm, IEEE Transactions on Multimedia, vol.10, issue.7, pp.1356-1365, 2008.
DOI : 10.1109/TMM.2008.2004913

URL : https://hal.archives-ouvertes.fr/hal-00656363