G. Csurka, C. Dance, L. Fan, J. Willamowski, and C. Bray, Visual categorization with bags of keypoints, ECCV'04 workshop on Statistical Learning in Computer Vision, pp.59-74, 2004.

R. Fergus, P. Perona, and A. Zisserman, Object class recognition by unsupervised scale-invariant learning, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings., pp.3-264, 2003.
DOI : 10.1109/CVPR.2003.1211479
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.114.7863

T. Leung and J. Malik, Representing and recognizing the visual appearance of materials using three-dimensional textons, International Journal of Computer Vision, vol.43, issue.1, pp.29-44, 2001.
DOI : 10.1023/A:1011126920638

S. Agarwal, A. Awan, and D. Roth, Learning to detect objects in images via a sparse, part-based representation, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.26, issue.11, pp.1475-1490, 2004.
DOI : 10.1109/TPAMI.2004.108

R. Fergus, L. Fei-fei, P. Perona, and A. Zisserman, Learning object categories from Google's image search, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, pp.1816-1823, 2005.
DOI : 10.1109/ICCV.2005.142
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.522.8976

K. Grauman and T. Darrell, Efficient Image Matching with Distributions of Local Invariant Features, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.5-627, 2005.
DOI : 10.1109/CVPR.2005.138

B. Leibe and B. Schiele, Interleaved Object Categorization and Segmentation, Procedings of the British Machine Vision Conference 2003, 2003.
DOI : 10.5244/C.17.78
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.103.3586

D. Lowe, Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004.
DOI : 10.1023/B:VISI.0000029664.99615.94
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.14.4931

K. Mikolajczyk and C. Schmid, An Affine Invariant Interest Point Detector, In: ECCV, p.128, 2002.
DOI : 10.1007/3-540-47969-4_9
URL : https://hal.archives-ouvertes.fr/inria-00548252

J. Sivic and A. Zisserman, Video Google: a text retrieval approach to object matching in videos, Proceedings Ninth IEEE International Conference on Computer Vision, pp.3-1470, 2003.
DOI : 10.1109/ICCV.2003.1238663

M. Weber, M. Welling, and P. Perona, Unsupervised Learning of Models for Recognition, In: ECCV, vol.I, pp.18-32, 2000.
DOI : 10.1007/3-540-45054-8_2

F. Jurie and B. Triggs, Creating efficient codebooks for visual recognition, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, 2005.
DOI : 10.1109/ICCV.2005.66
URL : https://hal.archives-ouvertes.fr/inria-00548511

J. Winn, A. Criminisi, and T. Minka, Object categorization by learned universal visual dictionary, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, 2005.
DOI : 10.1109/ICCV.2005.171
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.93.8714

G. Bouchard and B. Triggs, Hierarchical Part-Based Visual Object Categorization, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.710-715, 2005.
DOI : 10.1109/CVPR.2005.174
URL : https://hal.archives-ouvertes.fr/inria-00548513

A. Agarwal and B. Triggs, Hyperfeatures ??? Multilevel Local Coding for Visual Recognition, In: ECCV, 2006.
DOI : 10.1007/11744023_3
URL : https://hal.archives-ouvertes.fr/inria-00548592

T. Joachims, Text categorization with Support Vector Machines: Learning with many relevant features, ECML-98, 10th European Conference on Machine Learning, pp.137-142, 1998.
DOI : 10.1007/BFb0026683
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.11.6124

W. Niblack, R. Barber, W. Equitz, M. Flickner, D. Glasman et al., The qbic project: Querying image by content using color, texture, and shape, SPIE, vol.1908, pp.173-187, 1993.

S. Lazebnik, C. Schmid, and J. Ponce, Affine-invariant local descriptors and neighborhood statistics for texture recognition, Proceedings Ninth IEEE International Conference on Computer Vision, pp.649-655, 2003.
DOI : 10.1109/ICCV.2003.1238409
URL : https://hal.archives-ouvertes.fr/inria-00548231

Y. Rubner, C. Tomasi, and L. Guibas, The earth mover's distance as a metric for image retrieval, International Journal of Computer Vision, vol.40, issue.2, pp.99-121, 2000.
DOI : 10.1023/A:1026543900054

K. Mikolajczyk, T. Tuytelaars, C. Schmid, A. Zisserman, J. Matas et al., A Comparison of Affine Region Detectors, International Journal of Computer Vision, vol.65, issue.1-2, pp.43-72, 2005.
DOI : 10.1007/s11263-005-3848-x
URL : https://hal.archives-ouvertes.fr/inria-00548528

T. Lindeberg, Detecting salient blob-like image structures and their scales with a scale-space primal sketch: A method for focus-of-attention, International Journal of Computer Vision, vol.8, issue.8, pp.283-318, 1993.
DOI : 10.1007/BF01469346

E. Nowak and F. Jurie, Vehicle Categorization: Parts for Speed and Accuracy, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, 2005.
DOI : 10.1109/VSPETS.2005.1570926
URL : https://hal.archives-ouvertes.fr/inria-00548506

J. Zhang, M. Marszalek, S. Lazebnik, and C. Schmid, Local features and kernels for classifcation of texture and object categories: An in-depth study, 2005.