S. E. De-avila, N. Thome, M. Cord, E. Valle, A. De-albuquerque et al., Bossa: Extended bow formalism for image classification, ICIP, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00625533

J. Yang, Y. Jiang, A. Hauptmann, and C. Ngo, Evaluating bag-of-visual-words representations in scene classification, Proceedings of the international workshop on Workshop on multimedia information retrieval , MIR '07, 2007.
DOI : 10.1145/1290082.1290111

J. Matas, O. Chum, M. Urban, and T. Pajdla, Robust widebaseline stereo from maximally stable extremal regions, Image and Vision Computing, vol.22, issue.10, 2004.

T. Kadir, A. Zisserman, and M. Brady, An Affine Invariant Salient Region Detector, 2004.
DOI : 10.1007/978-3-540-24670-1_18

D. Lowe, Object recognition from local scale-invariant features, Proceedings of the Seventh IEEE International Conference on Computer Vision, 1999.
DOI : 10.1109/ICCV.1999.790410

H. Bay, T. Tuytelaars, and L. Van-gool, Surf: Speeded up robust features, European Conference on Computer Vision (ECCV, 2006.

K. Mikolajczyk, T. Tuytelaars, C. Schmid, A. Zisserman, J. Matas et al., A Comparison of Affine Region Detectors, International Journal of Computer Vision, vol.65, issue.1-2, 2005.
DOI : 10.1007/s11263-005-3848-x

URL : https://hal.archives-ouvertes.fr/inria-00548528

V. Lepetit and P. Fua, Keypoint recognition using randomized trees, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.28, issue.9, 2006.
DOI : 10.1109/TPAMI.2006.188

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.183.8088

Y. Zhan and D. Shen, Automated Segmentation of 3D US Prostate Images Using Statistical Texture-Based Matching Method, Medical Image Computing and Computer-Assisted Intervention, 2003.
DOI : 10.1007/978-3-540-39899-8_84

A. Vedaldi and B. Fulkerson, Vlfeat, Proceedings of the international conference on Multimedia, MM '10, 2008.
DOI : 10.1145/1873951.1874249

R. Baeza-yates and B. Ribeiro-neto, Modern information retrieval, 1999.

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.68

URL : https://hal.archives-ouvertes.fr/inria-00548585

J. Van-gemert, C. Veenman, A. Smeulders, and J. Geusebroek, Visual word ambiguity Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol.32, issue.7, pp.1271-1283, 2010.

J. Wang, J. Yang, K. Yu, F. Lv, T. Huang et al., Locality-constrained Linear Coding for image classification, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5540018

G. Csurka and F. Perronnin, Fisher vectors: Beyond bag-ofvisual-words image representations, Computer Vision, Imaging and Computer Graphics. Theory and Applications, 2011.

K. Grauman and T. Darrell, The pyramid match kernel: Efficient learning with sets of features, J. Mach. Learn. Res, vol.8, 2007.

K. E. Van-de-sande, T. Gevers, and C. G. Snoek, Evaluating Color Descriptors for Object and Scene Recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, issue.9, 2010.
DOI : 10.1109/TPAMI.2009.154