B. S. Manjunath, Netra: a toolbox for navigating large image databases, ICIP, 1997.

J. Sivic, B. C. Russell, A. Efros, A. A. Zisserman, and W. Freeman, Discovering objects and their location in images, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, 2005.
DOI : 10.1109/ICCV.2005.77

V. Vapnik, Statistical learning theory, 1998.

F. Perronnin and C. R. Dance, Fisher Kernels on Visual Vocabularies for Image Categorization, 2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007.
DOI : 10.1109/CVPR.2007.383266

H. Goh, N. Thome, M. Cord, and J. Lim, Top-down regularization of deep belief networks, NIPS, pp.1878-1886, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00947569

H. Goh, N. Thome, M. Cord, and J. Lim, Learning Deep Hierarchical Visual Feature Coding, IEEE Transactions on Neural Networks and Learning Systems, vol.25, issue.12, pp.2212-2225, 2014.
DOI : 10.1109/TNNLS.2014.2307532
URL : https://hal.archives-ouvertes.fr/hal-01185465

S. Eliza-fontes-de-avila, N. Thome, M. Cord, E. Valle, A. De-albuquerque et al., Pooling in Image Representation: the Visual Codeword Point of View, pp.453-465, 2013.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, NIPS, 2012.

A. Berg, J. Deng, and L. Fei-fei, Large scale visual recognition challenge 2010, 2010.

K. Fukushima, Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position, Biological Cybernetics, vol.40, issue.4, p.193202, 1980.
DOI : 10.1007/BF00344251

Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, Gradientbased learning applied to document recognition . proceedings, IEEE, vol.86, issue.11, p.22782324, 1998.

M. Chevalier, N. Thome, M. Cord, J. Fournier, G. Henaff et al., LR-CNN for fine-grained classification with varying resolution, 2015 IEEE International Conference on Image Processing (ICIP), 2015.
DOI : 10.1109/ICIP.2015.7351374
URL : https://hal.archives-ouvertes.fr/hal-01196958

T. Durand, N. Thome, and M. Cord, MANTRA: Minimum Maximum Latent Structural SVM for Image Classification and Ranking, 2015 IEEE International Conference on Computer Vision (ICCV), 2016.
DOI : 10.1109/ICCV.2015.311
URL : https://hal.archives-ouvertes.fr/hal-01343784

S. Avila, N. Thome, M. Cord, E. Valle, A. De et al., BOSSA: Extended bow formalism for image classification, 2011 18th IEEE International Conference on Image Processing, 2011.
DOI : 10.1109/ICIP.2011.6116268
URL : https://hal.archives-ouvertes.fr/hal-00625533

Y. Boureau, J. Ponce, and Y. Lecun, A theoretical analysis of feature pooling in visual recognition, 2010.

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, 2015.

C. Szegedy, W. Liu, Y. Jia, and P. Sermanet, Going deeper with convolutions, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
DOI : 10.1109/CVPR.2015.7298594

B. Graham, Fractional max-pooling, 2015.

K. He, X. Zhang, S. Ren, and J. Sun, Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification, 2015 IEEE International Conference on Computer Vision (ICCV), 2015.
DOI : 10.1109/ICCV.2015.123

W. Shang, K. Sohn, D. Almeida, and H. Lee, Understanding and improving convolutional neural networks via concatenated rectified linear units, ICLM, 2016.

A. Bruno, D. J. Olshausen, and . Field, Emergence of simple-cell receptive field properties by learning a sparse code for natural images?, Nature, 1996.

G. E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. R. Salakhutdinov, Improving neural networks by preventing co-adaptation of feature detectors, 2012.

D. Cires-an, U. Meier, and J. Schmidhuber, Multi-column deep neural networks for image classification, 2012.

R. Kumar-srivastava, K. Greff, and J. Schmidhuber, Training very deep networks, NIPS, 2015.