G. E. Hinton, S. Osindero, and Y. Teh, A Fast Learning Algorithm for Deep Belief Nets, Neural Computation, vol.18, issue.7, pp.1527-1554, 2006.
DOI : 10.1162/jmlr.2003.4.7-8.1235

Y. Lecun, Une procédure d'apprentissage pour réseau a seuil asymmetrique (a learning scheme for asymmetric threshold networks), Cognitiva 85, 1985.

D. E. Rumelhart, G. E. Hinton, and R. J. Williams, Learning representations by back-propagating errors, Nature, vol.85, issue.6088, pp.533-536, 1986.
DOI : 10.1038/323533a0

Y. Bengio, Learning Deep Architectures for AI, Machine Learning, pp.1-127, 2009.
DOI : 10.1561/2200000006

Y. Bengio, P. Lamblin, D. Popovici, and H. Larochelle, Greedy layer-wise training of deep networks, NIPS, 2006.

M. Ranzato, C. Poultney, S. Chopra, and Y. Lecun, Efficient learning of sparse representations with an energy-based model, NIPS, 2006.

P. Vincent, H. Larochelle, Y. Bengio, and P. Manzagol, Extracting and composing robust features with denoising autoencoders, Proceedings of the 25th international conference on Machine learning, ICML '08, 2008.
DOI : 10.1145/1390156.1390294

P. Smolensky, Information processing in dynamical systems: Foundations of harmony theory, Parallel Distributed Processing, pp.194-281, 1986.

G. E. Hinton, Training Products of Experts by Minimizing Contrastive Divergence, Neural Computation, vol.22, issue.8, pp.1771-1800, 2002.
DOI : 10.1162/089976600300015385

H. Lee, C. Ekanadham, and A. Ng, Sparse deep belief net model for visual area V2, NIPS, 2008.

H. Goh, N. Thome, and M. Cord, Biasing restricted Boltzmann machines to manipulate latent selectivity and sparsity, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00716050

H. Larochelle and Y. Bengio, Classification using discriminative restricted Boltzmann machines, Proceedings of the 25th international conference on Machine learning, ICML '08, 2008.
DOI : 10.1145/1390156.1390224

N. , L. Roux, and Y. Bengio, Representational power of restricted Boltzmann machines and deep belief networks, Neural Computation, vol.20, pp.1631-1649, 2008.

I. Sutskever and G. E. Hinton, Learning multilevel distributed representations for high-dimensional sequences, AISTATS, 2007.

G. E. Hinton and R. Salakhutdinov, Reducing the Dimensionality of Data with Neural Networks, Science, vol.313, issue.5786, pp.504-507, 2006.
DOI : 10.1126/science.1127647

Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, Gradient-based learning applied to document recognition, Proceedings of the IEEE, vol.86, issue.11, pp.2278-2324, 1998.
DOI : 10.1109/5.726791

L. Fei-fei, R. Fergus, and P. Perona, Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories, Computer Vision and Image Understanding, vol.106, issue.1, 2004.
DOI : 10.1016/j.cviu.2005.09.012

R. Salakhutdinov and G. E. Hinton, Learning a nonlinear embedding by preserving class neighbourhood structure, AISTATS, 2007.

L. Deng and D. Yu, Deep convex net: A scalable architecture for speech pattern classification, Interspeech, 2011.

D. C. Cires¸ancires¸an, U. Meier, and J. Schmidhuber, Multi-column deep neural networks for image classification, CVPR, 2012.

D. Lowe, Object recognition from local scale-invariant features, Proceedings of the Seventh IEEE International Conference on Computer Vision, 1999.
DOI : 10.1109/ICCV.1999.790410

Y. Boureau, F. Bach, Y. Lecun, and J. Ponce, Learning mid-level features for recognition, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010.
DOI : 10.1109/CVPR.2010.5539963

H. Goh, N. Thome, M. Cord, and J. Lim, Unsupervised and Supervised Visual Codes with Restricted Boltzmann Machines, ECCV, 2012.
DOI : 10.1007/978-3-642-33715-4_22
URL : https://hal.archives-ouvertes.fr/hal-00816428

S. Lazebnik, C. Schmid, and J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Volume 2 (CVPR'06), 2006.
DOI : 10.1109/CVPR.2006.68
URL : https://hal.archives-ouvertes.fr/inria-00548585

S. Avila, N. Thome, M. Cord, E. Valle, and A. Araújo, Pooling in image representation: The visual codeword point of view, Computer Vision and Image Understanding, vol.117, issue.5, pp.453-465, 2013.
DOI : 10.1016/j.cviu.2012.09.007
URL : https://hal.archives-ouvertes.fr/hal-01172709

C. Theriault, N. Thome, and M. Cord, Extended Coding and Pooling in the HMAX Model, IEEE Transactions on Image Processing, vol.22, issue.2, 2013.
DOI : 10.1109/TIP.2012.2222900
URL : https://hal.archives-ouvertes.fr/hal-01185467

K. Sohn, D. Y. Jung, H. Lee, A. Hero, and I. , Efficient learning of sparse, distributed, convolutional feature representations for object recognition, ICCV, 2011.

K. Sohn, G. Zhou, C. Lee, and H. Lee, Learning and selecting features jointly with point-wise gated boltzmann machines, ICML, 2013.