Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, Gradient-based learning applied to document recognition, Proceeding of the IEEE, 1998.

K. Chellapilla, S. Puri, and P. Simard, High performance convolutional neural networks for document processing, Proc. of the Int. Workshop on Frontiers in Handwriting Recognition (IWFHR'06), 2006.
URL : https://hal.archives-ouvertes.fr/inria-00112631

M. Delakis and C. Garcia, Text detection with convolutional neural networks, VISAPP 2008: Proceedings of the Third International Conference on Computer Vision Theory and Applications, vol.2, pp.290-294, 2008.

K. Elagouni, C. Garcia, F. Mamalet, and P. Sébillot, Text recognition in multimedia documents: a study of two neural-based OCRs using and avoiding character segmentation, IJDAR, vol.17, issue.1, pp.19-31, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00867225

R. Vaillant, C. Monrocq, and Y. Le-cun, An original approach for the localization of objects in images, IEE Proc on Vision, Image, and Signal Processing, pp.245-250, 1994.

C. Garcia and M. Delakis, Convolutional face finder: A neural architecture for fast and robust face detection, IEEE Trans. Pattern Anal. Mach. Intell, vol.26, issue.11, pp.1408-1423, 2004.

M. Osadchy, Y. Lecun, M. L. Miller, and P. Perona, Synergistic face detection and pose estimation with energy-based model, Proc. of Advances in Neural Information Processing Systems (NIPS'05), 2005.

D. C. Ciresan, U. Meier, J. Masci, and J. Schmidhuber, Multi-column deep neural network for traffic sign classification, Neural Networks, vol.32, pp.333-338, 2012.

P. Sermanet, K. Kavukcuoglu, S. Chintala, and Y. Lecun, Pedestrian detection with unsupervised multi-stage feature learning, 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp.3626-3633, 2013.

R. Hadsell, P. Sermanet, M. Scoffier, A. Erkan, K. Kavackuoglu et al., Learning long-range vision for autonomous off-road driving, Journal of Field Robotics, 2009.

P. Sermanet, S. Chintala, and Y. Lecun, Convolutional neural networks applied to house numbers digit classification, Proceedings of the 21st International Conference on Pattern Recognition, ICPR 2012, pp.3288-3291, 2012.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems, pp.1106-1114, 2012.

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, CoRR, 2014.

D. E. Rumelhart, G. E. Hinton, and R. J. Williams, Learning representations by back-propagating errors, Nature, vol.323, pp.533-536, 1986.

P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, R. Fergus et al., OverFeat: Integrated recognition, localization and detection using convolutional networks, CoRR, 2013.

P. K. Meher, S. Y. Park, B. K. Mohanty, K. S. Lim, and C. Yeo, Efficient integer DCT architectures for HEVC, IEEE Transactions on Circuits and Systems for Video Technology, vol.24, pp.168-178, 2014.

Y. Chen, S. Oraintara, and T. Nguyen, Video compression using integer DCT, Proceedings 2000 International Conference on Image Processing, vol.2, pp.844-845, 2000.

A. M. Joshi, V. Mishra, and R. M. Patrikar, Design of real-time video watermarking based on integer dct for H.264 encoder, International Journal of Electronics, vol.102, issue.1, pp.141-155, 2015.

V. A. Coutinho, R. J. Cintra, F. M. Bayer, P. A. Oliveira, R. S. Oliveira et al., Pruned discrete Tchebichef transform approximation for image compression, Circuits, Systems, and Signal Processing, 2018.

D. Scherer, H. Schulz, and S. Behnke, Accelerating large-scale convolutional neural networks with parallel graphics multiprocessors, Proceedings of theInternational Conference on Artificial Neural Networks, pp.82-91, 2010.

B. White and M. Elmasry, The digi-neocognitron: A digital neocognitron neural network model for VLSI, IEEE Transactions on Neural Networks, vol.3, issue.1, pp.73-85, 1992.

M. Marchesi, G. Orlando, F. Piazza, and A. Uncini, Fast neural networks without multipliers, IEEE Transactions on Neural Networks, vol.4, issue.1, pp.53-62, 1993.

H. Kwan and C. Tang, Multiplyerless multilayer feedforward neural network design suitable for continuous input-output mapping, Electronic Letters, vol.29, issue.14, pp.1259-1260, 1993.

J. Vincent and D. Myers, Weight dithering and wordlength selection for digital backpropagation networks, BT Technology Journal, vol.10, issue.3, pp.124-133, 1992.

P. Simard and H. P. Graf, Backpropagation without multiplication, Proceedings of theAnnual Conference on Neural Information Processing Systems, pp.232-239, 1994.

S. Draghici, On the capabilities of neural networks using limited precision weights, Neural Networks, vol.15, issue.3, pp.395-414, 2002.

E. L. Machado, C. J. Miosso, R. Borries, M. Coutinho, P. De-azevedo et al., Computational cost reduction in learned transform classifications, 2015.

M. Courbariaux, Y. Bengio, and J. David, BinaryConnect: Training deep neural networks with binary weights during propagations, Proceedings of theAnnual Conference on Neural Information Processing Systems, 2015.

M. Kim and S. Paris, Bitwise neural networks, ICML Workshop on Resource-Efficient Machine Learning, 2015.

F. Mamalet, S. Roux, and C. Garcia, Real-time video convolutional face finder on embedded platforms, EURASIP J. Emb. Sys, 2007.

N. Farrugia, F. Mamalet, S. Roux, F. Yang, and M. Paindavoine, Fast and robust face detection on a parallel optimized architecture implemented on FPGA, IEEE Trans. Circuits Syst. Video Techn, vol.19, issue.4, pp.597-602, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00640765

C. Farabet, B. Martini, P. Akselrod, S. Talay, Y. Lecun et al., Hardware accelerated convolutional neural networks for synthetic vision systems, International Symposium on Circuits and Systems (ISCAS 2010), pp.257-260, 2010.

S. Chakradhar, M. Sankaradas, V. Jakkula, and S. Cadambi, A dynamically configurable coprocessor for convolutional neural networks, Proceedings of the International Symposium on Computer Architecture, pp.247-257, 2010.

T. Chen, Z. Du, N. Sun, J. Wang, C. Wu et al., DianNao: A small-footprint high-throughput accelerator for ubiquitous machine-learning, Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), pp.269-284, 2014.

C. Zhang, P. Li, G. Sun, Y. Guan, B. Xiao et al., Optimizing FPGA-based accelerator design for deep convolutional neural networks, Proceedings of the ACM/SIGDA International Symposium on Field-Programmable Gate Arrays -FPGA, pp.161-170, 2015.

F. Mamalet and C. Garcia, Simplifying convnets for fast learning, Artificial Neural Networks and Machine Learning -ICANN 2012 -22nd International Conference on Artificial Neural Networks, pp.58-65, 2012.
URL : https://hal.archives-ouvertes.fr/hal-01353039

M. Mathieu, M. Henaff, and Y. Lecun, Fast training of convolutional networks through FFTs, Proceedings of theInternational Conference on Learning Representations, 2014.

V. Vanhoucke, A. Senior, and M. Mao, Improving the speed of neural networks on CPUs, Proceedings of the Deep Learning and Unsupervised Feature Learning NIPS Workshop, 2011.

K. Osawa and R. Yokota, Evaluating the compression efficiency of the filters in convolutional neural networks, Artificial Neural Networks and Machine Learning -ICANN 2017, vol.10614, pp.459-466, 2017.

J. Xue, J. Li, and Y. Gong, Restructuring of deep neural network acoustic models with singular value decomposition, 2013.

T. N. Sainath, B. Kingsbury, V. Sindhwani, E. Arisoy, and B. Ramabhadran, Low-rank matrix factorization for deep neural network training with high-dimensional output targets, ICASSP, pp.6655-6659, 2013.

M. Denil, B. Shakibi, L. Dinh, and N. De-freitas, Predicting parameters in deep learning, Proceedings of theAnnual Conference on Neural Information Processing Systems, 2013.

E. Denton, W. Zaremba, and J. Bruna, Exploiting linear structure within convolutional networks for efficient evaluation, Proceedings of theAnnual Conference on Neural Information Processing Systems, 2014.

Z. Yang, M. Moczulski, M. Denil, N. D. Freitas, A. Smola et al., Deep Fried Convnets, Proceedings of theInternational Conference on Computer Vision, 2014.

M. Jaderberg, A. Vedaldi, and A. Zisserman, Speeding up convolutional neural networks with low rank expansions, Proceedings of theBritish Machine Vision Conference, 2014.

V. Lebedev, Y. Ganin, M. Rakhuba, I. Oseledets, and V. Lempitsky, Speeding up convolutional neural networks using fine-tuned CP decomposition, Proceedings of theInternational Conference on Learning Representations, 2015.

M. Courbariaux, Y. Bengio, and J. David, Binaryconnect: Training deep neural networks with binary weights during propagations, Advances in Neural Information Processing Systems, pp.3123-3131, 2015.

M. Courbariaux and Y. Bengio, Binarynet: Training deep neural networks with weights and activations constrained to +1 or ?1, CoRR, 2016.

M. Rastegari, V. Ordonez, J. Redmon, and A. Farhadi, XNOR-Net: Imagenet classification using binary convolutional neural networks, CoRR, 2016.

M. Courbariaux, Y. Bengio, and J. David, Low precision arithmetic for deep learning, CoRR, 2014.

D. Miyashita, E. H. Lee, and B. Murmann, Convolutional neural networks using logarithmic data representation, CoRR, 2016.

T. H. Cormen, C. E. Leiserson, R. L. Rivest, and C. Stein, Introduction to Algorithms, 2009.

S. Haykin, Neural Networks, 1999.

R. J. Cintra, An integer approximation method for discrete sinusoidal transforms, Circuits, Systems, and Signal Processing, vol.30, issue.6, pp.1481-1501, 2011.

P. A. Oliveira, R. J. Cintra, F. M. Bayer, S. Kulasekera, and A. Madanayake, A discrete Tchebichef transform approximation for image and video coding, IEEE Signal Processing Letters, vol.22, pp.1137-1141, 2015.

G. A. Seber, A Matrix Handbook for Statisticians, 2007.

C. J. Tablada, F. M. Bayer, and R. J. Cintra, A class of DCT approximations based on the Feig-Winograd algorithm, Signal Processing, vol.113, pp.38-51, 2015.

M. T. Tommiska, Efficient digital implementation of the sigmoid function for reprogrammable logic, IEE Proceedings -Computers and Digital Techniques, vol.150, pp.403-411, 2003.

M. Zhang, S. Vassiliadis, and J. G. Delgado-frias, Sigmoid generators for neural computing using piecewise approximations, IEEE Transactions on Computers, vol.45, pp.1045-1049, 1996.

K. Basterretxea, J. M. Tarela, and I. Del-campo, Approximation of sigmoid function and the derivative for hardware implementation of artificial neurons, IEE Proceedings -Circuits, Devices and Systems, vol.151, pp.18-24, 2004.

S. Bouguezel, M. O. Ahmad, and M. N. Swamy, A low-complexity parametric transform for image compression, IEEE International Symposium on Circuits and Systems, pp.2145-2148, 2011.

S. Bouguezel, M. O. Ahmad, and M. N. Swamy, A multiplicationfree transform for image compression, 2nd International Conference on Signals, Circuits and Systems, pp.1-4, 2008.

S. Bouguezel, M. O. Ahmad, and M. N. Swamy, Low-complexity 8x8 transform for image compression, Electronics Letters, vol.44, pp.1249-1250, 2008.

V. Britanak, P. Yip, and K. R. Rao, Discrete Cosine and Sine Transforms, 2007.

N. Ahmed, T. Natarajan, and K. R. Rao, Discrete cosine transform, IEEE Transactions on Computers, issue.23, pp.90-93, 1974.

A. G. Dempster and M. D. Macleod, Constant integer multiplication using minimum adders, IEE Proceedings -Circuits, Devices and Systems, vol.141, pp.407-413, 1994.

R. J. Cintra, F. M. Bayer, and C. J. Tablada, Low-complexity 8-point DCT approximations based on integer functions, Signal Processing, vol.99, pp.201-214, 2014.

F. M. Bayer and R. J. Cintra, DCT-like transform for image compression requires 14 additions only, Electronics Letters, vol.48, pp.919-921, 2012.

J. Lee and S. Leyffer, Mixed Integer Nonlinear Programming, The IMA Volumes in Mathematics and its Applications, 2011.

C. H. Papadimitriou and K. Steiglitz, Combinatorial Optimization: Algorithms and Complexity, 1998.

D. Bienstock and G. Nemhauser, Integer Programming and Combinatorial Optimization, Lecture Notes in Computer Science, 2004.

J. K. Lenstra and A. H. Kan, Computational complexity of discrete optimization problems, Annals of Discrete Mathematics, vol.4, pp.121-140, 1979.

. R-core-team, R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, 2013.

N. Megiddo, Linear programming in linear time when the dimension is fixed, Journal of the ACM (JACM), vol.31, issue.1, pp.114-127, 1984.

W. Burger and M. Burge, Digital Image Processing: An Algorithmic Introduction Using Java. Texts in Computer Science, 2016.

Y. Lecun, Generalization and network designs strategies, 1989.

Y. Lecun, Generalization and network design strategies, Proceedings of the International Conference Connectionism in Perspective, pp.10-13, 1988.

M. Zhang, S. Vassiliadis, and J. G. Delgado-frias, Sigmoid generators for neural computing using piecewise approximations, IEEE Transactions on Computers, vol.45, pp.1045-1049, 1996.

D. Larkin, A. Kinane, V. Muresan, and N. O'connor, An efficient hardware architecture for a neural network activation function generator, Advances in Neural Networks -ISNN, pp.1319-1327, 2006.

O. Temam, A defect-tolerant accelerator for emerging high-performance applications, Proceedings of the International Symposium on Computer Architecture, pp.356-367, 2012.

J. Schlessman, Approximation of the sigmoid function and its derivative using a minimax approach, 2002.

C. Alippi and G. Storti-gajani, Simple approximation of sigmoidal functions: realistic design of digital neural networks capable of learning, Proc. of IEEE Int. Symp. on Circuits and Systems, pp.1505-1508, 1991.

H. Amin, K. M. Curtis, and B. R. Hayes-gill, Piecewise linear approximation applied to nonlinear function of a neural network, IEE Proc. Circuits, Devices Sys, vol.144, issue.6, pp.313-317, 1997.

R. E. Blahut, Fast Algorithms for Digital Signal Processing, 2010.

U. S. Potluri, A. Madanayake, R. J. Cintra, F. M. Bayer, S. Kulasekera et al., Improved 8-point approximate DCT for image and video compression requiring only 14 additions, IEEE Transactions on Circuits and Systems I, vol.61, pp.1727-1740, 2014.

M. Mathias, R. Benenson, M. Pedersoli, and L. Van-gool, Face detection without bells and whistles, Proceesings of the European Conference on Computer Vision, pp.720-735, 2014.

D. C. Ciresan, U. Meier, and J. Schmidhuber, Multi-column deep neural networks for image classification, Proceedings of the International Conference onComputer Vision and Pattern Recognition, 2012.

V. Jain and E. Learned-miller, FDDB: A benchmark for face detection in unconstrained settings, 2010.

X. Zhu and D. Ramanan, Face detection, pose estimation, and landmark localization in the wild, Proceedings of the International Conference onComputer Vision and Pattern Recognition, 2012.

J. Yan, X. Zhang, Z. Lei, and S. Li, Face detection by structural models, Image and Vision Computing, vol.32, issue.10, pp.790-799, 2014.

Y. Lecun, C. Cortes, and C. J. Burges, The MNIST database of handwritten digits, 2015.