J. Andén and S. Mallat, Deep scattering spectrum, IEEE Transactions on Signal Processing, vol.62, issue.16, pp.4114-4128, 2014.

R. Balestriero, H. Glotin, and R. Baraniuk, Semisupervised learning enabled by multiscale deep neural network inversion, 2018.

, Spline Filters For End-to-End Deep Learning Brown, J. C. Calculation of a constant q spectral transform, Journal of the Acoustical Society of America, vol.89, issue.1, pp.425-434, 1991.

E. Cakir, E. C. Ozan, and T. Virtanen, Filterbank learning for deep neural network based polyphonic sound event detection, Neural Networks (IJCNN), 2016 International Joint Conference on, pp.3399-3406, 2016.

R. W. Clough, Original formulation of the finite element method. Finite Elements in Analysis and Design, vol.7, pp.89-101, 1990.

T. Cohen and M. Welling, Group equivariant convolutional networks, International Conference on Machine Learning, pp.2990-2999, 2016.

R. Cosentino, R. Balestriero, A. , and B. , Best basis selection using sparsity driven multi-family wavelet transform, IEEE Global Conference on Signal and Information Processing (GlobalSIP), pp.252-256, 2016.

R. Cosentino, R. Balestriero, R. Baraniuk, P. , and A. , Overcomplete frame thresholding for acoustic scene analysis, 2017.

W. Dai, C. Dai, S. Qu, J. Li, and S. Das, Very deep convolutional neural networks for raw waveforms, Acoustics, Speech and Signal Processing (ICASSP), pp.421-425, 2017.

H. Glotin, J. Ricard, and R. Balestriero, Fast chirplet transform injects priors in deep learning of animal calls and speech, International Conference on Learning Representations (ICLR, 2017.

T. Grill and J. Schlüter, Two convolutional neural networks for bird detection in audio signals, Proceedings of the 25th European Signal Processing Conference (EUSIPCO), 2017.

C. A. Hall and W. W. Meyer, Optimal error bounds for cubic spline interpolation, Journal of Approximation Theory, vol.16, issue.2, pp.105-122, 1976.

F. J. Harris, On the use of windows for harmonic analysis with the discrete Fourier transform, Proceedings of the IEEE, vol.66, issue.1, pp.51-83, 1978.

K. He, X. Zhang, S. Ren, and J. Sun, Deep residual learning for image recognition, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.770-778, 2016.

S. Jaffard, Y. Meyer, R. , and R. , Wavelets: Tools for Science and Technology, Other Titles in Applied Mathematics. Society for Industrial and Applied Mathematics, 2001.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, Advances in neural information processing systems, pp.1097-1105, 2012.

Y. Lecun, Y. Bengio, and G. Hinton, Deep learning. Nature, vol.521, pp.436-444, 2015.

M. K. Leung, H. Y. Xiong, L. J. Lee, and B. J. Frey, Deep learning of the tissue-regulated splicing code. Bioinformatics, vol.30, pp.121-129, 2014.

V. Lostanlen, Opérateurs convolutionnels dans le plan temps-fréquence, 2017.

S. Mallat, A Wavelet Tour of Signal Processing, 1999.

A. Megahed, A. M. Moussa, H. Elrefaie, and Y. Marghany, Selection of a suitable mother wavelet for analyzing power system fault transients, Power, Energy Society General Meeting-Conversion, Delivery of Electrical Energy in the 21st Century, pp.1-7, 2008.

Y. Meyer, Wavelets-Algorithms and Applications. WaveletsAlgorithms and applications Society for Industrial and Applied Mathematics Translation, vol.1, 1993.

B. A. Olshausen and D. J. Field, Emergence of simple-cell receptive field properties by learning a sparse code for natural images, Nature, vol.381, issue.6583, p.607, 1996.

D. A. Ramli and H. Jaafar, Peak finding algorithm to improve syllable segmentation for noisy bioacoustic sound signals, Procedia Computer Science, vol.96, pp.100-109, 2016.

T. N. Sainath, R. J. Weiss, A. Senior, K. W. Wilson, and O. Vinyals, Learning the speech front-end with raw waveform cldnns, Sixteenth Annual Conference of the International Speech Communication Association, 2015.

I. J. Schoenberg, On interpolation by spline functions and its minimal properties, On Approximation Theory, pp.109-129, 1964.

R. Serizel, V. Bisot, S. Essid, R. , and G. , Acoustic features for environmental sound analysis, Computational Analysis of Sound Scenes and Events, pp.71-101, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01575619

, Spline Filters For End-to-End Deep Learning

C. A. Shera, J. J. Guinan, and A. J. Oxenham, Revised estimates of human cochlear tuning from otoacoustic and behavioral measurements, Proceedings of the National Academy of Sciences, vol.99, issue.5, pp.3318-3323, 2002.

D. Stowell and M. D. Plumbley, An open dataset for research on audio field recording archives: freefield1010. CoRR, abs/1309, vol.5275, 2013.

G. Trigeorgis, F. Ringeval, R. Brueckner, E. Marchi, M. A. Nicolaou et al., Adieu features? end-to-end speech emotion recognition using a deep convolutional recurrent network, Acoustics, Speech and Signal Processing (ICASSP), pp.5200-5204, 2016.

M. Trone, H. Glotin, R. Balestriero, and D. E. Bonnett, Enhanced feature extraction using the morlet transform on 1 mhz recordings reveals the complex nature of amazon river dolphin (inia geoffrensis) clicks, Journal of the Acoustical Society of America, vol.138, issue.3, pp.1904-1904, 2015.

M. A. Unser, Ten good reasons for using spline wavelets, Wavelet Applications in Signal and Image Processing V, vol.3169, pp.422-432, 1997.

C. Xu, C. Wang, and W. Liu, Nonstationary vibration signal analysis using wavelet-based time-frequency filter and Wigner-Ville distribution, Journal of Vibration and Acoustics, vol.138, issue.5, p.51009, 2016.

N. Zeghidour, N. Usunier, I. Kokkinos, T. Schatz, G. Synnaeve et al., Learning filterbanks from raw speech for phone recognition, 2017.