G. Tzanetakis and P. Cook, Musical genre classification of audio signals, IEEE Transactions on Speech and Audio Processing, vol.10, issue.5, pp.293-302, 2002.

Y. M. Costa, L. S. Oliveira, and C. N. Silla, An evaluation of convolutional neural networks for music classification using spectrograms, Applied Soft Computing, vol.52, pp.28-38, 2017.

M. Wu and J. R. Jang, Combining acoustic and multilevel visual features for music genre classification, ACM Trans. Multimedia Comput. Commun. Appl, vol.12, issue.1, 2015.

L. Nanni, Y. M. Costa, A. Lumini, M. Y. Kim, and S. R. Baek, Combining visual and acoustic features for music genre classification, Expert Syst. Appl, vol.45, pp.108-117, 2016.

J. Pons, T. Lidy, and X. Serra, Experimenting with musically motivated convolutional neural networks

, 14th International Workshop on Content-based Multimedia Indexing, 2016.

T. Lidy and A. Schindler, Parallel convolutional neural networks for music genre and mood classification, Tech. Rep, 2016.

F. Mouret, Personalized Music Recommendation Based on Audio Features, INP ENSEEIHT, 2016.

W. Zhang, W. Lei, X. Xu, and X. Xing, Improved music genre classification with convolutional neural networks, pp.3304-3308, 2016.

O. Lartillot, P. Toiviainen, and T. Eerola, A matlab toolbox for music information retrieval, Data Analysis, Machine Learning and Applications, ser. Studies in Classification, Data Analysis, and Knowledge Organization

P. Laukka, P. Juslin, and R. Bresin, A dimensional approach to vocal expression of emotion, Cognition and Emotion, vol.19, issue.5, pp.633-653, 2005.

A. Gray and J. Markel, A spectral-flatness measure for studying the autocorrelation method of linear prediction of speech analysis, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.22, issue.3, pp.207-217, 1974.

C. E. Shannon, W. weaver the mathematical theory of communication, vol.29, 1949.
URL : https://hal.archives-ouvertes.fr/hal-01774215

W. A. Sethares, Tuning, timbre, spectrum, scale, 2005.

C. Harte, M. Sandler, and M. Gasser, Detecting harmonic change in musical audio, Proceedings of the 1st ACM Workshop on Audio and Music Computing Multimedia, pp.21-26, 2007.

X. Glorot, A. Bordes, and Y. Bengio, Deep sparse rectifier neural networks, Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS-11), vol.15, pp.315-323, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00752497

B. L. Sturm, An analysis of the gtzan music genre dataset, Proceedings of the Second International ACM Workshop on Music Information Retrieval with User-centered and Multimodal Strategies, ser. MIRUM '12, pp.7-12, 2012.

B. Efron, Bootstrap methods : Another look at the jackknife, Ann. Statist, vol.7, issue.1, pp.1-26, 1979.