A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems 25, pp.1097-1105, 2012.

C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed et al., Going deeper with convolutions, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014.
DOI : 10.1109/CVPR.2015.7298594

Y. Baveye, E. Dellandrea, C. Chamaret, and L. Chen, LIRIS-ACCEDE: A Video Database for Affective Content Analysis, IEEE Transactions on Affective Computing, vol.6, issue.1, pp.43-55, 2015.
DOI : 10.1109/TAFFC.2015.2396531

URL : https://hal.archives-ouvertes.fr/hal-01375518

J. Deng, W. Dong, R. Socher, L. Li, K. Li et al., Imagenet: A large-scale hierarchical image database, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.248-255, 2009.

A. Hanjalic and L. Xu, Affective video content representation and modeling, IEEE Transactions on Multimedia, vol.7, issue.1, pp.143-154, 2005.
DOI : 10.1109/TMM.2004.840618

S. Zhang, Q. Huang, S. Jiang, W. Gao, and Q. Tian, Affective Visualization and Retrieval for Music Video, IEEE Transactions on Multimedia, vol.12, issue.6, pp.510-522, 2010.
DOI : 10.1109/TMM.2010.2059634

N. Malandrakis, A. Potamianos, G. Evangelopoulos, and A. Zlatintsi, A supervised approach to movie emotion tracking, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.2376-2379, 2011.
DOI : 10.1109/ICASSP.2011.5946961

H. Drucker, C. J. Burges, L. Kaufman, A. Smola, and V. Vapnik, Support vector regression machines Advances in neural information processing systems, pp.155-161, 1997.

F. Weninger, F. Eyben, B. W. Schuller, M. Mortillaro, and K. R. Scherer, On the Acoustics of Emotion in Audio: What Speech, Music, and Sound have in Common, Frontiers in Psychology, vol.4, pp.1664-1078, 2013.
DOI : 10.3389/fpsyg.2013.00292

I. Kanluan, M. Grimm, and K. Kroschel, Audio-visual emotion recognition using an emotion space concept, 16th European Signal Processing Conference, 2008.

S. Zhang, Q. Tian, Q. Huang, W. Gao, and S. Li, Utilizing affective analysis for efficient movie browsing, 2009 16th IEEE International Conference on Image Processing (ICIP), pp.1853-1856, 2009.
DOI : 10.1109/ICIP.2009.5413590

L. Canini, S. Benini, and R. Leonardi, Affective Recommendation of Movies Based on Selected Connotative Features, IEEE Transactions on Circuits and Systems for Video Technology, pp.636-647, 2013.
DOI : 10.1109/TCSVT.2012.2211935

M. Nicolaou, H. Gunes, and M. Pantic, Continuous Prediction of Spontaneous Affect from Multiple Cues and Modalities in Valence-Arousal Space, IEEE Transactions on Affective Computing, vol.2, issue.2, pp.92-105, 2011.
DOI : 10.1109/T-AFFC.2011.9

Y. Lecun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard et al., Backpropagation Applied to Handwritten Zip Code Recognition, Neural Computation, vol.1, issue.4, pp.541-551, 1989.
DOI : 10.1007/BF00133697

M. Matsugu, K. Mori, Y. Mitari, and Y. Kaneda, Subject independent facial expression recognition with robust face detection using a convolutional neural network, Neural Networks, vol.16, issue.5-6, pp.555-559, 2003.
DOI : 10.1016/S0893-6080(03)00115-1

S. E. Kahou, C. Pal, X. Bouthillier, P. Froumenty, R. Gülçehre et al., Combining modality specific deep neural networks for emotion recognition in video, Proceedings of the 15th ACM on International Conference on Multimodal Interaction, ser. ICMI '13, pp.543-550, 2013.

R. Cowie, M. Sawey, C. Doherty, J. Jaimovich, C. Fyans et al., Gtrace: General Trace Program Compatible with EmotionML, 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, pp.709-710, 2013.
DOI : 10.1109/ACII.2013.126

M. M. Bradley and P. J. Lang, Measuring emotion: The self-assessment manikin and the semantic differential, Journal of Behavior Therapy and Experimental Psychiatry, vol.25, issue.1, pp.49-59, 1994.
DOI : 10.1016/0005-7916(94)90063-9

K. R. Scherer, Appraisal considered as a process of multi-level sequential checking, " in Appraisal processes in emotion: Theory, Methods, pp.92-120, 2001.

A. Metallinou and S. Narayanan, Annotation and processing of continuous emotional attributes: Challenges and opportunities, 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG), pp.1-8, 2013.
DOI : 10.1109/FG.2013.6553804

S. Mariooryad and C. Busso, Analysis and Compensation of the Reaction Lag of Evaluators in Continuous Emotional Annotations, 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, p.2013
DOI : 10.1109/ACII.2013.21

M. Nicolaou, H. Gunes, and M. Pantic, Continuous Prediction of Spontaneous Affect from Multiple Cues and Modalities in Valence-Arousal Space, IEEE Transactions on Affective Computing, vol.2, issue.2, pp.92-105, 2011.
DOI : 10.1109/T-AFFC.2011.9

M. Soleymani, M. N. Caro, E. M. Schmidt, C. Sha, and Y. Yang, 1000 songs for emotional analysis of music, Proceedings of the 2nd ACM international workshop on Crowdsourcing for multimedia, CrowdMM '13, pp.1-6, 2013.
DOI : 10.1145/2506364.2506365

J. J. Randolph, Free-marginal multirater kappa (multirater ? f ree ): An alternative to fleiss fixed-marginal multirater kappa, Paper presented at the Joensuu University Learning and Instruction Symposium, 2005.

J. R. Landis and G. G. Koch, The Measurement of Observer Agreement for Categorical Data, Biometrics, vol.33, issue.1, pp.159-174, 1977.
DOI : 10.2307/2529310

Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long et al., Caffe, Proceedings of the ACM International Conference on Multimedia, MM '14, 2014.
DOI : 10.1145/2647868.2654889

N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, Dropout: A simple way to prevent neural networks from overfitting, The Journal of Machine Learning Research, vol.15, issue.1, pp.1929-1958, 2014.