HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

An Occam’s Razor View on Learning Audiovisual Emotion Recognition with Small Training Sets

Valentin Vielzeuf 1, 2 Corentin Kervadec 1 Stéphane Pateux 1 Alexis Lechervy 2 Frédéric Jurie 2
2 Equipe Image - Laboratoire GREYC - UMR6072
GREYC - Groupe de Recherche en Informatique, Image et Instrumentation de Caen
Abstract : This paper presents a light-weight and accurate deep neural model for audiovisual emotion recognition. To design this model, the authors followed a philosophy of simplicity, drastically limiting the number of parameters to learn from the target datasets, always choosing the simplest earning methods: i) transfer learning and low-dimensional space embedding allows to reduce the dimensionality of the representations. ii) The isual temporal information is handled by a simple score-per-frame selection process, averaged across time. iii) A simple frame selection echanism is also proposed to weight the images of a sequence. iv) The fusion of the different modalities is performed at prediction level (late usion). We also highlight the inherent challenges of the AFEW dataset and the difficulty of model selection with as few as 383 validation equences. The proposed real-time emotion classifier achieved a state-of-the-art accuracy of 60.64 % on the test set of AFEW, and ranked 4th at he Emotion in the Wild 2018 challenge.
Complete list of metadata

Contributor : Corentin Kervadec Connect in order to contact the contributor
Submitted on : Monday, August 6, 2018 - 11:03:16 AM
Last modification on : Wednesday, November 3, 2021 - 5:12:15 AM
Long-term archiving on: : Wednesday, November 7, 2018 - 1:34:49 PM


Files produced by the author(s)


  • HAL Id : hal-01854019, version 1
  • ARXIV : 1808.02668


Valentin Vielzeuf, Corentin Kervadec, Stéphane Pateux, Alexis Lechervy, Frédéric Jurie. An Occam’s Razor View on Learning Audiovisual Emotion Recognition with Small Training Sets. ICMI (EmotiW) 2018, Oct 2018, Boulder, Colorado, United States. ⟨hal-01854019⟩



Record views


Files downloads