Modout: Learning to Fuse Face and Gesture Modalities with Stochastic Regularization

Abstract : Model selection methods based on stochastic regularization such as Dropout have been widely used in deep learning due to their simplicity and effectiveness. The standard Dropout method treats all units, visible or hidden, in the same way, thus ignoring any \emph{a priori} information related to grouping or structure. Such structure is present in multi-modal learning applications such as affect analysis and gesture recognition, where subsets of units may correspond to individual modalities. In this paper we describe Modout, a model selection method based on stochastic regularization, which is particularly useful in the multi-modal setting. Different from previous methods, it is capable of learning whether or when to fuse two modalities in a layer, which is usually considered to be an architectural hyper-parameter by deep learning researchers and practitioners. Modout is evaluated on one synthetic and two real multi-modal datasets. The results indicate improved performance compared to other stochastic regularization methods. The result on the Montalbano dataset shows that learning a fusion structure by Modout is on par with a state-of-the-art carefully designed architecture.
Type de document :
Communication dans un congrès
International Conference on Automatic Face and Gesture Recognition, May 2017, Washington D.C., United States. Face and Gestures, 2017
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01444614
Contributeur : Christian Wolf <>
Soumis le : mardi 24 janvier 2017 - 11:27:01
Dernière modification le : vendredi 10 novembre 2017 - 01:20:24

Identifiants

  • HAL Id : hal-01444614, version 1

Collections

Citation

Fan Li, Natalia Neverova, Christian Wolf, Graham W. Taylor. Modout: Learning to Fuse Face and Gesture Modalities with Stochastic Regularization . International Conference on Automatic Face and Gesture Recognition, May 2017, Washington D.C., United States. Face and Gestures, 2017. 〈hal-01444614〉

Partager

Métriques

Consultations de la notice

246