Pattern Recognition Letters Learning Off-line vs. On-line Models of Interactive Multimodal Behaviors with Recurrent Neural Networks

Duc Canh Nguyen; Gérard Bailly; Frédéric Elisei

doi:10.1016/j.patrec.2017.09.033

Article Dans Une Revue Pattern Recognition Letters Année : 2017

Pattern Recognition Letters Learning Off-line vs. On-line Models of Interactive Multimodal Behaviors with Recurrent Neural Networks

(1) , (1) , (2, 1)

1
2

Duc Canh Nguyen

Fonction : Auteur

GIPSA - Cognitive Robotics, Interactive Systems, & Speech Processing

Gérard Bailly

Fonction : Auteur
PersonId : 444
IdHAL : gerard-bailly
ORCID : 0000-0002-6053-0818
IdRef : 033792135

GIPSA - Cognitive Robotics, Interactive Systems, & Speech Processing

Frédéric Elisei

Fonction : Auteur
PersonId : 17769
IdHAL : frederic-elisei
ORCID : 0000-0002-1295-3445

GIPSA-Services

GIPSA - Cognitive Robotics, Interactive Systems, & Speech Processing

Résumé

Human interactions are driven by multi-level perception-action loops. Interactive behavioral models are typically built using rule-based methods or statistical approaches such as Hidden Markov Model (HMM), Dynamic Bayesian Network (DBN), etc. In this paper, we present the multimodal interactive data and our behavioral model based on recurrent neural networks, namely Long-Short Term Memory (LSTM) and Bidirectional LSTM (BiLSTM) models. Speech, gaze and gestures of two subjects involved in a collaborative task are here jointly modeled. The results show that the proposed deep neural networks are more effective than the conventional statistical methods in generating appropriate overt actions for both on-line and off-line prediction tasks.

Mots clés

co-verbal behavior multi-task RNN multimodal behavior behavioral models Face-to-face interaction Bi-directional LSTM LSTM

Domaines

Machine Learning [stat.ML]

Fichier principal

dan_PRL2017_R2_v2.pdf (887.19 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Gérard Bailly : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01609535

Soumis le : mardi 3 octobre 2017-17:28:47

Dernière modification le : jeudi 4 avril 2024-21:36:57

Dates et versions

hal-01609535 , version 1 (03-10-2017)

Identifiants

HAL Id : hal-01609535 , version 1
DOI : 10.1016/j.patrec.2017.09.033

Citer

Duc Canh Nguyen, Gérard Bailly, Frédéric Elisei. Pattern Recognition Letters Learning Off-line vs. On-line Models of Interactive Multimodal Behaviors with Recurrent Neural Networks. Pattern Recognition Letters, 2017, 100, pp.29-36. ⟨10.1016/j.patrec.2017.09.033⟩. ⟨hal-01609535⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS GIPSA GIPSA-DPC GIPSA-CRISSP

201 Consultations

143 Téléchargements

Pattern Recognition Letters Learning Off-line vs. On-line Models of Interactive Multimodal Behaviors with Recurrent Neural Networks

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager