G. Bailly, S. Raidt, and F. Elisei, Gaze, conversational agents and face-to-face communication, Speech Communication, vol.52, issue.6, pp.598-612, 2010.
DOI : 10.1016/j.specom.2010.02.015

URL : https://hal.archives-ouvertes.fr/hal-00480335

G. Bailly, Boucles de perception-action et interaction face-à-face. Revue fran\ccaise de linguistique appliquée 13, pp.121-131, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00343411

S. Banerjee and A. Rudnicky, Using simple speech? based features to detect the state of a meeting and the roles of the meeting participants, 2004.

S. Baron-cohen, Mind Reading: The Interactive Guide to Emotions, 2004.

J. Bloit and X. Rodet, Short-time Viterbi for online HMM decoding: Evaluation on a real-time phone recognition task, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.2121-2124, 2008.
DOI : 10.1109/ICASSP.2008.4518061

URL : https://hal.archives-ouvertes.fr/hal-01161222

J. Cassell, H. Vilhjalmsson, and T. Bickmore, BEAT, Proceedings of the 28th annual conference on Computer graphics and interactive techniques , SIGGRAPH '01, 2001.
DOI : 10.1145/383259.383315

C. Chiu and S. Marsella, Gesture Generation with Low-dimensional Embeddings, Proceedings of the and Multi-agent Systems, International Foundation for Autonomous Agents and Multiagent Systems, pp.781-788, 2014.

M. Dunham and K. Murphy, Probabilistic modeling toolkit for Matlab/Octave

D. Gatica-perez, Analyzing group interactions in conversations: a review. Multisensor Fusion and Integration for Intelligent Systems, IEEE International Conference on, pp.41-46, 2006.

D. Gatica-perez, Automatic nonverbal analysis of social interaction in small groups: A review, Image and Vision Computing, vol.27, issue.12, pp.1775-1787, 2009.
DOI : 10.1016/j.imavis.2009.01.004

M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann et al., The WEKA data mining software, ACM SIGKDD Explorations Newsletter, vol.11, issue.1, pp.10-18, 2009.
DOI : 10.1145/1656274.1656278

S. Kopp, B. Jung, N. Lessmann, and I. Wachsmuth, Max -A Multimodal Assistant in Virtual Reality Construction, p.11, 2003.

S. Kopp, B. Krenn, and S. Marsella, Towards a Common Framework for Multimodal Generation: The Behavior Markup Language, INTERNATIONAL CONFERENCE ON IN?LIGENT VIRTUAL AGENTS, pp.21-23, 2006.
DOI : 10.1007/11821830_17

B. Krenn and H. Pirker, Defining the gesticon: Language and gesture coordination for interacting embodied agents, Proc. of the AISB-2004 Symposium on Language, Speech and Gesture for Expressive Characters, pp.107-115, 2004.

B. Krenn, The NECA project: Net environments for embodied emotional conversational agents, Proc. of Workshop on emotionally rich virtual worlds with emotion synthesis at the 8th International Conference on 3D Web Technology (Web3D), 2003.

J. L. Lakin, V. E. Jefferis, C. M. Cheng, C. , and T. L. , The Chameleon Effect as Social Glue: Evidence for the Evolutionary Significance of Nonconscious Mimicry, Journal of Nonverbal Behavior, vol.27, issue.3, pp.145-162, 2003.
DOI : 10.1023/A:1025389814290

Q. A. Le and C. Pelachaud, Generating Co-speech Gestures for the Humanoid Robot NAO through BML
DOI : 10.1007/978-3-642-34182-3_21

URL : https://hal.archives-ouvertes.fr/hal-01113951

I. E. Efthimiou, G. Kouroupetroglou, and S. Fotinea, Gesture and Sign Language in Human-Computer Interaction and Embodied Communication, pp.228-237, 2012.
DOI : 10.1007/978-3-642-34182-3

V. Levenshtein, Binary Codes Capable of Correcting Deletions, Insertions and Reversals. Soviet Physics Doklady, vol.10, issue.8, pp.707-710, 1966.

A. Mihoub, G. Bailly, and C. Wolf, Social Behavior Modeling Based on Incremental Discrete Hidden Markov Models, Human Behavior Understanding, pp.172-183, 2013.
DOI : 10.1007/978-3-319-02714-2_15

URL : https://hal.archives-ouvertes.fr/hal-00851903

K. Otsuka, H. Sawada, Y. , and J. , Automatic inference of cross-modal nonverbal interactions in multiparty conversations, Proceedings of the ninth international conference on Multimodal interfaces , ICMI '07, pp.255-262, 2007.
DOI : 10.1145/1322192.1322237

K. Otsuka, Multimodal Conversation Scene Analysis for Understanding People???s Communicative Behaviors in Face-to-Face Meetings, pp.171-179, 2011.
DOI : 10.1016/0378-2166(87)90181-0

K. Otsuka, Conversation Scene Analysis [Social Sciences], IEEE Signal Processing Magazine, vol.28, issue.4, pp.127-131, 2011.
DOI : 10.1109/MSP.2011.941100

D. C. Richardson, R. Dale, and K. Shockley, Synchrony and swing in conversation: coordination, temporal dynamics, and communication, Embodied Communication in Humans and Machines, pp.75-94, 2008.
DOI : 10.1093/acprof:oso/9780199231751.003.0004

S. Scherer, S. Marsella, and G. Stratou, Perception Markup Language: Towards a Standardized Representation of Perceived Nonverbal Behaviors, Intelligent Virtual Agents, pp.455-463, 2012.
DOI : 10.1007/978-3-642-33197-8_47

P. Spanger, M. Yasuhara, R. Iida, and T. Tokunaga, Using extra linguistic information for generating demonstrative pronouns in a situated collaboration task, Proceedings of PreCogSci 2009: Production of Referring Expressions: Bridging the gap between computational and empirical approaches to reference, 2009.

M. Thiebaux, S. Marsella, A. N. Marshall, and M. Kallmann, Smartbody: Behavior realization for embodied conversational agents, Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems, pp.151-158, 2008.

A. Vinciarelli, M. Pantic, and D. Heylen, Bridging the Gap between Social Animal and Unsocial Machine: A Survey of Social Signal Processing, IEEE Transactions on Affective Computing, vol.3, issue.1, pp.69-87, 2012.
DOI : 10.1109/T-AFFC.2011.27

D. Zhang, D. Gatica-perez, S. Bengio, and I. Mccowan, Modeling individual and group actions in meetings with layered HMMs. Multimedia, IEEE Transactions on, vol.8, issue.3, pp.509-520, 2006.