L. Girin, J. Schwartz, and G. Feng, Audio-visual enhancement of speech in noise, The Journal of the Acoustical Society of America, vol.109, issue.6, pp.3007-3020, 2001.
DOI : 10.1121/1.1358887

I. Almajai and B. Milner, Visually Derived Wiener Filters for Speech Enhancement, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.6, pp.1642-1651, 2011.
DOI : 10.1109/TASL.2010.2096212

D. Segev, Y. Y. Schechner, and M. Elad, Examplebased cross-modal denoising, IEEE Conference on Computer Vision and Pattern Recognition, pp.486-493, 2012.
DOI : 10.1109/cvpr.2012.6247712

URL : https://hal.archives-ouvertes.fr/hal-00706031

B. Rivet, L. Girin, and C. Jutten, Mixing Audiovisual Speech Processing and Blind Source Separation for the Extraction of Speech Signals From Convolutive Mixtures, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.1, pp.96-108, 2007.
DOI : 10.1109/TASL.2006.872619

URL : https://hal.archives-ouvertes.fr/hal-00174100

S. M. Naqvi, Y. Miao, and J. A. Chambers, A Multimodal Approach to Blind Source Separation of Moving Sources, IEEE Journal of Selected Topics in Signal Processing, vol.4, issue.5, pp.895-910, 2010.
DOI : 10.1109/JSTSP.2010.2057198

M. Heckmann, F. Berthommier, and K. Kroschel, Noise Adaptive Stream Weighting in Audio-Visual Speech Recognition, EURASIP Journal on Advances in Signal Processing, vol.2002, issue.11, pp.1260-1273, 2002.
DOI : 10.1155/S1110865702206150

G. Potamianos, C. Neti, G. Gravier, A. Garg, and A. W. Senior, Recent advances in the automatic recognition of audiovisual speech, Proceedings of the IEEE, vol.91, issue.9, pp.1306-1326, 2003.
DOI : 10.1109/JPROC.2003.817150

J. Barker and S. Xu, Energetic and Informational Masking Effects in an Audiovisual Speech Recognition System, IEEE Transactions on Audio, Speech, and Language Processing, vol.17, issue.3, pp.446-458, 2009.
DOI : 10.1109/TASL.2008.2011534

A. Deleforge and R. Horaud, 2D sound-source localization on the binaural manifold, 2012 IEEE International Workshop on Machine Learning for Signal Processing, 2012.
DOI : 10.1109/MLSP.2012.6349784

URL : https://hal.archives-ouvertes.fr/hal-00768657

H. Viste and G. Evangelista, On the use of spatial cues to improve binaural source separation, Proc. Int. Conf. on Digital Audio Effects, pp.209-213, 2003.

M. I. Mandel, R. J. Weiss, and D. P. Ellis, Model-Based Expectation-Maximization Source Separation and Localization, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.2, pp.382-394, 2010.
DOI : 10.1109/TASL.2009.2029711

J. Woodruff and D. Wang, Binaural Localization of Multiple Sources in Reverberant and Noisy Environments, Audio, Speech, and Language Processing, pp.1503-1512, 2012.
DOI : 10.1109/TASL.2012.2183869

V. Khalidov, F. Forbes, and R. Horaud, Alignment of binocular-binaural data using a moving audio-visual target, 2013 IEEE 15th International Workshop on Multimedia Signal Processing (MMSP), 2013.
DOI : 10.1109/MMSP.2013.6659295

URL : https://hal.archives-ouvertes.fr/hal-00861482

X. Alameda-pineda and R. Horaud, A Geometric Approach to Sound Source Localization from Time-Delay Estimates, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue.6, pp.1082-1095, 2014.
DOI : 10.1109/TASLP.2014.2317989

URL : https://hal.archives-ouvertes.fr/hal-00910081

A. Deleforge, F. Forbes, and R. Horaud, High-dimensional regression with gaussian mixtures and partially-latent response variables, Statistics and Computing, vol.19, issue.11, 2014.
DOI : 10.1007/s11222-014-9461-5

URL : https://hal.archives-ouvertes.fr/hal-01107604

P. Viola and M. J. Jones, Robust Real-Time Face Detection, International Journal of Computer Vision, vol.57, issue.2, pp.137-154, 2004.
DOI : 10.1023/B:VISI.0000013087.49260.fb

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.102.9805

J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, and D. S. Pallett, The DARPA TIMIT acousticphonetic continuous speech corpus CD-ROM, National Institute of Standards and Technology, 1993.

P. Aarabi, Self-localizing dynamic microphone arrays, IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews), vol.32, issue.4, pp.474-484, 2002.
DOI : 10.1109/TSMCB.2002.804369

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.4.6131