D. Wang and G. J. Brown, Computational Auditory Scene Analysis: Principles, Algorithms and Applications, 2006.
DOI : 10.1109/9780470043387

A. Deleforge and R. P. Horaud, The cocktail party robot, Proceedings of the seventh annual ACM/IEEE international conference on Human-Robot Interaction, HRI '12, pp.431-438, 2012.
DOI : 10.1145/2157689.2157834
URL : https://hal.archives-ouvertes.fr/hal-00768668

E. C. Cherry, Some Experiments on the Recognition of Speech, with One and with Two Ears, The Journal of the Acoustical Society of America, vol.25, issue.5, pp.975-979, 1953.
DOI : 10.1121/1.1907229

S. Haykin and Z. Chen, The Cocktail Party Problem, Neural Computation, vol.31, issue.2, pp.1875-1902, 2005.
DOI : 10.1016/0378-5955(91)90148-3

J. C. Middlebrooks and D. M. Green, Sound Localization by Human Listeners, Annual Review of Psychology, vol.42, issue.1, pp.135-159, 1991.
DOI : 10.1146/annurev.ps.42.020191.001031

P. M. Hofman and A. J. Van-opstal, Spectro-temporal factors in two-dimensional human sound localization, The Journal of the Acoustical Society of America, vol.103, issue.5, pp.2634-2648, 1998.
DOI : 10.1121/1.422784

R. Liu and Y. Wang, Azimuthal source localization using interaural coherence in a robotic dog: modeling and application, Robotica, vol.4, issue.07, pp.1013-1020, 2010.
DOI : 10.1121/1.1791872

X. Alameda-pineda and R. P. Horaud, Geometrically constrained robust time delay estimation using non-coplanar microphone arrays, European Signal Processing Conference, pp.1309-1313, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00768763

M. I. Mandel, D. P. Ellis, and T. Jebara, An EM algorithm for localizing multiple sound sources in reverberant environments, Neural Information Processing Systems, pp.953-960, 2007.

A. Deleforge and R. P. Horaud, A Latently Constrained Mixture Model for Audio Source Separation and Localization, The Tenth International Conference on Latent Variable Analysis and Signal Separation, pp.372-379, 2012.
DOI : 10.1109/TSP.2004.828896
URL : https://hal.archives-ouvertes.fr/hal-00768660

J. Woodruff and D. Wang, Binaural Localization of Multiple Sources in Reverberant and Noisy Environments, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.5, pp.1503-1512, 2012.
DOI : 10.1109/TASL.2012.2183869

O. Y?lmaz and S. Rickard, Blind Separation of Speech Mixtures via Time-Frequency Masking, IEEE Transactions on Signal Processing, vol.52, issue.7, pp.1830-1847, 2004.
DOI : 10.1109/TSP.2004.828896

H. Viste and G. Evangelista, On the use of spatial cues to improve binaural source separation, International Conference on Digital Audio Effects, pp.209-213, 2003.

A. R. Kullaib, M. Mualla, and D. Vernon, 2D Binaural Sound Localization: for Urban Search and Rescue Robotics, Mobile Robotics, pp.423-435, 2009.
DOI : 10.1142/9789814291279_0053

F. Keyrouz, W. Maier, and K. Diepold, Robotic Localization and Separation of Concurrent Sound Sources using Self-Splitting Competitive Learning, 2007 IEEE Symposium on Computational Intelligence in Image and Signal Processing, pp.340-345, 2007.
DOI : 10.1109/CIISP.2007.369192

J. Hörnstein, M. Lopes, J. Santos-victor, and F. Lacerda, Sound Localization for Humanoid Robots - Building Audio-Motor Maps based on the HRTF, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp.1170-1176, 2006.
DOI : 10.1109/IROS.2006.281849

P. M. Hofman, J. G. Van-riswick, and A. J. Van-opstal, Relearning sound localization with new ears, The Journal of the Acoustical Society of America, vol.105, issue.2, pp.417-421, 1998.
DOI : 10.1121/1.424942

B. A. Wright and Y. Zhang, A review of learning with normal and altered sound-localization cues in human adults, International Journal of Audiology, vol.117, issue.sup1, pp.92-98, 2006.
DOI : 10.3109/01050398509045933

M. Aytekin, C. F. Moss, and J. Z. Simon, A Sensorimotor Approach to Sound Localization, Neural Computation, vol.79, issue.2, pp.603-635, 2008.
DOI : 10.1523/JNEUROSCI.0199-04.2004
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.211.7426

H. Poincaré and G. B. Trans, The foundations of science; Science and hypothesis, the value of science, science and method, Halsted, 1905.

J. K. O-'regan and A. Noe, A sensorimotor account of vision and visual consciousness, Behavioral and Brain Sciences, vol.24, issue.05, pp.939-1031, 2001.
DOI : 10.1017/S0140525X01000115

R. Held and A. Hein, Movement-produced stimulation in the development of visually guided behavior., Journal of Comparative and Physiological Psychology, vol.56, issue.5, pp.872-876, 1963.
DOI : 10.1037/h0040546

A. Deleforge and R. P. Horaud, 2D sound-source localization on the binaural manifold, 2012 IEEE International Workshop on Machine Learning for Signal Processing, pp.1-6, 2012.
DOI : 10.1109/MLSP.2012.6349784
URL : https://hal.archives-ouvertes.fr/hal-00768657

A. Deleforge, F. Forbes, and R. P. Horaud, Variational EM for binaural sound-source separation and localization, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.76-80, 2013.
DOI : 10.1109/ICASSP.2013.6637612
URL : https://hal.archives-ouvertes.fr/hal-00823453

S. T. Roweis, One microphone source separation, Advances in Neural Information Processing Systems, pp.793-799, 2000.

S. Bensaid, A. Schutz, and D. T. Slock, Single Microphone Blind Audio Source Separation Using EM-Kalman Filter and Short+Long Term AR Modeling, Latent Variable Analysis and Signal Separation, pp.106-113, 2010.
DOI : 10.1007/978-3-642-15995-4_14

P. Comon and C. Jutten, Handbook of Blind Source Separation, Independent Component Analysis and Applications, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00460653

N. Q. Duong, E. Vincent, and R. Gribonval, Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.7, pp.1830-1840, 2010.
DOI : 10.1109/TASL.2010.2050716
URL : https://hal.archives-ouvertes.fr/inria-00435807

M. I. Mandel, R. J. Weiss, and D. P. Ellis, Model-Based Expectation-Maximization Source Separation and Localization, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.2, pp.382-394, 2010.
DOI : 10.1109/TASL.2009.2029711

R. Rosipal and N. Krämer, Overview and recent advances in partial least squares Subspace, Latent Structure and Feature Selection, pp.34-51, 2006.
DOI : 10.1007/11752790_2
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.85.7735

K. C. Li, Sliced Inverse Regression for Dimension Reduction, Journal of the American Statistical Association, vol.13, issue.414, pp.316-327, 1991.
DOI : 10.1214/aos/1176345514
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.452.161

H. M. Wu, Kernel Sliced Inverse Regression with Applications to Classification, Journal of Computational and Graphical Statistics, vol.17, issue.3, pp.590-610, 2008.
DOI : 10.1198/106186008X345161

R. D. De-veaux, Mixtures of linear regressions, Computational Statistics & Data Analysis, vol.8, issue.3, pp.227-245, 1989.
DOI : 10.1016/0167-9473(89)90043-1

L. Xu, M. I. Jordan, and G. E. Hinton, An alternative model for mixtures of experts, Proc. of the Neural Information Processing Systems (NIPS) conference, pp.633-640, 1995.

A. Kain and M. Macon, Spectral voice conversion for text-to-speech synthesis, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181), pp.285-288, 1998.
DOI : 10.1109/ICASSP.1998.674423

Y. Stylianou, O. Cappé, and E. Moulines, Continuous probabilistic transform for voice conversion, IEEE Transactions on Speech and Audio Processing, vol.6, issue.2, pp.131-142, 1998.
DOI : 10.1109/89.661472

T. Toda, A. Black, and K. Tokuda, Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model, Speech Communication, vol.50, issue.3, pp.215-227, 2008.
DOI : 10.1016/j.specom.2007.09.001

Y. Qiao and N. Minematsu, Mixture of Probabilistic Linear Regressions: A unified view of GMM-based mapping techiques, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.3913-3916, 2009.
DOI : 10.1109/ICASSP.2009.4960483

A. Deleforge, F. Forbes, and R. P. Horaud, Highdimensional regression with gaussian mixtures and partially-latent response variables, arxiv:1308, p.2302, 2013.
DOI : 10.1007/s11222-014-9461-5
URL : http://arxiv.org/abs/1308.2302

M. Otani, T. Hirahara, and S. Ise, Numerical study on source-distance dependency of head-related transfer functions, The Journal of the Acoustical Society of America, vol.125, issue.5, pp.3253-61, 2009.
DOI : 10.1121/1.3111860

J. S. Garofolo, L. F. Lamel, and W. M. Fisher, The darpa timit acoustic-phonetic continuous speech corpus cdrom, 1993.

R. Talmon, I. Cohen, and S. Gannot, Supervised source localization using diffusion kernels, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp.245-248, 2011.
DOI : 10.1109/ASPAA.2011.6082267

Z. Zhang and H. Zha, Principal Manifolds and Nonlinear Dimensionality Reduction via Tangent Space Alignment, SIAM Journal on Scientific Computing, vol.26, issue.1, pp.313-338, 2005.
DOI : 10.1137/S1064827502419154
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.211.9957

M. Beal and Z. Ghahramani, The variational Bayesian EM algorithm for incomplete data: with application to scoring graphical model structures, Bayesian Statistics, vol.7, pp.453-464, 2003.

P. Aarabi, Self-localizing dynamic microphone arrays, IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews), vol.32, issue.4, pp.474-484, 2002.
DOI : 10.1109/TSMCB.2002.804369
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.4.6131

J. Mouba and S. Marchand, A source localization/separation/respatialization system based on unsupervised classification of interaural cues
URL : https://hal.archives-ouvertes.fr/hal-00307889