S. Rickard and O. Yilmaz, On the approximate W-disjoint orthogonality of speech, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.529-532, 2002.

O. Yilmaz and S. Rickard, Blind Separation of Speech Mixtures via Time-Frequency Masking, IEEE Transactions on Signal Processing, vol.52, issue.7, pp.1830-1847, 2004.
DOI : 10.1109/TSP.2004.828896

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.158.1318

M. I. Mandel, R. J. Weiss, and D. P. Ellis, Model-Based Expectation-Maximization Source Separation and Localization, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.2, pp.382-394, 2010.
DOI : 10.1109/TASL.2009.2029711

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.216.1061

M. Raspaud, H. Viste, and G. Evangelista, Binaural Source Localization by Joint Estimation of ILD and ITD, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.1, pp.68-77, 2010.
DOI : 10.1109/TASL.2009.2023644

T. May, S. Van-de-par, and A. Kohlrausch, A Probabilistic Model for Robust Localization Based on a Binaural Auditory Front-End, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.1, pp.1-13, 2011.
DOI : 10.1109/TASL.2010.2042128

J. Traa and P. Smaragdis, Multichannel Source Separation and Tracking With RANSAC and Directional Statistics, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue.12, pp.2233-2243, 2014.
DOI : 10.1109/TASLP.2014.2365701

H. Araki, R. Sawada, S. Mukai, and . Makino, Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors, Signal Processing, vol.87, issue.8, pp.1833-1847, 2007.
DOI : 10.1016/j.sigpro.2007.02.003

S. Winter, W. Kellermann, H. Sawada, and S. Makino, MAP-Based Underdetermined Blind Source Separation of Convolutive Mixtures by Hierarchical Clustering and -Norm Minimization, EURASIP Journal on Advances in Signal Processing, vol.2007, issue.1, pp.81-81, 2007.
DOI : 10.1109/TSP.2003.822284

S. Arberet, R. Gribonval, and F. Bimbot, A Robust Method to Count and Locate Audio Sources in a Multichannel Underdetermined Mixture, IEEE Transactions on Signal Processing, vol.58, issue.1, pp.121-133, 2010.
DOI : 10.1109/TSP.2009.2030854

URL : https://hal.archives-ouvertes.fr/inria-00305435

O. Schwartz and S. Gannot, Speaker Tracking Using Recursive EM Algorithms, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue.2, pp.392-402, 2014.
DOI : 10.1109/TASLP.2013.2292361

Y. Dorfan and S. Gannot, Tree-Based Recursive Expectation-Maximization Algorithm for Localization of Acoustic Sources, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.23, issue.10, pp.1692-1703, 2015.
DOI : 10.1109/TASLP.2015.2444654

S. Gannot, D. Burshtein, and E. Weinstein, Signal enhancement using beamforming and nonstationarity with applications to speech, IEEE Transactions on Signal Processing, vol.49, issue.8, pp.1614-1626, 2001.
DOI : 10.1109/78.934132

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.455.2627

T. G. Dvorkind and S. Gannot, Time difference of arrival estimation of speech source in a noisy and reverberant environment, Signal Processing, vol.85, issue.1, pp.177-204, 2005.
DOI : 10.1016/j.sigpro.2004.09.014

X. Li, L. Girin, R. Horaud, and S. Gannot, Estimation of relative transfer function in the presence of stationary noise based on segmental power spectral density matrix subtraction, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.320-324, 2015.
DOI : 10.1109/ICASSP.2015.7177983

URL : https://hal.archives-ouvertes.fr/hal-01119186

X. Li, R. Horaud, L. Girin, and S. Gannot, Local relative transfer function for sound source localization, 2015 23rd European Signal Processing Conference (EUSIPCO), pp.399-403, 2015.
DOI : 10.1109/EUSIPCO.2015.7362413

URL : https://hal.archives-ouvertes.fr/hal-01163675

I. Cohen, Relative Transfer Function Identification Using Speech Signals, IEEE Transactions on Speech and Audio Processing, vol.12, issue.5, pp.451-459, 2004.
DOI : 10.1109/TSA.2004.832975

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.625.8191

A. Deleforge and F. Forbes, Rectified binaural ratio: A complex T-distributed feature for robust sound localization, 2016 24th European Signal Processing Conference (EUSIPCO), pp.1257-1261, 2016.
DOI : 10.1109/EUSIPCO.2016.7760450

URL : https://hal.archives-ouvertes.fr/hal-01372337

A. Deleforge, S. Gannot, and W. Kellermann, Towards a generalization of relative transfer functions to more than one source, 2015 23rd European Signal Processing Conference (EUSIPCO), pp.419-423, 2015.
DOI : 10.1109/EUSIPCO.2015.7362417

R. Y. Litovsky, H. S. Colburn, W. A. Yost, and S. J. Guzman, The precedence effect, The Journal of the Acoustical Society of America, vol.106, issue.4, pp.1633-1654, 1999.
DOI : 10.1121/1.427914

J. Woodruff and D. Wang, Binaural Localization of Multiple Sources in Reverberant and Noisy Environments, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.5, pp.1503-1512, 2012.
DOI : 10.1109/TASL.2012.2183869

C. Faller and J. Merimaa, Source localization in complex listening situations: Selection of binaural cues based on interaural coherence, The Journal of the Acoustical Society of America, vol.116, issue.5, pp.3075-3089, 2004.
DOI : 10.1121/1.1791872

S. Mohan, M. E. Lockwood, M. L. Kramer, and D. L. Jones, Localization of multiple acoustic sources with small arrays using a coherence test, The Journal of the Acoustical Society of America, vol.123, issue.4, pp.2136-2147, 2008.
DOI : 10.1121/1.2871597

O. Nadiri and B. Rafaely, Localization of Multiple Speakers under High Reverberation using a Spherical Microphone Array and the Direct-Path Dominance Test, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue.10, pp.1494-1505, 2014.
DOI : 10.1109/TASLP.2014.2337846

X. Li, L. Girin, R. Horaud, and S. Gannot, Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.24, issue.11, pp.2171-2186, 2016.
DOI : 10.1109/TASLP.2016.2598319

URL : https://hal.archives-ouvertes.fr/hal-01349691

]. X. Li, L. Girin, F. Badeig, and R. Horaud, Reverberant sound localization with a robot head based on direct-path relative transfer function, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp.2819-2826, 2016.
DOI : 10.1109/IROS.2016.7759437

URL : https://hal.archives-ouvertes.fr/hal-01349771

Y. Avargel and I. Cohen, System Identification in the Short-Time Fourier Transform Domain With Crossband Filtering, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.4, pp.1305-1319, 2007.
DOI : 10.1109/TASL.2006.889720

R. Talmon, I. Cohen, and S. Gannot, Relative Transfer Function Identification Using Convolutive Transfer Function Approximation, IEEE Transactions on Audio, Speech, and Language Processing, vol.17, issue.4, pp.546-555, 2009.
DOI : 10.1109/TASL.2008.2009576

Y. Avargel and I. Cohen, On Multiplicative Transfer Function Approximation in the Short-Time Fourier Transform Domain, IEEE Signal Processing Letters, vol.14, issue.5, pp.337-340, 2007.
DOI : 10.1109/LSP.2006.888292

O. Schwartz, Y. Dorfan, E. Habets, and S. Gannot, Multi-speaker DOA estimation in reverberation conditions using expectation-maximization, 2016 IEEE International Workshop on Acoustic Signal Enhancement (IWAENC), 2016.
DOI : 10.1109/IWAENC.2016.7602897

B. Laufer-goldshtein, R. Talmon, and S. Gannot, Semi-Supervised Sound Source Localization Based on Manifold Regularization, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.24, issue.8, pp.1393-1407, 2016.
DOI : 10.1109/TASLP.2016.2555085

URL : http://arxiv.org/pdf/1508.03148

A. Deleforge, V. Drouard, L. Girin, and R. Horaud, Mapping sounds onto images using binaural spectrograms, European Signal Processing Conference, pp.2470-2474, 2014.

A. Deleforge, F. Forbes, and R. Horaud, Acoustic Space Learning for Sound-Source Separation and Localization on Binaural Manifolds, International Journal of Neural Systems, vol.7, issue.01, 2015.
DOI : 10.1109/TSA.2005.858005

URL : https://hal.archives-ouvertes.fr/hal-00960796

A. Deleforge, R. Horaud, Y. Y. Schechner, and L. Girin, Co-Localization of Audio Sources in Images Using Binaural Features and Locally-Linear Regression, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.23, issue.4, pp.718-731, 2015.
DOI : 10.1109/TASLP.2015.2405475

URL : https://hal.archives-ouvertes.fr/hal-01112834

H. Sawada, R. Mukai, S. Araki, and S. Makino, A Robust and Precise Method for Solving the Permutation Problem of Frequency-Domain Blind Source Separation, IEEE Transactions on Speech and Audio Processing, vol.12, issue.5, pp.530-538, 2004.
DOI : 10.1109/TSA.2004.832994

H. Sawada, S. Araki, R. Mukai, and S. Makino, Grouping Separated Frequency Components by Estimating Propagation Model Parameters in Frequency-Domain Blind Source Separation, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.5, pp.1592-1604, 2007.
DOI : 10.1109/TASL.2007.899218

M. A. Figueiredo and A. K. Jain, Unsupervised learning of finite mixture models, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, issue.3, pp.381-396, 2002.
DOI : 10.1109/34.990138

J. Rousseau and K. Mengersen, Asymptotic behaviour of the posterior distribution in overfitted mixture models, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.6, issue.5, pp.689-710, 2011.
DOI : 10.1214/aos/1176344136

URL : https://hal.archives-ouvertes.fr/hal-00641475

G. Malsiner-walli, S. Frühwirth-schnatter, and B. Grün, Model-based clustering based on sparse finite Gaussian mixtures, Statistics and Computing, vol.17, issue.2, pp.303-324, 2016.
DOI : 10.1093/bioinformatics/17.10.977

URL : https://link.springer.com/content/pdf/10.1007%2Fs11222-014-9500-2.pdf

C. Bishop, Pattern Recognition and Machine Learning, 2007.

H. Ishwaran, L. F. James, and J. Sun, Bayesian Model Selection in Finite Mixtures by Marginal Density Decompositions, Journal of the American Statistical Association, vol.96, issue.456, pp.1316-1332, 2001.
DOI : 10.1198/016214501753382255

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.7.1864

D. Malioutov, M. , and A. S. Willsky, A sparse signal reconstruction perspective for source localization with sensor arrays, IEEE Transactions on Signal Processing, vol.53, issue.8, pp.3010-3022, 2005.
DOI : 10.1109/TSP.2005.850882

A. Asaei, M. Golbabaee, H. Bourlard, and V. Cevher, Structured Sparsity Models for Reverberant Speech Separation, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue.3, pp.620-633, 2014.
DOI : 10.1109/TASLP.2013.2297012

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.645.4836

A. L. Yuille and A. Rangarajan, The Concave-Convex Procedure, Neural Computation, vol.39, issue.4, pp.915-936, 2003.
DOI : 10.1162/08997660260028674

A. J. Smola, S. Vishwanathan, and T. Hofmann, Kernel methods for missing variables, International Workshop on Artificial Intelligence and Statistics, pp.325-332, 2005.

S. Boyd and L. Vandenberghe, Convex optimization, 2004.

T. Lipp and S. Boyd, Variations and extension of the convex???concave procedure, Optimization and Engineering, vol.27, issue.3, pp.263-287, 2016.
DOI : 10.1023/A:1011841821203

A. Gilloire and M. Vetterli, Adaptive filtering in subbands with critical sampling: analysis, experiments, and application to acoustic echo cancellation, IEEE Transactions on Signal Processing, vol.40, issue.8, pp.1862-1875, 1992.
DOI : 10.1109/78.149989

G. Xu, H. Liu, L. Tong, and T. Kailath, A least-squares approach to blind channel identification, IEEE Transactions on signal processing, vol.43, issue.12, pp.2982-2993, 1995.

D. G. Manolakis, V. K. Ingle, and S. M. Kogon, Statistical and adaptive signal processing: spectral estimation, signal modeling, adaptive filtering, and array processing, 2005.

D. Campbell, The roomsim user guide (v3. 3), 2004.

W. G. Gardner and K. D. Martin, HRTF measurements of a KEMAR, The Journal of the Acoustical Society of America, vol.97, issue.6, pp.3907-3908, 1995.
DOI : 10.1121/1.412407

J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallett et al., Getting started with the DARPA TIMIT CD-ROM: An acoustic phonetic continuous speech database, National Institute of Standards and Technology (NIST), vol.107, 1988.

H. W. Loellmann, H. Barfuss, A. Deleforge, S. Meier, and W. Kellermann, Challenges in acoustic signal enhancement for human-robot communication, Proceedings of Speech Communication, pp.1-4, 2014.

M. I. Mandel and J. P. Barker, Multichannel Spatial Clustering for Robust Far-Field Automatic Speech Recognition in Mismatched Conditions, Interspeech 2016, pp.1991-1995, 2016.
DOI : 10.21437/Interspeech.2016-1275

J. H. Dibiase, H. F. Silverman, and M. S. Brandstein, Robust Localization in Reverberant Rooms, Microphone Arrays, pp.157-180, 2001.
DOI : 10.1007/978-3-662-04619-7_8

H. Do, H. F. Silverman, and Y. Yu, A Real-Time SRP-PHAT Source Location Implementation using Stochastic Region Contraction(SRC) on a Large-Aperture Microphone Array, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07, pp.121-124, 2007.
DOI : 10.1109/ICASSP.2007.366631

S. Gannot, degree (summa cum laude) from the Technion Israel Institute of Technology, Haifa, Israel in 1986 and the M.Sc. (cum laude) and Ph respectively, all in Electrical Engineering In 2001 he held a post-doctoral position at the department of Electrical Engineering (ESAT-SISTA) at K.U.Leuven, Belgium. From 2002 to 2003 he held a research and teaching position at the Faculty of Electrical Engineering, Technion-Israel Institute of Technology, Haifa, Israel. Currently, he is a Full Professor at the Faculty of Engineering, Bar-Ilan University, Israel, where he is heading the Speech and Signal Processing laboratory and the Signal Processing Track. Prof. Gannot is the recipient of Bar-Ilan University outstanding lecturer award for 2010 and 2014. He is also a co-recipient of seven best paper awards, Prof. Gannot has served as an Associate Editor of the EURASIP Journal of Advances in Signal Processing in 2003-2012, and as an Editor of several special issues on Multi-microphone Speech Processing of the same journal. He has also served as a guest editor of ELSEVIER Speech Communication and Signal Processing journals. Prof. Gannot has served as an Associate Editor of IEEE Transactions on Speech, Audio and Language Processing, 1995.