N. Duong, E. Vincent, and R. Gribonval, Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.7, pp.1830-1840, 2010.
DOI : 10.1109/TASL.2010.2050716

URL : https://hal.archives-ouvertes.fr/inria-00435807

S. Gannot, E. Vincent, S. Markovich-golan, and A. Ozerov, A Consolidated Perspective on Multimicrophone Speech Enhancement and Source Separation, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.25, issue.4, pp.692-730, 2017.
DOI : 10.1109/TASLP.2016.2647702

URL : https://hal.archives-ouvertes.fr/hal-01414179

S. Leglaive, R. Badeau, and G. Richard, Multichannel Audio Source Separation With Probabilistic Reverberation Priors, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.24, issue.12, pp.2453-2465, 2016.
DOI : 10.1109/TASLP.2016.2614140

URL : https://hal.archives-ouvertes.fr/hal-01370051

D. Kounades-bastian, L. Girin, X. Alameda-pineda, S. Gannot, and R. Horaud, A Variational EM Algorithm for the Separation of Time-Varying Convolutive Audio Mixtures, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.24, issue.8, pp.1408-1423, 2016.
DOI : 10.1109/TASLP.2016.2554286

URL : https://hal.archives-ouvertes.fr/hal-01301762

C. Févotte, N. Bertin, and J. Durrieu, Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis, Neural Computation, vol.14, issue.3, pp.793-830, 2009.
DOI : 10.1016/j.sigpro.2007.01.024

K. Adilo?-glu and E. Vincent, Variational bayesian inference for source separation and robust feature extraction, IEEE/ACM Trans. Audio, Speech, Lang. Process, vol.24, issue.10, 2016.

X. A. Miró, S. Bozonnet, N. Evans, C. Fredouille, G. Friedland et al., Speaker Diarization: A Review of Recent Research, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.2, pp.356-371, 2012.
DOI : 10.1109/TASL.2011.2125954

D. Vijayasenan, F. Valente, and H. Bourlard, Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features Springer handbook speech processing and speech communication, 2012.
DOI : 10.1016/j.specom.2011.07.001

A. Ozerov, C. Févotte, and M. Charbit, Factorial Scaled Hidden Markov Model for polyphonic audio representation and source separation, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009.
DOI : 10.1109/ASPAA.2009.5346527

URL : https://hal.archives-ouvertes.fr/inria-00553336

T. Higuchi and H. Kameoka, Unified approach for audio source separation with multichannel factorial HMM and DOA mixture model, 2015 23rd European Signal Processing Conference (EUSIPCO), 2015.
DOI : 10.1109/EUSIPCO.2015.7362743

URL : https://zenodo.org/record/38902/files/1570099211.pdf

Y. Oualil and D. Klakow, Multiple concurrent speaker shortterm tracking using a kalman filter bank, IEEE International Conference on Acoustics, Speech and Signal Processing, 2014.
DOI : 10.1109/icassp.2014.6853836

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.638.8306

B. Kleijn and F. Lim, Robust and low-complexity blind source separation for meeting rooms, 2017 Hands-free Speech Communications and Microphone Arrays (HSCMA), 2017.
DOI : 10.1109/HSCMA.2017.7895581

D. Kounades-bastian, L. Girin, X. Alameda-pineda, S. Gannot, R. Horaud et al., An EM algorithm for joint source separation and diarisation of multichannel convolutive speech mixtures Beyond the narrowband approximation: Wideband convex methods for under-determined reverberant audio source separation, IEEE Int, pp.1818-1829, 2010.

S. Arberet, A. Ozerov, N. Q. Duong, E. Vincent, R. Gribonval et al., Nonnegative matrix factorization and spatial covariance model for underdetermined reverberant audio source separation, IEEE Int. Conf. Info. Sciences, Signal Process, 2010.
DOI : 10.1109/isspa.2010.5605570

URL : https://hal.archives-ouvertes.fr/inria-00541436

N. Sturmel, A. Liutkus, J. Pinel, L. Girin, S. Marchand et al., Linear mixing models for active listening of music productions in realistic studio conditions, Convention of the Audio Eng. Society (AES), 2012.
URL : https://hal.archives-ouvertes.fr/hal-00790783

F. Neeser and J. Massey, Proper complex random processes with applications to information theory, IEEE Transactions on Information Theory, vol.39, issue.4, pp.1293-1302, 1993.
DOI : 10.1109/18.243446

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.408.1107

P. Smaragdis and J. Brown, Non-negative matrix factorization for polyphonic music transcription, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684), 2003.
DOI : 10.1109/ASPAA.2003.1285860

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.475.7518

C. Bishop, Pattern Recognition and Machine Learning, 2006.

C. Févotte, Majorization-minimization algorithm for smooth Itakura-Saito nonnegative matrix factorization, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011.
DOI : 10.1109/ICASSP.2011.5946898

J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallett et al., TIMIT acoustic-phonetic continuous speech corpus, linguistic Data Consortium, 1993.

C. Hummersone, R. Mason, and T. Brookes, A comparison of computational precedence models for source separation in reverberant environments, J. Audio Eng. Soc, vol.61, issue.7 8, pp.508-520, 2013.

Y. Dorfan and S. Gannot, Tree-Based Recursive Expectation-Maximization Algorithm for Localization of Acoustic Sources, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.23, issue.10, pp.1692-1703, 2015.
DOI : 10.1109/TASLP.2015.2444654

A. Ozerov and C. Févotte, Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.3, pp.550-563, 2010.
DOI : 10.1109/TASL.2009.2031510

J. Traa, D. Wingate, N. Stein, and P. Smaragdis, Robust Source Localization and Enhancement With a Probabilistic Steered Response Power Model, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.24, issue.3, pp.493-503, 2016.
DOI : 10.1109/TASLP.2015.2512499

X. Li, L. Girin, and R. Horaud, Audio source separation based on convolutive transfer function and frequency-domain lasso optimization, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017.
DOI : 10.1109/ICASSP.2017.7952214

URL : https://hal.archives-ouvertes.fr/hal-01430754

I. Gebru, S. Ba, X. Li, and R. Horaud, Audio-Visual Speaker Diarization Based on Spatiotemporal Bayesian Fusion, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017.
DOI : 10.1109/TPAMI.2017.2648793

URL : https://hal.archives-ouvertes.fr/hal-01413403