E. Vincent, M. Jafari, S. A. Abdallah, M. D. Plumbley, and M. E. Davies, Probabilistic Modeling Paradigms for Audio Source Separation, Machine Audition: Principles, Algorithms and Systems. IGI Global, pp.162-185, 2010.
DOI : 10.4018/978-1-61520-919-4.ch007

URL : https://hal.archives-ouvertes.fr/inria-00544016

H. Attias, New EM algorithms for source separation and deconvolution, Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.297-300, 2003.

D. Pham, C. Servì, and H. Boumaraf, Blind separation of speech mixtures based on nonstationarity, Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings., pp.73-76, 2003.
DOI : 10.1109/ISSPA.2003.1224818

S. A. Abdallah and M. D. Plumbley, Polyphonic transcription by nonnegative sparse coding of power spectra, Proc. 5th International Symposium Music Information Retrieval, pp.318-325, 2004.

C. Févotte and J. Cardoso, Maximum likelihood approach for blind audio source separation using time-frequency Gaussian source models, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005., pp.78-81, 2005.
DOI : 10.1109/ASPAA.2005.1540173

L. Benaroya, F. Bimbot, and R. Gribonval, Audio source separation with a single sensor, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.1, pp.191-199, 2006.
DOI : 10.1109/TSA.2005.854110

URL : https://hal.archives-ouvertes.fr/inria-00544949

A. Ozerov, P. Philippe, F. Bimbot, and R. Gribonval, Adaptation of Bayesian Models for Single-Channel Source Separation and its Application to Voice/Music Separation in Popular Songs, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.5, pp.1564-1578, 2007.
DOI : 10.1109/TASL.2007.899291

URL : https://hal.archives-ouvertes.fr/inria-00544774

R. Blouet, G. Rapaport, I. Cohen, and C. Févotte, Evaluation of several strategies for single sensor speech/music separation, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.37-40, 2008.
DOI : 10.1109/ICASSP.2008.4517540

C. Févotte, N. Bertin, and J. Durrieu, Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis, Neural Computation, vol.14, issue.3, pp.793-830, 2009.
DOI : 10.1016/j.sigpro.2007.01.024

E. Vincent, S. Arberet, and R. Gribonval, Underdetermined Instantaneous Audio Source Separation via Local Gaussian Modeling, Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation, pp.775-782, 2009.
DOI : 10.1109/TSP.2004.828896

URL : https://hal.archives-ouvertes.fr/hal-00482223

S. Arberet, A. Ozerov, R. Gribonval, and F. Bimbot, Blind Spectral-GMM Estimation for Underdetermined Instantaneous Audio Source Separation, Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation, pp.751-758, 2009.
DOI : 10.1109/TSP.2004.828896

URL : https://hal.archives-ouvertes.fr/hal-00482287

A. Ozerov, C. Févotte, and M. Charbit, Factorial Scaled Hidden Markov Model for polyphonic audio representation and source separation, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp.121-124, 2009.
DOI : 10.1109/ASPAA.2009.5346527

URL : https://hal.archives-ouvertes.fr/inria-00553336

A. Ozerov and C. Févotte, Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.3, pp.550-563, 2010.
DOI : 10.1109/TASL.2009.2031510

E. Vincent, N. Bertin, and R. Badeau, Adaptive Harmonic Spectral Decomposition for Multiple Pitch Estimation, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.3, pp.528-537, 2010.
DOI : 10.1109/TASL.2009.2034186

URL : https://hal.archives-ouvertes.fr/inria-00544094

N. Bertin, R. Badeau, and E. Vincent, Enforcing Harmonicity and Smoothness in Bayesian Non-Negative Matrix Factorization Applied to Polyphonic Music Transcription, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.3, pp.538-549, 2010.
DOI : 10.1109/TASL.2010.2041381

URL : https://hal.archives-ouvertes.fr/inria-00557088

J. L. Durrieu, G. Richard, B. David, and C. Févotte, Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.3, pp.564-575, 2010.
DOI : 10.1109/TASL.2010.2041114

S. Arberet, A. Ozerov, N. Duong, E. Vincent, R. Gribonval et al., Nonnegative matrix factorization and spatial covariance model for under-determined reverberant audio source separation, 10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010), pp.1-4, 2010.
DOI : 10.1109/ISSPA.2010.5605570

URL : https://hal.archives-ouvertes.fr/inria-00541436

N. Q. Duong, E. Vincent, and R. Gribonval, Under-Determined Reverberant Audio Source Separation Using Local Observed Covariance and Auditory-Motivated Time-Frequency Representation, 9th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA'10), pp.73-80, 2010.
DOI : 10.1007/978-3-642-15995-4_10

URL : https://hal.archives-ouvertes.fr/inria-00541868

A. P. Dempster, N. M. Laird, and D. B. Rubin, Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society. Series B (Methodological), vol.39, pp.1-38, 1977.

A. Ozerov, C. Févotte, R. Blouet, and J. Durrieu, Multichannel nonnegative tensor factorization with structured constraints for userguided audio source separation, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.257-260, 2011.
URL : https://hal.archives-ouvertes.fr/inria-00564851

J. Cardoso, M. Le-jeune, J. Delabrouille, M. Betoule, G. Fitzgerald et al., Component separation with flexible models ? Application to multichannel astrophysical observations Extended nonnegative tensor factorisation models for musical sound source separation, IEEE Journal of Selected Topics in Signal Processing Computational Intelligence and Neuroscience, vol.223, issue.5, pp.735-746, 2008.

A. Ozerov, E. Vincent, and F. Bimbot, A General Modular Framework for Audio Source Separation, 9th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA'10), pp.33-40, 2010.
DOI : 10.1007/978-3-642-15995-4_5

URL : https://hal.archives-ouvertes.fr/inria-00553504

F. Hlawatsch and G. F. Boudreaux-bartels, Linear and quadratic time-frequency signal representations, IEEE Signal Processing Magazine, vol.9, issue.2, pp.21-67, 1992.
DOI : 10.1109/79.127284

O. Yilmaz and S. Rickard, Blind Separation of Speech Mixtures via Time-Frequency Masking, IEEE Transactions on Signal Processing, vol.52, issue.7, pp.1830-1847, 2004.
DOI : 10.1109/TSP.2004.828896

H. Sawada, S. Araki, R. Mukai, and S. Makino, Grouping Separated Frequency Components by Estimating Propagation Model Parameters in Frequency-Domain Blind Source Separation, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.5, pp.1592-1604, 2007.
DOI : 10.1109/TASL.2007.899218

S. Araki, A. Ozerov, V. Gowreesunker, H. Sawada, F. Theis et al., The 2010 Signal Separation Evaluation Campaign (SiSEC2010): Audio Source Separation, 9th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA'10), pp.114-122, 2010.
DOI : 10.1007/978-3-642-15995-4_15

URL : https://hal.archives-ouvertes.fr/inria-00553385

E. Vincent, S. Araki, and P. Bofilld, The 2008 Signal Separation Evaluation Campaign: A Community-Based Approach to Large-Scale Evaluation, Proc. Int. Conf. on Independent Component Analysis and Signal Separation, pp.734-741, 2009.
DOI : 10.1109/TASL.2007.899176

URL : https://hal.archives-ouvertes.fr/inria-00544168

E. Moulines, J. Cardoso, and E. Gassiat, Maximum likelihood for blind separation and deconvolution of noisy signals using mixture models, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.3617-3620, 1997.
DOI : 10.1109/ICASSP.1997.604649

T. Yoshioka, T. Nakatani, M. Miyoshi, and H. Okuno, Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.1, pp.69-84, 2010.
DOI : 10.1109/TASL.2010.2045183

P. Smaragdis, Non-negative Matrix Factor Deconvolution; Extraction of Multiple Sound Sources from Monophonic Inputs, Fifth International Conference on Independent Component Analysis, pp.494-499, 2004.
DOI : 10.1007/978-3-540-30110-3_63

T. Virtanen, Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.3, pp.1066-1074, 2007.
DOI : 10.1109/TASL.2006.885253

A. Klapuri, Analysis of musical instrument sounds by source-filterdecay model, Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, pp.53-56, 2007.

I. Lee, T. Kim, and T. Lee, Independent vector analysis for convolutive blind speech separation, " in Blind speech separation, pp.169-192, 2007.

S. J. Rennie, J. R. Hershey, and P. A. Olsen, Efficient model-based speech separation and denoising using non-negative subspace analysis, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.1833-1836, 2008.
DOI : 10.1109/ICASSP.2008.4517989

S. T. Roweis, One microphone source separation, Advances in Neural Information Processing Systems 13, pp.793-799, 2000.

M. I. Mandel, R. J. Weiss, and D. Ellis, Model-Based Expectation-Maximization Source Separation and Localization, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.2, pp.382-394, 2010.
DOI : 10.1109/TASL.2009.2029711

J. Cardoso, The three easy routes to independent component analysis; contrasts and geometry, Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation (ICA'01), pp.1-6, 2001.

H. Kameoka, T. Nishimoto, and S. Sagayama, A Multipitch Analyzer Based on Harmonic Temporal Structured Clustering, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.3, pp.982-994, 2007.
DOI : 10.1109/TASL.2006.885248

R. Hennequin, R. Badeau, and B. David, NMF With Time–Frequency Activations to Model Nonstationary Audio Events, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.4, pp.744-753, 2011.
DOI : 10.1109/TASL.2010.2062506

Y. Meron and K. Hirose, Separation of singing and piano sounds, Proc. Int. Conf. on Spoken Language Processing, 1998.

L. R. Rabiner, A tutorial on hidden Markov models and selected applications in speech recognition, Proceedings of the IEEE, pp.257-286, 1989.

P. O. Hoyer, Non-negative matrix factorization with sparseness constraints, Journal of Machine Learning Research, vol.5, pp.1457-1469, 2004.

J. Eggert and E. Körner, Sparse coding and NMF, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541), pp.2529-2533, 2004.
DOI : 10.1109/IJCNN.2004.1381036

E. Vincent, R. Gribonval, and C. Fevotte, Performance measurement in blind audio source separation, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.4, pp.1462-1469, 2006.
DOI : 10.1109/TSA.2005.858005

URL : https://hal.archives-ouvertes.fr/inria-00544230

S. Arberet, R. Gribonval, and F. Bimbot, A Robust Method to Count and Locate Audio Sources in a Multichannel Underdetermined Mixture, IEEE Transactions on Signal Processing, vol.58, issue.1, pp.121-133, 2010.
DOI : 10.1109/TSP.2009.2030854

URL : https://hal.archives-ouvertes.fr/inria-00305435

C. Blandin, E. Vincent, and A. Ozerov, Multi-source TDOA estimation using SNR-based angular spectra, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.2616-2619, 2011.
DOI : 10.1109/ICASSP.2011.5947021

URL : https://hal.archives-ouvertes.fr/inria-00566706

E. Vincent, Complex nonconvex lp norm minimization for underdetermined source separation, Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation (ICA'07), pp.430-437, 2007.
URL : https://hal.archives-ouvertes.fr/inria-00544203

O. Gillet and G. Richard, Transcription and Separation of Drum Signals From Polyphonic Music, IEEE Transactions on Audio, Speech, and Language Processing, vol.16, issue.3, pp.529-540, 2008.
DOI : 10.1109/TASL.2007.914120

M. Goto, H. Hashiguchi, T. Nishimura, and R. Oka, RWC music database: Music genre database and musical instrument sound databases, 5th International Symposium on Music Information Retrieval (ISMIR), pp.229-230, 2004.

A. Ozerov and E. Vincent, Using the FASST source separation toolbox for noise robust speech recognition, International Workshop on Machine Listening in Multisource Environments, pp.86-87, 2011.
URL : https://hal.archives-ouvertes.fr/inria-00598734

R. Weiss and D. Ellis, Speech separation using speaker-adapted eigenvoice speech models, Computer Speech & Language, vol.24, issue.1, pp.16-29, 2010.
DOI : 10.1016/j.csl.2008.03.003

G. Grindlay and D. Ellis, Multi-voice polyphonic music transcription using eigeninstruments, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp.53-56, 2009.
DOI : 10.1109/ASPAA.2009.5346514

L. Benaroya, R. Blouet, C. Févotte, and I. Cohen, Single sensor source separation using multiple-window STFT representation, Proc. International Workshop on Acoustic Echo and Noise Control (IWAENC'06), pp.12-14, 2006.