D. Stowell, D. Giannoulis, E. Benetos, M. Lagrange, and M. D. Plumbley, Detection and Classification of Acoustic Scenes and Events, IEEE Transactions on Multimedia, vol.17, issue.10, pp.1733-1746, 2015.
DOI : 10.1109/TMM.2015.2428998

URL : https://hal.archives-ouvertes.fr/hal-01253912

A. J. Eronen, V. T. Peltonen, J. T. Tuomi, A. P. Klapuri, S. Fagerlund et al., Audio-based context recognition, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.1, pp.321-329, 2006.
DOI : 10.1109/TSA.2005.854103

T. Heittola, A. Mesaros, A. Eronen, and T. Virtanen, Contextdependent sound event detection, EURASIP Journal on Audio , Speech, and Music Processing, vol.2013, 2013.

J. Dennis, H. D. Tran, and E. S. Chng, Overlapping sound event recognition using local spectrogram features and the generalised hough transform, Pattern Recognition Letters, vol.34, issue.9, pp.1085-1093, 2013.
DOI : 10.1016/j.patrec.2013.02.015

J. F. Gemmeke, L. Vuegen, P. Karsmakers, B. Vanrumste, and H. Van-hamme, An exemplar-based NMF approach to audio event detection, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013.
DOI : 10.1109/WASPAA.2013.6701847

E. Cakir, T. Heittola, H. Huttunen, and T. Virtanen, Polyphonic sound event detection using multi label deep neural networks, 2015 International Joint Conference on Neural Networks (IJCNN), 2015.
DOI : 10.1109/IJCNN.2015.7280624

D. Giannoulis, E. Benetos, D. Stowell, M. Rossignol, M. Lagrange et al., Detection and classification of acoustic scenes and events: An IEEE AASP challenge, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013.
DOI : 10.1109/WASPAA.2013.6701819

URL : https://hal.archives-ouvertes.fr/hal-01123765

L. Vuegen, B. Van-den-broeck, P. Karsmakers, J. F. Gemmeke, B. Vanrumste et al., An MFCC-GMM approach for event detection and classification, IEEE AASP DCASE Challenge, 2013.

E. Benetos and T. Weyde, An efficient temporally-constrained probabilistic model for multiple-instrument music transcription, 16th International Society for Music Information Retrieval Conference (ISMIR), pp.701-707, 2015.

C. V. Cotton and D. P. Ellis, Spectral vs. spectro-temporal features for acoustic event detection, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp.69-72, 2011.
DOI : 10.1109/ASPAA.2011.6082331

A. Mesaros, T. Heittola, and A. Klapuri, Latent semantic analysis in sound event detection, European Signal Processing Conference, pp.1307-1311, 2011.

E. Benetos, M. Lagrange, and S. Dixon, Characterisation of acoustic scenes using a temporally-constrained shift-invariant model, 15th International Conference on Digital Audio Effects (DAFx), pp.317-323, 2012.

J. F. Gemmeke, T. Virtanen, A. Hurmalainen, M. Shashanka, B. Raj et al., Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition, Probabilistic latent variable models as nonnegative factorizations, pp.2067-2080, 2008.
DOI : 10.1109/TASL.2011.2112350

B. C. Moore, Frequency analysis and masking, " in Hearing ? Handbook of Perception and Cognition, pp.161-205, 1995.

E. Vincent, N. Bertin, and R. Badeau, Adaptive Harmonic Spectral Decomposition for Multiple Pitch Estimation, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.3, pp.528-537, 2010.
DOI : 10.1109/TASL.2009.2034186

URL : https://hal.archives-ouvertes.fr/inria-00544094

A. P. Dempster, N. M. Laird, and D. B. Rubin, Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society, vol.39, issue.1, pp.1-38, 1977.

G. Mysore, A Non-negative Framework for Joint Modeling of Spectral Structure and Temporal Dynamics in Sound Mixtures, 2010.

L. R. Rabiner, A tutorial on hidden Markov models and selected applications in speech recognition, Proceedings of the IEEE, pp.257-286, 1989.

M. Lagrange, G. Lafay, M. Rossignol, E. Benetos, and A. , An evaluation framework for event detection using a morphological model of acoustic scenes ArXiv e-prints, p.150200141, 2015.