E. Cakir, T. Heittola, H. Huttunen, and T. Virtanen, Polyphonic sound event detection using multi label deep neural networks, Proc. IJCNN, pp.1-7, 2015.

G. Parascandolo, H. Huttunen, and T. Virtanen, Recurrent neural networks for polyphonic sound event detection in real life recordings, Proc. ICASSP, pp.6440-6444, 2016.

X. Xia, R. Togneri, F. Sohel, and D. Huang, Framewise dynamic threshold based polyphonic acoustic event detection, Proc. Interspeech, pp.474-478, 2017.

Q. Kong, I. Sobieraj, W. Wang, and M. Plumbley, Deep neural network baseline for DCASE challenge, 2016.

L. Ballan, A. Bazzica, M. Bertini, A. D. Bimbo, and G. Serra, Deep networks for audio event classification in soccer videos, Proc. ICME, pp.474-477, 2009.

D. Barchiesi, D. Giannoulis, D. Stowell, and M. D. Plumbley, Acoustic scene classification: Classifying environments from the sounds they produce, IEEE Signal Processing Magazine, vol.32, issue.3, pp.16-34, 2015.

A. Harma, M. F. Mckinney, and J. Skowronek, Automatic surveillance of the acoustic activity in our living environment, Proc. Multimedia and Expo, 2005.

Y. Lecun, Y. Bengio, and G. Hinton, Deep learning, nature, vol.521, issue.7553, pp.436-444, 2015.

T. Virtanen, M. D. Plumbley, and D. Ellis, Computational analysis of sound scenes and events, 2018.

J. F. Gemmeke, D. P. Ellis, D. Freedman, A. Jansen, W. Lawrence et al., Audio set: An ontology and human-labeled dataset for audio events, Proc. ICASSP, pp.776-780, 2017.

Y. Guo, M. Xu, J. Wu, Y. Wang, and K. Hoashi, Multiscale convolutional recurrent neural network with ensemble method for weakly labeled sound event detection, DCASE Challenge, 2018.

L. Jiakai, Mean Teacher Convolution System for DCASE, DCASE Challenge, vol.4, 2018.

R. Serizel, N. Turpault, H. Eghbal-zadeh, and A. P. Shah, Large-scale weakly labeled semi-supervised sound event detection in domestic environments, Proc. DCASE, Woking, pp.19-23, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01850270

K. Koutini, H. Eghbal-zadeh, and G. Widmer, Iterative knowledge distillation in r-cnns for weakly-labeled semisupervised sound event detection, DCASE Challenge, Woking, 2018.

R. Harb and F. Pernkopf, Sound event detection using weakly labeled semi-supervised data with gcrnns, vat and self-adaptative label refinement, DCASE Challenge, 2018.

Y. Hou and S. Li, Semi-supervised sound event detection with convolutional recurrent neural network using weakly labelled data, DCASE Challenge, 2018.

Y. L. Liu, J. Yan, Y. Song, and J. Du, USTC-NELSLIP System For Dcase 2018 Challenge Task 4, Challenge, Woking, Tech. Rep, 2018.

R. Serizel, N. Turpault, H. Eghbal-zadeh, and A. Shah, Large-scale weakly labeled semi-supervised sound event detection in domestic environments, Proc. DCASE, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01850270

J. Salamon, B. Mcfee, P. Li, and J. P. Bello, Multiple instance learning for sound event detection, DCASE Challenge, 2017.

L. Cances, T. Pellegrini, and P. Guyot, Sound event detection from weak annotations: weighted GRU versus multi-instance learning, DCASE Challenge, 2018.

T. Pellegrini and L. Cances, Cosine-similarity penalty to discriminate sound classes in weakly-supervised sound event detection, Proc. IJCNN, 2019.

A. Mesaros, T. Heittola, T. Virtanen, A. Mesaros, T. Heittola et al., Metrics for Polyphonic Sound Event Detection, Applied Sciences, vol.6, issue.6, p.162, 2016.

L. Cances, T. Pellegrini, and P. Guyot, Multi task learning and post processing optimization for sound event detection, DCASE Challenge, 2019.