J. Sueur, A. Farina, C. Bobryk, D. Llusia, J. Mcwilliam et al., Ecology and acoustics: emergent properties from community to landscape, 2014.

C. Bryan, A. Pijanowski, . Farina, . Stuarth, . Gage et al., What is soundscape ecology? an introduction and overview of an emerging new science, Landscape Ecology, vol.26, issue.9, pp.1213-1232, 2011.

R. Steven, H. Ness, P. Symonds, G. Spong, and . Tzanetakis, The orchive : Data mining a massive bioacoustic archive, International Workshop on Machine Learning for Bioacoustics, 2013.

D. Stowell and M. D. Plumbley, Large-scale analysis of frequency modulation in birdsong databases, Methods in Ecology and Evolution, issue.11, 2013.

T. Heittola and A. Mesaros, Antti Eronen, and Tuomas Virtanen. Context-dependent sound event detection, EURASIP Journal on Audio, Speech, and Music Processing, issue.1, pp.2013-2013

R. Radhakrishnan, A. Divakaran, and P. Smaragdis, Audio analysis for surveillance applications, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005., pp.158-161, 2005.
DOI : 10.1109/ASPAA.2005.1540194

J. Turner-tae, H. Park, M. M. , H. Lee, C. Jacoby et al., Sensing urban soundscapes, EDBT/ICDT Workshops, pp.375-382, 2014.

D. Stowell, D. Giannoulis, E. Benetos, M. Lagrange, and M. D. Plumbley, Detection and Classification of Acoustic Scenes and Events, IEEE Transactions on Multimedia, vol.17, issue.10, pp.1733-1746, 2015.
DOI : 10.1109/TMM.2015.2428998

URL : https://hal.archives-ouvertes.fr/hal-01253912

L. Rabiner and B. Juang, Fundamentals of Speech Recognition, 1993.

M. Müller, Information Retrieval for Music and Motion, 2007.
DOI : 10.1007/978-3-540-74048-3

T. Heittola, . Mesaros, T. Eronen, and . Virtanen, Context-dependent sound event detection, Speech, and Music Processing, 2013.
DOI : 10.1109/89.365379

C. Jacoby, J. Salamon, and J. P. Bello, A dataset and taxonomy for urban sound research, Proc. 22nd ACM International Conference on Multimedia, pp.158-161, 2014.

D. Giannoulis, D. Stowell, E. Benetos, M. Rossignol, M. Lagrange et al., A database and challenge for acoustic scene classification and event detection, Proceedings of the European Signal Processing Conference (EUSIPCO), 2013.
URL : https://hal.archives-ouvertes.fr/hal-01123764

E. Benetos, G. Lafay, M. Lagrange, and M. D. Plumbley, Detection of overlapping acoustic events using a temporally-constrained probabilistic model, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.6450-6454, 2016.
DOI : 10.1109/ICASSP.2016.7472919

URL : https://hal.archives-ouvertes.fr/hal-01255074

D. Giannoulis, E. Benetos, D. Stowell, M. Rossignol, M. Lagrange et al., Detection and classification of acoustic scenes and events: An IEEE AASP challenge, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013.
DOI : 10.1109/WASPAA.2013.6701819

URL : https://hal.archives-ouvertes.fr/hal-01123765

R. Stiefelhagen, K. Bernardin, R. Bowers, R. T. Rose, M. Michel et al., The clear 2007 evaluation Multimodal Technologies for Perception of Humans: International Evaluation Workshops CLEAR, pp.3-34, 2007.

J. , V. Barker, E. Ma, N. Christensen, C. Green et al., The pascal chime speech separation and recognition challenge, Computer Speech and Language, vol.27, issue.3, p.2013
URL : https://hal.archives-ouvertes.fr/hal-00646370

E. Vincent, J. Barker, S. Watanabe, J. Le-roux, F. Nesta et al., The second ‘chime’ speech separation and recognition challenge: Datasets, tasks and baselines, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 2013.
DOI : 10.1109/ICASSP.2013.6637622

J. Barker, R. Marxer, E. Vincent, and S. Watanabe, The third ???CHiME??? speech separation and recognition challenge: Dataset, task and baselines, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.504-511, 2015.
DOI : 10.1109/ASRU.2015.7404837

URL : https://hal.archives-ouvertes.fr/hal-01211376

L. Cristoforetti, M. Ravanelli, M. Omologo, A. Sosi, A. Abad et al., The DIRHA simulated corpus, 9th International Conference on Language Resources and Evaluation (LREC), pp.2629-2634, 2014.

K. Kinoshita, M. Delcroix, S. Gannot, E. Habets, R. Haeb-umbach et al., A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research, EURASIP Journal on Advances in Signal Processing, vol.87, issue.7, 2016.
DOI : 10.1186/s13634-016-0306-6

I. Nelken, A. De, and C. , An ear for statistics, Nature Neuroscience, vol.66, issue.4, p.381382, 2013.
DOI : 10.1038/nn.3360

S. Spors, H. Teutsch, A. Kuntz, and R. Rabenstein, Sound Field Synthesis, Audio Signal Processing for Next-Generation Multimedia Communication Systems, pp.323-344, 2004.
DOI : 10.1007/1-4020-7769-6_12

D. N. Zotkin, R. Duraiswami, and L. S. Davis, Rendering Localized Spatial Audio in a Virtual Auditory Space, IEEE Transactions on Multimedia, vol.6, issue.4, pp.553-564, 2004.
DOI : 10.1109/TMM.2004.827516

C. Verron, M. Aramaki, R. Kronland-martinet, and G. Pallone, A 3-D Immersive Synthesizer for Environmental Sounds, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.6, pp.1550-1561, 2010.
DOI : 10.1109/TASL.2009.2037402

URL : https://hal.archives-ouvertes.fr/hal-00462544

D. Schwarz, State of the art in sound texture synthesis, Proc. Digital Audio Effects (DAFx), pp.221-231, 2011.
URL : https://hal.archives-ouvertes.fr/hal-01161296

J. H. Mcdermott and E. P. Simoncelli, Sound Texture Perception via Statistics of the Auditory Periphery: Evidence from Sound Synthesis, Neuron, vol.71, issue.5, pp.926-940, 2011.
DOI : 10.1016/j.neuron.2011.06.032

R. Turner and M. Sahani, Modeling natural sounds with modulation cascade processes, Advances in neural information processing systems, pp.1545-1552, 2008.

S. Albert and . Bregman, Auditory scene analysis: The perceptual organization of sound, 1994.

S. Joel, C. Snyder, and . Alain, Toward a neurophysiological theory of auditory stream segregation, Psychological bulletin, vol.133, issue.5, p.780, 2007.

V. Ciocca, The auditory organization of complex sounds Frontiers in bioscience: a journal and virtual library, pp.148-169, 2007.

P. Robert and . Carlyon, How the brain separates sounds, Trends in cognitive sciences, vol.8, issue.10, pp.465-471, 2004.

A. James, . Ballas, H. James, and . Howard, Interpreting the language of environmental sounds, Environment and behavior, vol.19, issue.1, pp.91-114, 1987.

I. Nelken and O. Bar-yosef, Neurons and objects: the case of auditory cortex, Frontiers in Neuroscience, vol.2, issue.1, p.107, 2008.
DOI : 10.3389/neuro.01.009.2008

C. Danì-ele-dubois, M. Guastavino, and . Raimbault, A cognitive approach to urban soundscapes: Using verbal data to access everyday life auditory categories, Acta Acustica united with Acustica, vol.92, issue.6, pp.865-874, 2006.

M. Raimbault and D. Dubois, Urban soundscapes: Experiences and knowledge, Cities, vol.22, issue.5, pp.339-350, 2005.
DOI : 10.1016/j.cities.2005.05.003

URL : https://hal.archives-ouvertes.fr/halshs-00204325

M. Niessen, C. Cance, and D. Dubois, Categories for soundscape: toward a hybrid classification, INTER-NOISE and NOISE-CON Congress and Conference Proceedings, p.58165829, 2010.

C. Guastavino, The ideal urban soundscape: Investigatng the sound quality of french cities, Acta Acustica United with Acustica, vol.92, pp.945-951, 2006.

B. Gygi and V. Shafiro, The incongruency advantage for environmental sounds presented in natural auditory scenes., Journal of Experimental Psychology: Human Perception and Performance, vol.37, issue.2, p.551, 2011.
DOI : 10.1037/a0020671

E. Maria, L. Niessen, . Van-maanen, C. Tjeerd, and . Andringa, Disambiguating sound through context, International Journal of Semantic Computing, vol.2, issue.03, pp.327-341, 2008.

O. Houix, G. Lemaitre, N. Misdariis, P. Susini, and I. Urdapilleta, A lexical analysis of environmental sound categories., Journal of Experimental Psychology: Applied, vol.18, issue.1, pp.52-80, 2012.
DOI : 10.1037/a0026240

URL : https://hal.archives-ouvertes.fr/hal-00714662

F. Guyot, M. Castellengo, and B. Fabre, Catégorisation et Cognition: De la Perception au Discours, chapter A study of the categorization of an everyday sound set, pp.41-58, 1997.

B. Gygi, R. Gary, . Kidd, S. Charles, and . Watson, Similarity and categorization of environmental sounds, Perception & Psychophysics, vol.47, issue.6, pp.839-855, 2007.
DOI : 10.3758/BF03193921

M. Michael, D. Marcell, M. Borella, E. Greene, S. Kerr et al., Confrontation naming of environmental sounds, Journal of clinical and experimental neuropsychology, vol.22, issue.6, pp.830-864, 2000.

N. J. Vanderveer, Ecological acoustics: Human perception of environmental sounds, 1979.

E. Rosch, B. Barbara, and . Lloyd, Cognition and categorization, pp.27-48, 1978.

W. William and . Gaver, What in the world do we hear?: An ecological approach to auditory event perception, Ecological psychology, vol.5, issue.1, pp.1-29, 1993.

A. L. Brown, J. Kang, and T. Gjestland, Towards standardization in soundscape preference assessment, Applied Acoustics, vol.72, issue.6, pp.387-392, 2011.
DOI : 10.1016/j.apacoust.2011.01.001

M. Southworth, The sonic environment of cities, Environment and behavior, 1969.

V. Maffiolo, Semantic and acoustic characterization of urban environmental sound quality, 1999.

H. Josh, M. Mcdermott, . Schemitsch, P. Eero, and . Simoncelli, Summary statistics in auditory perception, Nature neuroscience, vol.16, issue.4, pp.493-498, 2013.

N. Saint-arnaud, Classification of sound textures Massachusetts Institute of Technology, 1995. [54] Nicolas Saint-Arnaud and Kris Popat. Analysis and synthesis of sound textures, Readings in Computational Auditory Scene Analysis. Citeseer, 1995.

T. R. Agus, S. J. Thorpe, and D. Pressnitzer, Rapid Formation of Robust Auditory Memories: Insights from Noise, Neuron, vol.66, issue.4, pp.610-618, 2010.
DOI : 10.1016/j.neuron.2010.04.014

URL : https://hal.archives-ouvertes.fr/hal-00488683

R. Stiefelhagen, K. Bernardin, R. Bowers, J. Garofolo, D. Mostefa et al., The CLEAR 2006 Evaluation, Multimodal Technologies for Perception of Humans, pp.1-44, 2007.
DOI : 10.1007/978-3-540-69568-4_1

S. Davis and P. Mermelstein, Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.28, issue.4, pp.357-366, 1980.
DOI : 10.1109/TASSP.1980.1163420

L. Rabiner, A tutorial on hidden markov models and selected applications in speech recognition, Proceedings of the IEEE, pp.257-286, 1989.

S. Chauhan, S. Phadke, and C. Sherland, Event detection and classification, 2013.

A. Diment, T. Heittola, and T. Virtanen, Sound event detection for office live and office synthetic aasp challenge, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013.

A. Diment, T. Heittola, and T. Virtanen, Sound event detection for office live and office synthetic AASP challenge, 2013.

F. Jort, L. Gemmeke, P. Vuegen, B. Karsmakers, and . Vanrumste, An exemplar-based nmf approach to audio event detection, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013.

J. F. Gemmeke, L. Vuegen, B. Vanrumste, and H. Van-hamme, An exemplar-based NMF approach to audio event detection, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013.
DOI : 10.1109/WASPAA.2013.6701847

G. Roma, W. Nogueira, and P. Herrera, Recurrence quantification analysis features for environmental sound recognition, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013.
DOI : 10.1109/WASPAA.2013.6701890

W. Nogueira, G. Roma, and P. Herrera, Automatic event classification using front end single channel noise reduction, MFCC features and a support vector machine classifier, 2013.

E. Maria, . Niessen, L. Tim, A. Van-kasteren, and . Merentitis, Hierarchical modeling using automated sub-clustering for sound event recognition, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp.1-4, 2013.

M. E. Niessen, T. L. Van-kasteren, and A. Merentitis, Hierarchical sound event detection, 2013.

J. Schroder, N. Moritz, M. R. Schadler, B. Cauchi, K. Adiloglu et al., On the use of spectro-temporal features for the IEEE AASP challenge ‘detection and classification of acoustic scenes and events’, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013.
DOI : 10.1109/WASPAA.2013.6701868

J. Schröder, B. Cauchi, M. R. Schädler, N. Moritz, K. Adiloglu et al., Acoustic event detection using signal enhancement and spectro-temporal feature extraction, 2013.

L. Vuegen, B. Van-den-broeck, P. Karsmakers, J. F. Gemmeke, B. Vanrumste et al., An MFCC-GMM approach for event detection and classification, 2013.