J. Ballas, Common factors in the identification of an assortment of brief everyday sounds Journal of experimental psychology: human perception and performance, pp.250-8473838, 1993.

B. Gygi, G. Kidd, and C. Watson, Spectral-temporal factors in the identification of environmental sounds, The Journal of the Acoustical Society of America, vol.115, issue.3, pp.1252-15058346, 2004.
DOI : 10.1121/1.1635840

G. Felsen and Y. Dan, A natural approach to studying vision, Nature Neuroscience, vol.431, issue.12, pp.1643-1649, 2005.
DOI : 10.1038/nn1608

C. Suied and I. Viaud-delmon, Auditory-Visual Object Recognition Time Suggests Specific Processing for Animal Sounds, PLoS ONE, vol.106, issue.4, 2009.
DOI : 10.1371/journal.pone.0005256.t001
URL : https://hal.archives-ouvertes.fr/hal-01107100

K. Robinson and R. Patterson, The stimulus duration required to identify vowels, their octave, and their pitch chroma, The Journal of the Acoustical Society of America, vol.98, issue.4, pp.1858-65, 1995.
DOI : 10.1121/1.414405

K. Robinson and R. Patterson, The duration required to identify the instrument, the octave, or the pitch chroma of a musical note. Music Perception, pp.1-15, 1995.

C. Suied, T. Agus, S. Thorpe, N. Mesgarani, and D. Pressnitzer, Auditory gist: Recognition of very short sounds from timbre cues, The Journal of the Acoustical Society of America, vol.135, issue.3, pp.1380-91, 2014.
DOI : 10.1121/1.4863659

L. Romanski, B. Tian, J. Fritz, M. Mishkin, P. Goldman-rakic et al., Dual streams of auditory afferents target multiple domains in the primate prefrontal cortex, Nature Neuroscience, vol.2, issue.12, pp.1131-1137, 1999.
DOI : 10.1038/16056

D. Lucia, M. Clarke, S. Murray, and M. , A Temporal Hierarchy for Conspecific Vocalization Discrimination in Humans, Journal of Neuroscience, vol.30, issue.33, pp.11210-11231, 2010.
DOI : 10.1523/JNEUROSCI.2239-10.2010

P. Belin, R. Zatorre, P. Lafaille, P. Ahad, and B. Pike, Voice-selective areas in human auditory cortex, Nature, vol.403, issue.6767, pp.309-321, 2000.
DOI : 10.1038/35002078

J. Lewis, J. Brefczynski, R. Phinney, J. Janik, and E. Deyoe, Distinct Cortical Pathways for Processing Tool versus Animal Sounds, Journal of Neuroscience, vol.25, issue.21, pp.5148-58, 2005.
DOI : 10.1523/JNEUROSCI.0419-05.2005

A. Leaver and J. Rauschecker, Cortical Representation of Natural Complex Sounds: Effects of Acoustic Features and Auditory Object Category, Journal of Neuroscience, vol.30, issue.22, pp.7604-7616, 2010.
DOI : 10.1523/JNEUROSCI.0296-10.2010

N. Staeren, H. Renvall, D. Martino, F. Goebel, R. Formisano et al., Sound Categories Are Represented as Distributed Patterns in the Human Auditory Cortex, Current Biology, vol.19, issue.6, pp.498-502, 2009.
DOI : 10.1016/j.cub.2009.01.066

M. Moerel, D. Martino, F. Formisano, and E. , Processing of Natural Sounds in Human Auditory Cortex: Tonotopy, Spectral Tuning, and Relation to Voice Sensitivity, Journal of Neuroscience, vol.32, issue.41, pp.14205-14221, 2012.
DOI : 10.1523/JNEUROSCI.1388-12.2012

B. Giordano, S. Mcadams, R. Zatorre, N. Kriegeskorte, and P. Belin, Abstract Encoding of Auditory Objects in Cortical Activity Patterns, Cerebral Cortex, vol.23, issue.9, pp.2025-2062, 2013.
DOI : 10.1093/cercor/bhs162

C. Altmann, O. Doehrmann, and J. Kaiser, Selectivity for Animal Vocalizations in the Human Auditory Cortex, Cerebral Cortex, vol.17, issue.11, pp.2601-2609, 2007.
DOI : 10.1093/cercor/bhl167

R. Santoro, M. Moerel, D. Martino, F. Goebel, R. Ugurbil et al., Encoding of Natural Sounds at Multiple Spectral and Temporal Resolutions in the Human Auditory Cortex, PLoS Computational Biology, vol.83, issue.270, p.24391486, 2014.
DOI : 10.1371/journal.pcbi.1003412.s009

K. Patil, D. Pressnitzer, S. Shamma, and M. Elhilali, Music in our ears: the biological bases of musical timbre perception, PLoS computational biology, vol.8, issue.11, p.23133363, 2012.

E. Smith and M. Lewicki, Efficient auditory coding, Nature, vol.105, issue.7079, pp.978-82, 2006.
DOI : 10.1038/nature04485

T. Hromadka and A. Zador, Representations in auditory cortex Current opinion in neurobiology, pp.430-433, 2009.

R. Shannon, F. Zeng, V. Kamath, J. Wygonski, and M. Ekelid, Speech Recognition with Primarily Temporal Cues, Science, vol.270, issue.5234, pp.303-307, 1995.
DOI : 10.1126/science.270.5234.303

R. Remez, P. Rubin, D. Pisoni, and T. Carrell, Speech perception without traditional speech cues, Science, vol.212, issue.4497, pp.947-956, 1981.
DOI : 10.1126/science.7233191

C. Suied, A. Drémeau, D. Pressnitzer, and L. Daudet, Auditory sketches: Sparse representations of sounds based on perceptual models. From Sounds to Music and Emotions, pp.154-70, 2013.

J. Grey, Multidimensional perceptual scaling of musical timbres, The Journal of the Acoustical Society of America, vol.61, issue.5, pp.1270-1277, 1977.
DOI : 10.1121/1.381428

S. Mcadams, S. Winsberg, S. Donnadieu, D. Soete, G. Krimphoff et al., Perceptual scaling of synthesized musical timbres: Common dimensions, specificities, and latent subject classes, Psychological Research, vol.72, issue.2, pp.177-92, 1995.
DOI : 10.1007/BF00419633
URL : https://hal.archives-ouvertes.fr/hal-00828647

T. Elliott, L. Hamilton, and F. Theunissen, Acoustic structure of the five perceptual dimensions of timbre in orchestral instrument tones, The Journal of the Acoustical Society of America, vol.133, issue.1, pp.389-404, 2013.
DOI : 10.1121/1.4770244

J. Krimphoff, S. Mcadams, and S. Winsberg, Caract??risation du timbre des sons complexes.II. Analyses acoustiques et quantification psychophysique, Le Journal de Physique IV, vol.04, issue.C5, pp.5-625, 1994.
DOI : 10.1051/jp4:19945134

T. Chi, P. Ru, and S. Shamma, Multiresolution spectrotemporal analysis of complex sounds, The Journal of the Acoustical Society of America, vol.118, issue.2, pp.887-16158645, 2005.
DOI : 10.1121/1.1945807

P. Boersma and D. Weenink, Praat: doing phonetics by computer [Computer program]. Version 5.4.14, retrieved 24, 2015.

X. Yang, K. Wang, and S. Shamma, Auditory representations of acoustic signals. Information Theory, IEEE Transactions on, vol.38, issue.2, pp.824-863, 1992.
DOI : 10.1109/18.119739
URL : http://drum.lib.umd.edu/bitstream/1903/5064/1/TR_91-16.pdf

N. Macmillan and C. Creelman, Detection Theory: A User's Guide Lawrence Erlbaum Associates, 2005.

L. Decarlo, On a signal detection approach to -alternative forced choice with bias, with maximum likelihood and Bayesian approaches to estimation, Journal of Mathematical Psychology, vol.56, issue.3, pp.196-207, 2012.
DOI : 10.1016/j.jmp.2012.02.004

T. Agus, C. Suied, S. Thorpe, and D. Pressnitzer, Fast recognition of musical sounds based on timbre, The Journal of the Acoustical Society of America, vol.131, issue.5, pp.4124-4157, 2012.
DOI : 10.1121/1.3701865
URL : https://hal.archives-ouvertes.fr/hal-00706659

B. Moore, Temporal integration and context effects in hearing, Journal of Phonetics, vol.31, issue.3-4, pp.3-4563, 2003.
DOI : 10.1016/S0095-4470(03)00011-1

M. Plumbley, T. Blumensath, L. Daudet, R. Gribonval, and M. Davies, Sparse Representations in Audio and Music: From Coding to Source Separation, Proceedings of the IEEE, vol.98, issue.6, pp.995-1005, 2010.
DOI : 10.1109/JPROC.2009.2030345
URL : https://hal.archives-ouvertes.fr/inria-00489524

A. Liberman and I. Mattingly, A specialization for speech perception, Science, vol.243, issue.4890, pp.489-94, 1989.
DOI : 10.1126/science.2643163

P. Belin, Voice processing in human and non-human primates, Philosophical Transactions of the Royal Society B: Biological Sciences, vol.204, issue.4395, pp.2091-107, 1476.
DOI : 10.1126/science.108805
URL : http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1764839

C. Du, C. Du, and H. , expérience contrôle de l'étude du temps de traitement auditif visait à établir la durée minimale de présentation nécessaire pour la reconnaissance de stimuli auditifs courts et présentés individuellement, Comme détaillé précédemment, chaque catégorie (voix ou instruments) était composée de quatre sources sonores origi- nales

. Mcadams, 2013), tandis que l'inuence des composantes périodiques et apériodiques du signal quantiée par le HNR a notamment été mis en évidence dans des études d'imagerie sur l'encodage de sons naturels par le cortex auditif, 1995.

&. Hacker and . Ratcli, la table proposée par les auteurs donne les valeurs de d pour des valeurs de proportion correcte de 0 à 1, et pour diérents nombres d'alternatives. La proportion correcte correspond dans ce cas à la somme des, 1979.

&. Hacker and . Ratcli, adapté par catégorie : pour chaque catégorie, on calcule une proportion correcte qui correspond au taux de détections et rejets corrects, Puis on reporte cette valeur dans le tableau de Hacker & Ratcli, 1979.

. Isnard, le biais est calculé d'après la méthode de DeCarlo (2012), puis réinjecté pour faire le calcul de sensibilité par catégorie, 2016.

R. Adolphs, R. Gosselin, F. Buchanan, T. W. Tranel, D. Schyns et al., A mechanism for impaired fear recognition after amygdala damage, Nature, vol.11, issue.7021, p.4336872, 2005.
DOI : 10.1023/A:1016374617369

T. R. Agus, C. Suied, S. J. Thorpe, and D. Pressnitzer, Fast recognition of musical sounds based on timbre, The Journal of the Acoustical Society of America, vol.131, issue.5, p.131412433, 2012.
DOI : 10.1121/1.3701865
URL : https://hal.archives-ouvertes.fr/hal-00706659

A. J. Ahumada and J. Lovell, Stimulus Features in Signal Detection, The Journal of the Acoustical Society of America, vol.49, issue.6B, p.4917511756, 1971.
DOI : 10.1121/1.1912577

A. J. Ahumada, R. Marken, and A. Sandusky, Time and frequency analyses of auditory signal detection, The Journal of the Acoustical Society of America, vol.57, issue.2, p.385390, 1975.
DOI : 10.1121/1.380453

F. Attneave, Some informational aspects of visual perception., Psychological Review, vol.61, issue.3, p.183, 1954.
DOI : 10.1037/h0054663
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.300.5892

J. A. Ballas, Common factors in the identification of an assortment of brief everyday sounds., Journal of Experimental Psychology: Human Perception and Performance, vol.19, issue.2, p.250, 1993.
DOI : 10.1037/0096-1523.19.2.250

H. B. Barlow, Possible Principles Underlying the Transformations of Sensory Messages, Sensory Communications, p.217234, 1961.
DOI : 10.7551/mitpress/9780262518420.003.0013

B. Bathellier, L. Ushakova, and S. Rumpel, Discrete Neocortical Dynamics Predict Behavioral Categorization of Sounds, Neuron, vol.76, issue.2, p.76435449, 2012.
DOI : 10.1016/j.neuron.2012.07.008

P. Belin, Voice processing in human and non-human primates, Philosophical Transactions of the Royal Society B: Biological Sciences, vol.204, issue.4395, p.3612091107, 1476.
DOI : 10.1126/science.108805
URL : http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1764839

R. Belin, P. Fecteau, S. Bedard, and C. , Thinking the voice: neural correlates of voice perception, Trends in Cognitive Sciences, vol.8, issue.3, p.12935, 2004.
DOI : 10.1016/j.tics.2004.01.008

P. Belin and M. Grosbras, Before Speech: Cerebral Voice Processing in Infants, Neuron, vol.65, issue.6, p.65733735, 2010.
DOI : 10.1016/j.neuron.2010.03.018
URL : http://doi.org/10.1016/j.neuron.2010.03.018

P. Belin, R. J. Zatorre, and P. Ahad, Human temporal-lobe response to vocal sounds, Cognitive Brain Research, vol.13, issue.1, p.1726, 2002.
DOI : 10.1016/S0926-6410(01)00084-2

P. Belin, R. J. Zatorre, P. Lafaille, P. Ahad, and B. Pike, Voice-selective areas in human auditory cortex, Nature, issue.6767, p.403309312, 2000.

A. A. Benasich, J. J. Thomas, N. Choudhury, and P. H. Leppanen, The importance of rapid auditory processing abilities to early language development: Evidence from converging methodologies, Developmental Psychobiology, vol.8, issue.Suppl., p.40278292, 2002.
DOI : 10.1002/dev.10032

P. E. Bestelmeyer, J. Rouger, L. M. Debruine, and P. Belin, Auditory adaptation in vocal aect perception, Cognition, vol.117, issue.2, p.217223, 2010.
DOI : 10.1016/j.cognition.2010.08.008

E. Bigand, C. Delbe, Y. Gerard, and B. Tillmann, Categorization of Extremely Brief Auditory Stimuli: Domain-Specific or Domain-General Processes?, PLoS ONE, vol.1060, issue.10, p.27024, 2011.
DOI : 10.1371/journal.pone.0027024.s002
URL : http://doi.org/10.1371/journal.pone.0027024

S. Bleeck, T. Ives, and R. D. Patterson, Aim-mat : the auditory image model in matlab, Acta Acustica United with Acustica, vol.90, issue.4, p.781787, 2004.

P. Boersma, Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound, Proceedings of the institute of phonetic sciences, p.97110, 1993.

P. Boersma and D. Weenink, Praat : doing phonetics by computer [computer program]. version 5.4.12, retrieved 10 july, 2015.

P. Boulez, Timbre and composition ??? timbre and language, Contemporary Music Review, vol.2, issue.1, p.161171, 1987.
DOI : 10.1080/07494468708567057

D. S. Brungart and B. D. Simpson, Improving multitalker speech communication with advanced audio displays, 2005.

S. Buat, J. Plantier, C. Roumes, and J. Lorenceau, Repetition blindness for natural images of objects with viewpoint changes, Front Psychol, vol.3, p.622, 2012.

S. Buat, J. Plantier, C. Roumes, and J. Lorenceau, Repetition blindness for natural images of objects with viewpoint changes, Frontiers in psychology, vol.3, 2013.

A. Caclin, Interactions et independances entre dimensions du timbre des sons complexes : approche psychophysique et electrophysiologique chez l, 2004.

A. Caclin, S. Mcadams, B. K. Smith, and S. Winsberg, Acoustic correlates of timbre space dimensions: A confirmatory study using synthetic tones, The Journal of the Acoustical Society of America, vol.118, issue.1, p.471, 2005.
DOI : 10.1121/1.1929229

J. D. Carroll and J. Chang, Analysis of individual dierences in multidimensional scaling via an n-way generalization of "eckart-young" decomposition, Psychometrika, issue.3, p.35283319, 1970.

M. Carron, Methodes et outils pour denir et vehiculer une identite sonore : application au design sonore identitaire de la marque SNCF, 2016.

P. Cavanagh, The artist as neuroscientist, Nature, vol.6, issue.7031, p.301307, 2005.
DOI : 10.1073/pnas.96.4.1680

I. Charest, C. R. Pernet, G. A. Rousselet, I. Quinones, M. Latinus et al., Electrophysiological evidence for an early processing of human voices, BMC Neuroscience, vol.10, issue.1, p.127, 2009.
DOI : 10.1186/1471-2202-10-127

T. Chi, P. Ru, and S. A. Shamma, Multiresolution spectrotemporal analysis of complex sounds, The Journal of the Acoustical Society of America, vol.118, issue.2, p.887, 2005.
DOI : 10.1121/1.1945807

S. Chon and S. Mcadams, Exploring blending as a function of timbre saliency, Proceedings of the 12th International Conference of Music Perception and Cognition, 2012.

S. H. Chon, K. Schwartzbach, . Smith, and S. Mcadams, Eect of timbre on melody recognition in three-voice counterpoint music, Proceedings of the Sound and Music Computing Conference 2013, 2013.

M. M. Chun and M. C. Potter, A two-stage model for multiple target detection in rapid serial visual presentation., Journal of Experimental Psychology: Human Perception and Performance, vol.21, issue.1, p.109, 1995.
DOI : 10.1037/0096-1523.21.1.109

M. Clark, D. Luce, R. Abrams, H. Schlossberg, and J. Rome, Preliminary experiments on the aural signicance of parts of tones of orchestral instruments and on choral tones, Journal of the Audio Engineering Society, vol.11, issue.1, p.4554, 1963.

M. Cooke, A glimpsing model of speech perception in noise, The Journal of the Acoustical Society of America, vol.119, issue.3, p.15621573, 2006.
DOI : 10.1121/1.2166600

B. Craven, A table of d' for m-alternative odd-man-out forced-choice procedures, Perception & psychophysics, vol.51, issue.4, p.379385, 1992.

W. Creel, P. C. Boomsliter, and S. R. Powers, Sensations of tone as perceptual forms., Psychological Review, vol.77, issue.6, p.77534, 1970.
DOI : 10.1037/h0029943

R. Cusack and R. P. Carlyon, Perceptual asymetries in audition., Journal of Experimental Psychology: Human Perception and Performance, vol.29, issue.3, p.713725, 2003.
DOI : 10.1037/0096-1523.29.3.713

F. Cutzu and S. Edelman, Faithful representation of similarities among three-dimensional shapes in human vision., Proceedings of the National Academy of Sciences, vol.93, issue.21, p.931204612050, 1996.
DOI : 10.1073/pnas.93.21.12046

H. Dai and C. Micheyl, Psychophysical reverse correlation with multiple response alternatives., Journal of Experimental Psychology: Human Perception and Performance, vol.36, issue.4, p.97693, 2010.
DOI : 10.1037/a0017171
URL : http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3158580

D. Lucia, M. Clarke, S. Murray, and M. M. , A temporal hierarchy for conspecic vocalization discrimination in humans, The Journal of Neuroscience, issue.33, p.301121011221, 2010.

L. T. Decarlo, On a signal detection approach to -alternative forced choice with bias, with maximum likelihood and bayesian approaches to estimation Optimizing sound features for cortical neurons, Journal of Mathematical Psychology Science, vol.56, issue.35368, p.28014391444, 1998.

D. D. Dirks and D. Bower, Eect of forward and backward masking on speech intelligibility, The Journal of the Acoustical Society of America, issue.4B, p.4710031008, 1970.

F. C. Donders, On the speed of mental processes, Acta Psychologica, vol.30, p.412431, 1969.
DOI : 10.1016/0001-6918(69)90065-1

M. F. Dorman, P. C. Loizou, and D. Rainey, Speech intelligibility as a function of the number of channels of stimulation for signal processors using sine-wave and noise-band outputs, The Journal of the Acoustical Society of America, vol.102, issue.4, p.24032411, 1997.
DOI : 10.1121/1.419603

J. Duncan, S. Martens, and R. Ward, Within but not between sensory modalities, Nature, vol.387, p.809, 1997.

R. Efron, The minimum duration of a perception, Neuropsychologia, vol.8, issue.1, p.5763, 1970.
DOI : 10.1016/0028-3932(70)90025-4

R. Efron, The relationship between the duration of a stimulus and the duration of a perception, Neuropsychologia, vol.8, issue.1, p.3755, 1970.
DOI : 10.1016/0028-3932(70)90024-2

D. Ehresman and D. L. Wessel, Perception of timbral analogies, 1978.

M. Elad, Sparse and Redundant Representation Modeling—What Next?, IEEE Signal Processing Letters, vol.19, issue.12, p.922928, 2012.
DOI : 10.1109/LSP.2012.2224655

P. B. Elliot, Tables of d Signal detection and recognition by human observers, 1964.

C. A. Elliott, Attacks and releases as factors in instrument identication, Journal of Research in Music Education, vol.23, issue.1, p.3540, 1975.
DOI : 10.2307/3345201

L. L. Elliott, Development of Auditory Narrow???Band Frequency Contours, The Journal of the Acoustical Society of America, vol.42, issue.1, 1967.
DOI : 10.1121/1.1910543

T. M. Elliott, L. S. Hamilton, and F. E. Theunissen, Acoustic structure of the ve perceptual dimensions of timbre in orchestral instrument tones, J Acoust Soc Am, vol.133, issue.1, p.389404, 2013.

D. S. Emmerich, J. L. Gray, C. S. Watson, and D. C. Tanis, Response latency, condence, and rocs in auditory signal detection, Perception & Psychophysics, vol.11, issue.1, p.6572, 1972.
DOI : 10.3758/bf03212686

D. M. Ennis and M. O-'mahony, Probabilistic models for sequential taste effects in triadic choice., Journal of Experimental Psychology: Human Perception and Performance, vol.21, issue.5, p.1088, 1995.
DOI : 10.1037/0096-1523.21.5.1088

S. Fecteau, J. L. Armony, Y. Joanette, and P. Belin, Is voice processing species-specic in human auditory cortex ? an fmri study, Neuroimage, vol.23, issue.3, p.8408, 2004.

G. Felsen and Y. Dan, A natural approach to studying vision, Nature Neuroscience, vol.431, issue.12, p.16436, 2005.
DOI : 10.1038/nn1608

D. J. Field, Relations between the statistics of natural images and the response properties of cortical cells, Journal of the Optical Society of America A, vol.4, issue.12, p.23792394, 1987.
DOI : 10.1364/JOSAA.4.002379

E. Formisano, D. Martino, F. Bonte, M. Goebel, and R. , "Who" Is Saying "What"? Brain-Based Decoding of Human Voice and Speech, Science, vol.435, issue.17, p.3229703, 2008.
DOI : 10.1073/pnas.94.17.9440

M. Frenkel, G. F. Sherman, K. A. Bashan, A. M. Galaburda, and J. J. Loturco, Neocortical ectopias are associated with attenuated neurophysiological responses to rapidly changing auditory stimuli, NeuroReport, vol.11, issue.3, p.11575579, 2000.
DOI : 10.1097/00001756-200002280-00029

B. L. Giordano, J. Mcdonnell, and S. Mcadams, Hearing living symbols and nonliving icons: Category specificities in the cognitive processing of environmental sounds, Brain and Cognition, vol.73, issue.1, p.719, 2010.
DOI : 10.1016/j.bandc.2010.01.005

B. R. Glasberg and B. C. Moore, A model of loudness applicable to timevarying sounds, Journal of the Audio Engineering Society, vol.50, issue.5, p.331342, 2002.

F. Gosselin and P. G. Schyns, Bubbles: a technique to reveal the use of information in recognition tasks, Vision Research, vol.41, issue.17, p.4122612271, 2001.
DOI : 10.1016/S0042-6989(01)00097-9

F. Gosselin and P. G. Schyns, RAP: a new framework for visual categorization, Trends in Cognitive Sciences, vol.6, issue.2, p.7077, 2002.
DOI : 10.1016/S1364-6613(00)01838-6

F. Gosselin and P. G. Schyns, Superstitious Perceptions Reveal Properties of Internal Representations, Psychological Science, vol.39, issue.5, p.505509, 2003.
DOI : 10.1038/369395a0

F. Gosselin and P. G. Schyns, No troubles with bubbles: a reply to Murray and Gold, Vision Research, vol.44, issue.5, p.471477, 2004.
DOI : 10.1016/j.visres.2003.10.007
URL : http://doi.org/10.1016/j.visres.2003.10.007

F. Gosselin and P. G. Schyns, Bubbles : A user's guide. Building object categories in developmental time, p.91106, 2005.

U. Goswami, Sensory theories of developmental dyslexia: three challenges for research, Nature Reviews Neuroscience, vol.7, issue.1, p.4354, 2015.
DOI : 10.1016/j.cub.2013.01.044

R. Goto, M. Hashiguchi, H. Nishimura, T. Oka, and R. , Rwc music database : Music genre database and musical instrument sound database, ISMIR, p.229230, 2003.

G. W. Gray, Phonemic microtomy: The minimum duration of perceptible speech sounds, Speech Monographs, vol.9, issue.1, p.7590, 1942.
DOI : 10.1080/00335632609379646

D. Green and J. Swets, Signal detection theory and psychophysics, p.889, 1966.

D. M. Green, Consistency of auditory detection judgments., Psychological Review, vol.71, issue.5, p.392, 1964.
DOI : 10.1037/h0044520

J. M. Grey, An exploration of musical timbre, 1975.

J. M. Grey, Multidimensional perceptual scaling of musical timbres, The Journal of the Acoustical Society of America, vol.61, issue.5, p.6112701277, 1977.
DOI : 10.1121/1.381428

J. M. Grey and J. W. Gordon, Perceptual effects of spectral modifications on musical timbres, The Journal of the Acoustical Society of America, vol.63, issue.5, pp.1493-1500, 1978.
DOI : 10.1121/1.381843

J. M. Grey and J. A. Moorer, Perceptual evaluations of synthesized musical instrument tones, The Journal of the Acoustical Society of America, vol.62, issue.2, pp.454-462, 1977.
DOI : 10.1121/1.381508

T. D. Griths and J. D. Warren, What is an auditory object ?, Nature Reviews Neuroscience, vol.5, issue.11, p.887892, 2004.

B. Gygi, G. R. Kidd, and C. S. Watson, Spectral-temporal factors in the identification of environmental sounds, The Journal of the Acoustical Society of America, vol.115, issue.3, p.1252, 2004.
DOI : 10.1121/1.1635840

B. Gygi, G. R. Kidd, and C. S. Watson, Similarity and categorization of environmental sounds, Perception & Psychophysics, vol.47, issue.6, p.69839855, 2007.
DOI : 10.3758/BF03193921

M. J. Hacker and R. Ratcli, A revised table of d??? for M-alternative forced choice, Perception & Psychophysics, vol.26, issue.2, p.168170, 1979.
DOI : 10.3758/BF03208311

A. R. Halpern, R. J. Zatorre, M. Bouard, and J. A. Johnson, Behavioral and neural correlates of perceived and imagined musical timbre, Neuropsychologia, vol.42, issue.9, p.42128192, 2004.
DOI : 10.1016/j.neuropsychologia.2003.12.017

S. Handel and M. L. Erickson, A rule of thumb : The bandwidth for timbre invariance is one octave. Music Perception, An Interdisciplinary Journal, vol.19, issue.1, p.121126, 2001.

S. Harding, M. Cooke, and P. Konig, Auditory Gist Perception: An Alternative to Attentional Selection of Auditory Streams?, International Workshop on Attention in Cognitive Systems, p.399416, 2007.
DOI : 10.1007/978-3-540-77343-6_26

R. Hari, Illusory directional hearing in humans, Neuroscience Letters, vol.189, issue.1, p.2930, 1995.
DOI : 10.1016/0304-3940(95)11443-Z

R. Hari and P. Kiesila, Decit of temporal auditory processing in dyslexic adults, Neuroscience letters, vol.205, issue.2, p.138140, 1996.

R. Hari and H. Renvall, Impaired processing of rapid stimulus sequences in dyslexia, Trends in Cognitive Sciences, vol.5, issue.12, p.525532, 2001.
DOI : 10.1016/S1364-6613(00)01801-5

J. V. Haxby, M. I. Gobbini, M. L. Furey, A. Ishai, J. L. Schouten et al., Distributed and Overlapping Representations of Faces and Objects in Ventral Temporal Cortex, Science, vol.293, issue.5539, p.29324252430, 2001.
DOI : 10.1126/science.1063736

H. Helmholtz, On the sensation of tone as a physiological basis for the theory of music, AJ Ellis, Trans.), 1895.

T. Hromadka, M. R. Deweese, and A. M. Zador, Sparse Representation of Sounds in the Unanesthetized Auditory Cortex, PLoS Biology, vol.426, issue.1, p.16, 2008.
DOI : 10.1371/journal.pbio.0060016.sd002

T. Hromadka and A. M. Zador, Representations in auditory cortex, Current Opinion in Neurobiology, vol.19, issue.4, p.430433, 2009.
DOI : 10.1016/j.conb.2009.07.009

V. Isnard, M. Taou, I. Viaud-delmon, and C. Suied, Auditory Sketches: Very Sparse Representations of Sounds Are Still Recognizable, PLOS ONE, vol.361, issue.1476, p.150313, 2016.
DOI : 10.1371/journal.pone.0150313.s002
URL : https://hal.archives-ouvertes.fr/hal-01286047

L. Itti, C. Koch, and E. Niebur, A model of saliency-based visual attention for rapid scene analysis, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.20, issue.11, p.2012541259, 1998.
DOI : 10.1109/34.730558

P. Iverson and C. L. Krumhansl, Isolating the dynamic attributes of musical timbre, The Journal of the Acoustical Society of America, vol.94, issue.5, p.25952603, 1993.

T. L. Jernigan, J. R. Hesselink, E. Sowell, and P. A. Tallal, Cerebral Structure on Magnetic Resonance Imaging in Language- and Learning-Impaired Children, Archives of Neurology, vol.48, issue.5, p.539545, 1991.
DOI : 10.1001/archneur.1991.00530170103028

I. S. Johnsrude, R. J. Zatorre, B. A. Milner, and A. C. Evans, Lefthemisphere specialization for the processing of acoustic transients, NeuroReport, vol.8, issue.7, p.17611765, 1997.

M. Joos, Acoustic Phonetics, Language, vol.24, issue.2, p.5136, 1948.
DOI : 10.2307/522229

J. H. Kaas and T. A. Hackett, What' and 'where' processing in auditory cortex, Nature neuroscience, vol.2, issue.12, 1999.

N. Kanwisher, J. Mcdermott, and M. M. Chun, The fusiform face area : a module in human extrastriate cortex specialized for face perception, The Journal of neuroscience, issue.11, p.1743024311, 1997.

J. T. Kaplan and M. Iacoboni, Listen to my actions!, Behavioral and Brain Sciences, vol.28, issue.02, p.135136, 2005.
DOI : 10.1017/S0140525X05330032

J. T. Kaplan and M. Iacoboni, Multimodal action representation in human left ventral premotor cortex, Cognitive Processing, vol.24, issue.14, p.103113, 2007.
DOI : 10.1007/s10339-007-0165-z

A. Kapoor and J. B. Allen, Perceptual eects of plosive feature modication, 2012.
DOI : 10.1121/1.3665991
URL : http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3283903

S. G. Karadogan, J. Larsen, M. Syskindpedersen, and J. B. Boldt, Robust isolated speech recognition using binary masks, Signal Processing Conference 18th European, 2010.

R. Kawahara, H. Matsui, and H. , Auditory morphing based on an elastic perceptual distance metric in an interference-free time-frequency representation, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)., 2003.
DOI : 10.1109/ICASSP.2003.1198766

C. Kayser, C. I. Petkov, M. Lippert, and N. K. Logothetis, Mechanisms for Allocating Auditory Attention: An Auditory Saliency Map, Current Biology, vol.15, issue.21, p.1519437, 2005.
DOI : 10.1016/j.cub.2005.09.040
URL : http://doi.org/10.1016/j.cub.2005.09.040

R. A. Kendall, E. C. Carterette, and J. M. Hajda, Perceptual and acoustical features of natural and synthetic orchestral instrument tones. Music Perception, An Interdisciplinary Journal, vol.16, issue.3, p.327363, 1999.
DOI : 10.2307/40285796

C. Keysers, D. Xiao, P. Foldiak, and D. I. Perrett, The Speed of Sight, Journal of Cognitive Neuroscience, vol.74, issue.1, 2001.
DOI : 10.3758/BF03210089

A. J. King and I. Nelken, Unraveling the principles of auditory cortical processing: can we learn from the visual system?, Nature Neuroscience, vol.129, issue.6, pp.698-701, 2009.
DOI : 10.1038/nn.2308

M. D. Klein and J. A. Stolz, Looking and listening: A comparison of intertrial repetition effects in visual and auditory search tasks, Attention, Perception, & Psychophysics, vol.39, issue.5, p.77, 2015.
DOI : 10.3758/s13414-015-0908-3

K. P. Kording, P. Konig, and D. J. Klein, Learning of sparse auditory receptive elds, Proceedings of the International Joint Conference on Neural Networks (IJCNN), p.11031108, 2002.
DOI : 10.1109/ijcnn.2002.1007648
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.23.7180

N. Kriegeskorte, M. Mur, and P. A. Bandettini, Representational similarity analysis ??? connecting the branches of systems neuroscience, Frontiers in systems neuroscience, p.4, 2008.
DOI : 10.3389/neuro.06.004.2008

K. V. Kriegstein and A. Giraud, Distinct functional substrates along the right superior temporal sulcus for the processing of voices, NeuroImage, vol.22, issue.2, p.948955, 2004.
DOI : 10.1016/j.neuroimage.2004.02.020

C. L. Krumhansl, Why is musical timbre so hard to understand. Structure and perception of electroacoustic sound and music, p.4353, 1989.

P. Lakatos, Z. Pincze, K. G. Fu, D. C. Javitt, G. Karmos et al., Timing of pure tone and noise-evoked responses in macaque auditory cortex, NeuroReport, vol.16, issue.9, p.16933937, 2005.
DOI : 10.1097/00001756-200506210-00011

P. Lakatos, A. S. Shah, K. H. Knuth, I. Ulbert, G. Karmos et al., An Oscillatory Hierarchy Controlling Neuronal Excitability and Stimulus Processing in the Auditory Cortex, Journal of Neurophysiology, vol.94, issue.3, p.19041911, 2005.
DOI : 10.1152/jn.00263.2005

M. Latinus, P. Mcaleer, P. E. Bestelmeyer, and P. Belin, Norm-Based Coding of Voice Identity in Human Auditory Cortex, Current Biology, vol.23, issue.12, pp.1075-1080, 2013.
DOI : 10.1016/j.cub.2013.04.055

A. M. Leaver and J. P. Rauschecker, Cortical Representation of Natural Complex Sounds: Effects of Acoustic Features and Auditory Object Category, Journal of Neuroscience, vol.30, issue.22, p.30760412, 2010.
DOI : 10.1523/JNEUROSCI.0296-10.2010

G. Lemaitre, P. Susini, S. Winsberg, and S. Mcadams, Perceptively based design of new car horn sounds, 2003.

G. Lemaitre, P. Susini, S. Winsberg, S. Mcadams, and B. Letinturier, The sound quality of car horns : a psychoacoustical study of timbre, Acta acustica united with Acustica, issue.3, p.93457468, 2007.
URL : https://hal.archives-ouvertes.fr/hal-01106365

D. A. Leopold, I. V. Bondar, and M. A. Giese, Norm-based face encoding by single neurons in the monkey inferotemporal cortex, Nature, vol.22, issue.7102, pp.442572-575, 2006.
DOI : 10.1038/nature04951

D. A. Levy, R. Granot, and S. Bentin, Processing specicity for human voice stimuli : electrophysiological evidence, Neuroreport, issue.12, p.1226532657, 2001.
DOI : 10.1097/00001756-200108280-00013

M. S. Lewicki, Efficient coding of natural sounds, Nature Neuroscience, vol.5, issue.4, pp.356-63, 2002.
DOI : 10.1038/nn831
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.386.3036

J. W. Lewis, Cortical Networks Related to Human Use of Tools, The Neuroscientist, vol.1, issue.12, p.211231, 2006.
DOI : 10.1177/1073858406288327

J. W. Lewis, J. A. Brefczynski, R. E. Phinney, J. J. Janik, and E. A. Deyoe, Distinct Cortical Pathways for Processing Tool versus Animal Sounds, Journal of Neuroscience, vol.25, issue.21, p.25514858, 2005.
DOI : 10.1523/JNEUROSCI.0419-05.2005

J. W. Lewis, W. J. Talkington, N. A. Walker, G. A. Spirou, A. Jajosky et al., Human Cortical Organization for Processing Vocalizations Indicates Representation of Harmonic Structure as a Signal Attribute, Journal of Neuroscience, vol.29, issue.7, p.29228396, 2009.
DOI : 10.1523/JNEUROSCI.4145-08.2009

F. Li, A. Menon, and J. B. Allen, A psychoacoustic method to nd the perceptual cues of stop consonants in natural speech, J Acoust Soc Am, vol.127, issue.4, p.2599610, 2010.

A. M. Liberman and I. G. Mattingly, A specialization for speech perception, Science, vol.243, issue.4890, p.243489494, 1989.
DOI : 10.1126/science.2643163

M. Livingstone and D. Hubel, Segregation of form, color, movement, and depth: anatomy, physiology, and perception, Science, vol.240, issue.4853, 1988.
DOI : 10.1126/science.3283936

S. G. Lomber and S. Malhotra, Double dissociation of 'what' and 'where' processing in auditory cortex, Nature Neuroscience, vol.18, issue.5, p.609616, 2008.
DOI : 10.1016/j.neuroimage.2004.01.014

R. D. Luce, Detection and recognition, pages 103189, 1963.

R. Lyon, A computational model of ltering, detection, and compression in the cochlea, Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP'82, p.12821285, 1982.

R. Lyon, Computational models of neural auditory processing, ICASSP '84. IEEE International Conference on Acoustics, Speech, and Signal Processing, 1984.
DOI : 10.1109/ICASSP.1984.1172756

R. Lyon and S. Shamma, Auditory Representations of Timbre and Pitch, p.221270, 1996.
DOI : 10.1007/978-1-4612-4070-9_6

R. F. Lyon, A. G. Katsiamis, and E. M. Drakakis, History and future of auditory lter models, Proceedings of 2010 IEEE International Symposium on Circuits and Systems, p.38093812, 2010.

R. F. Lyon and C. Mead, An analog electronic cochlea, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.36, issue.7, p.3611191134, 1988.
DOI : 10.1109/29.1639
URL : http://authors.library.caltech.edu/53125/1/388884.pdf

O. Macherey and A. Delpierre, Perception of Musical Timbre by Cochlear Implant Listeners, Ear and Hearing, vol.34, issue.4, p.426, 2013.
DOI : 10.1097/AUD.0b013e31827535f8
URL : https://hal.archives-ouvertes.fr/hal-01328923

N. Macmillan and C. Creelman, Detection theory : A user's guide lawrence erlbaum associates, 2005.

P. P. Maeder, R. A. Meuli, M. Adriani, A. Bellmann, E. Fornari et al., Distinct pathways involved in sound recognition and localization : a human fMRI study, Neuroimage, vol.14, issue.4, p.802816, 2001.

M. Mandel, Learning an intelligibility map of individual utterances, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, p.14, 2013.
DOI : 10.1109/WASPAA.2013.6701835

M. I. Mandel, S. E. Yoho, and E. W. Healy, Generalizing time-frequency importance functions across noises, talkers, and phonemes, Fifteenth Annual Conference of the International Speech Communication Association, 2014.

M. Mangini and I. Biederman, Making the ineable explicit : estimating the information employed for face classications, Cognitive Science, vol.28, issue.2, p.209226, 2004.

J. Marozeau, A. Decheveigne, S. Mcadams, and S. Winsberg, The dependency of timbre on fundamental frequency, The Journal of the Acoustical Society of America, vol.114, issue.5, p.29462957, 2003.
DOI : 10.1121/1.1618239
URL : https://hal.archives-ouvertes.fr/tel-00008742

D. W. Massaro, Preperceptual auditory images., Journal of Experimental Psychology, vol.85, issue.3, p.411, 1970.
DOI : 10.1037/h0029712

D. W. Massaro, Effect of masking tone duration on preperceptual auditory images., Journal of Experimental Psychology, vol.87, issue.1, p.146, 1971.
DOI : 10.1037/h0030302

D. W. Massaro, Preperceptual images, processing time, and perceptual units in auditory perception., Psychological Review, vol.79, issue.2, p.124, 1972.
DOI : 10.1037/h0032264
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.468.6614

D. W. Massaro, Stimulus information vs processing time in auditory pattern recognition, Perception & Psychophysics, vol.7, issue.1, p.5056, 1972.
DOI : 10.3758/BF03212841

D. W. Massaro, Backward recognition masking, The Journal of the Acoustical Society of America, vol.58, issue.5, p.10591065, 1975.
DOI : 10.1121/1.380765

D. W. Massaro, Rate of perceptual processing, Psychological Research, vol.39, issue.3, p.277283, 1977.
DOI : 10.1007/BF00309292

S. Mcadams, Reconnaissance de sources et d'evenements sonores. Penser les sons, Psychologie cognitive de l'audition, p.157214, 1994.
URL : https://hal.archives-ouvertes.fr/hal-01105638

S. Mcadams and J. Cunible, Perception of Timbral Analogies, Philosophical Transactions of the Royal Society B: Biological Sciences, vol.336, issue.1278, p.336383389, 1278.
DOI : 10.1098/rstb.1992.0072
URL : https://hal.archives-ouvertes.fr/hal-01105548

S. Mcadams, P. Susini, N. Misdariis, and S. Winsberg, Multidimensional characterisation of perceptual and preference judgements of vehicle and environmental noises, Euro-Noise 98, 1998.
URL : https://hal.archives-ouvertes.fr/hal-01105508

R. Mcdermott and H. J. , Music perception with cochlear implants : a review. Trends in amplication, p.4982, 2004.

J. H. Mcdermott, M. Schemitsch, and E. P. Simoncelli, Summary statistics in auditory perception, Nature Neuroscience, vol.9, issue.4, p.4938, 2013.
DOI : 10.1121/1.3001672

J. H. Mcdermott and E. P. Simoncelli, Sound Texture Perception via Statistics of the Auditory Periphery: Evidence from Sound Synthesis, Neuron, vol.71, issue.5, p.7192640, 2011.
DOI : 10.1016/j.neuron.2011.06.032

R. Meddis, Simulation of mechanical to neural transduction in the auditory receptor, The Journal of the Acoustical Society of America, vol.79, issue.3, p.702711, 1986.
DOI : 10.1121/1.393460

R. Meddis, Simulation of auditory???neural transduction: Further studies, The Journal of the Acoustical Society of America, vol.83, issue.3, 1988.
DOI : 10.1121/1.396050

R. Meddis and M. J. Hewitt, Virtual pitch and phase sensitivity of a computer model of the auditory periphery. I: Pitch identification, The Journal of the Acoustical Society of America, vol.89, issue.6, p.28662882, 1991.
DOI : 10.1121/1.400725

R. Meddis and M. J. Hewitt, Virtual pitch and phase sensitivity of a computer model of the auditory periphery. II: Phase sensitivity, The Journal of the Acoustical Society of America, vol.89, issue.6, p.28832894, 1991.
DOI : 10.1121/1.400726

R. Meddis, M. J. Hewitt, and T. M. Shackleton, Implementation details of a computation model of the inner hair???cell auditory???nerve synapse, The Journal of the Acoustical Society of America, vol.87, issue.4, p.18131816, 1990.
DOI : 10.1121/1.399379

J. R. Miller and E. C. Carterette, Perceptual space for musical structures, The Journal of the Acoustical Society of America, vol.58, issue.3, 1975.
DOI : 10.1121/1.380719

A. Minard, P. Susini, N. Misdariis, G. Lemaitre, S. Mcadams et al., Environmental sound description : a meta-analysis of timbre perception, 2008.
URL : https://hal.archives-ouvertes.fr/hal-01106294

N. Misdariis, A. Minard, P. Susini, G. Lemaitre, S. Mcadams et al., Environmental sound perception : Metadescription and modeling based on independent primary studies, Speech, and Music Processing, p.126, 2010.
DOI : 10.1186/1687-4722-2010-362013
URL : https://hal.archives-ouvertes.fr/hal-00560335

M. Moerel, D. Martino, F. Formisano, and E. , Processing of Natural Sounds in Human Auditory Cortex: Tonotopy, Spectral Tuning, and Relation to Voice Sensitivity, Journal of Neuroscience, vol.32, issue.41, p.321420514216, 2012.
DOI : 10.1523/JNEUROSCI.1388-12.2012

B. C. Moore, An introduction to the psychology of hearing, 2012.

B. C. Moore, Temporal integration and context eects in hearing, Journal of Phonetics, pp.31-34, 2003.
DOI : 10.1016/s0095-4470(03)00011-1

B. B. Murdock, The serial position effect of free recall., Journal of Experimental Psychology, vol.64, issue.5, p.482, 1962.
DOI : 10.1037/h0045106

R. F. Murray, Classication images : A review, J Vis, vol.11, issue.5, 2011.
DOI : 10.1167/11.5.2

R. F. Murray and J. M. Gold, Troubles with bubbles, Vision Research, vol.44, issue.5, p.461470, 2004.
DOI : 10.1016/j.visres.2003.10.006
URL : http://doi.org/10.1016/j.visres.2003.10.006

A. Narayanan and D. Wang, Robust speech recognition from binary masks, The Journal of the Acoustical Society of America, vol.128, issue.5, 2010.
DOI : 10.1121/1.3497358

D. Navon, Forest before trees: The precedence of global features in visual perception, Cognitive Psychology, vol.9, issue.3, p.353383, 1977.
DOI : 10.1016/0010-0285(77)90012-3

I. Nelken, Processing of complex stimuli and natural scenes in the auditory cortex, Current Opinion in Neurobiology, vol.14, issue.4, p.47480, 2004.
DOI : 10.1016/j.conb.2004.06.005

I. Nelken and A. De-cheveigne, An ear for statistics, Nature Neuroscience, vol.66, issue.4, 2013.
DOI : 10.1038/nn.3360

I. Nelken, Y. Rotman, and O. B. Yosef, Responses of auditory-cortex neurons to structural features of natural sounds, Nature, issue.6715, p.397154157, 1999.

P. Neri and D. J. Heeger, Spatiotemporal mechanisms for detecting and identifying image features in human vision, Nature Neuroscience, vol.5, issue.8, p.8126, 2002.
DOI : 10.1038/nn886

S. Norman-haignere, N. G. Kanwisher, and J. H. Mcdermott, Distinct Cortical Pathways for Music and Speech Revealed by Hypothesis-Free Voxel Decomposition, Neuron, vol.88, issue.6, p.8812811296, 2015.
DOI : 10.1016/j.neuron.2015.11.035
URL : http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4740977

F. Occelli, C. Suied, D. Pressnitzer, J. M. Edeline, and B. Gourevitch, A Neural Substrate for Rapid Timbre Recognition? Neural and Behavioral Discrimination of Very Brief Acoustic Vowels, Cerebral Cortex, vol.26, issue.6, 2015.
DOI : 10.1093/cercor/bhv071
URL : https://hal.archives-ouvertes.fr/hal-01165596

O. Connor, K. N. Petkov, C. I. Sutter, and M. L. , Adaptive Stimulus Optimization for Auditory Cortical Neurons, Journal of Neurophysiology, vol.94, issue.6, pp.4051-4067, 2005.
DOI : 10.1152/jn.00046.2005

A. Oliva and P. G. Schyns, Coarse blobs or ne edges ? Evidence that information diagnosticity changes the perception of complex visual stimuli, Cognitive psychology, vol.34, p.72107, 1997.

A. Oliva and A. Torralba, Chapter 2 Building the gist of a scene: the role of global image features in recognition, Progress in brain research, vol.155, p.2336, 2006.
DOI : 10.1016/S0079-6123(06)55002-2

B. A. Olshausen and D. J. Field, Emergence of simple-cell receptive eld properties by learning a sparse code for natural images, NATURE, vol.381, p.13, 1996.

B. A. Olshausen and D. J. Field, Sparse coding with an overcomplete basis set : A strategy employed by v1 ? Vision research, p.33113325, 1997.
DOI : 10.1016/s0042-6989(97)00169-7
URL : http://doi.org/10.1016/s0042-6989(97)00169-7

B. A. Olshausen and D. J. Field, Sparse coding of sensory inputs, Current Opinion in Neurobiology, vol.14, issue.4, p.481487, 2004.
DOI : 10.1016/j.conb.2004.07.007

B. A. Olshausen and K. N. O-'connor, A new window on sound, Nature Neuroscience, vol.5, issue.4, p.292294, 2002.
DOI : 10.1038/nn0402-292

R. Overath, T. Kumar, S. Stewart, L. Von-kriegstein, K. Cusack et al., Cortical Mechanisms for the Segregation and Representation of Acoustic Textures, Journal of Neuroscience, vol.30, issue.6, p.3020706, 2010.
DOI : 10.1523/JNEUROSCI.5378-09.2010

E. Parizet, E. Guyader, and V. Nosulenko, Analysis of car door closing sound quality, Applied Acoustics, vol.69, issue.1, p.1222, 2008.
DOI : 10.1016/j.apacoust.2006.09.004
URL : https://hal.archives-ouvertes.fr/hal-00849046

K. Patil, D. Pressnitzer, S. Shamma, and M. Elhilali, Music in Our Ears: The Biological Bases of Musical Timbre Perception, PLoS Computational Biology, vol.9, issue.11, p.1002759, 2012.
DOI : 10.1371/journal.pcbi.1002759.t002

R. D. Patterson, Auditory images. How complex sounds are represented in the auditory system., THE JOURNAL OF THE ACOUSTICAL SOCIETY OF JAPAN (E), vol.21, issue.4, p.183, 2000.
DOI : 10.1250/ast.21.183

R. D. Patterson, M. H. Allerhand, and C. Giguere, Time???domain modeling of peripheral auditory processing: A modular architecture and a software platform, The Journal of the Acoustical Society of America, vol.98, issue.4, p.18901894, 1995.
DOI : 10.1121/1.414456

R. D. Patterson, K. Robinson, J. Holdsworth, D. Mckeown, C. Zhang et al., Complex sounds and auditory images. Auditory physiology and perception, p.429446, 1992.
DOI : 10.1016/b978-0-08-041847-6.50054-x
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.333.2780

G. Peeters, A large set of audio features for sound description (similarity and classication) in the cuidado project, 2004.

G. E. Peterson, The signicance of various portions of the wave length in the minimum duration necessary for the recognition of vowel sounds. Unpublished doctoral dissertation, 1939.

D. Russo and F. , Separate neural systems for processing action-or nonaction-related sounds, Neuroimage, vol.24, issue.3, p.852861, 2005.

R. Plomp and R. , Timbre as a multidimensional attribute of complex tones. Frequency analysis and periodicity detection in hearing, p.397414, 1970.

R. Plomp, Auditory analysis and timbre perception. Auditory analysis and perception of speech, p.722, 1975.
DOI : 10.1016/b978-0-12-248550-3.50005-2

R. Plomp, Fysikaliska motsvarigheter till klanfarg hos stationara ljud. Var horsel och musiken, 1979.

R. Plomp and H. Steeneken, Eect of phase on the timbre of complex tones, 1969.

M. D. Plumbley, T. Blumensath, L. Daudet, R. Gribonval, and M. E. Davies, Sparse Representations in Audio and Music: From Coding to Source Separation, Proceedings of the IEEE, p.9951005, 2010.
DOI : 10.1109/JPROC.2009.2030345
URL : https://hal.archives-ouvertes.fr/inria-00489524

E. Ponsot, P. Susini, and S. Meunier, A robust asymmetry in loudness between rising-and falling-intensity tones, Atten Percept Psychophys, vol.77907, issue.20, 2015.
DOI : 10.3758/s13414-014-0824-y
URL : https://hal.archives-ouvertes.fr/hal-01228813

M. I. Posner, Chronometric explorations of mind, 1978.

M. C. Potter, Short-term conceptual memory for pictures., Journal of Experimental Psychology: Human Learning & Memory, vol.2, issue.5, p.509, 1976.
DOI : 10.1037/0278-7393.2.5.509
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.323.9453

R. L. Powell and O. Tosi, Vowel Recognition Threshold as a Function of Temporal Segmentations, Journal of Speech Language and Hearing Research, vol.13, issue.4, p.715724, 1970.
DOI : 10.1044/jshr.1304.715

D. Pressnitzer, T. Agus, and C. Suied, Acoustic Timbre Recognition, p.16, 2013.
DOI : 10.1007/978-1-4614-6675-8_98
URL : https://hal.archives-ouvertes.fr/hal-01165596

D. Pressnitzer, R. D. Patterson, and K. Krumbholz, The lower limit of melodic pitch, The Journal of the Acoustical Society of America, vol.109, issue.5, pp.2074-2084, 2001.
DOI : 10.1121/1.1359797
URL : https://hal.archives-ouvertes.fr/hal-01105693

Y. Qi and R. E. Hillman, Temporal and spectral estimations of harmonicsto-noise ratio in human voice signals, The Journal of the Acoustical Society of America, vol.102, issue.1, p.537543, 1997.

F. Ramus, Dyslexia: Talk of two theories, Nature, vol.42, issue.6845, p.412393395, 2001.
DOI : 10.1038/35086683
URL : https://hal.archives-ouvertes.fr/hal-00242898

F. Ramus, S. White, and U. Frith, Weighing the evidence between competing theories of dyslexia, Developmental Science, vol.682, issue.3, p.265269, 2006.
DOI : 10.1111/j.1467-9817.2004.00223.x

R. Rasch and R. Plomp, The perception of musical tones. The psychology of music, p.89112, 1982.

J. P. Rauschecker, Cortical processing of complex sounds, Current Opinion in Neurobiology, vol.8, issue.4, p.516521, 1998.
DOI : 10.1016/S0959-4388(98)80040-8

J. P. Rauschecker and S. K. Scott, Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing, Nature Neuroscience, vol.168, issue.6, p.12718724, 2009.
DOI : 10.1038/nn.2331

J. P. Rauschecker and B. Tian, Mechanisms and streams for processing of "what" and "where" in auditory cortex, Proceedings of the National Academy of Sciences, p.971180011806, 2000.
DOI : 10.1038/35002078

J. E. Raymond, K. L. Shapiro, and K. M. Arnell, Temporary suppression of visual processing in an rsvp task : An attentional blink ? Journal of experimental psychology : Human perception and performance, p.849, 1992.

C. Redies, A universal model of esthetic perception based on the sensory coding of natural stimuli, Spatial vision, vol.21, issue.1, p.97117, 2007.

R. E. Remez, J. S. Pardo, R. L. Piorkowski, and P. E. Rubin, On the Bistability of Sine Wave Analogues of Speech, Psychological Science, vol.8, issue.1, pp.24-29, 2001.
DOI : 10.1111/1467-9280.00305

R. E. Remez, P. E. Rubin, D. B. Pisoni, and T. D. Carrell, Speech perception without traditional speech cues, Science, vol.212, issue.4497, p.212947949, 1981.
DOI : 10.1126/science.7233191

R. E. Remez and E. F. Thomas, Early recognition of speech, Wiley Interdisciplinary Reviews: Cognitive Science, vol.101, issue.2, p.213223, 2013.
DOI : 10.1002/wcs.1213

K. Robinson and R. D. Patterson, The Duration Required to Identify the Instrument, the Octave, or the Pitch Chroma of a Musical Note, Music Perception: An Interdisciplinary Journal, vol.13, issue.1, p.115, 1995.
DOI : 10.2307/40285682

K. Robinson and R. D. Patterson, The stimulus duration required to identify vowels, their octave, and their pitch chroma, The Journal of the Acoustical Society of America, vol.98, issue.4, p.18581865, 1995.
DOI : 10.1121/1.414405

D. Rocchesso, D. A. Mauro, and S. D. Monache, miMic, Proceedings of the TEI '16: Tenth International Conference on Tangible, Embedded, and Embodied Interaction, TEI '16, p.357364, 2016.
DOI : 10.1145/2839462.2839467

E. T. Rolls and M. J. Tovee, Sparseness of the neuronal representation of stimuli in the primate temporal visual cortex, Journal of Neurophysiology, vol.73, issue.2, p.713726, 1995.

E. T. Rolls, M. J. Tovee, and S. Panzeri, The Neurophysiology of Backward Visual Masking: Information Analysis, Journal of Cognitive Neuroscience, vol.71, issue.3, p.300311, 1999.
DOI : 10.1016/S0301-0082(96)00054-8

L. M. Romanski, B. Tian, J. Fritz, M. Mishkin, P. S. Goldman-rakic et al., Dual streams of auditory aerents target multiple domains in the primate prefrontal cortex, Nature neuroscience, vol.2, issue.12, p.11311136, 1999.

E. Rosch, C. Mervis, W. Gray, D. Johnson, and P. Boyes-braem, Basic objects in natural categories, Cognitive Psychology, vol.8, p.382439, 1976.
DOI : 10.1037/e666602011-017
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.149.3392

H. Sakoe and S. Chiba, Dynamic programming algorithm optimization for spoken word recognition, IEEE transactions on acoustics, speech, and signal processing, vol.26, issue.1, p.4349, 1978.

S. Samson, R. J. Zatorre, and J. O. Ramsay, Multidimensional scaling of synthetic musical timbre : Perception of spectral and temporal characteristics, 1997.

R. Santoro, R. Moerel, M. , D. Martino, F. Goebel et al., Encoding of Natural Sounds at Multiple Spectral and Temporal Resolutions in the Human Auditory Cortex, PLoS Computational Biology, vol.83, issue.270, p.1003412, 2014.
DOI : 10.1371/journal.pcbi.1003412.s009

J. Schwartz and P. Tallal, Rate of acoustic change may underlie hemispheric specialization for speech perception, Science, vol.207, issue.4437, p.13801381, 1980.
DOI : 10.1126/science.7355297

O. Schwartz and E. P. Simoncelli, Natural signal statistics and sensory gain control, Nature neuroscience, vol.4, issue.8, p.819825, 2001.

P. G. Schyns, L. Bonnar, and F. Gosselin, Show Me the Features! Understanding Recognition From the Use of Visual Information, Psychological Science, vol.13, issue.5, p.402409, 2002.
DOI : 10.1111/1467-9280.00472

H. Scurto, G. Lemaitre, J. Francoise, F. Voisin, F. Bevilacqua et al., Combining gestures and vocalizations to imitate sounds, The Journal of the Acoustical Society of America, vol.138, issue.3, p.17801780, 2015.
DOI : 10.1121/1.4933639
URL : https://hal.archives-ouvertes.fr/hal-01255934

S. Sene, A joint synchrony/mean-rate model of auditory speech processing, Journal of Phonetics, vol.16, p.5576, 1988.

K. J. Seymour, S. Mcdonald, J. Cliord, and C. W. , Failure of colour and contrast polarity identication at threshold for detection of motion and global form, Vision Res, issue.12, p.4915928, 2009.

S. Shamma, On the role of space and time in auditory processing, Trends in Cognitive Sciences, vol.5, issue.8, p.340348, 2001.
DOI : 10.1016/S1364-6613(00)01704-6

S. A. Shamma, Speech processing in the auditory system I: The representation of speech sounds in the responses of the auditory nerve, The Journal of the Acoustical Society of America, vol.78, issue.5, p.7816121621, 1985.
DOI : 10.1121/1.392799

S. A. Shamma, Speech processing in the auditory system II: Lateral inhibition and the central processing of speech evoked activity in the auditory nerve, The Journal of the Acoustical Society of America, vol.78, issue.5, p.7816221632, 1985.
DOI : 10.1121/1.392800

A. J. Simpson, M. J. Terrell, and J. D. Reiss, A practical step-by-step guide to the time-varying loudness model of Moore, 1997.

N. C. Singh and F. E. Theunissen, Modulation spectra of natural sounds and ethological theories of auditory processing, The Journal of the Acoustical Society of America, vol.114, issue.6, p.33943411, 2003.
DOI : 10.1121/1.1624067

M. Slaney, Auditory toolbox. Interval Research Corporation, 1998.

M. Slaney and R. F. Lyon, Apple hearing demo reel, 1991.

E. C. Smith and M. S. Lewicki, Ecient auditory coding, Nature, issue.7079, p.43997882, 2006.

J. E. Smith, Simple algorithms for m-alternative forced-choice calculations . Attention, Perception, Psychophysics, issue.1, p.319596, 1982.
DOI : 10.3758/bf03206208

M. L. Smith, G. W. Cottrell, F. Gosselin, and P. G. Schyns, Transmitting and Decoding Facial Expressions, Psychological Science, vol.17, issue.3, p.1849, 2005.
DOI : 10.1016/j.cub.2003.09.038
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.517.617

R. Smith, Z. M. Delgutte, B. Oxenham, and A. J. , Chimaeric sounds reveal dichotomies in auditory perception, Nature, vol.9, issue.6876, p.4168790, 2002.
DOI : 10.1038/416087a
URL : http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2268248

H. Stanislaw and N. Todorov, Calculation of signal detection theory measures . Behavior research methods, instruments, & computers, p.31137149, 1999.

S. Subramaniam, I. Biederman, and S. Madigan, Accurate identication but no priming and chance recognition memory for pictures in rsvp sequences, 2000.

C. Y. Suen and M. P. Beddoes, Discrimination of vowel sounds of very short duration, Perception & Psychophysics, vol.18, issue.6, p.417419, 1972.
DOI : 10.3758/BF03206282

N. Suga, Philosophy and stimulus design for neuroethology of complexsound processing, Philosophical Transactions of the Royal Society B : Biological Sciences, p.336423428, 1278.

N. Suga, Y. Zhang, and J. Yan, Sharpening of frequency tuning by inhibition in the thalamic auditory nucleus of the mustached bat, Journal of Neurophysiology, vol.77, issue.4, p.20982114, 1997.

C. Suied, T. R. Agus, S. J. Thorpe, and D. Pressnitzer, Processing of Short Auditory Stimuli: The Rapid Audio Sequential Presentation Paradigm (RASP), p.443451, 2013.
DOI : 10.1007/978-1-4614-1590-9_49
URL : https://hal.archives-ouvertes.fr/hal-00839051

C. Suied, A. Dremeau, D. Pressnitzer, and L. Daudet, Auditory Sketches: Sparse Representations of Sounds Based on Perceptual Models, p.154170, 2013.
DOI : 10.1007/978-3-642-41248-6_9

R. Suied, C. Susini, P. Mcadams, S. Patterson, and R. D. , Why are natural sounds detected faster than pips?, The Journal of the Acoustical Society of America, vol.127, issue.3, p.10510, 2010.
DOI : 10.1121/1.3310196
URL : https://hal.archives-ouvertes.fr/hal-01106520

P. Susini, S. Mcadams, and S. Winsberg, Caracterisation perceptive des bruits de vehicules, CFA : Congres Francais d'Acoustique, p.543546, 1997.
URL : https://hal.archives-ouvertes.fr/hal-01105462

P. Susini, S. Mcadams, and S. Winsberg, A multidimensional technique for sound quality assessment, Acta acustica united with Acustica, vol.85, issue.5, p.650656, 1999.
URL : https://hal.archives-ouvertes.fr/hal-01105599

P. Susini, N. Misdariis, S. Winsberg, and S. Mcadams, Caracterisation perceptive de bruits, Acoustique et Techniques, vol.13, issue.4, p.1115, 1998.
URL : https://hal.archives-ouvertes.fr/hal-01105462

P. Tallal, A dierent view of "auditory processing factors in language disorders, Journal of Speech and Hearing Disorders, vol.40, issue.3, p.413414, 1975.

P. Tallal, Auditory temporal perception, phonics, and reading disabilities in children, Brain and Language, vol.9, issue.2, p.182198, 1980.
DOI : 10.1016/0093-934X(80)90139-X

P. Tallal, Temporal or phonetic processing decit in dyslexia ? That is the question, Applied Psycholinguistics, vol.5, issue.02, p.167169, 1984.

P. Tallal, Opinion: Improving language and literacy is a matter of time, Nature Reviews Neuroscience, vol.43, issue.9, p.721728, 2004.
DOI : 10.1016/S0301-0511(00)00052-1

P. Tallal and M. Piercy, Defects of Non-Verbal Auditory Perception in Children with Developmental Aphasia, Nature, vol.7, issue.5390, 1973.
DOI : 10.1038/241468a0

R. Tallal, P. Piercy, and M. , Developmental aphasia: Rate of auditory processing and selective impairment of consonant perception, Neuropsychologia, vol.12, issue.1, p.8393, 1974.
DOI : 10.1016/0028-3932(74)90030-X

F. E. Theunissen and J. E. Elie, Neural processing of natural sounds, Nature Reviews Neuroscience, vol.59, issue.6, p.35566, 2014.
DOI : 10.1523/JNEUROSCI.2042-08.2009

F. E. Theunissen, K. Sen, and A. J. Doupe, Spectral-temporal receptive elds of nonlinear auditory neurons obtained using natural sounds, The Journal of Neuroscience, vol.20, issue.6, p.23152331, 2000.

S. Thorpe, D. Fize, and C. Marlot, Speed of processing in the human visual system, nature, issue.6582, p.381520522, 1996.

B. Tian, D. Reser, A. Durham, A. Kustov, and J. P. Rauschecker, Functional Specialization in Rhesus Monkey Auditory Cortex, Science, vol.292, issue.5515, p.292290, 2001.
DOI : 10.1126/science.1058911

S. Tremblay, F. Vachon, and D. M. Jones, Attentional and perceptual sources of the auditory attentional blink, Perception & Psychophysics, vol.27, issue.2, p.195208, 2005.
DOI : 10.3758/BF03206484

T. Tsuchida and G. W. Cottrell, Auditory saliency using natural statistics, 2012.

F. Vachon, S. Tremblay, R. W. Hughes, and D. M. Jones, Capturing and Unmasking the Mask in the Auditory Attentional Blink, Experimental Psychology, vol.57, issue.5, pp.346-53, 2010.
DOI : 10.1027/1618-3169/a000041

L. Varnet, K. Knoblauch, F. Meunier, and M. Hoen, Using auditory classication images for the identication of ne acoustic cues used in speech perception, Front Hum Neurosci, vol.7, p.865, 2013.

R. Venezia, J. H. Hickok, G. Richards, and V. M. , Auditory ???bubbles???: Efficient classification of the spectrotemporal modulations essential for speech intelligibility, The Journal of the Acoustical Society of America, vol.140, issue.2, pp.1072-1088, 2016.
DOI : 10.1121/1.4960544

W. E. Vinje and J. L. Gallant, Sparse Coding and Decorrelation in Primary Visual Cortex During Natural Vision, Science, vol.287, issue.5456, p.28712731276, 2000.
DOI : 10.1126/science.287.5456.1273
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.456.2467

M. S. Vitevitch, Change deafness: The inability to detect changes between two voices., Journal of Experimental Psychology: Human Perception and Performance, vol.29, issue.2, p.333, 2003.
DOI : 10.1037/0096-1523.29.2.333
URL : http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2553696

R. F. Voss and J. Clarke, ???1/fnoise??? in music and speech, Nature, vol.56, issue.5533, pp.317-318, 1975.
DOI : 10.1038/258317a0

R. F. Voss and J. Clarke, ??????1/f noise?????? in music: Music from 1/f noise, The Journal of the Acoustical Society of America, vol.63, issue.1, p.258263, 1978.
DOI : 10.1121/1.381721

K. M. Walker, J. K. Bizley, A. J. King, and J. W. Schnupp, Multiplexed and Robust Representations of Sound Features in Auditory Cortex, Journal of Neuroscience, vol.31, issue.41, p.311456514576, 2011.
DOI : 10.1523/JNEUROSCI.2074-11.2011

D. Wang, On Ideal Binary Mask As the Computational Goal of Auditory Scene Analysis, p.181197, 2005.
DOI : 10.1007/0-387-22794-6_12

D. Wang, U. Kjems, M. S. Pedersen, J. B. Boldt, and T. Lunner, Speech perception of noise with binary gains, The Journal of the Acoustical Society of America, vol.124, issue.4, p.23032307, 2008.

K. Wang and S. Shamma, Self-normalization and noise-robustness in early auditory representations, IEEE transactions on speech and audio processing, p.421435, 1994.

J. D. Warren and T. D. Griths, Distinct mechanisms for processing spatial sequences and pitch sequences in the human auditory brain, The journal of neuroscience, vol.23, issue.13, p.57995804, 2003.

J. D. Warren, S. K. Scott, C. J. Price, and T. D. Griths, Human brain mechanisms for the early analysis of voices, NeuroImage, vol.31, issue.3, p.31138997, 2006.
DOI : 10.1016/j.neuroimage.2006.01.034

L. Wedin and G. Goude, DIMENSION ANALYSIS OF THE PERCEPTION OF INSTRUMENTAL TIMBRE, Scandinavian Journal of Psychology, vol.10, issue.1, p.228240, 1972.
DOI : 10.1121/1.1909632

D. L. Wessel, Psychoacoustics and music : A report from michigan state university, PACE : Bulletin of the Computer Arts Society, vol.30, p.12, 1973.

D. L. Wessel, Timbre Space as a Musical Control Structure, Computer Music Journal, vol.3, issue.2, p.4552, 1979.
DOI : 10.2307/3680283

S. Winsberg and J. D. Carroll, A quasi-nonmetric method for multidimensional scaling VIA an extended euclidean model, Psychometrika, vol.48, issue.2, p.217229, 1989.
DOI : 10.1007/BF02294516

S. Winsberg and G. De-soete, A latent class approach to tting the weighted euclidean model, clascal, Psychometrika, vol.58, issue.2, p.315330, 1993.

D. L. Woods and C. Alain, Feature processing during high-rate auditory selective attention, Perception & Psychophysics, vol.15, issue.4, p.391402, 1993.
DOI : 10.3758/BF03206782

D. L. Woods, C. Alain, D. Covarrubias, and O. Zaidel, Frequencyrelated dierences in the speed of human auditory processing, Hearing research, vol.66, issue.1, p.4652, 1993.

S. M. Woolley, T. E. Fremouw, A. Hsu, and F. E. Theunissen, Tuning for spectro-temporal modulations as a mechanism for auditory discrimination of natural sounds, Nature Neuroscience, vol.15, issue.10, p.13719, 2005.
DOI : 10.1038/nn831

X. Yang, K. Wang, and S. A. Shamma, Auditory representations of acoustic signals. Information Theory, IEEE Transactions on, vol.38, issue.2, p.824839, 1992.

W. A. Yost, Psychoacoustics : A brief historical overview, p.4653, 2015.

M. P. Young and S. Yamane, Sparse population coding of faces in the inferotemporal cortex, Science, vol.256, issue.5061, p.25613271331, 1992.
DOI : 10.1126/science.1598577

E. W. Références-yund, A. Uno, and D. L. Woods, Preattentive control of serial auditory processing in dichotic listening, Brain and language, vol.66, issue.3, p.358376, 1999.

R. J. Zatorre, M. Bouard, and P. Belin, Sensitivity to Auditory Object Features in Human Temporal Neocortex, Journal of Neuroscience, vol.24, issue.14, pp.243637-3642, 2004.
DOI : 10.1523/JNEUROSCI.5458-03.2004

Y. Zhang, N. Suga, and J. Yan, Corticofugal modulation of frequency processing in bat auditory system, Nature, issue.6636, p.387900903, 1997.

E. Zwicker and B. Scharf, A model of loudness summation., Psychological Review, vol.72, issue.1, p.3, 1965.
DOI : 10.1037/h0021703