F. Janet, . Werker, C. Richard, and . Tees, Influences on infant speech processing: Toward a new synthesis Annual review of psychology 50, Bibliography, vol.1, pp.509-535, 1999.

K. Patricia, E. Kuhl, A. Stevens, T. Hayashi, S. Deguchi et al., Infants show a facilitation effect for native language phonetic perception between 6 and 12 months, Developmental science 9, pp.13-21, 2006.

B. De-boer, K. Patricia, and . Kuhl, Investigating the role of infant-directed speech with a computer model, Acoustics Research Letters Online, vol.4, issue.4, pp.129-134, 2003.
DOI : 10.1121/1.1613311

H. Michael and . Coen, Self-supervised acquisition of vowels in American English, Proc. National Conference On Artificial Intelligence, 2006.

B. Gauthier, R. Shi, and Y. Xu, Learning phonetic categories by tracking movements, Cognition, vol.103, issue.1, pp.80-106, 2007.
DOI : 10.1016/j.cognition.2006.03.002
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.126.4801

K. Gautam, . Vallabha, L. James, F. Mcclelland, . Pons et al., Unsupervised learning of vowel categories from infant-directed speech, Proceedings of the National Academy of Sciences, pp.13273-13278, 2007.

K. Miyazawa, H. Kikuchi, and R. Mazuka, Unsupervised learning of vowels from continuous speech based on self-organized phoneme acquisition model, Proc. INTERSPEECH, 2010.

K. Miyazawa, H. Miura, H. Kikuchi, and R. Mazuka, The Multi Timescale Phoneme Acquisition Model of the Self-Organizing Based on the Dynamic Features, Proc. INTERSPEECH, 2011.

F. Adriaans and D. Swingley, Distributional learning of vowel categories is supported by prosody in infant-directed speech, Proc. CogSci. 2012 (cit

C. Jones, F. Meakins, and S. Muawiyath, Learning Vowel Categories From Maternal Speech in Gurindji Kriol, Language Learning, vol.103, issue.4, pp.1052-1078, 2012.
DOI : 10.1111/j.1467-9922.2012.00725.x

B. Dillon, E. Dunbar, and W. Idsardi, A Single-Stage Approach to Learning Phonological Categories: Insights From Inuktitut, Cognitive Science, vol.107, issue.2-3, pp.344-377, 2013.
DOI : 10.1111/cogs.12008

H. Naomi, . Feldman, L. Thomas, S. Griffiths, . Goldwater et al., A role for the developing lexicon in phonetic category acquisition, In: Psychological review, vol.1204, p.751, 2013.

H. Rasilo, O. Räsänen, K. Unto, and . Laine, Feedback and imitation by a caregiver guides a virtual infant to learn native phonemes and the skill of speech inversion, Speech Communication, vol.55, issue.9, pp.909-931, 2013.
DOI : 10.1016/j.specom.2013.05.002

S. Frank, N. Feldman, and S. Goldwater, Weak semantic context helps phonetic learning in a model of infant language acquisition, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
DOI : 10.3115/v1/P14-1101

M. David, . Green, A. John, and . Swets, Signal detection theory and psychophysics, 1966.

S. Boucheron, O. Bousquet, and G. Lugosi, Theory of Classification: a Survey of Some Recent Advances, ESAIM: probability and statistics 9, pp.323-375, 2005.
DOI : 10.1051/ps:2005018
URL : https://hal.archives-ouvertes.fr/hal-00017923

U. Von-luxburg, M. Belkin, and O. Bousquet, Consistency of spectral clustering, The Annals of Statistics, vol.36, issue.2, pp.555-586, 2008.
DOI : 10.1214/009053607000000640

P. Mermelstein, Distance measures for speech recognition, psychological and instrumental, Pattern recognition and artificial intelligence, vol.116, issue.78, pp.91-103, 1976.

K. Taras and . Vintsyuk, Speech discrimination by dynamic programming, Cybernetics and Systems Analysis, vol.41, issue.133, pp.52-57, 1968.

B. Douglas, J. M. Paul, and . Baker, The design for the Wall Street Journal-based CSR corpus, Proc. Workshop on Speech and Natural Language, pp.357-362, 1992.

J. Trevor, R. J. Hastie, . Tibshirani, H. Jerome, and . Friedman, The elements of statistical learning: data mining, inference, and prediction, 2009.

J. Shi and J. Malik, Normalized cuts and image segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.228, issue.32, pp.888-905, 2000.

K. Hamid and . Seifoddini, Single linkage versus average linkage clustering in machine cells formation applications, In: Computers & Industrial Engineering, vol.163, pp.419-426, 1989.

L. Hubert and P. Arabie, Comparing partitions, Journal of Classification, vol.78, issue.1, pp.193-218, 1985.
DOI : 10.1007/BF01908075

T. Zhang, Statistical behavior and consistency of classification methods based on convex risk minimization, The Annals of Statistics, vol.32, issue.1, pp.56-85, 2004.
DOI : 10.1214/aos/1079120130

D. Arthur and S. Vassilvitskii, k-means++: The advantages of careful seeding, Proc. ACM-SIAM symposium on discrete algorithms, 2007.

F. Cérou and A. Guyader, Nearest neighbor classification in infinite dimension, ESAIM: Probability and Statistics, vol.10, pp.340-355, 2006.
DOI : 10.1051/ps:2006014

S. Kullback, A. Richard, and . Leibler, On Information and Sufficiency, The annals of mathematical statistics, pp.79-86, 1951.
DOI : 10.1214/aoms/1177729694

M. Mohri, A. Rostamizadeh, and A. Talwalkar, Foundations of machine learning, p.2012

D. David and . Lewis, Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval, Proc. European Conference on Machine Learning, 1998.

L. Devroye, L. Györfi, and G. Lugosi, A Probabilistic Theory of Pattern Recognition, 1996.
DOI : 10.1007/978-1-4612-0711-5

O. Shamir and N. Tishby, Stability and model selection in k-means clustering, Machine learning 80, pp.2-3, 2010.
DOI : 10.1007/s10994-010-5177-8

L. Ulrike-von, Clustering Stability: An Overview, Machine Learning, vol.23, pp.235-274, 2009.

A. Michael, S. Carlin, A. Thomas, H. Jansen, and . Hermansky, Rapid Evaluation of Speech Representations for Spoken Term Discovery, Proc. INTERSPEECH. 2011 (cit. on pp. 55, pp.65-68

T. Schatz, V. Peddinti, F. Bach, A. Jansen, H. Hermansky et al., Evaluating speech features with the Minimal-Pair ABX task: Analysis of the classical MFC/PLP pipeline, Proc. INTERSPEECH. 2013 (cit. on pp. 55, pp.68-71
URL : https://hal.archives-ouvertes.fr/hal-00918599

J. Davis and M. Goadrich, The relationship between Precision-Recall and ROC curves, Proceedings of the 23rd international conference on Machine learning , ICML '06, 2006.
DOI : 10.1145/1143844.1143874
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.98.4362

T. Schatz, X. Cao, A. Kolesnikova, T. Bergvelt, J. Wright et al., Articulation Index LSCP LDC2015S12

M. Versteegh, R. Thiolliere, T. Schatz, X. N. Cao, X. Anguera et al., The zero resource speech challenge, Proc. INTERSPEECH. 2015 (cit, pp.58-151, 2015.
DOI : 10.1016/j.procs.2016.04.031

E. Kotsoni, M. De-haan, H. Mark, and . Johnson, Categorical Perception of Facial Expressions by 7-Month-Old Infants, Perception, vol.48, issue.1, pp.1115-1125, 2001.
DOI : 10.1068/p3155

P. Scott, . Johnson, E. Erin, and . Hannon, Handbook of child psychology and developmental science Perceptual development. Seventh, p.2015

M. Jean, L. Mandler, and . Mcdonough, Concept formation in infancy, Cognitive development, vol.83, pp.291-318, 1993.

M. Jean, L. Mandler, and . Mcdonough, On developing a knowledge base in infancy, In: Developmental psychology, vol.346, p.1274, 1998.

C. Paul, . Quinn, H. Mark, and . Johnson, Global-Before-Basic Object Categorization in Connectionist Networks and 2-Month-Old Infants, In: Infancy, vol.1, issue.1, pp.31-46, 2000.

B. Elsner, S. Jeschonek, and S. Pauen, Event-related potentials for 7-month-olds??? processing of animals and furniture items, Developmental Cognitive Neuroscience, vol.3, pp.53-60, 2013.
DOI : 10.1016/j.dcn.2012.09.002

M. Molina, G. A. Van-de-walle, K. Condry, and E. S. Spelke, The Animate-Inanimate Distinction in Infancy: Developing Sensitivity to Constraints on Human Actions, Journal of Cognition and Development, vol.83, issue.4, pp.399-426, 2004.
DOI : 10.1016/S0163-6383(99)00007-7

D. Poulin-dubois, A. Lepage, and D. Ferland, Infants' concept of animacy, Cognitive Development, vol.11, issue.1, pp.19-36, 1996.
DOI : 10.1016/S0885-2014(96)90026-X

A. Aguiar and R. Baillargeon, 2.5-Month-Old Infants' Reasoning about When Objects Should and Should Not Be Occluded, Cognitive psychology 39, pp.116-157, 1999.
DOI : 10.1006/cogp.1999.0717
URL : https://hal.archives-ouvertes.fr/hal-01281646

A. Aguiar and R. Baillargeon, Developments in young infants' reasoning about occluded objects, Cognitive psychology 45, pp.267-336, 2002.
DOI : 10.1016/S0010-0285(02)00005-1

Y. Luo and R. Baillargeon, When the ordinary seems unexpected: evidence for incremental physical knowledge in young infants, Cognition 95, pp.297-328, 2005.
DOI : 10.1016/j.cognition.2004.01.010

J. Susan, A. L. Hespos, . Ferry, J. Lance, and . Rips, Five-month-old infants have different expectations for solids and liquids, Psychological Science, vol.205, pp.603-611, 2009.

V. Izard, C. Sann, S. Elizabeth, A. Spelke, and . Streri, Newborn infants perceive abstract numbers, Proceedings of the National Academy of Sciences, pp.10382-10385, 2009.
DOI : 10.1073/pnas.0812142106
URL : http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2700913

Y. Lecun, Y. Bengio, and G. Hinton, Deep learning, Nature, vol.9, issue.7553, pp.436-444, 2015.
DOI : 10.1007/s10994-013-5335-x

M. Jordan and T. Mitchell, Machine learning: Trends, perspectives, and prospects, Science, vol.349, issue.6245, pp.255-260, 2015.
DOI : 10.1126/science.aaa8415

A. Jansen, E. Dupoux, S. Goldwater, M. Johnson, S. Khudanpur et al., A summary of the 2012 JH CLSP Workshop on zero resource speech technologies and models of early language acquisition

P. Kuhl and A. Meltzoff, The bimodal perception of speech in infancy, Science, vol.218, issue.4577, pp.1138-1141, 1982.
DOI : 10.1126/science.7146899

L. Michelle, J. F. Patterson, and . Werker, Two-month-old infants match phonetic information in lips and voice, In: Developmental Science, vol.62, pp.191-196, 2003.

J. Maye, J. F. Werker, and L. Gerken, Infant sensitivity to distributional information can affect phonetic discrimination, Cognition, vol.82, issue.3, pp.101-111, 2002.
DOI : 10.1016/S0010-0277(01)00157-3

J. Maye, J. Daniel, . Weiss, N. Richard, and . Aslin, Statistical phonetic learning in infants: facilitation and feature generalization, Developmental Science, vol.37, issue.Supplement 1, pp.122-134, 2008.
DOI : 10.1111/j.1467-7687.2007.00653.x
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.130.1984

A. Katherine, F. Yoshida, J. Pons, J. F. Maye, and . Werker, Distributional phonetic learning at 10 months of age, pp.420-433, 2010.

T. Teinonen, N. Richard, P. Aslin, G. Alku, and . Csibra, Visual speech contributes to phonetic learning in 6-month-old infants, Cognition, vol.108, issue.3, pp.850-855, 2008.
DOI : 10.1016/j.cognition.2008.05.009

A. Cristia, Fine-grained variation in caregivers??? /s/ predicts their infants??? /s/ category, The Journal of the Acoustical Society of America, vol.129, issue.5, pp.3271-3280, 2011.
DOI : 10.1121/1.3562562

J. Emily, J. S. Jones, and . Herbert, The effect of learning experiences and context on infant imitation and generalization, In: Infancy, vol.136, pp.596-619, 2008.

M. Derek, . Houston, W. Peter, and . Jusczyk, Infants' long-term memory for the sound patterns of words and voices, In: Journal of Experimental Psychology: Human Perception and Performance, vol.296, p.1143, 2003.

H. Henny, Y. , and J. F. Werker, Learning words' sounds before learning how words sound: 9-month-olds use distinct objects as cues to categorize speech information, Cognition, vol.1132, pp.234-243, 2009.

N. Mani and S. Schneider, Speaker identity supports phonetic category learning., Journal of Experimental Psychology: Human Perception and Performance, vol.39, issue.3, p.623, 2013.
DOI : 10.1037/a0030402

C. Ngon, A. Martin, E. Dupoux, D. Cabrol, M. Dutat et al., (Non)words, (non)words, (non)words: evidence for a protolexicon during the first year of life, Developmental Science, vol.50, issue.1, pp.24-34, 2013.
DOI : 10.1111/j.1467-7687.2012.01189.x

K. Hirsh-pasek, D. G. Kemler-nelson, W. Peter, K. W. Jusczyk, B. Cassidy et al., Clauses are perceptual units for young infants, Cognition, vol.26, issue.3
DOI : 10.1016/S0010-0277(87)80002-1

W. Peter, K. Jusczyk, D. G. Hirsh-pasek, L. J. Kemler-nelson, A. Kennedy et al., Perception of acoustic correlates of major phrasal units by young infants, Cognitive psychology, vol.242, pp.252-293, 1992.

J. Myers, W. Peter, D. G. Jusczyk, A. L. Kemler-nelson-charles-luce, K. Woodward et al., Infants' sensitivity to word boundaries in fluent speech, Journal of Child Language, vol.54, issue.01, pp.1-30, 1996.
DOI : 10.1016/0885-2308(87)90004-0

H. Naomi, E. B. Feldman, . Myers, S. Katherine, . White et al., Word-level information influences phonetic learning in adults and infants, Cognition, vol.1273, pp.427-438, 2013.

E. Bergelson and D. Swingley, At 6-9 months, human infants know the meanings of many common nouns, Proceedings of the National Academy of Sciences, pp.3253-3258, 2012.
DOI : 10.1073/pnas.1113380109

E. Bergelson and D. Swingley, Early Word Comprehension in Infants: Replication and Extension " . In: Language Learning and Development ahead-of-print (2014), pp.1-12

H. Henny-yeung, M. Lawrence, J. F. Chen, and . Werker, Referential labeling can facilitate phonetic learning in infancy, pp.1036-1049, 2014.

H. Henny, Y. , and T. Nazzi, Object labeling influences infant phonetic learning and generalization, Cognition 132, pp.151-163, 2014.

A. Fernald, Four-month-old infants prefer to listen to motherese, Infant Behavior and Development, vol.8, issue.2, pp.181-195, 1985.
DOI : 10.1016/S0163-6383(85)80005-9

M. Scaife, S. Jerome, and . Bruner, The capacity for joint visual attention in the infant, Nature, vol.27, issue.5489, 1975.
DOI : 10.1038/253265a0

G. Butterworth, Pointing is the royal road to language for babies In: Pointing: Where language, culture, and cognition meet, pp.9-33, 2003.

G. Butterworth, Joint visual attention in infancy, Theories of infant development, pp.317-354, 2004.
DOI : 10.1002/9780470996348.ch8

K. Patricia, F. Kuhl, H. Tsao, and . Liu, Foreign-language experience in infancy: Effects of short-term exposure and social interaction on phonetic learning, Proceedings of the National Academy of Sciences, pp.9096-9101, 2003.

H. Douglas, A. G. Whalen, Q. Levitt, and . Wang, Intonational differences between the reduplicative babbling of French-and English-learning infants, Journal of Child Language, vol.18, pp.3-501, 1991.

L. Bénédicte-de-boysson-bardies, C. Sagart, and . Durand, Discernible differences in the babbling of infants according to target language, Journal of child language, vol.11, issue.01, pp.1-15, 1984.

H. Henny, Y. , and J. F. Werker, Lip movements affect infants' audiovisual speech perception, Psychological Science, vol.245, pp.603-612, 2013.

K. Patricia, . Kuhl, N. Andrew, and . Meltzoff, Infant vocalizations in response to speech: Vocal imitation and developmental change, In: The journal of the Acoustical Society of America, vol.1004, pp.2425-2438, 1996.

H. Michael, J. A. Goldstein, and . Schwade, Social feedback to infants' babbling facilitates rapid phonological learning, In: Psychological Science, vol.195, pp.515-523, 2008.

T. Nazzi and F. Ramus, Perception and acquisition of linguistic rhythm by infants, Speech Communication, vol.41, issue.1, pp.233-243, 2003.
DOI : 10.1016/S0167-6393(02)00106-1
URL : https://hal.archives-ouvertes.fr/hal-00242908

A. Jansen and K. Church, Towards Unsupervised Training of Speaker Independent Acoustic Models, Proc. INTERSPEECH. 2011 (cit. on pp. 66, pp.69-71

A. Jansen, S. Thomas, and H. Hermansky, Weak top-down constraints for unsupervised acoustic model training, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.69-71
DOI : 10.1109/ICASSP.2013.6639241

A. Jansen, S. Thomas, and H. Hermansky, Intrinsic Spectral Analysis for Zero and High Resource Speech Recognition, Proc. INTERSPEECH. 2012 (cit, pp.67-71

H. Hermansky, J. David, and . Broad, The effective second formant F2' and the vocal tract front-cavity, International Conference on Acoustics, Speech, and Signal Processing, 1989.
DOI : 10.1109/ICASSP.1989.266468

C. Lee and J. Glass, A nonparametric Bayesian approach to acoustic model discovery, Proc. ACL. 2012 (cit. on pp. 67, p.71

H. Kamper, W. Wang, and K. Livescu, Deep convolutional acoustic word embeddings using word-pair side information, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), p.68, 2015.
DOI : 10.1109/ICASSP.2016.7472619
URL : http://arxiv.org/abs/1510.01032

L. Deng, G. Hinton, and B. Kingsbury, New types of deep neural network learning for speech recognition and related applications: an overview, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, p.68
DOI : 10.1109/ICASSP.2013.6639344

L. Tóth, Combining time- and frequency-domain convolution in convolutional neural network-based phone recognition, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), p.68
DOI : 10.1109/ICASSP.2014.6853584

D. Snyder, D. Garcia-romero, and D. Povey, Time delay deep neural networkbased universal background models for speaker recognition, Proc. ASRU. 2015 (cit, p.68
DOI : 10.1109/asru.2015.7404779

K. Vijayan, P. R. Reddy, and K. Murty, Significance of analytic phase of speech signals in speaker verification, Speech Communication, vol.81, issue.151, pp.54-71, 2016.
DOI : 10.1016/j.specom.2016.02.005

T. Schatz, V. Peddinti, X. Cao, R. Francis, H. Bach et al., Evaluating speech features with the minimal-pair ABX task (II): resistance to noise, Proc. INTERSPEECH. 2014 (cit, pp.68-71

G. Synnaeve, T. Schatz, and E. Dupoux, Phonetics embedding learning with side information, 2014 IEEE Spoken Language Technology Workshop (SLT), pp.69-71
DOI : 10.1109/SLT.2014.7078558

M. Versteegh, X. Anguera, A. Jansen, and E. Dupoux, The Zero Resource Speech Challenge 2015: Proposed Approaches and Results, Procedia Computer Science, vol.81, issue.151, pp.67-72, 2016.
DOI : 10.1016/j.procs.2016.04.031

A. Mark, K. Pitt, E. Johnson, S. Hume, W. Kiesling et al., The Buckeye corpus of conversational speech: Labeling conventions and a test of transcriber reliability, Speech Communication, vol.451, issue.130, pp.89-95, 2005.

J. Nic, . Vries, H. Marelie, J. Davel, . Badenhorst et al., A smartphone-based ASR data collection tool for under-resourced languages, Speech communication, vol.56, pp.119-131, 2014.

R. Thiollière, E. Dunbar, G. Synnaeve, M. Versteegh, and E. Dupoux, A Hybrid Dynamic Time Warping-Deep Neural Network Architecture for Unsupervised Acoustic Modeling, Proc. INTERSPEECH. 2015 (cit, pp.70-71

D. Renshaw, H. Kamper, A. Jansen, and S. Goldwater, A comparison of neural network methods for unsupervised representation learning on the Zero Resource Speech Challenge, Proc. INTERSPEECH. 2015 (cit, pp.70-71

N. Zeghidour, G. Synnaeve, M. Versteegh, and E. Dupoux, A deep scattering spectrum ??? Deep Siamese network pipeline for unsupervised acoustic modeling, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI : 10.1109/ICASSP.2016.7472622

J. Andén and S. Mallat, Deep Scattering Spectrum, IEEE Transactions on Signal Processing, vol.62, issue.16, pp.4114-4128, 2014.
DOI : 10.1109/TSP.2014.2326991

L. Badino, A. Mereta, and L. Rosasco, Discovering Discrete Subword Units with Binarized Autoencoders and Hidden-Markov-Model Encoders, Proc. IN- TERSPEECH. 2015 (cit, pp.70-71

H. Chen, C. Leung, L. Xie, B. Ma, and H. Li, Parallel Inference of Dirichlet Process Gaussian Mixture Models for Unsupervised Acoustic Modeling: A Feasibility Study, Proc. INTERSPEECH. 2015 (cit, pp.70-71

M. Heck, S. Sakti, and S. Nakamura, Unsupervised Linear Discriminant Analysis for Supporting DPGMM Clustering in the Zero Resource Scenario, Procedia Computer Science, vol.81, issue.151, pp.73-79, 2016.
DOI : 10.1016/j.procs.2016.04.032

C. Chung, C. Tsai, H. Lu, C. Liu, H. Lee et al., An iterative deep learning framework for unsupervised discovery of speech features and linguistic units with applications on spoken term detection, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), p.71
DOI : 10.1109/ASRU.2015.7404801

N. Macmillan and C. Creelman, Detection Theory: A User's Guide Lawrence Erlbaum Associates, pp.73-76, 2005.

Y. Yao, S. Tilsen, L. Ronald, K. Sprouse, and . Johnson, Automated measurement of vowel formants in the buckeye corpus, 2010.

N. Ryant, J. Yuan, and M. Liberman, Automating phonetic measurement: The case of voice onset time, Proc. Meetings on Acoustics. 2013 (cit, p.78

H. Hermansky, Perceptual linear predictive (PLP) analysis of speech, The Journal of the Acoustical Society of America, vol.87, issue.4, pp.1738-1752, 1990.
DOI : 10.1121/1.399423

R. Schluter, H. Bezrukov, H. Wagner, and . Ney, Gammatone Features and Feature Combination for Large Vocabulary Speech Recognition, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07, 2007.
DOI : 10.1109/ICASSP.2007.366996

G. Hinton, L. Deng, D. Yu, E. George, A. Dahl et al., Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups, IEEE Signal Processing Magazine, vol.29, issue.6, pp.82-97, 2012.
DOI : 10.1109/MSP.2012.2205597

A. Martin, T. Schatz, M. Versteegh, K. Miyazawa, R. Mazuka et al., Mothers Speak Less Clearly to Infants Than to Adults A Comprehensive Test of the Hyperarticulation Hypothesis, In: Psychological science, vol.263, issue.120, pp.341-347, 2015.

A. Cristia, Input to Language: The Phonetics and Perception of Infant-Directed Speech, Language and Linguistics Compass, vol.7, issue.4, pp.157-170, 2013.
DOI : 10.1111/lnc3.12015

Y. Igarashi, K. Ken-'ya-nishikawa, R. Tanaka, and . Mazuka, Phonological theory informs the analysis of intonational exaggeration in Japanese infant-directed speech, The Journal of the Acoustical Society of America, vol.134, issue.2, pp.1283-1294, 2013.
DOI : 10.1121/1.4812755

R. Mazuka, Y. Igarashi, and K. Nishikawa, Input for learning Japanese: RIKEN Japanese Mother-Infant Conversation Corpus, pp.11-15, 2006.

B. Mcmurray, K. A. Kovack-lesh, D. Goodwin, and W. Mcechron, Infant directed speech and the development of speech perception: Enhancing development or an unintended consequence?, Cognition, vol.129, issue.2, pp.362-378, 2013.
DOI : 10.1016/j.cognition.2013.07.015

T. Benders, Mommy is only happy! Dutch mothers??? realisation of speech sounds in infant-directed speech expresses emotion, not didactic intent, Infant Behavior and Development, vol.36, issue.4, pp.847-862, 2013.
DOI : 10.1016/j.infbeh.2013.09.001

N. Ramírez-esparza, A. García-sierra, and P. K. Kuhl, Look who's talking: speech style and social context in language input to infants are linked to concurrent and future speech development, Developmental Science, vol.316, issue.6, pp.880-891, 2014.
DOI : 10.1111/desc.12172

A. Weisleder and A. Fernald, Talking to children matters early language experience strengthens processing and builds vocabulary, Psychological Science, vol.2411, pp.2143-2152, 2013.

P. Ladefoged and K. Johnson, A course in phonetics

K. Maekawa, Corpus of Spontaneous Japanese: Its design and evaluation, Proc

R. Smits, Evidence for hierarchical categorization of coarticulated phonemes., Journal of Experimental Psychology: Human Perception and Performance, vol.27, issue.5, p.1145, 2001.
DOI : 10.1037/0096-1523.27.5.1145

F. Hönig, G. Stemmer, C. Hacker, and F. Brugnara, Revising Perceptual Linear Prediction (PLP), Proc. INTERSPEECH. 2005 (cit. on pp. 91, p.98

D. Peter, . Eimas, R. Einar, . Siqueland, W. Peter et al., Speech perception in infants, Science, vol.171, pp.303-306, 1971.

D. Peter and . Eimas, Auditory and linguistic processing of cues for place of articulation by infants, In: Perception & Psychophysics, vol.163, pp.513-521, 1974.

D. Peter and . Eimas, Auditory and phonetic coding of the cues for speech: Discrimination of the [rl] distinction by young infants, Perception & Psychophysics, vol.185, issue.151, pp.341-347, 1975.

A. Lynn and . Streeter, Language perception of 2-mo-old infants shows effects of both innate mechanisms and experience, In: Nature, 1976.

E. Sandra and . Trehub, The discrimination of foreign speech contrasts by infants and adults, pp.466-472, 1976.

N. Richard, . Aslin, B. David, . Pisoni, L. Beth et al., Discrimination of voice onset time by human infants: New findings and implications for the effects of early experience, p.1135, 1981.

K. Patricia, . Kuhl, D. James, and . Miller, Discrimination of auditory target dimensions in the presence or absence of variation in a second dimension by infants, Perception & Psychophysics, vol.313, pp.279-292, 1982.

F. Janet, . Werker, C. Richard, and . Tees, Cross-language speech perception: Evidence for perceptual reorganization during the first year of life, pp.49-63, 1984.

J. Bertoncini, R. Bijeljac-babic, E. Sheila, J. Blumstein, and . Mehler, Discrimination in neonates of very short CVs, The Journal of the Acoustical Society of America, vol.82, issue.1, pp.31-37, 1987.
DOI : 10.1121/1.395570

J. Bertoncini, R. Bijeljac-babic, W. Peter, . Jusczyk, J. Lori et al., An investigation of young infants' perceptual representations of speech sounds., Journal of Experimental Psychology: General, vol.117, issue.1, p.21, 1988.
DOI : 10.1037/0096-3445.117.1.21

L. Polka, F. Janet, and . Werker, Developmental changes in perception of nonnative vowel contrasts., Journal of Experimental Psychology: Human Perception and Performance, vol.20, issue.2, p.421, 1994.
DOI : 10.1037/0096-1523.20.2.421

T. Tsushima, O. Takizawa, M. Sasaki, S. Shiraki, K. Nishi et al., Discrimination of English/rl/and/wy/by Japanese infants at 6-12 months: language-specific developmental changes in speech perception abilities, Proc. ICSLP, 1994.

A. Kujala, M. Huotilainen, M. Hotakainen, M. Lennes, L. Parkkonen et al., Speech-sound discrimination in neonates as measured with MEG, NeuroReport, vol.15, issue.13, pp.2089-2092, 2004.
DOI : 10.1097/00001756-200409150-00018

K. Patricia and . Kuhl, Theoretical contributions of tests on animals to the special-mechanisms debate in speech, In: Experimental biology, vol.453, pp.233-265, 1985.

L. Randy, . Diehl, J. Andrew, L. L. Lotto, and . Holt, Speech perception, Annu. Rev. Psychol, vol.55, pp.149-179, 2004.

R. Tincoff, M. Hauser, F. Tsao, G. Spaepen, F. Ramus et al., The role of speech rhythm in language discrimination: further tests with a non-human primate, Developmental Science, vol.37, issue.2, pp.26-35, 2005.
DOI : 10.1037//0735-7036.115.3.258
URL : https://hal.archives-ouvertes.fr/hal-00260024

L. Lori, . Holt, J. Andrew, and . Lotto, Speech perception within an auditory cognitive science framework, In: Current directions in psychological science, vol.171, pp.42-46, 2008.

K. Patricia, . Kuhl, D. James, and . Miller, Speech perception by the chinchilla: Voicedvoiceless distinction in alveolar plosive consonants, In: Science, vol.1904209, pp.69-72, 1975.

K. Patricia, . Kuhl, D. James, and . Miller, Speech perception by the chinchilla: Identification functions for synthetic VOT stimuli, In: The Journal of the Acoustical Society of America, vol.633, pp.905-917, 1978.

K. Patricia and . Kuhl, Discrimination of speech by nonhuman animals: Basic auditory sensitivities conducive to the perception of speech-sound categories, In: The Journal of the Acoustical Society of America, vol.702, pp.340-349, 1981.

K. Patricia, D. M. Kuhl, and . Padden, Enhanced discriminability at the phonetic boundaries for the voicing feature in macaques, Perception & Psychophysics, vol.326, pp.542-550, 1982.

K. Patricia, D. M. Kuhl, and . Padden, Enhanced discriminability at the phonetic boundaries for the place feature in macaques, In: The Journal of the Acoustical Society of America, vol.733, pp.1003-1010, 1983.

W. Fitch, The evolution of speech: a comparative review, Trends in cognitive sciences 4, pp.258-267, 2000.
DOI : 10.1016/S1364-6613(00)01494-7

P. Lieberman, The evolution of human speech, Current Anthropology, vol.481, pp.39-66, 2007.

B. Ann, W. Butler, and . Hodos, Comparative vertebrate neuroanatomy: evolution and adaptation, 2005.

J. Schnupp, I. Nelken, and A. King, Auditory neuroscience: Making sense of sound, 2011.

C. Evan, . Smith, S. Michael, and . Lewicki, Efficient auditory coding, Nature, vol.4397079, pp.978-982, 2006.

R. Jenny, . Saffran, F. Janet, L. A. Werker, and . Werner, The infant's auditory world: Hearing, speech, and the beginnings of language, Handbook of child psychology, 2006.

P. W. Daniel and . Ellis, PLP and RASTA (and MFCC, and inversion) in Matlab

S. Thomas, S. Ganapathy, and H. Hermansky, Spectro-temporal features for automatic speech recognition using linear prediction in spectral domain

M. Athineos, P. Daniel, and . Ellis, Frequency-domain linear prediction for temporal features, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721), p.97
DOI : 10.1109/ASRU.2003.1318451
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.112.2563

. Hynek, N. Hermansky, and . Morgan, RASTA processing of speech, IEEE Transactions on Speech and Audio Processing, vol.2, issue.107, pp.578-589, 1994.

C. Mead and . Killion, Revised estimate of minimum audible pressure: Where is the "missing 6 dB, The Journal of the Acoustical Society of America, vol.635, issue.101, pp.1501-1508, 1978.

Y. Suzuki and H. Takeshima, Equal-loudness-level contours for pure tones, The Journal of the Acoustical Society of America, vol.116, issue.2, pp.918-933, 2004.
DOI : 10.1121/1.1763601

G. Fant, J. Liljencrants, and Q. Lin, A four-parameter model of glottal flow, In: STL-QPSR, vol.4, pp.1-13, 1985.

J. Kreiman, R. Bruce, N. Gerratt, and . Antoñanzas-barroso, Measures of the Glottal Source Spectrum, Journal of Speech Language and Hearing Research, vol.50, issue.3, pp.595-610, 2007.
DOI : 10.1044/1092-4388(2007/042)

C. Brian, . Moore, R. Brian, and . Glasberg, A revised model of loudness perception applied to cochlear hearing loss, Hearing research, vol.1881, pp.70-88, 2004.

D. Byrne, H. Dillon, K. Tran, S. Arlinger, K. Wilbraham et al., An international comparison of long???term average speech spectra, The Journal of the Acoustical Society of America, vol.96, issue.4, pp.2108-2120, 1994.
DOI : 10.1121/1.410152

G. Fant, Acoustic theory of speech production: with calculations based on X-ray studies of Russian articulations, 1971.
DOI : 10.1515/9783110873429

M. Mohan, S. , and J. Schroeter, Speech production models and their digital implementations, The Digital Signal Processing Handbook, 1997.

S. Shamma and C. Lorenzi, On the balance of envelope and temporal fine structure in the encoding of speech in the early auditory system, The Journal of the Acoustical Society of America, vol.133, issue.5, pp.2818-2833, 2013.
DOI : 10.1121/1.4795783

K. Graeme, . Yates, M. Ian, D. Winter, and . Robertson, Basilar membrane nonlinearity determines auditory nerve rate-intensity functions and cochlear dynamic range, Hearing research, vol.453, issue.169, pp.203-219, 1990.

L. Charles, . Philips, M. John, E. Parr, and . Riskin, Signals, systems, and transforms, 1995.

E. De-boer and H. , On cochlear encoding: Potentialities and limitations of the reverse-correlation technique, The Journal of the Acoustical Society of America, vol.63, issue.1, pp.115-135, 1978.
DOI : 10.1121/1.381704

J. Pickles, An introduction to the physiology of hearing, 2008.

B. Moore, An introduction to the psychology of hearing), 2004.

R. Patterson, I. Nimmo-smith, J. Holdsworth, and P. Rice, An efficient auditory filterbank based on the gammatone function, Proc. Meeting of the IOC Speech Group on Auditory Modelling at RSRE, 1987.

E. Zwicker and E. Terhardt, Analytical expressions for critical???band rate and critical bandwidth as a function of frequency, The Journal of the Acoustical Society of America, vol.68, issue.5, pp.1523-1525, 1980.
DOI : 10.1121/1.385079

S. Smith-stevens, J. Volkmann, B. Edwin, and . Newman, A Scale for the Measurement of the Psychological Magnitude Pitch, The Journal of the Acoustical Society of America, vol.8, issue.3, pp.185-190, 1937.
DOI : 10.1121/1.1915893

D. Donald and . Greenwood, A cochlear frequency-position function for several species -29 years later, The Journal of the Acoustical Society of America, vol.876, pp.2592-2605, 1990.

C. Brian, . Moore, R. Brian, and . Glasberg, A revision of Zwicker's loudness model, Acta Acustica united with Acustica 82, pp.335-345, 1996.

H. Josh, . Mcdermott, P. Eero, and . Simoncelli, Sound texture perception via statistics of the auditory periphery: Evidence from sound synthesis, In: Neuron, vol.715, pp.926-940, 2011.

P. Joris, C. Schreiner, and A. Rees, Neural Processing of Amplitude-Modulated Sounds, Physiological reviews 84, pp.541-577, 2004.
DOI : 10.1152/physrev.00029.2003

V. Robert, F. Shannon, V. Zeng, J. Kamath, M. Wygonski et al., Speech recognition with primarily temporal cues, In: Science, vol.2705234, pp.303-304, 1995.

M. Zachary, B. Smith, . Delgutte, J. Andrew, and . Oxenham, Chimaeric sounds reveal dichotomies in auditory perception, Nature, vol.4166876, pp.87-90, 2002.

M. Steven, . Schimmel, E. Les, and . Atlas, Coherent Envelope Detection for Modulation Filtering of Speech, Proc. ICASSP. 2005 (cit

E. Richard, M. Turner, and . Sahani, Demodulation as probabilistic inference, IEEE Transactions on Audio, Speech, and Language Processing, vol.198, issue.121, pp.2398-2411, 2011.

G. Sell and M. Slaney, Solving Demodulation as an Optimization Problem, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.8, pp.2051-2066, 2010.
DOI : 10.1109/TASL.2010.2041108
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.407.2965

T. Chi, P. Ru, A. Shihab, and . Shamma, Multiresolution spectrotemporal analysis of complex sounds, The Journal of the Acoustical Society of America, vol.118, issue.2, pp.887-906, 2005.
DOI : 10.1121/1.1945807

S. Muhammad, . Zilany, C. Ian, . Bruce, C. Paul et al., A phenomenological model of the synapse between the inner hair cell and auditory nerve: long-term adaptation with power-law dynamics, The Journal of the Acoustical Society of America, vol.1265, pp.2390-2412, 2009.

C. Brian and . Moore, Temporal integration and context effects in hearing, Journal of Phonetics, vol.313, pp.563-574, 2003.

E. Zwicker, ???Negative Afterimage??? in Hearing, The Journal of the Acoustical Society of America, vol.36, issue.12, pp.2413-2415, 1964.
DOI : 10.1121/1.1919373

M. Unser, On the approximation of the discrete Karhunen-Loeve transform for stationary processes, Signal Processing, vol.7, issue.3, pp.231-249, 1984.
DOI : 10.1016/0165-1684(84)90002-1

K. Anil and . Jain, A fast Karhunen-Loeve transform for a class of random processes, p.42860, 1976.

K. Anil and . Jain, A sinusoidal family of unitary transforms, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.4, pp.356-365, 1979.

F. Peter and . Macneilage, Motor control of serial ordering of speech, In: Psychological review, vol.773, p.182, 1970.

R. Kumaresan and A. Rao, Model-based approach to envelope and positive instantaneous frequency estimation of signals with speech applications, The Journal of the Acoustical Society of America, vol.105, issue.3, pp.1912-1924, 1999.
DOI : 10.1121/1.426727

C. Kim, M. Richard, and . Stern, Power-normalized cepstral coefficients (PNCC) for robust speech recognition, Proc. ICASSP. 2012
DOI : 10.1109/icassp.2012.6288820

A. Jansen and P. Niyogi, Intrinsic Spectral Analysis, IEEE Transactions on Signal Processing, vol.61, issue.7, pp.1698-1710, 2013.
DOI : 10.1109/TSP.2013.2238931

A. Enrique and . Lopez-poveda, Spectral processing by the peripheral auditory system: facts and models, In: International review of neurobiology, vol.70, pp.7-48, 2004.

L. Morten, . Jepsen, D. Stephan, T. Ewert, and . Dau, A computational model of human auditory signal processing and perception, In: The Journal of the Acoustical Society of America, vol.1241, pp.422-438, 2008.

W. Strange, Speech perception and linguistic experience: Issues in cross-language research, p.123, 1995.

A. Cutler, Native listening: Language experience and the recognition of spoken words, p.2012

H. Goto, Auditory perception by normal Japanese adults of the sounds ???L??? and ???R???, Neuropsychologia, vol.9, issue.3, pp.317-323, 1971.
DOI : 10.1016/0028-3932(71)90027-3

K. Miyawaki, J. James, W. Jenkins, A. M. Strange, R. Liberman et al., An effect of linguistic experience: The discrimination of [r] and [l] by native speakers of Japanese and English, Perception & Psychophysics, vol.18, issue.5, pp.331-340, 1975.
DOI : 10.3758/BF03211209

T. Catherine and . Best, The emergence of native-language phonological influences in infants: A perceptual assimilation model " . In: The development of speech perception: The transition from speech sounds to spoken words, pp.224-146, 1994.

T. Catherine and . Best, A Direct Realist View of Cross-Language Speech Perception In: Speech Perception and Linguistic Experience: Issues in Cross-Language Research, pp.124-146, 1995.

T. Catherine, . Best, W. Gerald, . Mcroberts, M. Nomathemba et al., Examination of perceptual reorganization for nonnative speech contrasts: Zulu click discrimination by English-speaking adults and infants, In: Journal of Experimental Psychology: Human perception and performance, vol.143, issue.145, pp.345-146, 1988.

E. James and . Flege, Second language speech learning: Theory, findings, and problems " . In: Speech perception and linguistic experience: Issues in cross-language research, pp.233-277, 1995.

E. James and . Flege, Age of learning and second language speech " . In: Second language acquisition and the critical period hypothesis, pp.101-131, 1999.

K. Patricia, P. Kuhl, and . Iverson, Chapter 4: Linguistic Experience and the " Perceptual Magnet Effect In: Speech perception and linguistic experience: Issues in cross-language research, pp.121-154, 1995.

K. Patricia, . Kuhl, T. Barbara, S. Conboy, D. Coffey-corina et al., Phonetic learning as a pathway to language: new data and native language magnet theory expanded (NLM-e), In: Philosophical Transactions of the Royal Society B: Biological Sciences, vol.363, issue.126, pp.1493-979, 2008.

H. Naomi, . Feldman, L. Thomas, . Griffiths, L. James et al., The influence of categories on perception: explaining the perceptual magnet effect as optimal statistical inference, In: Psychological review, vol.1164, p.752, 2009.

L. Bonnasse-gahot and J. Nadal, Neural coding of categories: information efficiency and optimal population codes, Journal of Computational Neuroscience, vol.256, issue.1, pp.169-187, 2008.
DOI : 10.1007/s10827-007-0071-5

C. Cortes and V. Vapnik, Support-vector networks, Machine Learning, vol.1, issue.3, pp.273-297, 1995.
DOI : 10.1007/BF00994018

W. Strange, M. Hisagi, R. Akahane-yamada, and R. Kubo, Crosslanguage perceptual similarity predicts categorial discrimination of American vowels by naive Japanese listeners, The Journal of the Acoustical Society of America, vol.1304, issue.135, pp.226-231, 2011.

W. Strange, . Ocke-schwen-bohn, A. Sonja, K. Trent, and . Nishi, Acoustic and perceptual similarity of North German and American English vowels, The Journal of the Acoustical Society of America, vol.115, issue.4, pp.1791-1807, 2004.
DOI : 10.1121/1.1687832

J. Gong, M. Cooke, and G. Lecumberri, Towards a quantitative model of Mandarin Chinese perception of English consonants, Proc. NewSounds 2010, p.146, 2010.

L. Terry and . Gottfried, Effects of consonant context on the perception of French vowels, Journal of Phonetics, vol.122, issue.134, pp.91-114, 1984.

S. Arthur, L. Abramson, and . Lisker, Discriminability along the voicing continuum: Cross-language tests, Proc. International Congress of Phonetic Sciences, 1970.

K. Connie, . So, T. Catherine, and . Best, Cross-language perception of non-native tonal contrasts: Effects of native phonological and phonetic influences, Language and speech 53, pp.273-293, 2010.

T. Schultz, Globalphone: a multilingual speech and text database developed at karlsruhe university, Proc. INTERSPEECH, 2002.

N. T. Vu and T. Schultz, Vietnamese large vocabulary continuous speech recognition, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009.
DOI : 10.1109/ASRU.2009.5373424
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.187.1443

D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek et al., The Kaldi speech recognition toolkit, Proc. Workshop on Automatic Speech Recognition and Understanding, 2011.

P. Ghahremani, B. Babaali, D. Povey, K. Riedhammer, and S. Khudanpur, A pitch extraction algorithm tuned for automatic speech recognition, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI : 10.1109/ICASSP.2014.6854049

N. Ogasawara, Acoustic Analysis of Voice-Onset Time in Taiwan Mandarin and

N. Ogasawara, Production and perception of voice onset time cues in spoken Japanese and Taiwan Mandarin., The Journal of the Acoustical Society of America, vol.129, issue.4, pp.2419-2419, 2011.
DOI : 10.1121/1.3587893

Y. Lee, A. Douglas, . Vakoch, H. Lee, and . Wurm, Tone perception in Cantonese and Mandarin: A cross-linguistic comparison, Journal of Psycholinguistic Research, vol.19, issue.5, pp.527-542, 1996.
DOI : 10.1007/BF01758181

W. Chiao, B. Kabak, and B. Braun, When more is less: Non-native perception of level tone contrasts, Proc. Psycholinguistic Representation of Tone Conference, 2011.

M. Hisagi, L. Valerie, W. Shafer, E. S. Strange, and . Sussman, Perception of a Japanese vowel length contrast by Japanese and American English listeners: Behavioral and electrophysiological measures, Brain Research, vol.1360, issue.142, pp.1360-89, 2010.
DOI : 10.1016/j.brainres.2010.08.092

M. Hisagi and W. Strange, Perception of Japanese Temporally-cued Contrasts by American English Listeners, Language and Speech, vol.54, issue.2, pp.241-264, 2011.
DOI : 10.1177/0023830910397499

G. Jia, W. Strange, Y. Wu, J. Collado, and Q. Guan, Perception and production of English vowels by Mandarin speakers: Age-related differences vary with amount of L2 exposure, The Journal of the Acoustical Society of America, vol.119, issue.2, pp.1118-1130, 2006.
DOI : 10.1121/1.2151806

J. Pylkkönen and M. Kurimo, Duration modeling techniques for continuous speech recognition, Proc. INTERSPEECH, 2004.

G. Synnaeve and E. Dupoux, Weakly supervised multi-embeddings learning of acoustic models, 2014.

K. Pappagari-raghavendra-reddy, K. Vijayan, and . Murty, Analysis of features from analytic representation of speech using MP-ABX measures, Proc. INTERSPEECH. 2015

G. Synnaeve and E. Dupoux, A Temporal Coherence Loss Function for Learning Unsupervised Acoustic Embeddings, Procedia Computer Science, vol.81, pp.95-100, 2016.
DOI : 10.1016/j.procs.2016.04.035

M. J. Carbajal, R. Fér, and E. Dupoux, Modeling language discrimination in infants using i-vector representations, Proc. CogSci. 2016 (submitted) (cit, p.151

S. Vladimir, Y. Korolyuk, and . Borovskich, Theory of U-statistics, p.153, 2013.

C. Bell, D. Blackwell, and L. Breiman, On the Completeness of Order Statistics, The Annals of Mathematical Statistics, pp.794-797, 1960.
DOI : 10.1214/aoms/1177705808

J. Pfanzagl, Parametric statistical theory, 1994.
DOI : 10.1515/9783110889765

W. Hoeffding, Probability inequalities for sums of bounded random variables, Journal of the American statistical association, vol.58301, pp.13-30, 1963.

A. Miguel, E. Arcones, and . Gine, On the bootstrap of U and V statistics, The Annals of Statistics, pp.655-674, 1992.

W. Aad, . Van, and . Vaart, Asymptotic statistics, 2000.

L. Cohen, Time-frequency analysis, 1995.

B. Gold, C. Oppenheim, and . Rader, Theory and implementation of the discrete Hilbert transform, Proc. Symposium on Computer Processing in Communications, Polytechnic Institute of Brooklyn, 1969.

H. Duifhuis, Cochlear mechanics: introduction to a time domain analysis of the nonlinear cochlea, pp.2012-167
DOI : 10.1007/978-1-4419-6117-4

A. Bell, A Resonance Approach to Cochlear Mechanics, PLoS ONE, vol.39, issue.11, p.47918, 2012.
DOI : 10.1371/journal.pone.0047918.g010
URL : http://doi.org/10.1371/journal.pone.0047918

A. Recio-spinoso, S. William, and . Rhode, Fast Waves at the Base of the Cochlea, PLOS ONE, vol.12, issue.9, p.129556, 2015.
DOI : 10.1371/journal.pone.0129556.g006