Influences on infant speech processing: Toward a new synthesis Annual review of psychology 50, Bibliography, vol.1, pp.509-535, 1999. ,
Infants show a facilitation effect for native language phonetic perception between 6 and 12 months, Developmental science 9, pp.13-21, 2006. ,
Investigating the role of infant-directed speech with a computer model, Acoustics Research Letters Online, vol.4, issue.4, pp.129-134, 2003. ,
DOI : 10.1121/1.1613311
Self-supervised acquisition of vowels in American English, Proc. National Conference On Artificial Intelligence, 2006. ,
Learning phonetic categories by tracking movements, Cognition, vol.103, issue.1, pp.80-106, 2007. ,
DOI : 10.1016/j.cognition.2006.03.002
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.126.4801
Unsupervised learning of vowel categories from infant-directed speech, Proceedings of the National Academy of Sciences, pp.13273-13278, 2007. ,
Unsupervised learning of vowels from continuous speech based on self-organized phoneme acquisition model, Proc. INTERSPEECH, 2010. ,
The Multi Timescale Phoneme Acquisition Model of the Self-Organizing Based on the Dynamic Features, Proc. INTERSPEECH, 2011. ,
Distributional learning of vowel categories is supported by prosody in infant-directed speech, Proc. CogSci. 2012 (cit ,
Learning Vowel Categories From Maternal Speech in Gurindji Kriol, Language Learning, vol.103, issue.4, pp.1052-1078, 2012. ,
DOI : 10.1111/j.1467-9922.2012.00725.x
A Single-Stage Approach to Learning Phonological Categories: Insights From Inuktitut, Cognitive Science, vol.107, issue.2-3, pp.344-377, 2013. ,
DOI : 10.1111/cogs.12008
A role for the developing lexicon in phonetic category acquisition, In: Psychological review, vol.1204, p.751, 2013. ,
Feedback and imitation by a caregiver guides a virtual infant to learn native phonemes and the skill of speech inversion, Speech Communication, vol.55, issue.9, pp.909-931, 2013. ,
DOI : 10.1016/j.specom.2013.05.002
Weak semantic context helps phonetic learning in a model of infant language acquisition, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) ,
DOI : 10.3115/v1/P14-1101
Signal detection theory and psychophysics, 1966. ,
Theory of Classification: a Survey of Some Recent Advances, ESAIM: probability and statistics 9, pp.323-375, 2005. ,
DOI : 10.1051/ps:2005018
URL : https://hal.archives-ouvertes.fr/hal-00017923
Consistency of spectral clustering, The Annals of Statistics, vol.36, issue.2, pp.555-586, 2008. ,
DOI : 10.1214/009053607000000640
Distance measures for speech recognition, psychological and instrumental, Pattern recognition and artificial intelligence, vol.116, issue.78, pp.91-103, 1976. ,
Speech discrimination by dynamic programming, Cybernetics and Systems Analysis, vol.41, issue.133, pp.52-57, 1968. ,
The design for the Wall Street Journal-based CSR corpus, Proc. Workshop on Speech and Natural Language, pp.357-362, 1992. ,
The elements of statistical learning: data mining, inference, and prediction, 2009. ,
Normalized cuts and image segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.228, issue.32, pp.888-905, 2000. ,
Single linkage versus average linkage clustering in machine cells formation applications, In: Computers & Industrial Engineering, vol.163, pp.419-426, 1989. ,
Comparing partitions, Journal of Classification, vol.78, issue.1, pp.193-218, 1985. ,
DOI : 10.1007/BF01908075
Statistical behavior and consistency of classification methods based on convex risk minimization, The Annals of Statistics, vol.32, issue.1, pp.56-85, 2004. ,
DOI : 10.1214/aos/1079120130
k-means++: The advantages of careful seeding, Proc. ACM-SIAM symposium on discrete algorithms, 2007. ,
Nearest neighbor classification in infinite dimension, ESAIM: Probability and Statistics, vol.10, pp.340-355, 2006. ,
DOI : 10.1051/ps:2006014
On Information and Sufficiency, The annals of mathematical statistics, pp.79-86, 1951. ,
DOI : 10.1214/aoms/1177729694
Foundations of machine learning, p.2012 ,
Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval, Proc. European Conference on Machine Learning, 1998. ,
A Probabilistic Theory of Pattern Recognition, 1996. ,
DOI : 10.1007/978-1-4612-0711-5
Stability and model selection in k-means clustering, Machine learning 80, pp.2-3, 2010. ,
DOI : 10.1007/s10994-010-5177-8
Clustering Stability: An Overview, Machine Learning, vol.23, pp.235-274, 2009. ,
Rapid Evaluation of Speech Representations for Spoken Term Discovery, Proc. INTERSPEECH. 2011 (cit. on pp. 55, pp.65-68 ,
Evaluating speech features with the Minimal-Pair ABX task: Analysis of the classical MFC/PLP pipeline, Proc. INTERSPEECH. 2013 (cit. on pp. 55, pp.68-71 ,
URL : https://hal.archives-ouvertes.fr/hal-00918599
The relationship between Precision-Recall and ROC curves, Proceedings of the 23rd international conference on Machine learning , ICML '06, 2006. ,
DOI : 10.1145/1143844.1143874
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.98.4362
Articulation Index LSCP LDC2015S12 ,
The zero resource speech challenge, Proc. INTERSPEECH. 2015 (cit, pp.58-151, 2015. ,
DOI : 10.1016/j.procs.2016.04.031
Categorical Perception of Facial Expressions by 7-Month-Old Infants, Perception, vol.48, issue.1, pp.1115-1125, 2001. ,
DOI : 10.1068/p3155
Handbook of child psychology and developmental science Perceptual development. Seventh, p.2015 ,
Concept formation in infancy, Cognitive development, vol.83, pp.291-318, 1993. ,
On developing a knowledge base in infancy, In: Developmental psychology, vol.346, p.1274, 1998. ,
Global-Before-Basic Object Categorization in Connectionist Networks and 2-Month-Old Infants, In: Infancy, vol.1, issue.1, pp.31-46, 2000. ,
Event-related potentials for 7-month-olds??? processing of animals and furniture items, Developmental Cognitive Neuroscience, vol.3, pp.53-60, 2013. ,
DOI : 10.1016/j.dcn.2012.09.002
The Animate-Inanimate Distinction in Infancy: Developing Sensitivity to Constraints on Human Actions, Journal of Cognition and Development, vol.83, issue.4, pp.399-426, 2004. ,
DOI : 10.1016/S0163-6383(99)00007-7
Infants' concept of animacy, Cognitive Development, vol.11, issue.1, pp.19-36, 1996. ,
DOI : 10.1016/S0885-2014(96)90026-X
2.5-Month-Old Infants' Reasoning about When Objects Should and Should Not Be Occluded, Cognitive psychology 39, pp.116-157, 1999. ,
DOI : 10.1006/cogp.1999.0717
URL : https://hal.archives-ouvertes.fr/hal-01281646
Developments in young infants' reasoning about occluded objects, Cognitive psychology 45, pp.267-336, 2002. ,
DOI : 10.1016/S0010-0285(02)00005-1
When the ordinary seems unexpected: evidence for incremental physical knowledge in young infants, Cognition 95, pp.297-328, 2005. ,
DOI : 10.1016/j.cognition.2004.01.010
Five-month-old infants have different expectations for solids and liquids, Psychological Science, vol.205, pp.603-611, 2009. ,
Newborn infants perceive abstract numbers, Proceedings of the National Academy of Sciences, pp.10382-10385, 2009. ,
DOI : 10.1073/pnas.0812142106
URL : http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2700913
Deep learning, Nature, vol.9, issue.7553, pp.436-444, 2015. ,
DOI : 10.1007/s10994-013-5335-x
Machine learning: Trends, perspectives, and prospects, Science, vol.349, issue.6245, pp.255-260, 2015. ,
DOI : 10.1126/science.aaa8415
A summary of the 2012 JH CLSP Workshop on zero resource speech technologies and models of early language acquisition ,
The bimodal perception of speech in infancy, Science, vol.218, issue.4577, pp.1138-1141, 1982. ,
DOI : 10.1126/science.7146899
Two-month-old infants match phonetic information in lips and voice, In: Developmental Science, vol.62, pp.191-196, 2003. ,
Infant sensitivity to distributional information can affect phonetic discrimination, Cognition, vol.82, issue.3, pp.101-111, 2002. ,
DOI : 10.1016/S0010-0277(01)00157-3
Statistical phonetic learning in infants: facilitation and feature generalization, Developmental Science, vol.37, issue.Supplement 1, pp.122-134, 2008. ,
DOI : 10.1111/j.1467-7687.2007.00653.x
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.130.1984
Distributional phonetic learning at 10 months of age, pp.420-433, 2010. ,
Visual speech contributes to phonetic learning in 6-month-old infants, Cognition, vol.108, issue.3, pp.850-855, 2008. ,
DOI : 10.1016/j.cognition.2008.05.009
Fine-grained variation in caregivers??? /s/ predicts their infants??? /s/ category, The Journal of the Acoustical Society of America, vol.129, issue.5, pp.3271-3280, 2011. ,
DOI : 10.1121/1.3562562
The effect of learning experiences and context on infant imitation and generalization, In: Infancy, vol.136, pp.596-619, 2008. ,
Infants' long-term memory for the sound patterns of words and voices, In: Journal of Experimental Psychology: Human Perception and Performance, vol.296, p.1143, 2003. ,
Learning words' sounds before learning how words sound: 9-month-olds use distinct objects as cues to categorize speech information, Cognition, vol.1132, pp.234-243, 2009. ,
Speaker identity supports phonetic category learning., Journal of Experimental Psychology: Human Perception and Performance, vol.39, issue.3, p.623, 2013. ,
DOI : 10.1037/a0030402
(Non)words, (non)words, (non)words: evidence for a protolexicon during the first year of life, Developmental Science, vol.50, issue.1, pp.24-34, 2013. ,
DOI : 10.1111/j.1467-7687.2012.01189.x
Clauses are perceptual units for young infants, Cognition, vol.26, issue.3 ,
DOI : 10.1016/S0010-0277(87)80002-1
Perception of acoustic correlates of major phrasal units by young infants, Cognitive psychology, vol.242, pp.252-293, 1992. ,
Infants' sensitivity to word boundaries in fluent speech, Journal of Child Language, vol.54, issue.01, pp.1-30, 1996. ,
DOI : 10.1016/0885-2308(87)90004-0
Word-level information influences phonetic learning in adults and infants, Cognition, vol.1273, pp.427-438, 2013. ,
At 6-9 months, human infants know the meanings of many common nouns, Proceedings of the National Academy of Sciences, pp.3253-3258, 2012. ,
DOI : 10.1073/pnas.1113380109
Early Word Comprehension in Infants: Replication and Extension " . In: Language Learning and Development ahead-of-print (2014), pp.1-12 ,
Referential labeling can facilitate phonetic learning in infancy, pp.1036-1049, 2014. ,
Object labeling influences infant phonetic learning and generalization, Cognition 132, pp.151-163, 2014. ,
Four-month-old infants prefer to listen to motherese, Infant Behavior and Development, vol.8, issue.2, pp.181-195, 1985. ,
DOI : 10.1016/S0163-6383(85)80005-9
The capacity for joint visual attention in the infant, Nature, vol.27, issue.5489, 1975. ,
DOI : 10.1038/253265a0
Pointing is the royal road to language for babies In: Pointing: Where language, culture, and cognition meet, pp.9-33, 2003. ,
Joint visual attention in infancy, Theories of infant development, pp.317-354, 2004. ,
DOI : 10.1002/9780470996348.ch8
Foreign-language experience in infancy: Effects of short-term exposure and social interaction on phonetic learning, Proceedings of the National Academy of Sciences, pp.9096-9101, 2003. ,
Intonational differences between the reduplicative babbling of French-and English-learning infants, Journal of Child Language, vol.18, pp.3-501, 1991. ,
Discernible differences in the babbling of infants according to target language, Journal of child language, vol.11, issue.01, pp.1-15, 1984. ,
Lip movements affect infants' audiovisual speech perception, Psychological Science, vol.245, pp.603-612, 2013. ,
Infant vocalizations in response to speech: Vocal imitation and developmental change, In: The journal of the Acoustical Society of America, vol.1004, pp.2425-2438, 1996. ,
Social feedback to infants' babbling facilitates rapid phonological learning, In: Psychological Science, vol.195, pp.515-523, 2008. ,
Perception and acquisition of linguistic rhythm by infants, Speech Communication, vol.41, issue.1, pp.233-243, 2003. ,
DOI : 10.1016/S0167-6393(02)00106-1
URL : https://hal.archives-ouvertes.fr/hal-00242908
Towards Unsupervised Training of Speaker Independent Acoustic Models, Proc. INTERSPEECH. 2011 (cit. on pp. 66, pp.69-71 ,
Weak top-down constraints for unsupervised acoustic model training, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.69-71 ,
DOI : 10.1109/ICASSP.2013.6639241
Intrinsic Spectral Analysis for Zero and High Resource Speech Recognition, Proc. INTERSPEECH. 2012 (cit, pp.67-71 ,
The effective second formant F2' and the vocal tract front-cavity, International Conference on Acoustics, Speech, and Signal Processing, 1989. ,
DOI : 10.1109/ICASSP.1989.266468
A nonparametric Bayesian approach to acoustic model discovery, Proc. ACL. 2012 (cit. on pp. 67, p.71 ,
Deep convolutional acoustic word embeddings using word-pair side information, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), p.68, 2015. ,
DOI : 10.1109/ICASSP.2016.7472619
URL : http://arxiv.org/abs/1510.01032
New types of deep neural network learning for speech recognition and related applications: an overview, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, p.68 ,
DOI : 10.1109/ICASSP.2013.6639344
Combining time- and frequency-domain convolution in convolutional neural network-based phone recognition, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), p.68 ,
DOI : 10.1109/ICASSP.2014.6853584
Time delay deep neural networkbased universal background models for speaker recognition, Proc. ASRU. 2015 (cit, p.68 ,
DOI : 10.1109/asru.2015.7404779
Significance of analytic phase of speech signals in speaker verification, Speech Communication, vol.81, issue.151, pp.54-71, 2016. ,
DOI : 10.1016/j.specom.2016.02.005
Evaluating speech features with the minimal-pair ABX task (II): resistance to noise, Proc. INTERSPEECH. 2014 (cit, pp.68-71 ,
Phonetics embedding learning with side information, 2014 IEEE Spoken Language Technology Workshop (SLT), pp.69-71 ,
DOI : 10.1109/SLT.2014.7078558
The Zero Resource Speech Challenge 2015: Proposed Approaches and Results, Procedia Computer Science, vol.81, issue.151, pp.67-72, 2016. ,
DOI : 10.1016/j.procs.2016.04.031
The Buckeye corpus of conversational speech: Labeling conventions and a test of transcriber reliability, Speech Communication, vol.451, issue.130, pp.89-95, 2005. ,
A smartphone-based ASR data collection tool for under-resourced languages, Speech communication, vol.56, pp.119-131, 2014. ,
A Hybrid Dynamic Time Warping-Deep Neural Network Architecture for Unsupervised Acoustic Modeling, Proc. INTERSPEECH. 2015 (cit, pp.70-71 ,
A comparison of neural network methods for unsupervised representation learning on the Zero Resource Speech Challenge, Proc. INTERSPEECH. 2015 (cit, pp.70-71 ,
A deep scattering spectrum ??? Deep Siamese network pipeline for unsupervised acoustic modeling, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ,
DOI : 10.1109/ICASSP.2016.7472622
Deep Scattering Spectrum, IEEE Transactions on Signal Processing, vol.62, issue.16, pp.4114-4128, 2014. ,
DOI : 10.1109/TSP.2014.2326991
Discovering Discrete Subword Units with Binarized Autoencoders and Hidden-Markov-Model Encoders, Proc. IN- TERSPEECH. 2015 (cit, pp.70-71 ,
Parallel Inference of Dirichlet Process Gaussian Mixture Models for Unsupervised Acoustic Modeling: A Feasibility Study, Proc. INTERSPEECH. 2015 (cit, pp.70-71 ,
Unsupervised Linear Discriminant Analysis for Supporting DPGMM Clustering in the Zero Resource Scenario, Procedia Computer Science, vol.81, issue.151, pp.73-79, 2016. ,
DOI : 10.1016/j.procs.2016.04.032
An iterative deep learning framework for unsupervised discovery of speech features and linguistic units with applications on spoken term detection, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), p.71 ,
DOI : 10.1109/ASRU.2015.7404801
Detection Theory: A User's Guide Lawrence Erlbaum Associates, pp.73-76, 2005. ,
Automated measurement of vowel formants in the buckeye corpus, 2010. ,
Automating phonetic measurement: The case of voice onset time, Proc. Meetings on Acoustics. 2013 (cit, p.78 ,
Perceptual linear predictive (PLP) analysis of speech, The Journal of the Acoustical Society of America, vol.87, issue.4, pp.1738-1752, 1990. ,
DOI : 10.1121/1.399423
Gammatone Features and Feature Combination for Large Vocabulary Speech Recognition, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07, 2007. ,
DOI : 10.1109/ICASSP.2007.366996
Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups, IEEE Signal Processing Magazine, vol.29, issue.6, pp.82-97, 2012. ,
DOI : 10.1109/MSP.2012.2205597
Mothers Speak Less Clearly to Infants Than to Adults A Comprehensive Test of the Hyperarticulation Hypothesis, In: Psychological science, vol.263, issue.120, pp.341-347, 2015. ,
Input to Language: The Phonetics and Perception of Infant-Directed Speech, Language and Linguistics Compass, vol.7, issue.4, pp.157-170, 2013. ,
DOI : 10.1111/lnc3.12015
Phonological theory informs the analysis of intonational exaggeration in Japanese infant-directed speech, The Journal of the Acoustical Society of America, vol.134, issue.2, pp.1283-1294, 2013. ,
DOI : 10.1121/1.4812755
Input for learning Japanese: RIKEN Japanese Mother-Infant Conversation Corpus, pp.11-15, 2006. ,
Infant directed speech and the development of speech perception: Enhancing development or an unintended consequence?, Cognition, vol.129, issue.2, pp.362-378, 2013. ,
DOI : 10.1016/j.cognition.2013.07.015
Mommy is only happy! Dutch mothers??? realisation of speech sounds in infant-directed speech expresses emotion, not didactic intent, Infant Behavior and Development, vol.36, issue.4, pp.847-862, 2013. ,
DOI : 10.1016/j.infbeh.2013.09.001
Look who's talking: speech style and social context in language input to infants are linked to concurrent and future speech development, Developmental Science, vol.316, issue.6, pp.880-891, 2014. ,
DOI : 10.1111/desc.12172
Talking to children matters early language experience strengthens processing and builds vocabulary, Psychological Science, vol.2411, pp.2143-2152, 2013. ,
A course in phonetics ,
Corpus of Spontaneous Japanese: Its design and evaluation, Proc ,
Evidence for hierarchical categorization of coarticulated phonemes., Journal of Experimental Psychology: Human Perception and Performance, vol.27, issue.5, p.1145, 2001. ,
DOI : 10.1037/0096-1523.27.5.1145
Revising Perceptual Linear Prediction (PLP), Proc. INTERSPEECH. 2005 (cit. on pp. 91, p.98 ,
Speech perception in infants, Science, vol.171, pp.303-306, 1971. ,
Auditory and linguistic processing of cues for place of articulation by infants, In: Perception & Psychophysics, vol.163, pp.513-521, 1974. ,
Auditory and phonetic coding of the cues for speech: Discrimination of the [rl] distinction by young infants, Perception & Psychophysics, vol.185, issue.151, pp.341-347, 1975. ,
Language perception of 2-mo-old infants shows effects of both innate mechanisms and experience, In: Nature, 1976. ,
The discrimination of foreign speech contrasts by infants and adults, pp.466-472, 1976. ,
Discrimination of voice onset time by human infants: New findings and implications for the effects of early experience, p.1135, 1981. ,
Discrimination of auditory target dimensions in the presence or absence of variation in a second dimension by infants, Perception & Psychophysics, vol.313, pp.279-292, 1982. ,
Cross-language speech perception: Evidence for perceptual reorganization during the first year of life, pp.49-63, 1984. ,
Discrimination in neonates of very short CVs, The Journal of the Acoustical Society of America, vol.82, issue.1, pp.31-37, 1987. ,
DOI : 10.1121/1.395570
An investigation of young infants' perceptual representations of speech sounds., Journal of Experimental Psychology: General, vol.117, issue.1, p.21, 1988. ,
DOI : 10.1037/0096-3445.117.1.21
Developmental changes in perception of nonnative vowel contrasts., Journal of Experimental Psychology: Human Perception and Performance, vol.20, issue.2, p.421, 1994. ,
DOI : 10.1037/0096-1523.20.2.421
Discrimination of English/rl/and/wy/by Japanese infants at 6-12 months: language-specific developmental changes in speech perception abilities, Proc. ICSLP, 1994. ,
Speech-sound discrimination in neonates as measured with MEG, NeuroReport, vol.15, issue.13, pp.2089-2092, 2004. ,
DOI : 10.1097/00001756-200409150-00018
Theoretical contributions of tests on animals to the special-mechanisms debate in speech, In: Experimental biology, vol.453, pp.233-265, 1985. ,
Speech perception, Annu. Rev. Psychol, vol.55, pp.149-179, 2004. ,
The role of speech rhythm in language discrimination: further tests with a non-human primate, Developmental Science, vol.37, issue.2, pp.26-35, 2005. ,
DOI : 10.1037//0735-7036.115.3.258
URL : https://hal.archives-ouvertes.fr/hal-00260024
Speech perception within an auditory cognitive science framework, In: Current directions in psychological science, vol.171, pp.42-46, 2008. ,
Speech perception by the chinchilla: Voicedvoiceless distinction in alveolar plosive consonants, In: Science, vol.1904209, pp.69-72, 1975. ,
Speech perception by the chinchilla: Identification functions for synthetic VOT stimuli, In: The Journal of the Acoustical Society of America, vol.633, pp.905-917, 1978. ,
Discrimination of speech by nonhuman animals: Basic auditory sensitivities conducive to the perception of speech-sound categories, In: The Journal of the Acoustical Society of America, vol.702, pp.340-349, 1981. ,
Enhanced discriminability at the phonetic boundaries for the voicing feature in macaques, Perception & Psychophysics, vol.326, pp.542-550, 1982. ,
Enhanced discriminability at the phonetic boundaries for the place feature in macaques, In: The Journal of the Acoustical Society of America, vol.733, pp.1003-1010, 1983. ,
The evolution of speech: a comparative review, Trends in cognitive sciences 4, pp.258-267, 2000. ,
DOI : 10.1016/S1364-6613(00)01494-7
The evolution of human speech, Current Anthropology, vol.481, pp.39-66, 2007. ,
Comparative vertebrate neuroanatomy: evolution and adaptation, 2005. ,
Auditory neuroscience: Making sense of sound, 2011. ,
Efficient auditory coding, Nature, vol.4397079, pp.978-982, 2006. ,
The infant's auditory world: Hearing, speech, and the beginnings of language, Handbook of child psychology, 2006. ,
PLP and RASTA (and MFCC, and inversion) in Matlab ,
Spectro-temporal features for automatic speech recognition using linear prediction in spectral domain ,
Frequency-domain linear prediction for temporal features, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721), p.97 ,
DOI : 10.1109/ASRU.2003.1318451
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.112.2563
RASTA processing of speech, IEEE Transactions on Speech and Audio Processing, vol.2, issue.107, pp.578-589, 1994. ,
Revised estimate of minimum audible pressure: Where is the "missing 6 dB, The Journal of the Acoustical Society of America, vol.635, issue.101, pp.1501-1508, 1978. ,
Equal-loudness-level contours for pure tones, The Journal of the Acoustical Society of America, vol.116, issue.2, pp.918-933, 2004. ,
DOI : 10.1121/1.1763601
A four-parameter model of glottal flow, In: STL-QPSR, vol.4, pp.1-13, 1985. ,
Measures of the Glottal Source Spectrum, Journal of Speech Language and Hearing Research, vol.50, issue.3, pp.595-610, 2007. ,
DOI : 10.1044/1092-4388(2007/042)
A revised model of loudness perception applied to cochlear hearing loss, Hearing research, vol.1881, pp.70-88, 2004. ,
An international comparison of long???term average speech spectra, The Journal of the Acoustical Society of America, vol.96, issue.4, pp.2108-2120, 1994. ,
DOI : 10.1121/1.410152
Acoustic theory of speech production: with calculations based on X-ray studies of Russian articulations, 1971. ,
DOI : 10.1515/9783110873429
Speech production models and their digital implementations, The Digital Signal Processing Handbook, 1997. ,
On the balance of envelope and temporal fine structure in the encoding of speech in the early auditory system, The Journal of the Acoustical Society of America, vol.133, issue.5, pp.2818-2833, 2013. ,
DOI : 10.1121/1.4795783
Basilar membrane nonlinearity determines auditory nerve rate-intensity functions and cochlear dynamic range, Hearing research, vol.453, issue.169, pp.203-219, 1990. ,
Signals, systems, and transforms, 1995. ,
On cochlear encoding: Potentialities and limitations of the reverse-correlation technique, The Journal of the Acoustical Society of America, vol.63, issue.1, pp.115-135, 1978. ,
DOI : 10.1121/1.381704
An introduction to the physiology of hearing, 2008. ,
An introduction to the psychology of hearing), 2004. ,
An efficient auditory filterbank based on the gammatone function, Proc. Meeting of the IOC Speech Group on Auditory Modelling at RSRE, 1987. ,
Analytical expressions for critical???band rate and critical bandwidth as a function of frequency, The Journal of the Acoustical Society of America, vol.68, issue.5, pp.1523-1525, 1980. ,
DOI : 10.1121/1.385079
A Scale for the Measurement of the Psychological Magnitude Pitch, The Journal of the Acoustical Society of America, vol.8, issue.3, pp.185-190, 1937. ,
DOI : 10.1121/1.1915893
A cochlear frequency-position function for several species -29 years later, The Journal of the Acoustical Society of America, vol.876, pp.2592-2605, 1990. ,
A revision of Zwicker's loudness model, Acta Acustica united with Acustica 82, pp.335-345, 1996. ,
Sound texture perception via statistics of the auditory periphery: Evidence from sound synthesis, In: Neuron, vol.715, pp.926-940, 2011. ,
Neural Processing of Amplitude-Modulated Sounds, Physiological reviews 84, pp.541-577, 2004. ,
DOI : 10.1152/physrev.00029.2003
Speech recognition with primarily temporal cues, In: Science, vol.2705234, pp.303-304, 1995. ,
Chimaeric sounds reveal dichotomies in auditory perception, Nature, vol.4166876, pp.87-90, 2002. ,
Coherent Envelope Detection for Modulation Filtering of Speech, Proc. ICASSP. 2005 (cit ,
Demodulation as probabilistic inference, IEEE Transactions on Audio, Speech, and Language Processing, vol.198, issue.121, pp.2398-2411, 2011. ,
Solving Demodulation as an Optimization Problem, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.8, pp.2051-2066, 2010. ,
DOI : 10.1109/TASL.2010.2041108
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.407.2965
Multiresolution spectrotemporal analysis of complex sounds, The Journal of the Acoustical Society of America, vol.118, issue.2, pp.887-906, 2005. ,
DOI : 10.1121/1.1945807
A phenomenological model of the synapse between the inner hair cell and auditory nerve: long-term adaptation with power-law dynamics, The Journal of the Acoustical Society of America, vol.1265, pp.2390-2412, 2009. ,
Temporal integration and context effects in hearing, Journal of Phonetics, vol.313, pp.563-574, 2003. ,
???Negative Afterimage??? in Hearing, The Journal of the Acoustical Society of America, vol.36, issue.12, pp.2413-2415, 1964. ,
DOI : 10.1121/1.1919373
On the approximation of the discrete Karhunen-Loeve transform for stationary processes, Signal Processing, vol.7, issue.3, pp.231-249, 1984. ,
DOI : 10.1016/0165-1684(84)90002-1
A fast Karhunen-Loeve transform for a class of random processes, p.42860, 1976. ,
A sinusoidal family of unitary transforms, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.4, pp.356-365, 1979. ,
Motor control of serial ordering of speech, In: Psychological review, vol.773, p.182, 1970. ,
Model-based approach to envelope and positive instantaneous frequency estimation of signals with speech applications, The Journal of the Acoustical Society of America, vol.105, issue.3, pp.1912-1924, 1999. ,
DOI : 10.1121/1.426727
Power-normalized cepstral coefficients (PNCC) for robust speech recognition, Proc. ICASSP. 2012 ,
DOI : 10.1109/icassp.2012.6288820
Intrinsic Spectral Analysis, IEEE Transactions on Signal Processing, vol.61, issue.7, pp.1698-1710, 2013. ,
DOI : 10.1109/TSP.2013.2238931
Spectral processing by the peripheral auditory system: facts and models, In: International review of neurobiology, vol.70, pp.7-48, 2004. ,
A computational model of human auditory signal processing and perception, In: The Journal of the Acoustical Society of America, vol.1241, pp.422-438, 2008. ,
Speech perception and linguistic experience: Issues in cross-language research, p.123, 1995. ,
Native listening: Language experience and the recognition of spoken words, p.2012 ,
Auditory perception by normal Japanese adults of the sounds ???L??? and ???R???, Neuropsychologia, vol.9, issue.3, pp.317-323, 1971. ,
DOI : 10.1016/0028-3932(71)90027-3
An effect of linguistic experience: The discrimination of [r] and [l] by native speakers of Japanese and English, Perception & Psychophysics, vol.18, issue.5, pp.331-340, 1975. ,
DOI : 10.3758/BF03211209
The emergence of native-language phonological influences in infants: A perceptual assimilation model " . In: The development of speech perception: The transition from speech sounds to spoken words, pp.224-146, 1994. ,
A Direct Realist View of Cross-Language Speech Perception In: Speech Perception and Linguistic Experience: Issues in Cross-Language Research, pp.124-146, 1995. ,
Examination of perceptual reorganization for nonnative speech contrasts: Zulu click discrimination by English-speaking adults and infants, In: Journal of Experimental Psychology: Human perception and performance, vol.143, issue.145, pp.345-146, 1988. ,
Second language speech learning: Theory, findings, and problems " . In: Speech perception and linguistic experience: Issues in cross-language research, pp.233-277, 1995. ,
Age of learning and second language speech " . In: Second language acquisition and the critical period hypothesis, pp.101-131, 1999. ,
Chapter 4: Linguistic Experience and the " Perceptual Magnet Effect In: Speech perception and linguistic experience: Issues in cross-language research, pp.121-154, 1995. ,
Phonetic learning as a pathway to language: new data and native language magnet theory expanded (NLM-e), In: Philosophical Transactions of the Royal Society B: Biological Sciences, vol.363, issue.126, pp.1493-979, 2008. ,
The influence of categories on perception: explaining the perceptual magnet effect as optimal statistical inference, In: Psychological review, vol.1164, p.752, 2009. ,
Neural coding of categories: information efficiency and optimal population codes, Journal of Computational Neuroscience, vol.256, issue.1, pp.169-187, 2008. ,
DOI : 10.1007/s10827-007-0071-5
Support-vector networks, Machine Learning, vol.1, issue.3, pp.273-297, 1995. ,
DOI : 10.1007/BF00994018
Crosslanguage perceptual similarity predicts categorial discrimination of American vowels by naive Japanese listeners, The Journal of the Acoustical Society of America, vol.1304, issue.135, pp.226-231, 2011. ,
Acoustic and perceptual similarity of North German and American English vowels, The Journal of the Acoustical Society of America, vol.115, issue.4, pp.1791-1807, 2004. ,
DOI : 10.1121/1.1687832
Towards a quantitative model of Mandarin Chinese perception of English consonants, Proc. NewSounds 2010, p.146, 2010. ,
Effects of consonant context on the perception of French vowels, Journal of Phonetics, vol.122, issue.134, pp.91-114, 1984. ,
Discriminability along the voicing continuum: Cross-language tests, Proc. International Congress of Phonetic Sciences, 1970. ,
Cross-language perception of non-native tonal contrasts: Effects of native phonological and phonetic influences, Language and speech 53, pp.273-293, 2010. ,
Globalphone: a multilingual speech and text database developed at karlsruhe university, Proc. INTERSPEECH, 2002. ,
Vietnamese large vocabulary continuous speech recognition, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009. ,
DOI : 10.1109/ASRU.2009.5373424
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.187.1443
The Kaldi speech recognition toolkit, Proc. Workshop on Automatic Speech Recognition and Understanding, 2011. ,
A pitch extraction algorithm tuned for automatic speech recognition, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ,
DOI : 10.1109/ICASSP.2014.6854049
Acoustic Analysis of Voice-Onset Time in Taiwan Mandarin and ,
Production and perception of voice onset time cues in spoken Japanese and Taiwan Mandarin., The Journal of the Acoustical Society of America, vol.129, issue.4, pp.2419-2419, 2011. ,
DOI : 10.1121/1.3587893
Tone perception in Cantonese and Mandarin: A cross-linguistic comparison, Journal of Psycholinguistic Research, vol.19, issue.5, pp.527-542, 1996. ,
DOI : 10.1007/BF01758181
When more is less: Non-native perception of level tone contrasts, Proc. Psycholinguistic Representation of Tone Conference, 2011. ,
Perception of a Japanese vowel length contrast by Japanese and American English listeners: Behavioral and electrophysiological measures, Brain Research, vol.1360, issue.142, pp.1360-89, 2010. ,
DOI : 10.1016/j.brainres.2010.08.092
Perception of Japanese Temporally-cued Contrasts by American English Listeners, Language and Speech, vol.54, issue.2, pp.241-264, 2011. ,
DOI : 10.1177/0023830910397499
Perception and production of English vowels by Mandarin speakers: Age-related differences vary with amount of L2 exposure, The Journal of the Acoustical Society of America, vol.119, issue.2, pp.1118-1130, 2006. ,
DOI : 10.1121/1.2151806
Duration modeling techniques for continuous speech recognition, Proc. INTERSPEECH, 2004. ,
Weakly supervised multi-embeddings learning of acoustic models, 2014. ,
Analysis of features from analytic representation of speech using MP-ABX measures, Proc. INTERSPEECH. 2015 ,
A Temporal Coherence Loss Function for Learning Unsupervised Acoustic Embeddings, Procedia Computer Science, vol.81, pp.95-100, 2016. ,
DOI : 10.1016/j.procs.2016.04.035
Modeling language discrimination in infants using i-vector representations, Proc. CogSci. 2016 (submitted) (cit, p.151 ,
Theory of U-statistics, p.153, 2013. ,
On the Completeness of Order Statistics, The Annals of Mathematical Statistics, pp.794-797, 1960. ,
DOI : 10.1214/aoms/1177705808
Parametric statistical theory, 1994. ,
DOI : 10.1515/9783110889765
Probability inequalities for sums of bounded random variables, Journal of the American statistical association, vol.58301, pp.13-30, 1963. ,
On the bootstrap of U and V statistics, The Annals of Statistics, pp.655-674, 1992. ,
Asymptotic statistics, 2000. ,
Time-frequency analysis, 1995. ,
Theory and implementation of the discrete Hilbert transform, Proc. Symposium on Computer Processing in Communications, Polytechnic Institute of Brooklyn, 1969. ,
Cochlear mechanics: introduction to a time domain analysis of the nonlinear cochlea, pp.2012-167 ,
DOI : 10.1007/978-1-4419-6117-4
A Resonance Approach to Cochlear Mechanics, PLoS ONE, vol.39, issue.11, p.47918, 2012. ,
DOI : 10.1371/journal.pone.0047918.g010
URL : http://doi.org/10.1371/journal.pone.0047918
Fast Waves at the Base of the Cochlea, PLOS ONE, vol.12, issue.9, p.129556, 2015. ,
DOI : 10.1371/journal.pone.0129556.g006