P. Hallé and A. Cristia, Global and detailed speech representations in early language acquisition, Speech production and perception: Planning and dynamics, pp.11-38, 2012.

O. Rsnen, G. Doyle, and M. C. Frank, Pre-linguistic segmentation of speech into syllable-like units, Cognition, vol.171, pp.130-150, 2018.

P. K. , Early language acquisition: cracking the speech code, Nature Reviews Neuroscience, vol.5, issue.11, pp.831-843, 2004.

J. A. Tourville and F. H. Guenther, The DIVA model: A neural theory of speech acquisition and production, Language and Cognitive Processes, vol.26, issue.7, pp.952-981, 2011.

B. J. Kröger, J. Kannampuzha, and E. Kaufmann, Associative learning and self-organization as basic principles for simulating speech acquisition, speech production, and speech perception, EPJ Nonlinear Biomedical Physics, vol.2, issue.1, pp.1-28, 2014.

S. Peperkamp and E. Dupoux, Learning the mapping from surface to underlying representations in an artificial language, Laboratory Phonology, vol.9, pp.315-338, 2007.

C. Moulin-frier, S. M. Nguyen, and P. Oudeyer, Selforganization of early vocal development in infants and machines: the role of intrinsic motivation, Frontiers in Psychology, vol.4, issue.1006, pp.1-20, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00927940

N. H. Feldman, T. L. Griffiths, S. Goldwater, and J. L. Morgan, A role for the developing lexicon in phonetic category acquisition, Psychological Review, vol.120, issue.4, pp.751-778, 2013.

A. K. Philippsen, R. F. Reinhart, and B. Wrede, Learning how to speak: Imitation-based refinement of syllable production in an articulatory-acoustic model, The 4th Joint IEEE International Conference on Developmental and Learning and on Epigenetic Robotics, pp.195-200, 2014.

C. Moulin-frier, J. Diard, J. Schwartz, P. Bessì-ere, and &. Cosmo, Communicating about Objects using Sensory-Motor Operations"): A Bayesian modeling framework for studying speech communication and the emergence of phonological systems, Journal of Phonetics, vol.53, pp.5-41, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01230175

C. Moulin-frier, R. Laurent, P. Bessì-ere, J. Schwartz, and J. Diard, Adverse conditions improve distinguishability of auditory, motor, and perceptuo-motor theories of speech perception: An exploratory Bayesian modelling study, Language and Cognitive Processes, vol.27, issue.7-8, pp.1240-1263, 2012.

R. Laurent, M. Barnaud, J. Schwartz, P. Bessì-ere, and J. Diard, The complementary roles of auditory and motor information evaluated in a Bayesian perceptuo-motor model of speech perception, Psychological Review, vol.124, issue.5, pp.572-602, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01484383

M. Barnaud, P. Bessì-ere, J. Diard, and J. Schwartz, Reanalyzing neurocognitive data on the role of the motor system in speech perception within COSMO, a Bayesian perceptuo-motor model of speech communication, Brain and Language, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01669961

M. Barnaud, J. Diard, P. Bessì-ere, and J. Schwartz, Assessing Idiosyncrasies in a Bayesian Model of Speech Communication, Proceedings of Interspeech, pp.2080-2084, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01371722

P. Bessì-ere, E. Mazer, J. M. Ahuactzin, K. Mekhnacha, and B. Programming, , 2013.

E. Gilet, J. Diard, and P. Bessì, Bayesian action-perception computational model: Interaction of production and recognition of cursive letters, PLoS ONE, vol.6, issue.6, p.20387, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00645868

S. Maeda, Compensatory articulation during speech: Evidence from the analysis and synthesis of vocal-tract shapes using an articulatory model, Speech Production and Speech Modelling, pp.131-149, 1990.

J. Schwartz, L. Boë, P. Badin, and T. R. Sawallis, Grounding stop place systems in the perceptuo-motor substance of speech: On the universality of the labial-coronal-velar stop series, Journal of Phonetics, vol.40, issue.1, pp.20-36, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00640400

P. Messum and I. S. Howard, Creating the cognitive form of phonological units: The speech sound correspondence problem in infancy could be solved by mirrored vocal interactions rather than by imitation, Journal of Phonetics, vol.53, pp.125-140, 2015.

J. Maye, D. J. Weiss, and R. N. Aslin, Statistical phonetic learning in infants: Facilitation and feature generalization, Developmental Science, vol.11, issue.1, pp.122-134, 2008.

B. De-boer and P. K. , Investigating the role of infantdirected speech with a computer model, Acoustics Research Letters Online, vol.4, issue.4, pp.129-134, 2003.

B. Mcmurray, R. N. Aslin, and J. C. Toscano, Statistical learning of phonetic categories: insights from a computational approach, Developmental Science, vol.12, issue.3, pp.369-378, 2009.

G. K. Vallabha, J. L. Mcclelland, F. Pons, J. F. Werker, and S. Amano, Unsupervised learning of vowel categories from infant-directed speech, Proceedings of the National Academy of Sciences, vol.104, issue.33, pp.13-273, 2007.

B. Varadarajan, S. Khudanpur, and E. Dupoux, Unsupervised learning of acoustic sub-word units, Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies. Association for Computational Linguistics, pp.165-168, 2008.
DOI : 10.3115/1557690.1557736

URL : http://dl.acm.org/ft_gateway.cfm?id=1557736&type=pdf

M. Barnaud, Modélisation bayésienne du développement conjoint de la perception, l'action et la phonologie, 2018.

E. Todorov, Optimality principles in sensorimotor control, Nature neuroscience, vol.7, issue.9, p.907, 2004.
DOI : 10.1038/nn1309

URL : http://europepmc.org/articles/pmc1488877?pdf=render

P. F. Macneilage, B. L. Davis, and C. L. Matyear, Babbling and first words: Phonetic similarities and differences, Speech Communication, vol.22, issue.2-3, pp.269-277, 1997.

M. R. Schroeder, B. S. Atal, and J. Hall, Objective measure of certain speech signal degradations based on masking properties of human auditory perception, Frontiers of speech commu, pp.217-229, 1979.