Modeling the concurrent development of speech perception and production in a Bayesian framework: COSMO, a Bayesian computational model of speech communication: Assessing the role of sensory vs. motor knowledge in speech perception

Abstract : It is now widely accepted that there is a functional relationship between the speech perception and production systems in the human brain. However, the precise mechanisms and role of this relationship still remain debated. The question of invariance and robustness in categorization are set at the center of the debate: how is stable information extracted from the variable sensory input in order to achieve speech comprehension? In this context, auditory (resp. motor, perceptuo-motor) theories propose that speech is categorized thanks to auditory (resp. motor, perceptuo-motor) processes. However, experimental evidence is still scarce and does not allow to clearly distinguish between the current theories and determine whether invariance in speech perception is of an auditory or motor type. This is why we developed COSMO, a Bayesian model comparing sensory and motor processes in the form of probability distributions which enable both theoretical developments and quantitative simulations. A first significant result in COSMO is an indistinguishability theorem: it is only by simulations of adverse conditions or partial learning that the specificity of sensory vs. motor processing can emerge and provide a basis for evaluation of the specific role of each sub-system. We present the COSMO model, and how its sensory and motor sub-systems are learned, then we describe simulations exploring the way these sub-systems differ during speech categorization. We discuss the experimental results in the light of a “narrowband vs. wideband” interpretation: the sensory sub-system is more precisely tuned to the frequently learned sensory input and hence more efficient in recognizing these inputs, providing a “narrowband” system. Conversely, the motor sub-system is less accurate to recognize learned sensory inputs but it has better generalization properties, making it more robust to unexpected variability which would provide it with “wideband” characteristics.
Type de document :
Poster
EPIROB-ICDL, Aug 2015, Providence, United States. 2015, 〈http://www.icdl-epirob.org/〉
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01202418
Contributeur : Marie-Lou Barnaud <>
Soumis le : mercredi 23 septembre 2015 - 15:32:59
Dernière modification le : vendredi 31 août 2018 - 09:13:02
Document(s) archivé(s) le : mardi 29 décembre 2015 - 08:53:44

Fichier

Poster.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01202418, version 1

Collections

Citation

Marie-Lou Barnaud, Julien Diard, Pierre Bessière, Jean-Luc Schwartz. Modeling the concurrent development of speech perception and production in a Bayesian framework: COSMO, a Bayesian computational model of speech communication: Assessing the role of sensory vs. motor knowledge in speech perception. EPIROB-ICDL, Aug 2015, Providence, United States. 2015, 〈http://www.icdl-epirob.org/〉. 〈hal-01202418〉

Partager

Métriques

Consultations de la notice

695

Téléchargements de fichiers

63