Modeling the concurrent development of speech perception and production in a Bayesian framework

Marie-Lou Barnaud; Julien Diard; Pierre Bessière; Jean-Luc Schwartz

Communication Dans Un Congrès Année : 2015

Modeling the concurrent development of speech perception and production in a Bayesian framework

(1) , (2) , (3, 4) , (1)

1
2
3
4

Marie-Lou Barnaud

Fonction : Auteur correspondant
PersonId : 970444

Connectez-vous pour contacter l'auteur

GIPSA - Perception, Contrôle, Multimodalité et Dynamiques de la parole

Julien Diard

Fonction : Auteur
PersonId : 172176
IdHAL : julien-diard
ORCID : 0000-0003-0673-477X
IdRef : 072644672

Laboratoire de Psychologie et NeuroCognition

Pierre Bessière

Fonction : Auteur
PersonId : 3434
IdHAL : pierre-bessiere
ORCID : 0000-0002-8620-2505
IdRef : 072644702

Institut des Systèmes Intelligents et de Robotique

AMAC

Jean-Luc Schwartz

Fonction : Auteur
PersonId : 1160
IdHAL : jean-luc-schwartz
ORCID : 0000-0001-8969-9185
IdRef : 033230374

GIPSA - Perception, Contrôle, Multimodalité et Dynamiques de la parole

Résumé

It is now widely accepted that there is a functional relationship between the speech perception and production systems in the human brain. However, the precise mechanisms and role of this relationship still remain debated. The question of invariance and robustness in categorization are set at the center of the debate: how is stable information extracted from the variable sensory input in order to achieve speech comprehension? In this context, auditory (resp. motor, perceptuo-motor) theories propose that speech is categorized thanks to auditory (resp. motor, perceptuo-motor) processes. However, experimental evidence is still scarce and does not allow to clearly distinguish between the current theories and determine whether invariance in speech perception is of an auditory or motor type. This is why we developed COSMO, a Bayesian model comparing sensory and motor processes in the form of probability distributions which enable both theoretical developments and quantitative simulations. A first significant result in COSMO is an indistinguishability theorem: it is only by simulations of adverse conditions or partial learning that the specificity of sensory vs. motor processing can emerge and provide a basis for evaluation of the specific role of each sub-system. We present the COSMO model, and how its sensory and motor sub-systems are learned, then we describe simulations exploring the way these sub-systems differ during speech categorization. We discuss the experimental results in the light of a “narrowband vs. wideband” interpretation: the sensory sub-system is more precisely tuned to the frequently learned sensory input and hence more efficient in recognizing these inputs, providing a “narrowband” system. Conversely, the motor sub-system is less accurate to recognize learned sensory inputs but it has better generalization properties, making it more robust to unexpected variability which would provide it with “wideband” characteristics.

Mots clés

Social Motor Development of Perceptual Emotional Cognitive Statistical Learning and Communication Skills in Biological Systems and Robots General Principles of Development and Learning

Domaines

Informatique Sciences cognitives

Fichier principal

Poster.pdf (947.2 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Marie-Lou Barnaud : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01202418

Soumis le : mercredi 23 septembre 2015-15:32:59

Dernière modification le : jeudi 4 avril 2024-21:08:28

Archivage à long terme le : mardi 29 décembre 2015-08:53:44

Dates et versions

hal-01202418 , version 1 (23-09-2015)

Identifiants

HAL Id : hal-01202418 , version 1

Citer

Marie-Lou Barnaud, Julien Diard, Pierre Bessière, Jean-Luc Schwartz. Modeling the concurrent development of speech perception and production in a Bayesian framework: COSMO, a Bayesian computational model of speech communication: Assessing the role of sensory vs. motor knowledge in speech perception. ICDL-EpiRob 2015 - 5th International Conference on Development and Learning and on Epigenetic Robotics, Aug 2015, Providence, United States. ⟨hal-01202418⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-SAVOIE UPMC UGA CNRS GIPSA GIPSA-DPC GIPSA-PCMD ISIR LPNC SORBONNE-UNIVERSITE SU-SCIENCES ISIR_AMAC

894 Consultations

130 Téléchargements

Modeling the concurrent development of speech perception and production in a Bayesian framework

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager