Characterization of inter-speaker articulatory variability: A two-level multi-speaker modelling approach based on MRI data

Antoine Serrurier; Pierre Badin; Laurent Lamalle; Christiane Neuschaefer-Rube

doi:10.1121/1.5096631

Article Dans Une Revue Journal of the Acoustical Society of America Année : 2019

Characterization of inter-speaker articulatory variability: A two-level multi-speaker modelling approach based on MRI data

(1) , (2) , (3) , (1)

1
2
3

Antoine Serrurier

Fonction : Auteur

Rheinisch-Westfälische Technische Hochschule Aachen University

Pierre Badin

Fonction : Auteur
PersonId : 4918
IdHAL : pierrebadin
ORCID : 0000-0001-7440-820X
IdRef : 117976687

GIPSA - Cognitive Robotics, Interactive Systems, & Speech Processing

Laurent Lamalle

Fonction : Auteur
PersonId : 18741
IdHAL : laurent-lamalle
IdRef : 223755451

IRMaGe

Christiane Neuschaefer-Rube

Fonction : Auteur

Rheinisch-Westfälische Technische Hochschule Aachen University

Résumé

Speech communication relies on articulatory and acoustic codes shared between speakers and listeners despite inter-individual differences in morphology and idiosyncratic articulatory strategies. This study addresses the long-standing problem of characterizing and modelling speaker-independent articulatory strategies and inter-speaker articulatory variability. It explores a multi-speaker modelling approach based on two levels: statistically-based linear articulatory models, which capture the speakerspecific articulatory variability on the one hand, are in turn controlled by a speaker model, which captures the inter-speaker variability on the other hand. A low dimensionality speaker model is obtained by taking advantage of the inter-speaker correlations between morphology and strategy. To validate this approach, contours of the vocal tract articulators were manually segmented on midsagittal MRI data recorded from 11 French speakers uttering 62 vowels and consonants. Using these contours, multi-speaker models with 14 articulatory components and two morphology and strategy components led to overall variance explanations of 66%–69% and root-mean-square errors of 0.36–0.38 cm obtained in leave-one-out procedure over the speakers. Results suggest that inter-speaker variability is more related to the morphology than to the idiosyncratic strategies and illustrate the adaptation of the articulatory components to the morphology.

Mots clés

multi-speaker models MRI articulatory modelling vocal tract inter-speaker variability

Domaines

Sciences de l'information et de la communication

Fichier principal

Serrurier_Badin_Lamalle_Neuschaefer_ModelsOfModels_JASA_2019.pdf (5.84 Mo)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Pierre Badin : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02106595

Soumis le : jeudi 26 novembre 2020-20:12:25

Dernière modification le : mercredi 17 avril 2024-08:12:03

Archivage à long terme le : samedi 27 février 2021-18:03:41

Dates et versions

hal-02106595 , version 1 (26-11-2020)

Identifiants

HAL Id : hal-02106595 , version 1
DOI : 10.1121/1.5096631

Citer

Antoine Serrurier, Pierre Badin, Laurent Lamalle, Christiane Neuschaefer-Rube. Characterization of inter-speaker articulatory variability: A two-level multi-speaker modelling approach based on MRI data. Journal of the Acoustical Society of America, 2019, 145 (4), pp.2149-2170. ⟨10.1121/1.5096631⟩. ⟨hal-02106595⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSERM UGA CNRS GIPSA GIPSA-DPC GIPSA-CRISSP ANR

179 Consultations

86 Téléchargements

Characterization of inter-speaker articulatory variability: A two-level multi-speaker modelling approach based on MRI data

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager