Extraction d’un modèle articulatoire à partir d’une analyse tri-directionnelle de cinéradiographies d’un locuteur

Martine Cadot 1 Yves Laprie 1
1 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Résumé : For several reasons it is difficult to analyze the sequences of radiographs of a person talking. The first is technical: these data are images annotated in several places, times, in a semiautomatic or manual way. The second is representational: the movements of the articulators during speech (tongue, jaw, etc.) are complex to describe because of multiple mechanical and dynamic interdependencies. When speaking, a speaker sets in motion a complex set of articulators: the jaw which opens more or less, the tongue which takes many shapes and positions, the lips that allow him to leave the air escaping more or less abruptly, etc.. The best-known articulary model is the one of Maeda (1990), derived from Principal Component Analysis made on arrays of coordinates of points of the articulators of a speaker talking. We propose a 3-way analysis of the same data type, after converting tables into distances. We validate our model by predicting spoken sounds, which prediction proved almost as good as the acoustic model, and even better when coarticulation is taken into account.
Type de document :
Article dans une revue
Revue des Nouvelles Technologies de l'Information, Hermann, 2016, Fouille de Données Complexes (RNTI-E-31), pp.73-92
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01346987
Contributeur : Martine Cadot <>
Soumis le : mercredi 20 juillet 2016 - 10:16:26
Dernière modification le : mardi 18 décembre 2018 - 16:38:02

Licence


Copyright (Tous droits réservés)

Identifiants

  • HAL Id : hal-01346987, version 1

Collections

Citation

Martine Cadot, Yves Laprie. Extraction d’un modèle articulatoire à partir d’une analyse tri-directionnelle de cinéradiographies d’un locuteur. Revue des Nouvelles Technologies de l'Information, Hermann, 2016, Fouille de Données Complexes (RNTI-E-31), pp.73-92. 〈hal-01346987〉

Partager

Métriques

Consultations de la notice

314