Articulatory Speech Synthesis from Static Context-Aware Articulatory Targets

Abstract : The aim of this work is to develop an algorithm for controlling the articulators (the jaw, the tongue, the lips, the velum, the larynx and the epiglottis) to produce given speech sounds, syllables and phrases. This control has to take into account coarticulation and be flexible enough to be able to vary strategies for speech production. The data for the algorithm are 97 static MRI images capturing the articulation of French vowels and blocked consonant-vowel syllables. The results of this synthesis are evaluated visually, acoustically and perceptually, and the problems encountered are broken down by their origin: the dataset, its modeling, the algorithm for managing the vocal tract shapes, their translation to the area functions, and the acoustic simulation.
Type de document :
Communication dans un congrès
ISSP 2017 - 11th International Seminar on Speech Production, Oct 2017, Tianjin, China
Liste complète des métadonnées

Littérature citée [18 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01643487
Contributeur : Anastasiia Tsukanova <>
Soumis le : mardi 21 novembre 2017 - 14:32:45
Dernière modification le : mardi 18 décembre 2018 - 16:38:02

Fichier

ISSP2017Tsukanova.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01643487, version 1

Citation

Anastasiia Tsukanova, Benjamin Elie, Yves Laprie. Articulatory Speech Synthesis from Static Context-Aware Articulatory Targets. ISSP 2017 - 11th International Seminar on Speech Production, Oct 2017, Tianjin, China. 〈hal-01643487〉

Partager

Métriques

Consultations de la notice

299

Téléchargements de fichiers

204