Evaluation of the Impact of Corpus Phonetic Alignment on the HMM-Based Speech Synthesis Quality - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

Evaluation of the Impact of Corpus Phonetic Alignment on the HMM-Based Speech Synthesis Quality

Résumé

This study investigates the impact of phonetization and phonetic segmentation of training corpora on the quality of HMM-based TTS synthesis. HMM-TTS requires phonetic symbols aligned to the speech corpus in order to train the models used for synthesis. Phonetic annotation is a complex task, since pronunciation usually differs from spelling, as well as differing among regional accents. In this paper, the infrastructure of a French TTS system is presented. A corpus whose phonetic label occurrences were systematically modified (number of schwas and liaisons) and label boundaries were displaced, was used to train several systems, one for each condition. A perceptual evaluation of the influence of labeling accuracy on synthetic speech quality was conducted. Despite the degree of annotation changes, the synthetic speech quality of the five best systems remained close to that of the reference system, built upon the corpus whose labels were manually corrected.
Fichier non déposé

Dates et versions

hal-01621844 , version 1 (23-10-2017)

Identifiants

  • HAL Id : hal-01621844 , version 1

Citer

Marc Evrard, Albert Rilliard, Christophe d'Alessandro. Evaluation of the Impact of Corpus Phonetic Alignment on the HMM-Based Speech Synthesis Quality. International Conference on Statistical Language and Speech Processing (SLSP 2015), 2015, Budapest, Hungary. pp.62-72. ⟨hal-01621844⟩
85 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More