Evaluation of the Impact of Corpus Phonetic Alignment on the HMM-Based Speech Synthesis Quality

Marc Evrard; Albert Rilliard; Christophe d'Alessandro

Communication Dans Un Congrès Année : 2015

Evaluation of the Impact of Corpus Phonetic Alignment on the HMM-Based Speech Synthesis Quality

(1) , (1) , (1)

Marc Evrard

Fonction : Auteur
PersonId : 1220707
IdHAL : marcevrard

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Albert Rilliard

Fonction : Auteur
PersonId : 6796
IdHAL : albert-rilliard
ORCID : 0000-0001-6490-2386
IdRef : 184193273

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Christophe d'Alessandro

Fonction : Auteur
PersonId : 16760
IdHAL : christophe-dalessandro
ORCID : 0000-0002-2629-8752
IdRef : 05971638X

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Résumé

This study investigates the impact of phonetization and phonetic segmentation of training corpora on the quality of HMM-based TTS synthesis. HMM-TTS requires phonetic symbols aligned to the speech corpus in order to train the models used for synthesis. Phonetic annotation is a complex task, since pronunciation usually differs from spelling, as well as differing among regional accents. In this paper, the infrastructure of a French TTS system is presented. A corpus whose phonetic label occurrences were systematically modified (number of schwas and liaisons) and label boundaries were displaced, was used to train several systems, one for each condition. A perceptual evaluation of the influence of labeling accuracy on synthetic speech quality was conducted. Despite the degree of annotation changes, the synthetic speech quality of the five best systems remained close to that of the reference system, built upon the corpus whose labels were manually corrected.

Mots clés

HTS HMM-based speech synthesis TTS Subjective evaluation MOS Phonetic labeling Phonetic alignment French speech synthesis

Domaines

Linguistique

Albert Rilliard : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01621844

Soumis le : lundi 23 octobre 2017-19:45:44

Dernière modification le : mercredi 7 février 2024-03:35:15

Dates et versions

hal-01621844 , version 1 (23-10-2017)

Identifiants

HAL Id : hal-01621844 , version 1

Citer

Marc Evrard, Albert Rilliard, Christophe d'Alessandro. Evaluation of the Impact of Corpus Phonetic Alignment on the HMM-Based Speech Synthesis Quality. International Conference on Statistical Language and Speech Processing (SLSP 2015), 2015, Budapest, Hungary. pp.62-72. ⟨hal-01621844⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIMSI UNIV-PARIS-SACLAY SORBONNE-UNIVERSITE LISN GS-COMPUTER-SCIENCE GS-SPORT-HUMAN-MOVEMENT

85 Consultations

0 Téléchargements

Evaluation of the Impact of Corpus Phonetic Alignment on the HMM-Based Speech Synthesis Quality

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager