HMM-based TTS for Hanoi Vietnamese: issues in design and evaluation - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2013

HMM-based TTS for Hanoi Vietnamese: issues in design and evaluation

Résumé

This paper presents the development and evaluation of an HMM-based TTS system for the modern Hanoi dialect of Northern Vietnamese, a tonal language. A study of specific phonetic and prosodic features of Hanoi Vietnamese is discussed. Consequences on the design of an HMM-based TTS system are derived. Using this knowledge, a TTS system, called VTed, is then developed under the Mary TTS platform. The second part of the paper is devoted to perceptual evaluations of Vietnamese speech synthesis. Three kinds of evaluations are considered necessary for quality assessment of this tonal language. The general MOS assessment, utterance- level intelligibility, and tone-level intelligibility tests are conducted on the VTed system under a “natural speech reference” condition. The results show 1.21 points difference between natural and synthetic speech for the MOS test, a 0.2% – 0.9% difference for the utterance-level intelligibility test, 23% on average and – depending on the tone type – from 0% to 37% difference for the tone-level intelligibility test. These results demonstrate the need for more specific works on tonal/prosodic level to improve automatic synthesis of Vietnamese and other tonal languages.
Fichier non déposé

Dates et versions

hal-01621853 , version 1 (23-10-2017)

Identifiants

  • HAL Id : hal-01621853 , version 1

Citer

Thi Thu Trang Nguyen, Christophe d'Alessandro, Albert Rilliard, Do Dat Tran. HMM-based TTS for Hanoi Vietnamese: issues in design and evaluation. Annual Conference of the International Speech Communication Association (INTERSPEECH 2013), 2013, Lyon, France. pp.2311-2315. ⟨hal-01621853⟩
167 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More