Skip to Main content Skip to Navigation
Journal articles

Can we Generate Emotional Pronunciations for Expressive Speech Synthesis?

Abstract : In the field of expressive speech synthesis, a lot of work has been conducted on suprasegmental prosodic features while few has been done on pronunciation variants. However, prosody is highly related to the sequence of phonemes to be expressed. This article raises two issues in the generation of emotional pronunciations for TTS systems. The first issue consists in designing an automatic pronunciation generation method from text, while the second issue addresses the very existence of emotional pronunciations through experiments conducted on emotional speech. To do so, an innovative pronunciation adaptation method which automatically adapts canonical phonemes first to those labeled in the corpus used to create a synthetic voice, then to those labeled in an expressive corpus, is presented. This method consists in training conditional random fields pronunciation models with prosodic, linguistic, phonological and articulatory features. The analysis of emotional pronunciations reveals strong dependencies between prosody and phoneme assimilation or elisions. According to perception tests, the double adaptation allows to synthesize expressive speech samples of good quality, but emotion-specific pronunciations are too subtle to be perceived by testers.
Complete list of metadata

Cited literature [51 references]  Display  Hide  Download
Contributor : Marie Tahon Connect in order to contact the contributor
Submitted on : Monday, September 10, 2018 - 10:23:48 AM
Last modification on : Saturday, June 25, 2022 - 9:16:28 AM
Long-term archiving on: : Tuesday, December 11, 2018 - 1:34:34 PM


Files produced by the author(s)



Marie Tahon, Gwénolé Lecorvé, Damien Lolive. Can we Generate Emotional Pronunciations for Expressive Speech Synthesis?. IEEE Transactions on Affective Computing, Institute of Electrical and Electronics Engineers, 2020, 11 (4), pp.684-695. ⟨10.1109/TAFFC.2018.2828429⟩. ⟨hal-01802463⟩



Record views


Files downloads