Synthesis and expressive transformation of singing voice

Luc Ardaillon 1
1 Analyse et synthèse sonores [Paris]
STMS - Sciences et Technologies de la Musique et du Son
Abstract : This thesis aimed at conducting research on the synthesis and expressive transformations of the singing voice, towards the development of a high-quality synthesizer that can generate a natural and expressive singing voice automatically from a given score and lyrics. Mainly 3 research directions can be identified: the methods for modelling the voice signal to automatically generate an intelligible and natural-sounding voice according to the given lyrics; the control of the synthesis to render an adequate interpretation of a given score while conveying some expressivity related to a specific singing style; the transformation of the voice signal to improve its naturalness and add expressivity by varying the timbre adequately according to the pitch, intensity and voice quality. This thesis provides some contributions in each of those 3 directions. First, a fully-functional synthesis system has been developed, based on diphones concatenations. The modular architecture of this system allows to integrate and compare different signal modeling approaches. Then, the question of the control is addressed, encompassing the automatic generation of the f0, intensity, and phonemes durations. The modeling of specific singing styles has also been addressed by learning the expressive variations of the modeled control parameters on commercial recordings of famous French singers. Finally, some investigations on expressive timbre transformations have been conducted, for a future integration into our synthesizer. This mainly concerns methods related to intensity transformation, considering the effects of both the glottal source and vocal tract, and the modeling of vocal roughness.
Complete list of metadatas

Cited literature [353 references]  Display  Hide  Download
Contributor : Abes Star <>
Submitted on : Monday, June 18, 2018 - 10:20:09 AM
Last modification on : Saturday, December 21, 2019 - 3:54:44 AM
Long-term archiving on: Wednesday, September 19, 2018 - 2:26:48 PM


Version validated by the jury (STAR)


  • HAL Id : tel-01710926, version 2


Luc Ardaillon. Synthesis and expressive transformation of singing voice. Signal and Image processing. Université Pierre et Marie Curie - Paris VI, 2017. English. ⟨NNT : 2017PA066511⟩. ⟨tel-01710926v2⟩



Record views


Files downloads