Statistical parametric speech synthesis, Speech Communication, pp.1039-1064, 2009. ,
DOI : 10.1016/j.specom.2009.04.004
URL : https://hal.archives-ouvertes.fr/hal-00746106
Wavenet: A generative model for raw audio, 2016. ,
Tacotron: Towards End-to-End Speech Synthesis, Interspeech 2017, 2017. ,
DOI : 10.21437/Interspeech.2017-1452
Deep voice: Real-time neural text-to-speech, 2017. ,
Deep voice 2: Multispeaker neural text-to-speech, 2017. ,
Lia phon : un système complet de phonétisation de textes, Traitement Automatique des Langues (TAL), pp.47-67, 2001. ,
Joint-sequence models for grapheme-to-phoneme conversion, Speech Communication, pp.434-451, 2008. ,
DOI : 10.1016/j.specom.2008.01.002
URL : https://hal.archives-ouvertes.fr/hal-00499203
Pronunciation of proper names with a joint n-gram model for bi-directional grapheme-to-phoneme conversion, Proceedings of InterSpeech, 2002. ,
Grapheme to phoneme conversion using an smt system, Proceedings of InterSpeech, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-01451534
Graphemeto-phoneme conversion using long short-term memory recurrent neural networks, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.2015-4225 ,
DOI : 10.1109/icassp.2015.7178767
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.674.6326
Sequence-to-sequence neural net models for grapheme-to-phoneme conversion, Proceedings of InterSpeech, 2015. ,
Speech synthesis in various communicative situations: Impact of pronunciation variations, Proceedings of InterSpeech, 2014. ,
Testing the consistency assumption: Pronunciation variant forced alignment in read and spontaneous speech synthesis, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.2016-51555159 ,
DOI : 10.1109/ICASSP.2016.7472660
Neural machine translation by jointly learning to align and translate, International Conference on Learning Representations (ICLR), 2015. ,
Nmtpy: A flexible toolkit for advanced neural machine translation systems, 2017. ,
The kaldi speech recognition toolkit, Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 2011. ,
Improving wfst-based g2p conversion with alignment constraints and rnnlm n-best rescoring, Proceedings of InterSpeech, 2012. ,
Failure transitions for joint n-gram models and g2p conversion, Proceedings of InterSpeech, 2013. ,
Srilm ? an extensible language modeling toolkit, Proceedings of InterSpeech, 2002. ,
Srilm at sixteen: Update and outlook, Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding, 2011. ,