The Zero Resource Speech Challenge 2015: Proposed approaches and results, Procedia Computer Science, vol.81, pp.67-72, 2016. ,
The Zero Resource Speech Challenge, pp.323-330, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01687504
Automatic discovery of a phonetic inventory for unwritten languages for statistical speech synthesis, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.2594-2598, 2014. ,
Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the, pp.4979-4983 ,
URL : https://hal.archives-ouvertes.fr/hal-01709578
Variational inference for acoustic unit discovery, SLTU, ser. Procedia Computer Science, vol.81, pp.80-86, 2016. ,
Merlin: An open source neural network speech synthesis system, Speech Synthesis Workshop. ISCA, pp.202-207, 2016. ,
An autoencoder based approach to unsupervised learning of subword units, ICASSP, pp.7634-7638, 2014. ,
Partitioning of posteriorgrams using siamese models for unsupervised acoustic modelling, International Workshop on Grounding Language Understanding (GLU) ,
Feature optimized DPGMM clustering for unsupervised subword modeling: A contribution to ZeroSpeech 2017, pp.740-746, 2017. ,
Wavenet: A generative model for raw audio, SSW. ISCA, p.125, 2016. ,
SampleRNN: An unconditional end-to-end neural audio generation model, CoRR, 2016. ,
Natural TTS synthesis by conditioning wavenet on MEL spectrogram predictions, ICASSP, pp.4779-4783, 2018. ,
Deep voice 3: 2000-speaker neural text-to-speech, CoRR, 2017. ,
Close to human quality TTS with transformer, CoRR, 2018. ,
Listening while speaking: Speech chain by deep learning, pp.301-308, 2017. ,
Parallel-data-free voice conversion using cycle-consistent adversarial networks, CoRR, 2017. ,
Voice conversion from non-parallel corpora using variational auto-encoder, Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, pp.1-6, 2016. ,
Multi-target voice conversion without parallel data by adversarially learning disentangled audio representations, CoRR, 2018. ,
Voice impersonation using generative adversarial networks, ICASSP, pp.2506-2510, 2018. ,
Neural discrete representation learning, Advances in Neural Information Processing Systems, pp.6306-6315, 2017. ,
Unsupervised speech representation learning using wavenet autoencoders, 2019. ,
Development of HMM-based Indonesian speech synthesis, Proc. Oriental COCOSDA, pp.215-219, 2008. ,
Development of Indonesian large vocabulary continuous speech recognition system within A-STAR project, Proceedings of the Workshop on Technologies and Corpora for AsiaPacific Speech Translation (TCAST), 2008. ,
Bayesian models for unit discovery on a very low resource language, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.5939-5943, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01709589
The Kaldi speech recognition toolkit, IEEE Signal Processing Society, Tech. Rep, 2011. ,
Zero Resource Speech Synthesis Using Transcripts Derived from Perceptual Acoustic Units, 2019. ,
Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling, 2019. ,
Unsupervised End-to-End Learning of Discrete Linguistic Units for Voice Conversion, 2019. ,
Unsupervised acoustic unit discovery for speech synthesis using discrete latent-variable neural networks, 2019. ,
Virtual Phone Discovery for Speech Synthesis, 2019. ,
,
Temporally-Aware Acoustic Unit Discovery for Zerospeech 2019 Challenge, 2019. ,
VQVAE Unsupervised Unit Discovery and Multi-scale Code2Spec Inverter for Zerospeech Challenge, 2019. ,