The information bottleneck method, The 37th annual Allerton Conference on Communication, Control, and Computing, pp.368-377, 1999. ,
The language-independent bottleneck features, Proc. Spoken Language Technology Workshop (SLT), pp.336-341, 2012. ,
Improved bottleneck features using pretrained deep neural networks, 2011. ,
Auto-encoding variational bayes, 2013. ,
, Neural Discrete Representation Learning, 2017.
Unsupervised speech representation learning using wavenet autoencoders, Speech, and Language Processing, vol.27, pp.2041-2053, 2019. ,
The Zero Resource Speech Challenge 2019: TTS Without T, Proc. Interspeech, pp.1088-1092, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02274112
VQVAE Unsupervised Unit Discovery and Multi-Scale Code2Spec Inverter for Zerospeech Challenge 2019, Proc. Interspeech, pp.1118-1122, 2019. ,
Unsupervised Acoustic Unit Discovery for Speech Synthesis Using Discrete Latent-Variable Neural Networks, Proc. Interspeech, pp.1103-1107, 2019. ,
Categorical Reparameterization with Gumbel-Softmax, 2016. ,
Estimating or Propagating Gradients Through Stochastic Neurons, 2013. ,
Unsupervised neural segmentation and clustering for unit discovery in sequential data, Perception as Generative Reasoning Workshop, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02399138
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift, 2015. ,
Long short-term memory recurrent neural network architectures for large scale acoustic modeling, INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, pp.338-342, 2014. ,
K-means++: The advantages of careful seeding, Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms, pp.1027-1035, 2007. ,
Random sampling with a reservoir, ACM Trans. Math. Softw, vol.11, issue.1, pp.37-57, 1985. ,
Arvind Neelakantan, and Niki Parmar, Theory and Experiments on Vector Quantized Autoencoders, 2018. ,
Generating Diverse High-Fidelity Images with VQ-VAE-2, 2019. ,
The "ScribbleLens" Dutch historical handwriting corpus, Under review for: International Conference on Frontiers of Handwriting Recognition (ICFHR), 2020. ,
Deep speech 2 : End-to-end speech recognition in english and mandarin, Proceedings of The 33rd International Conference on Machine Learning, vol.48, pp.173-182, 2016. ,
Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural networks, Proceedings of the 23rd International Conference on Machine Learning, pp.369-376, 2006. ,
The kaldi speech recognition toolkit, IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, p.11, 2011. ,
, Open Speech and Language Resources
Pixel recurrent neural networks, Proceedings of the 33rd International Conference on International Conference on Machine Learning, vol.48, pp.1747-1756, 2016. ,
Adam: A Method for Stochastic Optimization, 2014. ,
Acceleration of stochastic approximation by averaging, SIAM journal on control and optimization, vol.30, issue.4, pp.838-855, 1992. ,
Emergence of simple-cell receptive field properties by learning a sparse code for natural images, Nature, vol.381, issue.6583, pp.607-609, 1996. ,
Learning the parts of objects by non-negative matrix factorization, Nature, vol.401, issue.6755, pp.788-791, 1999. ,
Deep Variational Information Bottleneck, 2016. ,
, Variational Information Bottleneck on Vector Quantized Autoencoders, 2018.
Simple statistical gradient-following algorithms for connectionist reinforcement learning, Machine Learning, vol.8, issue.3-4, pp.229-256, 1992. ,
REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models, Advances in Neural Information Processing Systems, vol.30, pp.2627-2636, 2017. ,
Continuous relaxation training of discrete latent variable image models ,
A Nonparametric Bayesian Approach to Acoustic Model Discovery, Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, vol.1, pp.40-49, 2012. ,
Variational Inference for Acoustic Unit Discovery, Procedia Computer Science, vol.81, pp.80-86, 2016. ,
Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery, pp.488-492, 2017. ,
Full Bayesian Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery, pp.2688-2692, 2018. ,
Composing graphical models with neural networks for structured representations and fast inference, 2016. ,