, References

J. Glass, Towards unsupervised speech processing, 2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA), pp.1-4, 2012.
DOI : 10.1109/ISSPA.2012.6310546

A. Jansen, E. Dupoux, S. Goldwater, M. Johnson, S. Khudanpur et al., A summary of the 2012 JH CLSP Workshop on zero resource speech technologies and models of early language acquisition, Proc. ICASSP, 2013.

E. Dunbar, X. Nga-cao, J. Benjumea, J. Karadayi, M. Bernard et al., The zero resource speech challenge 2017, 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), p.2017
DOI : 10.1109/ASRU.2017.8268953

URL : https://hal.archives-ouvertes.fr/hal-01687504

L. Besacier, B. Zhou, and Y. Gao, TOWARDS SPEECH TRANSLATION OF NON WRITTEN LANGUAGES, 2006 IEEE Spoken Language Technology Workshop, pp.222-225, 2006.
DOI : 10.1109/SLT.2006.326795

L. Duong, A. Anastasopoulos, D. Chiang, S. Bird, and T. Cohn, An Attentional Model for Speech Translation Without Transcription, Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp.949-959, 2016.
DOI : 10.18653/v1/N16-1109

E. Dupoux, Cognitive science in the era of artificial intelligence: A roadmap for reverse-engineering the infant language-learner, Cognition, vol.173, pp.43-59, 2018.
DOI : 10.1016/j.cognition.2017.11.008

URL : https://hal.archives-ouvertes.fr/hal-01888694

G. Adda, S. Stüker, M. Adda-decker, O. Ambouroue, L. Besacier et al., Breaking the Unwritten Language Barrier: The BULB Project, Procedia Computer Science, vol.81, pp.8-14, 2016.
DOI : 10.1016/j.procs.2016.04.023

URL : https://hal.archives-ouvertes.fr/halshs-01428027

D. Blachon, E. Gauthier, L. Besacier, G. Kouarata, M. Adda-decker et al., Parallel Speech Collection for Under-resourced Language Studies Using the Lig-Aikuma Mobile Device App, Procedia Computer Science, vol.81, pp.61-66, 2016.
DOI : 10.1016/j.procs.2016.04.030

URL : https://hal.archives-ouvertes.fr/hal-01350065

M. Z. Boito, A. Berard, A. Villavicencio, and L. Besacier, Unwritten languages demand attention too! Word discovery with encoder-decoder models, 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), p.2017
DOI : 10.1109/ASRU.2017.8268972

URL : https://hal.archives-ouvertes.fr/hal-01592091

S. Goldwater, T. L. Griffiths, and M. Johnson, A Bayesian framework for word segmentation: Exploring the effects of context, Cognition, vol.112, issue.1, pp.21-54, 2009.
DOI : 10.1016/j.cognition.2009.03.008

A. Bérard, O. Pietquin, C. Servan, and L. Besacier, Listen and translate: A proof of concept for end-to-end speech-to-text translation, NIPS workshop on End-to-end Learning for Speech and Audio Processing, 2016.

R. J. Weiss, J. Chorowski, N. Jaitly, Y. Wu, and Z. Chen, Sequence-to-sequence models can directly transcribe foreign speech, 2017.
DOI : 10.21437/interspeech.2017-503

URL : http://arxiv.org/pdf/1703.08581

P. Godard, G. Adda, M. Adda-decker, J. Benjumea, L. Besacier et al., A Very Low Resource Language Speech Corpus for Computational Language Documentation Experiments, Proc. LREC, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01807093

D. Bahdanau, K. Cho, and Y. Bengio, Neural machine translation by jointly learning to align and translate, Proc. ICLR, 2015.

L. Ondel, L. Burget, and J. Cernock, Variational Inference for Acoustic Unit Discovery, Procedia Computer Science, vol.81, pp.80-86, 2016.
DOI : 10.1016/j.procs.2016.04.033

URL : https://doi.org/10.1016/j.procs.2016.04.033

Y. W. Teh and M. I. Jordan, Hierarchical Bayesian nonparametric models with applications, Bayesian Nonparametrics: Principles and Practice, 2010.
DOI : 10.1017/CBO9780511802478.006

URL : http://www.stat.berkeley.edu/tech-reports/770.pdf

K. Kurihara, M. Welling, and Y. W. Teh, Collapsed variational Dirichlet process mixture models, Proc. IJCAI, pp.2796-2801, 2007.

M. Johnson, Composing graphical models with neural networks for structured representations and fast inference, Advances in Neural Information Processing Systems, pp.2946-2954, 2016.

D. Kingma and M. Welling, Auto-encoding variational Bayes, Proc. ICLR, Banff, 2014.

M. Hoffman, Stochastic variational inference, Journal of Machine Learning Research, vol.14, pp.1303-1347, 2013.

F. Grézl and M. Karafiát, Adapting multilingual neural network hierarchy to a new language, Proc. SLTU, pp.39-45, 2014.

B. Ludusan, M. Versteegh, A. Jansen, G. Gravier, X. Cao et al., Bridging the gap between speech technology and natural language processing: an evaluation toolbox for term discovery systems, Proc. LREC, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01026368

A. Jansen and B. Van-durme, Efficient spoken term discovery using randomized algorithms, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, pp.401-406, 2011.
DOI : 10.1109/ASRU.2011.6163965

URL : http://www.cs.jhu.edu/%7Evandurme/papers/JansenVanDurmeASRU11.pdf

L. Ondel, P. Godard, L. Besacier, E. Larsen, M. Hasegawa-johnson et al., Bayesian Models for Unit Discovery on a Very Low Resource Language, Proc. ICASSP, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01709589

C. Lee, T. J. Donnell, and J. Glass, Unsupervised lexicon discovery from acoustic input, Transactions of the Association for Computational Linguistics, vol.3, pp.389-403, 2015.

C. Bartels, W. Wang, V. Mitra, C. Richey, A. Kathol et al., Toward human-assisted lexical unit discovery without text resources, 2016 IEEE Spoken Language Technology Workshop (SLT), pp.2016-64
DOI : 10.1109/SLT.2016.7846246

M. Elsner, S. Goldwater, N. Feldman, and F. Wood, A joint learning model of word segmentation, lexical acquisition, and phonetic variability, Proc. EMNLP. Association for Computational Linguistics, pp.42-54, 2013.

P. Godard, G. Adda, M. Adda-decker, A. Allauzen, L. Besacier et al., Preliminary Experiments on Unsupervised Word Discovery in Mboshi, Interspeech 2016, 2016.
DOI : 10.21437/Interspeech.2016-886

URL : https://hal.archives-ouvertes.fr/hal-01350119

S. Stüker, Towards human translations guided language discovery for ASR systems, Proc. SLTU, 2008.

S. Stüker, L. Besacier, and A. Waibel, Human Translations Guided Language Discovery for ASR Systems, Proc. Interspeech . Brighton (UK): Eurasip, pp.1-4, 2009.

F. Stahlberg, T. Schlippe, S. Vogel, and T. Schultz, Word segmentation through cross-lingual word-to-phoneme alignment, 2012 IEEE Spoken Language Technology Workshop (SLT), pp.2012-85
DOI : 10.1109/SLT.2012.6424202

A. Anastasopoulos, S. Bansal, D. Chiang, S. Goldwater, and A. Lopez, Spoken Term Discovery for Language Documentation using Translations, Proceedings of the Workshop on Speech-Centric Natural Language Processing, pp.53-58, 2017.
DOI : 10.18653/v1/W17-4607

URL : https://doi.org/10.18653/v1/w17-4607