L. Qin and A. Rudnicky, OOV word detection using hybrid models with mixed types of fragments, Interspeech, pp.2450-2453, 2012.

W. Chen, S. Ananthakrishnan, R. Prasad, and P. Natarajan, Variablespan out-of-vocabulary named entity detection, Interspeech, pp.3761-3765, 2013.

J. Li, G. Ye, R. Zhao, J. Droppo, and Y. Gong, Acoustic-To-Word Model Without OOV, 2017.

M. Sun, A. Chen, and Y. , Learning OOV through semantic relatedness in spoken dialog systems, pp.1453-1457, 2015.

P. Maergner, A. Waibel, L. , and I. , Unsupervised vocabulary selection for real-time speech recognition of lectures, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp.4417-4420, 2012.

S. Oger, G. Linares, F. Bechet, and P. Nocera, Ondemand new word learning using world wide web, ICASSP, 2008.
URL : https://hal.archives-ouvertes.fr/hal-01319857

K. Ohtsuki, ?. N. Hiroshima, M. Oku, and A. Imamura, Unsupervised vocabulary expansion for automatic transcription of broadcast news, 2005.

D. M. Blei, A. Y. Ng, J. , and M. I. , Latent dirichlet allocation, Journal of Machine Learning Research, vol.3, pp.993-1022, 2003.

M. Iyyer, Y. Manjunatha, J. Boyd-graber, and H. Daume, Deep unordered composition rivals syntactic methods for text classification, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics, vol.1, pp.1681-1691, 2015.

I. Sheikh, I. Illina, D. Fohr, and G. Linares, Improved neural bag-ofwords model to retrieve out-of-vocabulary words in speech recognition, Interspeech, pp.675-679, 2016.

I. Sheikh, D. Fohr, I. Illina, and G. Linares, Modelling Semantic Context of OOV Words in Large Vocabulary Continuous Speech Recognition, IEEE/ACM Transactions on Audio, Speech and Language Processing, vol.25, issue.3, pp.598-610, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01461617

I. Sheikh, I. Illina, D. Fohr, and G. Linares, Document level semantic context for retrieving OOV proper names, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.6050-6054, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01331716

Q. Le and T. Mikolov, Distributed representations of sentences and documents, International Conference on Machine Learning, pp.1188-1196, 2013.

T. Mikolov, K. Chen, G. Corrado, and J. Dean, Efficient estimation of word representations in vector space, 2013.

Y. Kim, Convolutional Neural Networks for Sentence Classification, EMNLP, 2014.

Y. Bengio, P. Simard, and P. Frasconi, Learning long-term dependencies with gradient descent is difficult, IEEE Transactions on Neural Networks, pp.157-166, 1994.

R. Jozefowicz, W. Zaremba, and I. Sutskever, An empirical exploration of recurrent network architectures, International Conference on Machine Learning, pp.2342-2350, 2015.

K. Cho, B. Merriënboer, C. Gülçehre, . Bahdanau, D. F. Bougares et al., Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2014.
URL : https://hal.archives-ouvertes.fr/hal-01433235

A. Graves, A. Mohamed, and G. Hinton, Speech recognition with Deep Recurrent Neural Networks, IEEE Acoustics, Speech and Signal Processing International Conference, pp.6645-6649, 2013.

A. Allauzen and H. Bonneau-maynard, Training and evaluation of pos taggers on the french multitag corpus, Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC), 2008.

I. Sheikh, I. Illina, and D. Fohr, How diachronic text corpora affect context based retrieval of OOV proper names for audio news, Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC), pp.3851-3855, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01331714

J. Weston, S. Chopra, A. , and K. , #TagSpace: Semantic embeddings from hashtags, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.1822-1827, 2014.

C. D. Manning, P. Raghavan, and H. Schutze, Introduction to Information Retrieval, 2008.

D. Povey,

M. Bisani and H. Ney, Joint-sequence models for grapheme-to-phoneme conversion, Speech Communication, vol.50, issue.5, pp.434-451, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00499203

I. Illina, D. Fohr, J. , and D. , Multiple Pronunciation Generation using Grapheme-to-Phoneme Conversion based on Conditional Random Fields, XIV International Conference «Speech and Computer» SPECOM, 2011.
URL : https://hal.archives-ouvertes.fr/inria-00616325

P. Bojanowski, E. Grave, A. Joulin, and T. Mikolov, Enriching Word Vectors with Subword Information, Transactions of the Association for Computational Linguistics, vol.5, issue.1, 2017.

K. Diederik and J. Ba, Adam: A method for stochastic optimization, Proceedings of the 3 rd International Conference on Learning Representations (ICLR), 2015.