S. Ghannay, A. Caubrière, Y. Estève, N. Camelin, E. Simonnet et al., End-to-end named entity and semantic concept extraction from speech, IEEE Spoken Language Technology Workshop, pp.692-699, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01987740

N. Tomashenko, A. Caubrière, and Y. Estève, Investigating adaptation and transfer learning for end-to-end spoken language understanding from speech, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02307811

L. Lugosch, M. Ravanelli, P. Ignoto, V. S. Tomar, and Y. Bengio, Speech model pre-training for end-to-end spoken language understanding, 2019.

P. Wang, L. Wei, Y. Cao, J. Xie, and Z. Nie, Large-scale unsupervised pre-training for end-to-end spoken language understanding, ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.7999-8003, 2020.

R. Price, End-to-end spoken language understanding without matched language speech model pretraining data, ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.7979-7983, 2020.

A. Caubrière, N. Tomashenko, A. Laurent, E. Morin, N. Camelin et al., Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability, 2019.

F. Béchet and C. Raymond, Benchmarking benchmarks: introducing new automatic indicators for benchmarking spoken language understanding corpora, 2019.

T. J. Hazen, S. Seneff, and J. Polifroni, Recognition confidence scoring and its use in speech understanding systems, Computer Speech & Language, vol.16, issue.1, pp.49-67, 2002.

D. Hakkani-tür, F. Béchet, G. Riccardi, and G. Tur, Beyond asr 1-best: Using word confusion networks in spoken language understanding, Computer Speech & Language, vol.20, issue.4, pp.495-514, 2006.

C. Raymond, Y. Esteve, F. Béchet, R. D. Mori, and G. Damnati, Belief confirmation in spoken dialog systems using confidence measures, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding, pp.150-155, 2003.
URL : https://hal.archives-ouvertes.fr/hal-01434556

B. Minescu, G. Damnati, F. Béchet, and R. D. Mori, Conditional use of word lattices, confusion networks and 1-best string hypotheses in a sequential interpretation strategy, Eighth Annual Conference of the International Speech Communication Association, 2007.
URL : https://hal.archives-ouvertes.fr/hal-01312935

E. Simonnet, S. Ghannay, N. Camelin, Y. Estève, and R. Mori, Asr error management for improving spoken language understanding, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01526298

A. Caubrière, S. Ghannay, N. Tomashenko, R. Mori, A. Laurent et al., Error analysis applied to end-toend spoken language understanding, ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.8514-8518, 2020.

D. Amodei, S. Ananthanarayanan, R. Anubhai, J. Bai, E. Battenberg et al., Deep speech 2: End-to-end speech recognition in English and Mandarin, International Conference on Machine Learning, pp.173-182, 2016.

A. Graves, S. Fernández, F. Gomez, and J. Schmidhuber, Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks, Proceedings of the 23rd international conference on Machine learning, pp.369-376, 2006.

V. Vukotic, C. Raymond, and G. Gravier, Is it time to switch to word embedding and recurrent neural networks for spoken language understanding?, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01196915

L. Devillers, H. Maynard, S. Rosset, P. Paroubek, K. Mctait et al., The French MEDIA/EVALDA project: the evaluation of the understanding capability of spoken language dialogue systems, LREC, 2004.

Y. Belinkov and J. Glass, Analyzing hidden representations in end-to-end automatic speech recognition systems, Advances in Neural Information Processing Systems, pp.2441-2451, 2017.

D. P. Kingma and J. Ba, Adam: A method for stochastic optimization, 2014.

G. Evermann and P. Woodland, Posterior probability decoding, confidence estimation and system combination, Proc. Speech Transcription Workshop, vol.27, pp.78-81, 2000.