A. D. Références, S. Ananthanarayanan, . Anubhai-r, J. Bai, E. Battenberg et al., Deep speech 2 : End-to-end speech recognition in english and mandarin, Proceedings of ICML'16, pp.173-182, 2016.

J. M. Ben, O. Galibert, . Adda-decker-m.-&-rosset-s.-;-caubrière-a, N. Tomashenko, . Laurent-a et al., How to evaluate asr output for named entity recognition ? In Interspeech, 2015.

, Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability, Interspeech

P. Deléglise, Y. Esteve, and . Meignier-s.-&-merlin-t, Improvements to the lium french asr system based on cmu sphinx : what helps to significantly reduce the word error rate ? In Interspeech, 2009.

. Galibert-o, J. Leixa, G. Adda, . Choukri-k.-&-gravier-g.-;-ghannay-s, . Caubrière-a et al., Connectionist temporal classification : labelling unsegmented sequence data with recurrent neural networks, Language Resources Evaluation Conference (LREC), 2006.

G. Gravier, G. Adda, N. Paulsson, M. Carré, and . Giraudel-a.-&-galibert-o, The etape corpus for the evaluation of speech-based tv content processing in the french language, LREC, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00712591

C. Grouin, S. Rosset, P. Zweigenbaum, . Fort-k, and . Galibert-o.-&-quintard-l, , 2011.

, Proposal for an extension of traditional named entities : From guidelines to evaluation, an overview, Linguistic Annotation Workshop, pp.92-100

J. M. , Pcfg models of linguistic tree representations, Computational Linguistics, 1998.

J. Lafferty, . Mccallum-a, and . C. Pereira-f, Conditional random fields : Probabilistic models for segmenting and labeling sequence data, ICML, 2001.

G. Lample, M. Ballesteros, . Subramanian-s, and . Kawakami-k.-&-dyer-c, Neural architectures for named entity recognition, 2016.

. Lavergne-t, . &. Cappé-o, and . Yvon-f, Practical Very Large Scale CRFs, Annual Meeting of the Association for Computational Linguistics, pp.504-513, 2010.

. Lugosch-l, M. Ravanelli, P. Ignoto, . S. Tomar-v, and . Bengio-y, Speech model pre-training for end-to-end spoken language understanding, 2019.

M. X. Hovy-e, End-to-end sequence labeling via bi-directional lstm-cnns-crf, 2016.

J. Makhoul, F. Kubala, and . Schwartz-r.-&-weischedel-r, Performance measures for information extraction, DARPA Broadcast News Workshop, pp.249-252, 1999.

R. C. Vancouver, C. Rosset-s, and . Grouin-c.-&-zweigenbaum-p, Robust tree-structured named entities recognition from speech, Proceedings of the International Conference on Acoustic Speech and Signal Processing, 2011.

. Tomashenko-n, K. Vythelingum, and . Rousseau-a.-&-estève-y, Lium asr systems for the 2016 multi-genre broadcast arabic challenge, IEEE Spoken Language Technology Workshop, 2016.