Z. Elloumi, L. Besacier, O. Galibert, J. Kahn, and B. Lecouteux, Asr performance prediction on unseen broadcast programs using convolutional neural networks, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018.
URL : https://hal.archives-ouvertes.fr/hal-02102831

M. Negri, M. Turchi, J. Souza, and D. Falavigna, Quality estimation for automatic speech recognition, COLING, pp.1813-1823, 2014.

J. Souza, C. Buck, M. Turchi, and M. Negri, Fbk-uedin participation to the wmt13 quality estimation shared task, Proceedings of the eighth workshop on statistical machine translation, pp.352-358, 2013.

S. Jalalvand, M. Negri, F. Daniele, and M. Turchi, Driving rover with segment-based asr quality estimation, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, vol.1, pp.1095-1105, 2015.

H. José-gc-de-souza, M. Zamani, M. Negri, F. Turchi, and . Daniele, Multitask learning for adaptive quality estimation of automatically transcribed utterances, Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp.714-724, 2015.

S. Jalalvand, M. Negri, M. Turchi, J. Souza, D. Falavigna et al., Transcrater: a tool for automatic speech recognition quality estimation, Proceedings of ACL-2016 System Demonstrations, pp.43-48, 2016.

G. Gravier, G. Adda, N. Paulson, M. Carré, A. Giraudel et al., The etape corpus for the evaluation of speech-based tv content processing in the french language, LRECEighth international conference on Language Resources and Evaluation, p.p. na, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00712591

S. Galliano, E. Geoffrois, D. Mostefa, K. Choukri, J. Bonastre et al., The ester phase ii evaluation campaign for the rich transcription of french broadcast news.," in Interspeech, pp.1149-1152, 2005.

J. Kahn, O. Galibert, L. Quintard, M. Carré, A. Giraudel et al., A presentation of the repere challenge, Content-Based Multimedia Indexing (CBMI), 2012 10th International Workshop on, pp.1-6, 2012.

D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek et al., The kaldi speech recognition toolkit, IEEE 2011 workshop on automatic speech recognition and understanding, 2011.

A. Stolcke, Srilm-an extensible language modeling toolkit, Interspeech, p.2002, 2002.

M. De, C. , and G. Pérennou, Bdlex: a lexicon for spoken and written french, Proceedings of 1st International Conference on Langage Resources & Evaluation, pp.1129-1136, 1998.

O. Galibert, Methodologies for the evaluation of speaker diarization and automatic speech recognition in the presence of overlapping speech, pp.1131-1134, 2013.

P. Geurts, D. Ernst, and L. Wehenkel, Extremely randomized trees, Machine learning, vol.63, issue.1, pp.3-42, 2006.
DOI : 10.1007/s10994-006-6226-1

URL : https://hal.archives-ouvertes.fr/hal-00341932

N. Meinshausen and P. Bühlmann, Stability selection, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.72, issue.4, pp.417-473, 2010.
DOI : 10.1111/j.1467-9868.2010.00740.x

URL : https://rss.onlinelibrary.wiley.com/doi/pdf/10.1111/j.1467-9868.2010.00740.x

H. Schmid, Treetagger-a language independent part-of-speech tagger, Institut für Maschinelle Sprachverarbeitung, Universität Stuttgart, vol.43, p.28, 1995.

Y. Kim, Convolutional neural networks for sentence classification, 2014.
DOI : 10.3115/v1/d14-1181

URL : https://doi.org/10.3115/v1/d14-1181

W. Dai, C. Dai, S. Qu, J. Li, and S. Das, Very deep convolutional neural networks for raw waveforms, Acoustics, Speech and Signal Processing (ICASSP, pp.421-425, 2017.

F. Chollet, Keras, 2015.