J. Barcenilla and J. M. Bastien, L'acceptabilité des nouvelles technologies: quelles relations avec l'ergonomie, l'utilisabilité et l'expérience utilisateur?, Le travail humain, vol.72, issue.4, pp.311-331, 2009.

H. Jiang, Confidence measures for speech recognition: A survey, Speech communication, vol.45, issue.4, pp.455-470, 2005.

S. Ghannay, Y. Esteve, and N. Camelin, Word embeddings combination and neural networks for robustness in asr error detection, IEEE 23rd European Signal Processing Conference (EUSIPCO), pp.1671-1675, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01433210

B. T. Meyer, S. H. Mallidi, H. Kayser, and H. Hermansky, Predicting error rates for unknown data in automatic speech recognition, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.5330-5334, 2017.

A. Ali and S. Renals, Word error rate estimation for speech recognition: e-wer, Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, vol.2, pp.20-24, 2018.

H. Hermansky, E. Variani, and V. Peddinti, Mean temporal distance: Predicting asr error from temporal properties of speech signal, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.7423-7426, 2013.

A. Varga and H. J. Steeneken, Assessment for automatic speech recognition: Ii. noisex-92: A database and an experiment to study the effect of additive noise on speech recognition systems, Speech communication, vol.12, issue.3, pp.247-251, 1993.

D. Wang, On ideal binary mask as the computational goal of auditory scene analysis, Speech separation by humans and machines, pp.181-197, 2005.

E. Zwicker and H. Fastl, Psychoacoustics: Facts and Models

, Springer Series in Information Sciences, 1990.

G. John, CSR-I (WSJ0) complete LDC93S6A, DVD. Philadelphia: Linguistic Data Consortium, 1993.

J. Garofolo, D. Graff, D. Paul, and D. Pallett, CSR-II (WSJ1) Complete, 1994.

K. Vesel?, A. Ghoshal, L. Burget, and D. Povey, Sequence-discriminative training of deep neural networks.," in Interspeech, vol.2013, pp.2345-2349, 2013.

J. Thiemann, N. Ito, and E. Vincent, DEMAND: a collection of multi-channel recordings of acoustic noise in diverse environments, Proc. Meetings Acoust, 2013.