W. Xiong, J. Droppo, X. Huang, F. Seide, M. Seltzer et al., Achieving human parity in conversational speech recognition, 2016.

G. Saonn, G. Kurata, T. Sercu, K. Audhkhasi, S. Thomas et al., English conversational telephone speech recognition by humans and machines, 2017.

J. Godfrey, E. Holliman, and J. Mcdaniel, SWITCHBOARD: Telephone speech corpus for research and development, Proc. ICASSP, vol.1, pp.517-520, 1992.

E. Delpech, M. Laignelet, C. Pimm, C. Raynal, M. Trzos et al., A real-life, french-accented corpus of Air Traffic Control communications, Proc. LREC, pp.2866-2870, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01725882

R. , Speech in action: Fluency for Air Traffic Control, 2007.

R. Manual-of,

J. C. Segura, T. Ehrette, A. Potamianos, D. Fohr, I. Illina et al., The HIWIRE database, a noisy and non-native English speech corpus for cockpit communication, Web Download, 2007.

S. Pigeon, W. Shen, and D. Van-leeuwen, Design and characterization of the non-native military air traffic communications database (nnMATC), Proc. Interspeech, pp.2417-2420, 2007.

L. Graglia, B. Favennec, and A. Arnoux, Vocalise: Assessing the impact of data link technology on the r/t channel, The 24th Digital Avionics Systems Conference, vol.1, 2005.

S. Lopez, A. Condamines, A. Josselin-leray, M. Odonoghue, and R. Salmon, Linguistic analysis of english phraseology and plain language in air-ground communication, Journal of Air Transport Studies, vol.4, issue.1, pp.44-60, 2013.
URL : https://hal.archives-ouvertes.fr/halshs-00924821

K. Hofbauer, S. Petrik, and H. Hering, The ATCOSIM corpus of non-prompted clean Air Traffic Control speech, Proc. LREC, pp.2147-2152, 2008.

J. Godfrey, Air traffic control complete LDC94S14A, Linguistic Data Consortium, 1994.

L. ?mídl and P. Ircing, Air Traffic Control communications (ATCC) speech corpus, Web Download, 2014.

M. Adda-decker and L. Lamel, Do speech recognizers prefer female speakers, Proc. Interspeech, pp.2205-2208, 2005.

H. Hermansky, Perceptual linear predictive (PLP) analysis of speech, the Journal of the Acoustical Society of America, vol.87, issue.4, pp.1738-1752, 1990.

H. Hermansky and N. Morgan, RASTA processing of speech, IEEE transactions on speech and audio processing, vol.2, issue.4, pp.578-589, 1994.

A. Graves, S. Fernández, F. Gomez, and J. Schmidhuber, Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks, Proc. ICML, pp.369-376, 2006.

A. Waibel, T. Hanazawa, G. Hinton, K. Shikano, and K. J. Lang, Phoneme recognition using time-delay neural networks, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.37, issue.3, pp.328-339, 1989.

D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek et al., The kaldi speech recognition toolkit, IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, 2011.

V. Gupta and G. Boulianne, CRIM's system for the MGB-3 English multi-genre broadcast media transcription, Proc. Interspeech, pp.2653-2657, 2018.

D. Povey, G. Cheng, Y. Wang, K. Li, H. Xu et al., Semi-orthogonal low-rank matrix factorization for deep neural networks, Proc. Interspeech, pp.3743-3747, 2018.

L. ?mídl, J. ?vec, A. Pra?ák, and J. Trmal, Semi-supervised training of DNN-based acoustic model for ATC speech recognition, Proc. SPECOM, pp.646-655, 2018.