Achieving human parity in conversational speech recognition, 2016. ,
English conversational telephone speech recognition by humans and machines, 2017. ,
SWITCHBOARD: Telephone speech corpus for research and development, Proc. ICASSP, vol.1, pp.517-520, 1992. ,
A real-life, french-accented corpus of Air Traffic Control communications, Proc. LREC, pp.2866-2870, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01725882
Speech in action: Fluency for Air Traffic Control, 2007. ,
,
The HIWIRE database, a noisy and non-native English speech corpus for cockpit communication, Web Download, 2007. ,
Design and characterization of the non-native military air traffic communications database (nnMATC), Proc. Interspeech, pp.2417-2420, 2007. ,
Vocalise: Assessing the impact of data link technology on the r/t channel, The 24th Digital Avionics Systems Conference, vol.1, 2005. ,
Linguistic analysis of english phraseology and plain language in air-ground communication, Journal of Air Transport Studies, vol.4, issue.1, pp.44-60, 2013. ,
URL : https://hal.archives-ouvertes.fr/halshs-00924821
The ATCOSIM corpus of non-prompted clean Air Traffic Control speech, Proc. LREC, pp.2147-2152, 2008. ,
Air traffic control complete LDC94S14A, Linguistic Data Consortium, 1994. ,
Air Traffic Control communications (ATCC) speech corpus, Web Download, 2014. ,
Do speech recognizers prefer female speakers, Proc. Interspeech, pp.2205-2208, 2005. ,
Perceptual linear predictive (PLP) analysis of speech, the Journal of the Acoustical Society of America, vol.87, issue.4, pp.1738-1752, 1990. ,
RASTA processing of speech, IEEE transactions on speech and audio processing, vol.2, issue.4, pp.578-589, 1994. ,
Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks, Proc. ICML, pp.369-376, 2006. ,
Phoneme recognition using time-delay neural networks, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.37, issue.3, pp.328-339, 1989. ,
The kaldi speech recognition toolkit, IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, 2011. ,
CRIM's system for the MGB-3 English multi-genre broadcast media transcription, Proc. Interspeech, pp.2653-2657, 2018. ,
Semi-orthogonal low-rank matrix factorization for deep neural networks, Proc. Interspeech, pp.3743-3747, 2018. ,
Semi-supervised training of DNN-based acoustic model for ATC speech recognition, Proc. SPECOM, pp.646-655, 2018. ,