LIUM ASR systems for the 2016 Multi-Genre Broadcast Arabic Challenge

Abstract : This paper describes the automatic speech recognition (ASR) systems developed by LIUM in the framework of the 2016 Multi-Genre Broadcast (MGB-2) Challenge in the Arabic language. LIUM participated in the first of the two proposed tasks, namely the speech-to-text transcription of Aljazeera recordings. We present the approaches and details found in our systems, as well as our results in the evaluation campaign: the primary LIUM ASR system attained the second position. The main aspects come from the use of GMM-derived features for training a DNN, combined with the use of time-delay neural networks for acoustic models, the use of two different approaches in order to automatically phonetize Arabic words, and finally, the training data selection strategy for acoustic and language models.
Type de document :
Communication dans un congrès
IEEE Workshop on Spoken Language Technology, Dec 2016, San Diego, CA, USA, United States. 2016, 〈http://www.slt2016.org〉. 〈10.1109/SLT.2016.7846278〉
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01433188
Contributeur : Sylvain Meignier <>
Soumis le : jeudi 12 janvier 2017 - 15:13:20
Dernière modification le : lundi 9 avril 2018 - 15:35:46

Identifiants

Collections

Citation

Natalia Tomashenko, Kévin Vythelingum, Anthony Rousseau, Yannick Estève. LIUM ASR systems for the 2016 Multi-Genre Broadcast Arabic Challenge. IEEE Workshop on Spoken Language Technology, Dec 2016, San Diego, CA, USA, United States. 2016, 〈http://www.slt2016.org〉. 〈10.1109/SLT.2016.7846278〉. 〈hal-01433188〉

Partager

Métriques

Consultations de la notice

153