Skip to Main content Skip to Navigation
Conference papers

LIUM ASR systems for the 2016 Multi-Genre Broadcast Arabic Challenge

Abstract : This paper describes the automatic speech recognition (ASR) systems developed by LIUM in the framework of the 2016 Multi-Genre Broadcast (MGB-2) Challenge in the Arabic language. LIUM participated in the first of the two proposed tasks, namely the speech-to-text transcription of Aljazeera recordings. We present the approaches and details found in our systems, as well as our results in the evaluation campaign: the primary LIUM ASR system attained the second position. The main aspects come from the use of GMM-derived features for training a DNN, combined with the use of time-delay neural networks for acoustic models, the use of two different approaches in order to automatically phonetize Arabic words, and finally, the training data selection strategy for acoustic and language models.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01433188
Contributor : Sylvain Meignier <>
Submitted on : Thursday, January 12, 2017 - 3:13:20 PM
Last modification on : Monday, April 9, 2018 - 3:35:46 PM

Identifiers

Collections

Citation

Natalia Tomashenko, Kévin Vythelingum, Anthony Rousseau, Yannick Estève. LIUM ASR systems for the 2016 Multi-Genre Broadcast Arabic Challenge. IEEE Workshop on Spoken Language Technology, Dec 2016, San Diego, CA, USA, United States. ⟨10.1109/SLT.2016.7846278⟩. ⟨hal-01433188⟩

Share

Metrics

Record views

243